Content Sources
Content Sources are the data inputs that populate your knowledge bases with up-to-date information. Seclai automatically fetches, processes, and indexes content from various sources, keeping your knowledge bases current without manual intervention.
What are Content Sources?
A source is a connection to external content that you want to make searchable. When you add a source to a knowledge base:
- Initial Polling: Seclai fetches content based on your seeding preferences
- Content Processing: Text is extracted, chunked, and embedded for vector search
- Automatic Updates: Content is refreshed based on your polling schedule
- Multi-Phase Processing: Audio/video is transcribed, and all content is indexed
Content Sources can be shared across multiple knowledge bases or organizations, with each connection maintaining independent settings for polling, retention, and indexing.
Source Types
RSS Feeds
RSS feeds automatically pull content from blogs, podcasts, news sites, and other syndicated content.
Best For:
- Blog posts and articles
- Podcast episodes
- News feeds
- YouTube channels (via RSS)
Features:
- Automatic detection of RSS/Atom feeds
- Content metadata extraction (title, author, date)
- Support for full content or summary feeds
- Historical data seeding options
Example Use Cases:
- Monitor industry news and trends
- Track competitor blog posts
- Index podcast transcripts for searchability
- Aggregate content from multiple sources
File Uploads
Upload documents and files directly for indexing.
Best For:
- Internal documents
- Custom content
- One-time uploads
Supported Formats:
- Text: .txt, .md, .html, .csv, .json, .xml
- Documents: .pdf, .doc/.docx, .ppt/.pptx, .xls/.xlsx, .epub, .msg
- Images: .png, .jpg, .gif, .bmp, .tiff, .webp
- Audio (with transcription): .mp3, .wav, .m4a, .flac, .ogg
- Video (with transcription): .mp4, .mov, .avi
- Archives: .zip
Features:
- No automatic polling (manual only)
- Content filtering options
- Custom embedding configuration
- Direct file storage integration
Custom Index
Build your own index programmatically using the API.
Best For:
- Integration with existing systems
- Programmatic content management
- Custom data sources
- Dynamic content generation
Features:
- API-driven content creation
- Full control over content and metadata
- Custom chunking and embedding
- No automatic polling