Content Sources

Content Sources

Content Sources are the data inputs that populate your knowledge bases with up-to-date information. Seclai automatically fetches, processes, and indexes content from various sources, keeping your knowledge bases current without manual intervention.

What are Content Sources?

A source is a connection to external content that you want to make searchable. When you add a source to a knowledge base:

  1. Initial Polling: Seclai fetches content based on your seeding preferences
  2. Content Processing: Text is extracted, chunked, and embedded for vector search
  3. Automatic Updates: Content is refreshed based on your polling schedule
  4. Multi-Phase Processing: Audio/video is transcribed, and all content is indexed

Content Sources can be shared across multiple knowledge bases or organizations, with each connection maintaining independent settings for polling, retention, and indexing.

Source Types

RSS Feeds

RSS feeds automatically pull content from blogs, podcasts, news sites, and other syndicated content.

Best For:

  • Blog posts and articles
  • Podcast episodes
  • News feeds
  • YouTube channels (via RSS)

Features:

  • Automatic detection of RSS/Atom feeds
  • Content metadata extraction (title, author, date)
  • Support for full content or summary feeds
  • Historical data seeding options

Example Use Cases:

  • Monitor industry news and trends
  • Track competitor blog posts
  • Index podcast transcripts for searchability
  • Aggregate content from multiple sources

File Uploads

Upload documents and files directly for indexing.

Best For:

  • Internal documents
  • Custom content
  • One-time uploads

Supported Formats:

  • Text: .txt, .md, .html, .csv, .json, .xml
  • Documents: .pdf, .doc/.docx, .ppt/.pptx, .xls/.xlsx, .epub, .msg
  • Images: .png, .jpg, .gif, .bmp, .tiff, .webp
  • Audio (with transcription): .mp3, .wav, .m4a, .flac, .ogg
  • Video (with transcription): .mp4, .mov, .avi
  • Archives: .zip

Features:

  • No automatic polling (manual only)
  • Content filtering options
  • Custom embedding configuration
  • Direct file storage integration

Custom Index

Build your own index programmatically using the API.

Best For:

  • Integration with existing systems
  • Programmatic content management
  • Custom data sources
  • Dynamic content generation

Features:

  • API-driven content creation
  • Full control over content and metadata
  • Custom chunking and embedding
  • No automatic polling