Changelog

Stay up to date with all the latest changes and improvements to RAGaaS.


January 9, 2025

Added OneDrive integration and Jina AI embedding models for enhanced content search capabilities.

🚀 Features

  • Added OneDrive connector to connect your OneDrive with RAGaaS (docs):
    • New endpoint to add a new OneDrive connection
    • Secure OAuth2 flow for OneDrive access and file selection
    • Support for PDF, CSV, DOCX, TXT, MD, and PPTX files
    • Integration with both personal and business accounts
    • Required Microsoft Graph permissions: Files.Read.All, offline_access, openid, User.Read
  • Updated namespace:
    • Added OneDrive configuration to namespace
  • Added Jina AI as a new embedding model provider (docs):
    • High-performance jina-embeddings-v3 model with 1024 dimensions

🔄 API Changes

  • Updated /v1/connections endpoint for creating and managing OneDrive connections (docs)
  • Added /v1/ingest/onedrive endpoint for ingesting selected OneDrive files (docs)
  • Updated namespace configuration to support:
    • OneDrive connector settings (docs)
    • Jina embedding model in embeddingModelConfig (docs)

📝 Documentation

  • New guide on how to use the OneDrive connector (docs)
  • Added Jina embedding models to supported models list with configuration examples (docs)
  • Updated the namespace reference to include new configurations (docs)
  • Added new pages to sitemap.xml for improved SEO

January 8, 2025

Added Dropbox integration to connect and search your Dropbox content.

🚀 Features

  • Added Dropbox connector to connect your Dropbox with RAGaaS (docs):
    • New endpoint to add a new Dropbox connection
    • Secure OAuth2 flow for Dropbox access and file selection
    • Support for PDF, CSV, DOCX, TXT, MD, and PPTX files
  • Updated namespace:
    • Added Dropbox configuration to namespace

🔄 API Changes

  • Updated /v1/connections endpoint for creating and managing Dropbox connections (docs)
  • Added /v1/ingest/dropbox endpoint for ingesting selected Dropbox files (docs)
  • Updated namespace configuration to support Dropbox settings (docs)

📝 Documentation

  • New guide on how to use the Dropbox connector (docs)
  • Updated the namespace reference to include Dropbox configuration (docs)
  • Added new pages to sitemap.xml for improved SEO

January 7, 2025

Added Google Drive integration to connect and search your Drive content.

🚀 Features

  • Added Google Drive connector to connect your Drive with RAGaaS (docs):
    • New endpoint to add a new Drive connection
    • Secure OAuth2 flow for Drive access and file selection
    • Support for Google Docs, Sheets, and native files
  • Updated namespace:
    • Added Google Drive configuration to namespace

🔄 API Changes

  • Updated /v1/connections endpoint for creating and managing Drive connections (docs)
  • Added /v1/ingest/google-drive endpoint for ingesting selected Drive files (docs)
  • Updated namespace configuration to support Drive settings (docs)

📝 Documentation

  • New guide on how to use the Google Drive connector (docs)
  • Updated the namespace reference to include Drive configuration (docs)
  • Added new pages to sitemap.xml for improved SEO

January 6, 2025

Major improvements to document connectivity with the addition of Notion connector.

🚀 Features

  • Added Notion connector to connect your Notion workspace with RAGaaS (docs):
    • New endpoint to add a new Notion connection
    • Secure OAuth2 flow for Notion workspace access and content ingestion
  • Updated namespace:
    • Added Notion configuration to namespace

🔄 API Changes

  • Added /v1/connections endpoint for creating and fetching connections (docs)
  • Added /v1/connections/:connectionId endpoint for getting and managing a particular connection (docs)
  • Added /v1/ingest/notion endpoint for ingesting Notion content from all the pages you select during the OAuth flow (docs)
  • Updated namespace configuration to support Notion settings (docs)

📝 Documentation

  • New guide on how to use the Notion connector (docs)
  • New endpoint reference for connectors (docs)
  • Updated the namespace reference to include Notion configuration (docs)
  • Added new pages to sitemap.xml for improved SEO and discoverability

January 5, 2025

Major improvements to document ingestion capabilities, introducing direct file uploads and API standardization.

🚀 Features

  • Added direct file upload support with multiple formats (docs):
    • Documents: .pdf, .docx, .pptx, .xlsx
    • Text-based formats: .csv, .json, .xml, .html
    • Upload files directly through the API without needing public URLs
  • Enhanced document model with ingestion tracking (docs):
    • Added ingestJob field
    • Added ingestJobRun field

🔄 API Changes

  • Added new /ingest/file endpoint for direct file uploads (docs)
  • Added web scraper configuration support in namespaces
  • Standardized document filter parameters to camelCase (docs):
    • document_idsdocumentIds
    • document_external_idsdocumentExternalIds
    • document_typesdocumentTypes

📝 Documentation

  • Added sitemap.xml for improved SEO and discoverability
  • Updated RAG flow diagrams in "What is RAG?" guide:
    • Added light mode versions
    • Improved contrast and readability
    • Enhanced visual consistency across themes

January 4, 2025 🚀

We're excited to announce the official launch of RAGaaS - a privacy-focused, self-serve platform for building AI-powered search and Q&A applications!

🚀 Core Features

  • Privacy-First Architecture:

    • Bring your own S3-compatible storage (docs)
    • Bring your own vector database (docs)
    • Use your own LLM API keys (docs)
    • No data retention or training on your content
    • EU-based metadata storage
  • Document Processing:

    • Raw text ingestion (docs)
    • Web scraping capabilities (docs):
      • URLs: Process web pages and files available through public URLs
      • Sitemaps: Automatically process all pages in a sitemap
      • Websites: Intelligently crawl websites with custom rules
    • Custom chunking configuration (docs)
  • Search & Retrieval:

    • Hybrid search combining semantic and keyword approaches (docs)
    • Support for multiple embedding models (docs):
      • OpenAI: State-of-the-art embeddings
      • Cohere: Multilingual support
    • Basic metadata filtering (key=value pairs) (docs)
    • Multilingual search support (docs)
  • Developer Experience:

    • Comprehensive API access (docs)
    • Detailed documentation (docs)
    • Multiple code examples (cURL, JavaScript, Python)

📈 Launch Plans

  • Pilot ($99/month):

    • 5,000 monthly ingestion pages
    • 25,000 monthly retrieval calls
    • ~50 requests/minute
    • Email support
  • Pro ($299/month):

    • 25,000 monthly ingestion pages
    • 100,000 monthly retrieval calls
    • ~200 requests/minute
    • Standard support
  • Team ($999/month):

    • 100,000 monthly ingestion pages
    • 500,000 monthly retrieval calls
    • ~500 requests/minute
    • Priority support
  • Enterprise (Custom pricing):

    • Custom usage limits
    • Custom rate limits
    • Dedicated SLA
    • White-glove support

🗺️ Roadmap

  • Enhanced Processing:

    • Direct file uploads (coming tomorrow!)
    • OCR for tables & images in PDFs
    • Advanced metadata filtering
    • Additional embedding models (Voyage, Mistral)
    • Additional vector databases (Qdrant, Weaviate)
  • Advanced Features:

    • Advanced RAG pipeline with reranking
    • Multi-step retrieval
    • Real-time content updates
    • Custom embeddings & fine-tuning
  • Integrations:

    • Google Drive
    • Notion
    • SharePoint
    • Dropbox
    • And more connectors
  • Developer Tools:

    • Webhooks for real-time notifications
    • Advanced analytics
    • Usage pattern monitoring
    • Search quality metrics