Document Management

OpenGateLLM provides a comprehensive document management system to help you perform Retrieval-Augmented Generation (RAG). This allows you to store, process, and search through your documents to enhance AI responses with relevant context.

What is RAG?

Retrieval-Augmented Generation (RAG) is a technique that combines a language model with an external knowledge source. Instead of relying only on its internal training data, the model first retrieves relevant information from a database or document store, then uses that context to generate more accurate, up-to-date, and domain-specific responses.

Prerequisites

To use document management features, you need to configure a vector store. OpenGateLLM supports two vector databases:

For detailed setup instructions, see the Vector Store documentation.

Concepts

Document management organizes data in a hierarchical structure with three main entities:

Collection: Storage space for documents and chunks
Document: Text extracted from a file
Chunk: A portion of text split from a document

How it Works

OpenGateLLM allow to upload files and process them into documents and chunks. Chunks are the smallest units in the vector store, representing portions of text from documents. Each chunk is vectorized and can be retrieved during search operations to add more context of you LLM requests. When you import a file, it goes through multiple phases:

File: The original file (PDF, JSON, Markdown, HTML, etc.)
Parsing: Text extraction from the file
Document: Extracted text with metadata
Chunking: Splitting the document into smaller pieces
Chunks: Text portions with their vectors
Vectorization: Converting chunks to embeddings using an embedding model
Indexation: Storing chunks and vectors in the database

When you upload a file to create a document, the system processes it through multiple stages involving validation, parsing, chunking, vectorization, and storage. Here's the complete flow:

The processing involves several key components:

API Endpoint: Handles HTTP requests and validation
Document Manager: Orchestrates the document creation process
Parser Manager: Extracts text from various file formats (PDF, JSON, Markdown, HTML)
Vector Store: Stores chunks and their vector embeddings (Qdrant or Elasticsearch)
Embedding Model: Converts text chunks into vector representations
PostgreSQL: Stores document metadata and relationships

What is RAG?​

Prerequisites​

Concepts​

How it Works​

What is RAG?

Prerequisites

Concepts

How it Works