Gaia RAG: PDF Question-Answering with Gaia and Qdrant

Gaia PDF RAG is a Retrieval-Augmented Generation (RAG) application that allows users to ask questions about PDF documents using a local Gaia node and Qdrant vector database. It combines the power of local LLMs with efficient vector search to provide accurate, context-aware answers.

Multiple Files Example

Features

📑 PDF document processing and chunking
🔍 Semantic search using Qdrant vector database
🤖 Local LLM integration through Gaia node
↗️ Cross-encoder reranking for improved relevance
💨 Streaming responses for better UX
🎯 Smart source citation
⚡ Relevance filtering to prevent hallucinations

Prerequisites

Before running GaiaRAG, ensure you have:

A local Gaia node running (Check this link to learn how to run your own local LLM: https://docs.gaianet.ai/node-guide/quick-start)
Qdrant server running
Python 3.8+
Required system libraries for PDF processing

Installation

Clone the repository:

git clone https://github.com/harishkotra/gaia-pdf-rag.git
cd gaiarag

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Setting Up Components

1. Gaia Node

Start your local Gaia node:

gaianet init
gaianet start

2. Qdrant Server

Start Qdrant using Docker:

docker run -d -p 6333:6333 -p 6334:6334 \
    -v $(pwd)/qdrant_storage:/qdrant/storage \
    qdrant/qdrant

Running the Application

Make sure both Gaia node and Qdrant are running
Start the Streamlit app:

streamlit run app.py

Open your browser at http://localhost:8501

Usage

Upload a PDF document using the sidebar
Click "Process Document" to index it
Ask questions in the main input field
View answers and relevant source documents

Configuration

You can modify the following parameters in app.py:

GAIA_NODE_URL: URL of your local Gaia node
QDRANT_HOST: Qdrant server host
QDRANT_PORT: Qdrant server port
VECTOR_SIZE: Embedding dimension size
COLLECTION_NAME: Name for vector database collection

Project Structure

gaia-pdf-rag/
├── app.py              # Main Streamlit application
├── requirements.txt    # Python dependencies
├── .gitignore          # Gitignore file
├── README.md           # This file

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Credits

Inspired by this example.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
gaia_rag_app.py		gaia_rag_app.py
gaia_rap_app_multiple_files.py		gaia_rap_app_multiple_files.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gaia RAG: PDF Question-Answering with Gaia and Qdrant

Multiple Files Example

Features

Prerequisites

Installation

Setting Up Components

1. Gaia Node

2. Qdrant Server

Running the Application

Usage

Configuration

Project Structure

Contributing

Credits

About

Releases

Packages

Languages

License

harishkotra/gaia-pdf-rag

Folders and files

Latest commit

History

Repository files navigation

Gaia RAG: PDF Question-Answering with Gaia and Qdrant

Multiple Files Example

Features

Prerequisites

Installation

Setting Up Components

1. Gaia Node

2. Qdrant Server

Running the Application

Usage

Configuration

Project Structure

Contributing

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages