RAG-nificent: An AI Chatbot Powered by LLMs for Citation of Custom PDFs, Reports, and Guidelines

Disclaimer: This project is maintained on a best-effort basis as I am not actively using it. APIs may change, and some models have been decommissioned. Please refer to the demo video for the most current optimal use case.

Now Supports Latest GPT-5, GPT-4.1, and Advanced LLM Models

RAG-nificent is a state-of-the-art repository that leverages the power of Retrieval-Augmented Generation (RAG) to provide instant answers and references from a curated directory of PDFs containing information on any given topic such as WHO recommendations documents. This system is designed to aid researchers, policy makers, and the public in quickly finding specific information within extensive documents.

Features

Conversational Interface: Engage with the system using natural language queries to receive responses directly sourced from the PDFs.
Direct Citation: Every response from the system includes a direct link to the source PDF page, ensuring traceability and verification.
PDF Directory: A predefined set of key PDF documents, currently including WHO recommendations on major health topics such as schistosomiasis and malaria.

Available Models

OpenAI Models

🚀 gpt-5-mini-2025-08-07: Latest GPT-5 model with advanced capabilities
🚀 gpt-5-nano-2025-08-07: GPT-5 nano version for efficient processing
📘 gpt-4.1-2025-04-14: GPT-4.1 full version with enhanced reasoning
📘 gpt-4.1-mini-2025-04-14: GPT-4.1 mini version for cost-effective use
💬 chatgpt-4o-latest: Latest ChatGPT-4o with improved conversational abilities
⚡ gpt-4-turbo-2024-04-09: GPT-4 Turbo optimized for complex reasoning
💎 gpt-4o-mini-2024-07-18: GPT-4o mini for efficient general-purpose tasks

Groq Models

🦙 llama-3.1-8b-instant: Fast 8B model for quick interactions
🦙 llama-3.3-70b-versatile: Versatile 70B model with enhanced capabilities
🔬 deepseek-r1-distill-llama-70b: Distilled 70B model for efficient processing
🌐 qwen/qwen3-32b: Qwen 32B model for multilingual tasks

Demo

demo.mov

How It Works

The application utilizes a combination of OpenAI embeddings, Pinecone vector search, and a conversational interface to provide a seamless retrieval experience. When a query is made, the system:

Converts the query into embeddings.
Searches for the most relevant document sections using Pinecone's vector search.
Returns the answer along with citations and links to the source documents.

Setup

Option 1: Standard Setup

Clone the repository:

git clone https://github.com/yourusername/RAG-nificent.git

Install dependencies:
```
pip install -r requirements.txt
```
Set environment variables in a .env (also see .env.examplefile:
- PINECONE_INDEX_NAME
- PINECONE_NAME_SPACE
- OPENAI_API_KEY
- PINECONE_API_KEY
- GROQ_API_KEY
Create a Pinecone index with the same name as PINECONE_INDEX_NAME. Set it up with dimensions=1536 and metric=cosine.
Place your PDFs in the pdf_data directory and run data_ingestion.py
Run the application:
```
chainlit run src/app.py
```

Option 2: Docker Setup

Clone the repository:

git clone https://github.com/yourusername/RAG-nificent.git

Create a .env file with the required environment variables (see .env.example):
- PINECONE_INDEX_NAME
- PINECONE_NAME_SPACE
- OPENAI_API_KEY
- PINECONE_API_KEY
- GROQ_API_KEY
Build and run using Docker Compose:
```
docker-compose up -d
```
Access the application at http://localhost:8000

To ingest PDFs with Docker:

# Place your PDFs in the pdf_data directory first
docker-compose exec ragnificent python data_ingestion.py

To stop the application:
```
docker-compose down
```

Development with Docker

For development purposes, you can use the following commands:

# Build the Docker image
docker-compose build

# Run the application in development mode (with live reloading)
docker-compose up

# View logs
docker-compose logs -f

# Run a specific command inside the container
docker-compose exec ragnificent <command>

Source Documents

The system currently includes guidelines from the following PDFs with direct links to the documents:

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.chainlit		.chainlit
.github/workflows		.github/workflows
assets		assets
pdf_data		pdf_data
public		public
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
chainlit.md		chainlit.md
docker-compose.yml		docker-compose.yml
fly.toml		fly.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-nificent: An AI Chatbot Powered by LLMs for Citation of Custom PDFs, Reports, and Guidelines

Now Supports Latest GPT-5, GPT-4.1, and Advanced LLM Models

Features

Available Models

OpenAI Models

Groq Models

Demo

How It Works

Setup

Option 1: Standard Setup

Option 2: Docker Setup

Development with Docker

Source Documents

About

Uh oh!

Releases 5

Packages

Uh oh!

Languages

License

MaxMLang/RAG-nificent

Folders and files

Latest commit

History

Repository files navigation

RAG-nificent: An AI Chatbot Powered by LLMs for Citation of Custom PDFs, Reports, and Guidelines

Now Supports Latest GPT-5, GPT-4.1, and Advanced LLM Models

Features

Available Models

OpenAI Models

Groq Models

Demo

How It Works

Setup

Option 1: Standard Setup

Option 2: Docker Setup

Development with Docker

Source Documents

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Languages

Packages