RAG System with Synthetic Data Generation and Evaluation using RAGAS

This project implements a Retrieval-Augmented Generation (RAG) system with FastAPI for real-time querying and evaluation. The system integrates multiple technologies for document indexing, context retrieval, and conversational memory. It also includes functionality for generating synthetic data and evaluating the system's performance using RAGAS.

Features

RAG System: Utilizes LlamaIndex for document indexing and retrieval, LangChain for managing conversation history, and OpenAI models for generating responses.
FastAPI Integration: Provides a web API for querying the RAG system with support for real-time response streaming.
Synthetic Data Generation: Generates synthetic question-answer pairs relevant to your specific use case using OpenAI's GPT models.
Data Formatting and Evaluation: Formats generated data for RAGAS and evaluates the RAG system's performance.

Components

RAG System (RAGsystem.py):
- Sets up FastAPI with endpoints for querying.
- Uses LlamaIndex for document indexing and retrieval.
- Manages conversational history with LangChain.
Synthetic Data Generation (GenerateData.py):
- Generates synthetic question-answer pairs using OpenAI's GPT models.
- Formats and saves the data in JSON for RAGAS evaluation.
Data Formatting for RAGAS (formattedDS.py):
- Converts the synthetic data into a format suitable for RAGAS evaluation.
Evaluation Script (RagasEval.py):
- Loads the formatted dataset and evaluates the RAG system using RAGAS metrics.
- Analyzes and prints the results for various evaluation metrics.

Getting Started

Prerequisites

Python 3.8+
OpenAI API key
LlamaIndex API key
Required Python packages (listed in requirements.txt)

Installation

Clone the repository:

git clone https://github.com/qamar100/RAGSystem-with-RAGAS.git
cd RAGSystem-with-RAGAS

Install the required packages:
```
pip install -r requirements.txt
```
Set up environment variables:

Create a .env file in the root directory with the following content:
```
OPENAI_API_KEY=your_openai_api_key
LLAMA_CLOUD_API_KEY=your_llama_cloud_api_key
```

Usage

Generate Synthetic Data:

Run the script to generate synthetic question-answer pairs relevant to your specific use case:
```
python GenerateData.py
```
Note: Modify the prompts in GenerateData.py to match your specific use case. Adjust the system and user messages to ensure they are relevant to your domain.
Format Data for RAGAS:

Convert the generated data into RAGAS-compatible format:
```
python formattedDS.py
```
Note: After generating and formatting the data, review the qa_dataset.json and ragas_dataset.json files. Make any necessary adjustments to ensure data quality before proceeding with evaluation.
Run the RAG System:

Start the FastAPI server:
```
python RAGsystem.py
```
Note: Update the prompts in RAGsystem.py to suit your specific use case. Modify the query handling and response generation to fit your requirements.
Evaluate the RAG System:

Run the evaluation script to assess the system's performance:
```
python RagasEval.py
```

API Endpoints

POST /query: Accepts a JSON payload with a text field containing the query. Returns the RAG system's response.

Results

Evaluation results are printed to the console, including metrics such as faithfulness, answer relevancy, context recall, and context utilization.

Contributing

Feel free to submit issues or pull requests to improve the system.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
GenerateData.py		GenerateData.py
RAGsystem.py		RAGsystem.py
README.md		README.md
RagasEval.py		RagasEval.py
formattedDS.py		formattedDS.py
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG System with Synthetic Data Generation and Evaluation using RAGAS

Features

Components

Getting Started

Prerequisites

Installation

Usage

API Endpoints

Results

Contributing

About

Uh oh!

Releases

Packages

Languages

qamar100/RAGSystem-with-RAGAS

Folders and files

Latest commit

History

Repository files navigation

RAG System with Synthetic Data Generation and Evaluation using RAGAS

Features

Components

Getting Started

Prerequisites

Installation

Usage

API Endpoints

Results

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages