Skip to content

ly2xxx/rag_chat_opensource_llm

Repository files navigation

rag_chat_opensource_llm

Experimented with Retrieval Augmented Generation based on given data

An AI chatbot featuring conversational memory, designed to enable users to discuss their CSV, PDF, TXT data, Website page and YTB videos in a more intuitive manner. 🚀

For better understanding, see medium article 🖖 : Build a chat-bot over your CSV data

Running Locally 💻

Follow these steps to set up and run the service locally :

Prerequisites

Installation

Clone the repository :

git clone https://github.com/ly2xxx/rag_chat_opensource_llm

Navigate to the project directory :

cd rag_chat_opensource_llm

Create a virtual environment :

python -m venv .venv
.\.venv\Scripts\activate

Install the required dependencies in the virtual environment :

pip install -r requirements.txt

Setup gpt-2 tokenizer manually for langchain ConversationalRetrievalChain : Copy models\models--gpt2 to C:\Users\[windows-username]\.cache\huggingface\hub\models--gpt2 gpt2 manual copy screenshot

Launch the chat service locally :

streamlit run streamlit_app.py

That's it! The AI chatbot is now up and running locally. 🤗

website demo screenshot