Chat with PDFs: A Chainlit-Powered PDF Chatbot

Overview

This project is a PDF chatbot application powered by Chainlit, LangChain, and Chroma. The app allows users to upload PDF and text documents, process them into chunks, and then query them conversationally. By leveraging the latest in AI-based embeddings and language models, the chatbot provides insightful answers while referencing relevant document sources.

Features

File Upload Support: Users can upload PDF and plain text files to interact with their content.
Chunk Processing: Large documents are split into manageable chunks for efficient search and retrieval.
Vector Search with Chroma: Documents are embedded using SentenceTransformers and stored in Chroma, a fast and scalable vector store.
Conversational QA: Powered by ChatGroq, users can ask questions in natural language and receive detailed responses with cited sources.
Real-Time Document Updates: Uploaded files are dynamically processed and added to the document corpus.
Streamed Responses: Answers are streamed for an engaging user experience.

Technologies Used

Frameworks and Libraries

Chainlit: For building and deploying the chatbot interface.
LangChain: To manage document loaders, text splitting, and retrieval chains.
SentenceTransformers: For generating embeddings of document text and user queries.
Chroma: A high-performance vector database for storing and searching document embeddings.

Language Models

ChatGroq: A state-of-the-art LLM for generating responses to user queries.

Utilities

RecursiveCharacterTextSplitter: For chunking large documents.
PyPDFLoader: For extracting content from PDF files.
TextLoader: For loading plain text files.

Installation

Prerequisites

Python 3.8+
pip: Package manager for Python.

Steps

Clone the repository:

git clone https://github.com/oss-bit/PDF-Chat.git
cd PDF-Chat

Install the required packages:
```
pip install -r requirements.txt
```
Set up API keys for ChatGroq:
- Replace the placeholder qroq_api_key in the code with your actual API key.

Usage

Start the chatbot:
```
chainlit run main.py -w
```
Open the browser interface (usually at http://localhost:8000).
Upload a document (PDF or text) using the clip icon.
Ask questions about your uploaded documents, and get real-time answers with cited sources.

Key Functions

`process_file(files)`

Loads files and splits them into chunks for easier embedding and retrieval.

`get_vec_search(file)`

Processes the documents and stores them in a Chroma vector store.

`start()`

Initializes the chatbot with a welcome message.

`main(message)`

Handles user interactions, processes uploaded documents, and answers queries.

Example Workflow

Upload a File: Drag and drop your PDF or text file into the chatbot interface.
Ask Questions: Type in queries like:
- "What is the main topic of the document?"
- "Provide details from section X of the file."
Receive Answers: The bot responds with detailed answers and references to the source document.

Contribution

Feel free to open issues or submit pull requests. Contributions are welcome!

Fork the repository.
Create a feature branch.
Commit your changes and submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

Enjoy seamless interaction with your documents! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
screenshoots		screenshoots
=2.6.0		=2.6.0
README.md		README.md
app.py		app.py
chainlit.md		chainlit.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chat with PDFs: A Chainlit-Powered PDF Chatbot

Overview

Features

Technologies Used

Frameworks and Libraries

Language Models

Utilities

Installation

Prerequisites

Steps

Usage

Key Functions

`process_file(files)`

`get_vec_search(file)`

`start()`

`main(message)`

Example Workflow

Contribution

License

Acknowledgments

About

Releases

Packages

Languages

oss-bit/PDF-Chat

Folders and files

Latest commit

History

Repository files navigation

Chat with PDFs: A Chainlit-Powered PDF Chatbot

Overview

Features

Technologies Used

Frameworks and Libraries

Language Models

Utilities

Installation

Prerequisites

Steps

Usage

Key Functions

process_file(files)

get_vec_search(file)

start()

main(message)

Example Workflow

Contribution

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`process_file(files)`

`get_vec_search(file)`

`start()`

`main(message)`

Packages