PDFChat

PDFChat is a chatbot that allows you to interact with your PDF files. You can:

Ask general questions about the contents of a PDF: These are questions that don't require summarization. Instead, these questions require that PDFBot look through the contents of the PDF to answer the user's question.
Ask PDFchat to summarize the document for you, at which point it'll allow you to download a sumamry PDF and a change diff in HTMl

Requirements

Python 3.10
Streamlit 1.25
An OpenAI API key

Running

Create virtual environment

virtualenv pdf_bot

Create a .streamlit folder at the root of the repository and add a secrets.toml file to it
Add OPENAI_API_KEY = "<YOUR-OPEN-AI-KEY>" to your secrets.toml file
Run as streamlit app

streamlit run pdf_chat/streamlit_app.py

How It Works

The bot is made up of three components, a master agent and two child agents. They interact in the following manner:

Master OpenAI Functions agent: This agent takes in a user prompt and delegates the task to one of the two bots it has access to using langchain's OpenAIFunctions Agent (see points two and three). It's a simple chat agent with memory, enabling a chat-like interface between the user and the bot
QA Retrieval Agent: This agent handles general questions about the contents of a PDF. Does so using Langchain's ConversationalRetrievalChain, which stores different document parts as embeddings in a vectore store, and performs a similarity search between these embeddings and the user's prompt. See here for more https://python.langchain.com/docs/use_cases/question_answering/
Summarization Agent: This agent handles the summarizatin of the PDF. Does so by using langchain's MapReduceDocumentsChain, which uses Map-Reduce to summarize all splits of document (Map) before combining them to produce one full summary of the entire document (Reduce). See here for more https://python.langchain.com/docs/use_cases/summarization

Why use master-child architecture?

All prompts sent to agents are much smaller since there's no requirement of constantly pre-pending the entire document to the prompt for the agent to have context. Smaller prompt means, incurring less costs and allowing for the master agent's memory not to get flooded
It's more modularized, so we aren’t restricted to a single model. You can use different models based on the different requirements of the agents

Demo

You can find an example PDF under the examples folder. Types of questions you can ask:

How can the definition of demand change based on who is defining it?
What is demand?
Summarize this document for me

The bot is tuned not to answer questions that don't pertain to the document. Questions like the following won't be answered:

How many continents are their?
What is water?

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
_static		_static
examples		examples
pdf_chat		pdf_chat
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PDFChat

Requirements

Running

How It Works

Why use master-child architecture?

Demo

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

akhanafer/PDFChat

Folders and files

Latest commit

History

Repository files navigation

PDFChat

Requirements

Running

How It Works

Why use master-child architecture?

Demo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages