Medical Chatbot: MedBot

MedBot is an innovative healthcare chatbot project that leverages large Language Models along with Retrieval-Augmented Generation (RAG) from trusted databases created from PubMed datasets to facilitate seamless and intuitive communication in the realm of medical assistance. Designed to enhance the interaction between users and healthcare information, MedBot offers immediate responses to inquiries related to health, wellness, and medical queries. This intelligent chatbot employs RAG, natural language processing, and understanding to provide accurate and personalized responses, making it a reliable companion for individuals seeking information on symptoms, medications, and general healthcare advice. MedBot aims to bridge the gap between users and healthcare knowledge, offering a convenient and accessible platform for health-related conversations.

Project Overview

This project utilizes Large Language Models with Retrieval-Augmented Generation (RAG), trained on reliable medical datasets collected from PubMed. The bot demonstrates impressive performance metrics, including:

96.7% Content Precision 95% Context Recall 85% Faithfulness 73% Answer Relevancy 69.4% Answer Correctness

Example Responses

Notebook Breakdown of 'MedBot.ipynb':

Importing Required Resources including Data

Identifying and bringing in all necessary tools and resources required for the project, including programming languages, machine learning frameworks, data collection tools, and other dependencies. Collecting and preparing data relevant to the project, including data cleaning, preprocessing, and structuring the data in a format suitable for analysis or modeling.

Creating a Vector Database Using Only the Contexts

Transforming the collected data into numerical vectors while preserving semantic meaning, typically using word embedding models or contextual embedding models to represent words or sentences as dense vectors.

Testing the Vector Database:

Validating the effectiveness of the vector database by querying it with known inputs and verifying that the retrieved vectors match expectations.

Testing the Vector Database with Paraphrased Questions

Assessing the robustness of the vector database to handle paraphrased queries and verifying its ability to accurately retrieve relevant vectors even when the query is rephrased or expressed differently.

Creating the Retrieval-Augmented Generation (RAG) Pipeline Using LANGCHAIN

Building a pipeline that integrates retrieval and generation techniques using LANGCHAIN, based on the vector database.

Evaluating the RAG Pipeline Using RAGAS

Assessing the effectiveness of the RAG pipeline in generating relevant responses to queries using RAGAS (Retrieval Augmented Generation Assessment Suite) or a similar evaluation framework.

Performance Measures and Evaluation

Calculating various metrics to assess the efficiency and effectiveness of the RAG pipeline, including faithfulness, context precision, context recall, answer similarity, answer relevancy, and answer correctness. Summarizing and analyzing the evaluation results, highlighting the performance of the RAG pipeline based on the calculated metrics.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
MedBot.ipynb		MedBot.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Medical Chatbot: MedBot

Project Overview

Example Responses

Notebook Breakdown of 'MedBot.ipynb':

Importing Required Resources including Data

Creating a Vector Database Using Only the Contexts

Testing the Vector Database:

Testing the Vector Database with Paraphrased Questions

Creating the Retrieval-Augmented Generation (RAG) Pipeline Using LANGCHAIN

Evaluating the RAG Pipeline Using RAGAS

Performance Measures and Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

puja-urmi/Medical-Chatbot-LLM-RAG

Folders and files

Latest commit

History

Repository files navigation

Medical Chatbot: MedBot

Project Overview

Example Responses

Notebook Breakdown of 'MedBot.ipynb':

Importing Required Resources including Data

Creating a Vector Database Using Only the Contexts

Testing the Vector Database:

Testing the Vector Database with Paraphrased Questions

Creating the Retrieval-Augmented Generation (RAG) Pipeline Using LANGCHAIN

Evaluating the RAG Pipeline Using RAGAS

Performance Measures and Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages