GitHub - aidul23/search-from-webpageURL: LLM based rag application that embed given web page to vector db and answer given query using vector similarity cosine.

LLM-based RAG Application

This is a full-stack web application that allows users to store text content from a webpage (via URL) into a MongoDB vector database after embedding the content using an LLM (Large Language Model). Users can also query the stored data to receive precise answers based on the embedded content. The application includes a Node.js backend and a React frontend with a user interface.

Features

URL Embedding: Extracts and embeds webpage content into a MongoDB vector database.
Query System: Allows users to ask questions in text and audio, and the app responds with answers based on the embedded content.
Responsive Design: Frontend styled using Tailwind CSS and designed to mimic ChatGPT’s interface.
Interactive Sidebar: Sidebar for adding URLs and a chat-like area for user interaction.
Real-Time Feedback: Provides loading indicators and success/error messages during operations.

Technologies Used

Frontend

React with Vite for a fast and responsive user interface.
Tailwind CSS for styling.
React Icons for UI enhancements.

Backend

Node.js with Express for API endpoints.
MongoDB with Atlas Search for storing and querying vectorized data.
OpenAI API's text-embedding-ada-002 model is used for text embedding to vectorize data.
OpenAI API's gpt-3.5-turbo-16k model is used for context-based query from vector database
OpenAI API's whisper model is used for speech to text convertion

Preview

Video: https://github.com/user-attachments/assets/7f0d4dcf-d72a-4689-9ea5-42b690e5243a

Future Improvements

Add user authentication and session management.
Enable bulk URL uploads for embedding.
Implement advanced query filtering and pagination.
Enhance error handling and logging.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
client		client
server		server
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-based RAG Application

Features

Technologies Used

Frontend

Backend

Preview

Future Improvements

About

Releases

Packages

Languages

aidul23/search-from-webpageURL

Folders and files

Latest commit

History

Repository files navigation

LLM-based RAG Application

Features

Technologies Used

Frontend

Backend

Preview

Future Improvements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages