Skip to content

sprklinginfo/LLM-Powered-RAG-System

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

82 Commits
 
 
 
 

Repository files navigation

LLM-Powered-RAG-System

Frameworks

  • langchain - ⚡ Building applications with LLMs through composability ⚡ GitHub Repo stars
  • llama_index - LlamaIndex (formerly GPT Index) is a data framework for your LLM applications GitHub Repo stars
  • crewAI - Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks. GitHub Repo stars
  • embedchain - Embedchain is an Open Source RAG Framework that makes it easy to create and deploy AI apps. - GitHub Repo stars
  • FlagEmbedding - Dense Retrieval and Retrieval-augmented LLMs - GitHub Repo stars
  • TaskingAI - Dense Retrieval and Retrieval-augmented LLMs - GitHub Repo stars
  • fastRAG - Efficient Retrieval Augmentation and Generation Framework - GitHub Repo stars
  • llmware - Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models. - GitHub Repo stars
  • llm-applications - A comprehensive guide to building RAG-based LLM applications for production. - GitHub Repo stars
  • DB-GPT - An open-source AI native data app development framework with AWEL(Agentic Workflow Expression Language) and agents. Revolutionizing Database Interactions with Private LLM Technology - GitHub Repo stars
  • langroid - Harness LLMs with Multi-Agent Programming - GitHub Repo stars
  • pandas-ai - Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG. - GitHub Repo stars
  • canopy - Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone - GitHub Repo stars
  • autollm - Ship RAG based LLM web apps in seconds. - GitHub Repo stars ‼️ it can set up a fastAPI server quickly,
  • GraphRAG - A modular graph-based Retrieval-Augmented Generation (RAG) system. A data pipeline and transformation suite that is designed to extract meaningful, structured data from unstructured text using the power of LLMs. - GitHub Repo stars
  • DSPy - DSPy: The framework for programming—not prompting—foundation models . - GitHub Repo stars
  • Haystack - an end-to-end LLM framework that allows you to build applications powered by LLMs, Transformer models, vector search and more. - GitHub Repo stars

Projects

  • Dify - an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production. - GitHub Repo stars
  • quivr - Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! - Ship RAG based LLM web apps in seconds. - GitHub Repo stars
  • Langchain-Chatchat - 基于 ChatGLM 等大语言模型与 Langchain 等应用框架实现,开源、可离线部署的 RAG 与 Agent 应用项目。. - GitHub Repo stars
  • Chatbot UI - The open-source AI chat app for everyone. - GitHub Repo stars
  • Jan - Jan is an open-source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM). - GitHub Repo stars
  • fastGPT - FastGPT is a knowledge-based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization! - GitHub Repo stars
  • LangChain-ChatGLM-Webui - 基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答 - GitHub Repo stars
  • anything-llm - Open-source multi-user ChatGPT for all LLMs, embedders, and vector databases. Unlimited documents, messages, and users in one privacy-focused app. - GitHub Repo stars
  • QAnything - Question and Answer based on Anything.
  • danswer - Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc. - GitHub Repo stars
  • RAGFlow - an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. - GitHub Repo stars
  • Bisheng - an open LLM devops platform for next generation AI applications.. - GitHub Repo stars
  • khoj - A copilot to search and chat (using RAG) with your knowledge base (pdf, markdown, org). Use powerful, online (e.g gpt4) or private, offline (e.g mistral) LLMs. - GitHub Repo stars
  • rags - Build ChatGPT over your data, all with natural language - GitHub Repo stars
  • Verba - Retrieval Augmented Generation (RAG) chatbot powered by Weaviate - GitHub Repo stars
  • SuperMemory - Build your own second brain with supermemory. It's a ChatGPT for your bookmarks. Import tweets or save websites and content using the chrome extension (the extension on webstore is not updated, please use the one in the repo) - GitHub Repo stars
  • llm-app - LLM App templates for RAG, knowledge mining, and stream analytics. Ready to run with Docker,⚡in sync with your data sources. - GitHub Repo stars
  • casibase - ⚡️Open-source AI LangChain-like RAG (Retrieval-Augmented Generation) knowledge database with web UI and Enterprise SSO⚡️ - GitHub Repo stars
  • trt-llm-rag-windows - A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM - GitHub Repo stars
  • GPT-RAG - GPT-RAG core is a Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences. - GitHub Repo stars
  • rag-demystified - An LLM-powered advanced RAG pipeline built from scratch - GitHub Repo stars
  • LARS - An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses. - GitHub Repo stars
  • SemanticFinder - frontend-only live semantic search with transformers.js. - GitHub Repo stars
  • MindSearch - an open-source AI Search Engine Framework with Perplexity.ai Pro performance. - GitHub Repo stars
  • Anthropic Quickstarts - a collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API. - GitHub Repo stars

Components

AI Agents

  • AutoGroq - groundbreaking tool that revolutionizes the way users interact with AI assistants. By dynamically generating tailored teams of AI agents based on your project requirements. - GitHub Repo stars 💥 💥
  • micro-agent - An AI agent that writes (actually useful) code for you: JavaScript maily - GitHub Repo stars 💥 💥
  • llama-agentic-system - This repo allows you to run Llama 3.1 as a system capable of performing "agentic" tasks - GitHub Repo stars
  • Controllable Agent for Complex RAG - n advanced Retrieval-Augmented Generation (RAG) solution designed to tackle complex questions that simple semantic similarity-based retrieval cannot solve. - GitHub Repo stars

Chat with Documents

  • privateGPT - Interact with your documents using the power of GPT, 100% privately, no data leaks
  • localGPT - Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
  • ChatFiles - Document Chatbot
  • pdfGPT - PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
  • chatd - Chat with your documents using local AI - GitHub Repo stars
  • IncarnaMind - Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs
  • ArXivChatGuru - Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache. - GitHub Repo stars
  • h2ogpt - Private chat with local GPT with document, images, video, etc. - GitHub Repo stars

Database

  • qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Qdrant is also available as a fully managed Qdrant Cloud ⛅ including a free tier. - GitHub Repo stars
  • vanna - 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄. - GitHub Repo stars
  • txtai - 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows - GitHub Repo stars
  • infinity - The AI-native database built for LLM applications, providing incredibly fast vector and full-text search - GitHub Repo stars
  • postgresml - The GPU-powered AI application database. - GitHub Repo stars
  • lancedb - Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps! - GitHub Repo stars

Optimize/Evaluation Method

  • sparrow - Data extraction with ML and LLM - GitHub Repo stars
  • fastembed - Fast, Accurate, Lightweight Python library to make State of the Art Embedding - GitHub Repo stars
  • self-rag - SELF-RAG: Learning to Retrieve, Generate and Critique through Self-reflection - GitHub Repo stars
  • instructor - Your Gateway to Structured Outputs with OpenAI
  • swirl-search - Swirl is open source software that simultaneously searches multiple content sources and returns AI ranked results. - GitHub Repo stars
  • kernel-memory - Index and query any data using LLM and natural language, tracking sources and showing citations. - GitHub Repo stars
  • AutoRAG - a tool for finding optimal RAG pipeline for “your data.” 🔮 - GitHub Repo stars
  • promptfoo - Test your prompts, agents, and RAGs. Use LLM evals to improve your app's quality and catch problems. - GitHub Repo stars
  • YiVal - Your Automatic Prompt Engineering Assistant for GenAI Applications - GitHub Repo stars
  • LLM Comparator - LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR team. - GitHub Repo stars
  • Ragas - a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines. - GitHub Repo stars

Data Prep

  • OmniParse - OmniParse is a platform that ingests and parses any unstructured data into structured, actionable data optimized for GenAI (LLM) applications. Whether you are working with documents, tables, images, videos, audio files, or web pages, OmniParse prepares your data to be clean, structured, and ready for AI applications such as RAG, fine-tuning, and more GitHub Repo stars
  • Open Parse) - Easily chunks complex documents the same way a human would, including Semantic Processing. GitHub Repo stars
  • Unstructured - open-source components for ingesting and pre-processing images and text documents, such as PDFs, HTML, Word docs, and many more. GitHub Repo stars
  • ExtractThinker - Library to extract data from files and documents agnostically using LLMs. extract_thinker provides ORM-style interaction between files and LLMs, allowing for flexible and powerful document extraction workflows. - GitHub Repo stars
  • nlm-ingestor - This repo provides the service code for llmsherpa API to parse PDF, HTML, DOCX, PPTX. - GitHub Repo stars
  • python-readability - Given an HTML document, extract and clean up the main body text and title. - GitHub Repo stars
  • llm_aided_ocr - The LLM-Aided OCR Project is an advanced system designed to significantly enhance the quality of Optical Character Recognition (OCR) output. By leveraging cutting-edge natural language processing techniques and large language models (LLMs), this project transforms raw OCR text into highly accurate, well-formatted, and readable documents.. - GitHub Repo stars
  • Datachain - a modern Pythonic data-frame library designed for artificial intelligence. It is made to organize your unstructured data into datasets and wrangle it at scale on your local machine. - GitHub Repo stars
  • Jina Reader-LM - take noisy HTML and parse it into clean and usable Markdown for training and reasoning.
  • Introducing Contextual Retrieval (blog post) - Anthropic shows how to semantically chunk documents, which dramatically improves performance while only costing $1/million chunks due to caching.
  • thepi.pe - Python package that can extract clean markdown from PDFs, URLs, slides, videos, and more, ready for any LLM. - GitHub Repo stars

Fine-tuning

  • mistral-finetune - mistral-finetune is a light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models.... - GitHub Repo stars
  • LLaMA-Factory - A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024). - GitHub Repo stars
  • axolotl - a tool designed to streamline the fine-tuning of various AI models, offering support for multiple configurations and architectures.. - GitHub Repo stars
  • LlamaDuo - This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM. - GitHub Repo stars

Others

  • chatgpt-retrieval-plugin - The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
  • RAGxplorer - Open-source tool to visualise your RAG 🔮 - GitHub Repo stars
  • deep-chat - a fully customizable AI chat component that can be injected into your website with minimal to no effort. - GitHub Repo stars
  • Ollama-Laravel - a Laravel package providing seamless integration with the Ollama API.. - GitHub Repo stars
  • fabric - an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere. - GitHub Repo stars
  • n8n - Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services. - GitHub Repo stars
  • Langtrace - an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. - GitHub Repo stars
  • tokencost - Helps calculate the USD cost of using major Large Language Model (LLMs) APIs by calculating the estimated cost of prompts and completions. - GitHub Repo stars
  • quality-prompts - Use and evaluate prompting techniques quickly. - GitHub Repo stars
  • Laminar - Open-Source observability, analytics, evals and prompt chains for complex LLM apps. - GitHub Repo stars

Inference server

  • mistral.rs - a fast LLM inference platform supporting inference on a variety of devices, quantization, and easy-to-use application with an Open-AI API compatible HTTP server and Python bindings. - GitHub Repo stars
  • MInference - To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy. - GitHub Repo stars
  • LiteLLM - Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs). - GitHub Repo stars
  • LitServe - an easy-to-use, flexible serving engine for AI models built on FastAPI. Features like batching, streaming, and GPU autoscaling eliminate the need to rebuild a FastAPI server per model. - GitHub Repo stars
  • SGLang - a fast-serving framework for large language models and vision language models. - GitHub Repo stars

LLMs

  • MiniCPM-V - a series of end-side multimodal LLMs (MLLMs) designed for vision-language understanding. Models take image and text as inputs and provide high-quality text outputs.- GitHub Repo stars
  • DeepSeek-Coder-V2 - an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. - GitHub Repo stars
  • Trol - Traversal of Layers for Large Language and Vision Models. - GitHub Repo stars
  • FunAudioLLM - This project hosts two exciting applications leveraging advanced audio understand and speech generation models to bring your audio experiences to life: CosyVoice and SenseVoice - GitHub Repo stars
  • MobileLLM - MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024. - GitHub Repo stars
  • awesome-whisper - Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI. - GitHub Repo stars
  • SmolLM - A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos. blazingly fast and remarkably powerful
  • CogVideo - Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
  • MiniCPM-V - MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
  • Flux - the latest promising open-source AI image generation tool. It is capable of running on consumer-grade laptops. - GitHub Repo stars

Papers

  • Awesome-LLM-RAG - This repo aims to record advanced papers of Retrieval Agumented Generation (RAG) in LLMs.

Blog

Other Resources

  • funNLP - NLP民工的乐园: 几乎最全的中文NLP资源库, 在入门到熟悉NLP的过程中,用到了很多github上的包,遂整理了一下,分享在这里。
  • AGI-survey - Awesome AGI Survey. Must-read papers on Artificial General Intelligence.
  • rag-resources - A collection of curated RAG (Retrieval Augmented Generation) resources.
  • RAG-Survey
  • Awesome-LLM-RAG-Application - the resources about the application based on LLM with RAG pattern

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published