GitHub - yfclark/rag-web-ui: RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.

Knowledge Base Management Based on RAG (Retrieval-Augmented Generation)

Features • Quick Start • Deployment • Architecture • Development • Contributing

English | 简体中文

📖 Introduction

RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology. It helps enterprises and individuals build intelligent Q&A systems based on their own knowledge bases. By combining document retrieval and large language models, it delivers accurate and reliable knowledge-based question-answering services.

✨ Features

📚 Intelligent Document Management
- Support for multiple document formats (PDF, DOCX, Markdown, Text)
- Automatic document chunking and vectorization
- Support for async document processing and incremental updates
🤖 Advanced Dialogue Engine
- Precise retrieval and generation based on RAG
- Support for multi-turn contextual dialogue
- Support for reference citations in conversations
🎯 Robust Architecture
- Frontend-backend separation design
- Distributed file storage
- High-performance vector database

🖼️ Screenshots

Knowledge Base Management Dashboard

Document Processing Dashboard

Document List

Intelligent Chat Interface with References

API Key Management

API Reference

Project Flowchart

graph TB
    %% Role Definitions
    client["Caller/User"]
    open_api["Open API"]
    
    subgraph import_process["Document Ingestion Process"]
        direction TB
        %% File Storage and Document Processing Flow
        docs["Document Input<br/>(PDF/MD/TXT/DOCX)"]
        job_id["Return Job ID"]
        
        nfs["NFS"]

        subgraph async_process["Asynchronous Document Processing"]
            direction TB
            preprocess["Document Preprocessing<br/>(Text Extraction/Cleaning)"]
            split["Text Splitting<br/>(Segmentation/Overlap)"]
            
            subgraph embedding_process["Embedding Service"]
                direction LR
                embedding_api["Embedding API"] --> embedding_server["Embedding Server"]
            end
            
            store[(Vector Database)]
            
            %% Internal Flow of Asynchronous Processing
            preprocess --> split
            split --> embedding_api
            embedding_server --> store
        end
        
        subgraph job_query["Job Status Query"]
            direction TB
            job_status["Job Status<br/>(Processing/Completed/Failed)"]
        end
    end
    
    %% Query Service Flow  
    subgraph query_process["Query Service"]
        direction LR
        user_history["User History"] --> query["User Query<br/>(Based on User History)"]
        query --> query_embed["Query Embedding"]
        query_embed --> retrieve["Vector Retrieval"]
        retrieve --> rerank["Re-ranking<br/>(Cross-Encoder)"]
        rerank --> context["Context Assembly"]
        context --> llm["LLM Generation"]
        llm --> response["Final Response"]
        query -.-> rerank
    end
    
    %% Main Flow Connections
    client --> |"1.Upload Document"| docs
    docs --> |"2.Generate"| job_id
    docs --> |"3a.Trigger"| async_process
    job_id --> |"3b.Return"| client
    docs --> nfs
    nfs --> preprocess

    %% Open API Retrieval Flow
    open_api --> |"Retrieve Context"| retrieval_service["Retrieval Service"]
    retrieval_service --> |"Access"| store
    retrieval_service --> |"Return Context"| open_api

    %% Status Query Flow
    client --> |"4.Poll"| job_status
    job_status --> |"5.Return Progress"| client
    
    %% Database connects to Query Service
    store --> retrieve

    %% Style Definitions (Adjusted to match GitHub theme colors)
    classDef process fill:#d1ecf1,stroke:#0077b6,stroke-width:1px
    classDef database fill:#e2eafc,stroke:#003566,stroke-width:1px
    classDef input fill:#caf0f8,stroke:#0077b6,stroke-width:1px
    classDef output fill:#ffc8dd,stroke:#d00000,stroke-width:1px
    classDef rerank fill:#cdb4db,stroke:#5a189a,stroke-width:1px
    classDef async fill:#f8edeb,stroke:#7f5539,stroke-width:1px,stroke-dasharray: 5 5
    classDef actor fill:#fefae0,stroke:#606c38,stroke-width:1px
    classDef jobQuery fill:#ffedd8,stroke:#ca6702,stroke-width:1px
    classDef queryProcess fill:#d8f3dc,stroke:#40916c,stroke-width:1px
    classDef embeddingService fill:#ffe5d9,stroke:#9d0208,stroke-width:1px
    classDef importProcess fill:#e5e5e5,stroke:#495057,stroke-width:1px

    %% Applying classes to nodes
    class docs,query,retrieval_service input
    class preprocess,split,query_embed,retrieve,context,llm process
    class store,nfs database
    class response,job_id,job_status output
    class rerank rerank
    class async_process async
    class client,open_api actor
    class job_query jobQuery
    style query_process fill:#d8f3dc,stroke:#40916c,stroke-width:1px
    style embedding_process fill:#ffe5d9,stroke:#9d0208,stroke-width:1px
    style import_process fill:#e5e5e5,stroke:#495057,stroke-width:1px
    style job_query fill:#ffedd8,stroke:#ca6702,stroke-width:1px

🚀 Quick Start

Prerequisites

Docker & Docker Compose v2.0+
Node.js 18+
Python 3.9+
8GB+ RAM

Installation

Clone the repository

git clone https://github.com/rag-web-ui/rag-web-ui.git
cd rag-web-ui

Configure environment variables

cp backend/.env.example backend/.env
# Edit .env file with necessary configurations

Start services

docker-compose up -d

Verification

Access the following URLs after service startup:

🌐 Frontend UI: http://localhost:3000
📚 API Documentation: /docs
💾 MinIO Console: http://localhost:9001

🏗️ Architecture

Backend Stack

🐍 Python FastAPI: High-performance async web framework
🗄️ MySQL + ChromaDB: Relational + Vector databases
📦 MinIO: Distributed object storage
🔗 Langchain: LLM application framework
🔒 JWT + OAuth2: Authentication

Frontend Stack

⚛️ Next.js 14: React framework
📘 TypeScript: Type safety
🎨 Tailwind CSS: Utility-first CSS
🎯 Shadcn/UI: High-quality components
🤖 Vercel AI SDK: AI integration

📈 Performance Optimization

The system is optimized in the following aspects:

⚡️ Incremental document processing and async chunking
🔄 Streaming responses and real-time feedback
📑 Vector database performance tuning
🎯 Distributed task processing

📖 Development Guide

Backend Development

cd backend
python -m venv venv
source venv/bin/activate  # Linux/macOS
pip install -r requirements.txt
uvicorn app.main:app --reload

Frontend Development

cd frontend
pnpm install
pnpm dev

Database Migration

cd backend
alembic revision --autogenerate -m "migration message"
alembic upgrade head

🔧 Configuration

Core Configuration

Parameter	Description	Default	Required
MYSQL_SERVER	MySQL Server Address	localhost	✅
MYSQL_USER	MySQL Username	postgres	✅
MYSQL_PASSWORD	MySQL Password	postgres	✅
MYSQL_DATABASE	MySQL Database Name	ragwebui	✅
SECRET_KEY	JWT Secret Key	-	✅
ACCESS_TOKEN_EXPIRE_MINUTES	JWT Token Expiry (minutes)	30	✅
CHROMA_DB_HOST	ChromaDB Server Address	localhost	✅
CHROMA_DB_PORT	ChromaDB Port	8000	✅
OPENAI_API_KEY	OpenAI API Key	-	✅
OPENAI_API_BASE	OpenAI API Proxy URL	-	❌
NEXT_PUBLIC_API_URL	Next.js API URL	-	❌

🤝 Contributing

We welcome community contributions!

Contribution Process

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit changes (git commit -m 'Add some AmazingFeature')
Push to branch (git push origin feature/AmazingFeature)
Create a Pull Request

Development Guidelines

Follow Python PEP 8 coding standards
Follow Conventional Commits

🚧 Roadmap

📄 License

This project is licensed under the Apache-2.0 License

Note

This project is for learning and sharing RAG knowledge only. Please do not use it for commercial purposes. It is not ready for production use and is still under active development.

🙏 Acknowledgments

Thanks to these open source projects:

If this project helps you, please consider giving it a ⭐️

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github		.github
backend		backend
docs/images		docs/images
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

English | 简体中文

📖 Introduction

✨ Features

🖼️ Screenshots

Project Flowchart

🚀 Quick Start

Prerequisites

Installation

Verification

🏗️ Architecture

Backend Stack

Frontend Stack

📈 Performance Optimization

📖 Development Guide

Backend Development

Frontend Development

Database Migration

🔧 Configuration

Core Configuration

🤝 Contributing

Contribution Process

Development Guidelines

🚧 Roadmap

📄 License

Note

🙏 Acknowledgments

About

Releases

Packages

Languages

License

yfclark/rag-web-ui

Folders and files

Latest commit

History

Repository files navigation

English | 简体中文

📖 Introduction

✨ Features

🖼️ Screenshots

Project Flowchart

🚀 Quick Start

Prerequisites

Installation

Verification

🏗️ Architecture

Backend Stack

Frontend Stack

📈 Performance Optimization

📖 Development Guide

Backend Development

Frontend Development

Database Migration

🔧 Configuration

Core Configuration

🤝 Contributing

Contribution Process

Development Guidelines

🚧 Roadmap

📄 License

Note

🙏 Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages