DeepWiki-Open

DeepWiki is my own implementation attempt of DeepWiki, automatically creates beautiful, interactive wikis for any GitHub, GitLab, or BitBucket repository! Just enter a repo name, and DeepWiki will:

Analyze the code structure
Generate comprehensive documentation
Create visual diagrams to explain how everything works
Organize it all into an easy-to-navigate wiki

✨ Features

Instant Documentation: Turn any GitHub, GitLab or BitBucket repo into a wiki in seconds
Private Repository Support: Securely access private repositories with personal access tokens
Smart Analysis: AI-powered understanding of code structure and relationships
Beautiful Diagrams: Automatic Mermaid diagrams to visualize architecture and data flow
Easy Navigation: Simple, intuitive interface to explore the wiki
Ask Feature: Chat with your repository using RAG-powered AI to get accurate answers
DeepResearch: Multi-turn research process that thoroughly investigates complex topics
Multiple Model Providers: Support for Google Gemini, OpenAI, OpenRouter, and local Ollama models

🚀 Quick Start (Super Easy!)

Option 1: Using Docker

# Clone the repository
git clone https://github.com/AsyncFuncAI/deepwiki-open.git
cd deepwiki-open

# Create a .env file with your API keys
echo "GOOGLE_API_KEY=your_google_api_key" > .env
echo "OPENAI_API_KEY=your_openai_api_key" >> .env
# Optional: Add OpenRouter API key if you want to use OpenRouter models
echo "OPENROUTER_API_KEY=your_openrouter_api_key" >> .env

# Run with Docker Compose
docker-compose up

💡 Where to get these keys:

Get a Google API key from Google AI Studio

Get an OpenAI API key from OpenAI Platform

Option 2: Manual Setup (Recommended)

Step 1: Set Up Your API Keys

Create a .env file in the project root with these keys:

GOOGLE_API_KEY=your_google_api_key
OPENAI_API_KEY=your_openai_api_key
# Optional: Add this if you want to use OpenRouter models
OPENROUTER_API_KEY=your_openrouter_api_key

Step 2: Start the Backend

# Install Python dependencies
pip install -r api/requirements.txt

# Start the API server
python -m api.main

Step 3: Start the Frontend

# Install JavaScript dependencies
npm install
# or
yarn install

# Start the web app
npm run dev
# or
yarn dev

Step 4: Use DeepWiki!

Open http://localhost:3000 in your browser
Enter a GitHub, GitLab, or Bitbucket repository (like https://github.com/openai/codex, https://github.com/microsoft/autogen, https://gitlab.com/gitlab-org/gitlab, or https://bitbucket.org/redradish/atlassian_app_versions)
For private repositories, click "+ Add access tokens" and enter your GitHub or GitLab personal access token
Click "Generate Wiki" and watch the magic happen!

🔍 How It Works

DeepWiki uses AI to:

Clone and analyze the GitHub, GitLab, or Bitbucket repository (including private repos with token authentication)
Create embeddings of the code for smart retrieval
Generate documentation with context-aware AI (using Google Gemini, OpenAI, OpenRouter, or local Ollama models)
Create visual diagrams to explain code relationships
Organize everything into a structured wiki
Enable intelligent Q&A with the repository through the Ask feature
Provide in-depth research capabilities with DeepResearch

graph TD
    A[User inputs GitHub/GitLab/Bitbucket repo] --> AA{Private repo?}
    AA -->|Yes| AB[Add access token]
    AA -->|No| B[Clone Repository]
    AB --> B
    B --> C[Analyze Code Structure]
    C --> D[Create Code Embeddings]

    D --> M{Select Model Provider}
    M -->|Google Gemini| E1[Generate with Gemini]
    M -->|OpenAI| E2[Generate with OpenAI]
    M -->|OpenRouter| E3[Generate with OpenRouter]
    M -->|Local Ollama| E4[Generate with Ollama]

    E1 --> E[Generate Documentation]
    E2 --> E
    E3 --> E
    E4 --> E

    D --> F[Create Visual Diagrams]
    E --> G[Organize as Wiki]
    F --> G
    G --> H[Interactive DeepWiki]

    classDef process stroke-width:2px;
    classDef data stroke-width:2px;
    classDef result stroke-width:2px;
    classDef decision stroke-width:2px;

    class A,D data;
    class AA,M decision;
    class B,C,E,F,G,AB,E1,E2,E3,E4 process;
    class H result;

🛠️ Project Structure

deepwiki/
├── api/                  # Backend API server
│   ├── main.py           # API entry point
│   ├── api.py            # FastAPI implementation
│   ├── rag.py            # Retrieval Augmented Generation
│   ├── data_pipeline.py  # Data processing utilities
│   └── requirements.txt  # Python dependencies
│
├── src/                  # Frontend Next.js app
│   ├── app/              # Next.js app directory
│   │   └── page.tsx      # Main application page
│   └── components/       # React components
│       └── Mermaid.tsx   # Mermaid diagram renderer
│
├── public/               # Static assets
├── package.json          # JavaScript dependencies
└── .env                  # Environment variables (create this)

🤖 Provider-Based Model Selection System

DeepWiki now implements a flexible provider-based model selection system supporting multiple LLM providers:

Supported Providers and Models

Google: Default gemini-2.0-flash, also supports gemini-1.5-flash, gemini-1.0-pro, etc.
OpenAI: Default gpt-4o, also supports o4-mini, etc.
OpenRouter: Access to multiple models via a unified API, including Claude, Llama, Mistral, etc.
Ollama: Support for locally running open-source models like llama3

Environment Variables

Each provider requires its corresponding API key environment variables:

# API Keys
GOOGLE_API_KEY=your_google_api_key        # Required for Google Gemini models
OPENAI_API_KEY=your_openai_api_key        # Required for OpenAI models
OPENROUTER_API_KEY=your_openrouter_api_key # Required for OpenRouter models

# OpenAI API Base URL Configuration
OPENAI_API_BASE=https://custom-api-endpoint.com/v1  # Optional, for custom OpenAI API endpoints

# Configuration Directory
DEEPWIKI_CONFIG_DIR=/path/to/custom/config/dir  # Optional, for custom config file location

Configuration Files

DeepWiki uses JSON configuration files to manage various aspects of the system:

generator.json: Configuration for text generation models
- Defines available model providers (Google, OpenAI, OpenRouter, Ollama)
- Specifies default and available models for each provider
- Contains model-specific parameters like temperature and top_p
embedder.json: Configuration for embedding models and text processing
- Defines embedding models for vector storage
- Contains retriever configuration for RAG
- Specifies text splitter settings for document chunking
repo.json: Configuration for repository handling
- Contains file filters to exclude certain files and directories
- Defines repository size limits and processing rules

By default, these files are located in the api/config/ directory. You can customize their location using the DEEPWIKI_CONFIG_DIR environment variable.

Custom Model Selection for Service Providers

The custom model selection feature is specifically designed for service providers who need to:

You can offer multiple AI model choices to users within your organization
You can quickly adapt to the rapidly evolving LLM landscape without code changes
You can support specialized or fine-tuned models that aren't in the predefined list

Service providers can implement their model offerings by selecting from the predefined options or entering custom model identifiers in the frontend interface.

Base URL Configuration for Enterprise Private Channels

The OpenAI Client's base_url configuration is designed primarily for enterprise users with private API channels. This feature:

Enables connection to private or enterprise-specific API endpoints
Allows organizations to use their own self-hosted or custom-deployed LLM services
Supports integration with third-party OpenAI API-compatible services

Coming Soon: In future updates, DeepWiki will support a mode where users need to provide their own API keys in requests. This will allow enterprise customers with private channels to use their existing API arrangements without sharing credentials with the DeepWiki deployment.

🛠️ Advanced Setup

Environment Variables

Variable	Description	Required	Note
`GOOGLE_API_KEY`	Google Gemini API key for AI generation	No	Required only if you want to use Google Gemini models
`OPENAI_API_KEY`	OpenAI API key for embeddings	Yes	Note: This is required even if you're not using OpenAI models, as it's used for embeddings.
`OPENROUTER_API_KEY`	OpenRouter API key for alternative models	No	Required only if you want to use OpenRouter models
`PORT`	Port for the API server (default: 8001)	No	If you host API and frontend on the same machine, make sure change port of `SERVER_BASE_URL` accordingly
`SERVER_BASE_URL`	Base URL for the API server (default: http://localhost:8001)	No

If you're not using ollama mode, you need to configure an OpenAI API key for embeddings. Other API keys are only required when configuring and using models from the corresponding providers.

Docker Setup

You can use Docker to run DeepWiki:

# Pull the image from GitHub Container Registry
docker pull ghcr.io/asyncfuncai/deepwiki-open:latest

# Run the container with environment variables
docker run -p 8001:8001 -p 3000:3000 \
  -e GOOGLE_API_KEY=your_google_api_key \
  -e OPENAI_API_KEY=your_openai_api_key \
  -e OPENROUTER_API_KEY=your_openrouter_api_key \
  -v ~/.adalflow:/root/.adalflow \
  ghcr.io/asyncfuncai/deepwiki-open:latest

This command also mounts ~/.adalflow on your host to /root/.adalflow in the container. This path is used to store:

Cloned repositories (~/.adalflow/repos/)
Their embeddings and indexes (~/.adalflow/databases/)
Cached generated wiki content (~/.adalflow/wikicache/)

This ensures that your data persists even if the container is stopped or removed.

Or use the provided docker-compose.yml file:

# Edit the .env file with your API keys first
docker-compose up

(The docker-compose.yml file is pre-configured to mount ~/.adalflow for data persistence, similar to the docker run command above.)

Using a .env file with Docker

You can also mount a .env file to the container:

# Create a .env file with your API keys
echo "GOOGLE_API_KEY=your_google_api_key" > .env
echo "OPENAI_API_KEY=your_openai_api_key" >> .env
echo "OPENROUTER_API_KEY=your_openrouter_api_key" >> .env

# Run the container with the .env file mounted
docker run -p 8001:8001 -p 3000:3000 \
  -v $(pwd)/.env:/app/.env \
  -v ~/.adalflow:/root/.adalflow \
  ghcr.io/asyncfuncai/deepwiki-open:latest

This command also mounts ~/.adalflow on your host to /root/.adalflow in the container. This path is used to store:

Cloned repositories (~/.adalflow/repos/)
Their embeddings and indexes (~/.adalflow/databases/)
Cached generated wiki content (~/.adalflow/wikicache/)

This ensures that your data persists even if the container is stopped or removed.

Building the Docker image locally

If you want to build the Docker image locally:

# Clone the repository
git clone https://github.com/AsyncFuncAI/deepwiki-open.git
cd deepwiki-open

# Build the Docker image
docker build -t deepwiki-open .

# Run the container
docker run -p 8001:8001 -p 3000:3000 \
  -e GOOGLE_API_KEY=your_google_api_key \
  -e OPENAI_API_KEY=your_openai_api_key \
  -e OPENROUTER_API_KEY=your_openrouter_api_key \
  deepwiki-open

API Server Details

The API server provides:

Repository cloning and indexing
RAG (Retrieval Augmented Generation)
Streaming chat completions

For more details, see the API README.

🔌 OpenRouter Integration

DeepWiki now supports OpenRouter as a model provider, giving you access to hundreds of AI models through a single API:

Multiple Model Options: Access models from OpenAI, Anthropic, Google, Meta, Mistral, and more
Simple Configuration: Just add your OpenRouter API key and select the model you want to use
Cost Efficiency: Choose models that fit your budget and performance needs
Easy Switching: Toggle between different models without changing your code

How to Use OpenRouter with DeepWiki

Get an API Key: Sign up at OpenRouter and get your API key
Add to Environment: Add OPENROUTER_API_KEY=your_key to your .env file
Enable in UI: Check the "Use OpenRouter API" option on the homepage
Select Model: Choose from popular models like GPT-4o, Claude 3.5 Sonnet, Gemini 2.0, and more

OpenRouter is particularly useful if you want to:

Try different models without signing up for multiple services
Access models that might be restricted in your region
Compare performance across different model providers
Optimize for cost vs. performance based on your needs

🤖 Ask & DeepResearch Features

Ask Feature

The Ask feature allows you to chat with your repository using Retrieval Augmented Generation (RAG):

Context-Aware Responses: Get accurate answers based on the actual code in your repository
RAG-Powered: The system retrieves relevant code snippets to provide grounded responses
Real-Time Streaming: See responses as they're generated for a more interactive experience
Conversation History: The system maintains context between questions for more coherent interactions

DeepResearch Feature

DeepResearch takes repository analysis to the next level with a multi-turn research process:

In-Depth Investigation: Thoroughly explores complex topics through multiple research iterations
Structured Process: Follows a clear research plan with updates and a comprehensive conclusion
Automatic Continuation: The AI automatically continues research until reaching a conclusion (up to 5 iterations)
Research Stages:
1. Research Plan: Outlines the approach and initial findings
2. Research Updates: Builds on previous iterations with new insights
3. Final Conclusion: Provides a comprehensive answer based on all iterations

To use DeepResearch, simply toggle the "Deep Research" switch in the Ask interface before submitting your question.

📱 Screenshots

The main interface of DeepWiki

Access private repositories with personal access tokens

DeepResearch conducts multi-turn investigations for complex topics

Demo Video

Watch DeepWiki in action!

❓ Troubleshooting

API Key Issues

"Missing environment variables": Make sure your .env file is in the project root and contains the required API keys
"API key not valid": Check that you've copied the full key correctly with no extra spaces
"OpenRouter API error": Verify your OpenRouter API key is valid and has sufficient credits

Connection Problems

"Cannot connect to API server": Make sure the API server is running on port 8001
"CORS error": The API is configured to allow all origins, but if you're having issues, try running both frontend and backend on the same machine

Generation Issues

"Error generating wiki": For very large repositories, try a smaller one first
"Invalid repository format": Make sure you're using a valid GitHub, GitLab or Bitbucket URL format
"Could not fetch repository structure": For private repositories, ensure you've entered a valid personal access token with appropriate permissions
"Diagram rendering error": The app will automatically try to fix broken diagrams

Common Solutions

Restart both servers: Sometimes a simple restart fixes most issues
Check console logs: Open browser developer tools to see any JavaScript errors
Check API logs: Look at the terminal where the API is running for Python errors

🤝 Contributing

Contributions are welcome! Feel free to:

Open issues for bugs or feature requests
Submit pull requests to improve the code
Share your feedback and ideas

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
.github/workflows		.github/workflows
.vscode		.vscode
api		api
public		public
screenshots		screenshots
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
Ollama-instruction.md		Ollama-instruction.md
README.es.md		README.es.md
README.ja.md		README.ja.md
README.kr.md		README.kr.md
README.md		README.md
README.vi.md		README.vi.md
README.zh.md		README.zh.md
docker-compose.yml		docker-compose.yml
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
pyproject.toml		pyproject.toml
run.sh		run.sh
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
uv.lock		uv.lock

License

AsyncFuncAI/deepwiki-open

Folders and files

Latest commit

History

Repository files navigation

DeepWiki-Open

✨ Features

🚀 Quick Start (Super Easy!)

Option 1: Using Docker

Option 2: Manual Setup (Recommended)

Step 1: Set Up Your API Keys

Step 2: Start the Backend

Step 3: Start the Frontend

Step 4: Use DeepWiki!

🔍 How It Works

🛠️ Project Structure

🤖 Provider-Based Model Selection System

Supported Providers and Models

Environment Variables

Configuration Files

Custom Model Selection for Service Providers

Base URL Configuration for Enterprise Private Channels

🛠️ Advanced Setup

Environment Variables

Docker Setup

Using a .env file with Docker

Building the Docker image locally

API Server Details

🔌 OpenRouter Integration

How to Use OpenRouter with DeepWiki

🤖 Ask & DeepResearch Features

Ask Feature

DeepResearch Feature

📱 Screenshots

Demo Video

❓ Troubleshooting

API Key Issues

Connection Problems

Generation Issues

Common Solutions

🤝 Contributing

📄 License

⭐ Star History

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 13

Languages

Packages