RAGNA Studio - RAG Server

A high-performance Retrieval-Augmented Generation (RAG) server built with TypeScript, Express.js, and modern vector databases. This server provides APIs for document parsing, embedding, and semantic search capabilities.

🚀 Features

Document Processing: Parse various document formats using Apache Tika
Text Embedding: Support for multiple embedding providers (OpenAI, Cohere)
Vector Storage: Qdrant integration for efficient vector storage and retrieval
Semantic Search: Fast similarity search across embedded documents
Text Tokenization: Built-in tokenization with tiktoken
Docker Support: Complete Docker Compose setup for easy deployment
TypeScript: Full type safety and modern development experience

📋 API Endpoints

The server exposes the following REST API endpoints under /api/v1:

/parse - Document parsing and text extraction
/embed - Document embedding and vector storage
/search - Semantic search across embedded documents
/tokenize - Text tokenization services

🛠️ Tech Stack

Runtime: Node.js with TypeScript
Framework: Express.js
Vector Database: Qdrant
Document Processing: Apache Tika
Embedding Providers: OpenAI, Cohere
Validation: Zod schemas
Logging: Consola
Containerization: Docker & Docker Compose

🏃‍♂️ Quick Start

Prerequisites

Node.js 22+
Docker and Docker Compose
Environment variables configured (see Configuration)

Development Setup

Clone the repository

git clone https://github.com/hopkins385/rag-server-ts.git
cd rag-server-ts

Install dependencies
```
npm install
```

Configure environment

cp .env.example .env
# Edit .env with your configuration

Start development server
```
npm run dev
```

Docker Deployment

Start the complete stack
```
docker-compose up -d
```
This will start:
- RAG Server (API)
- Qdrant (Vector Database)
- Apache Tika (Document Processing)

For development with hot reload

docker-compose -f docker-compose.dev.yml up -d

📖 API Documentation

Document Embedding

POST /api/v1/embed/file
Content-Type: application/json

{
  "mediaId": "unique-media-id",
  "recordId": "unique-record-id",
  "mimeType": "application/pdf",
  "filePath": "/path/to/document.pdf"
}

Semantic Search

POST /api/v1/search/vector
Content-Type: application/json

{
  "query": "your search query",
  "recordIds": ["record-id-1", "record-id-2"]
}

Text Tokenization

POST /api/v1/tokenize/text
Content-Type: application/json

{
  "text": "text to tokenize"
}

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Development Guidelines

Follow TypeScript best practices
Add tests for new features
Update documentation as needed
Run linting and type checking before committing
Use conventional commit messages

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🔗 Related Projects

Qdrant - Vector similarity search engine
Apache Tika - Content analysis toolkit
OpenAI - AI platform for embeddings
Cohere - Natural language AI platform

📞 Support

If you have any questions or run into issues, please:

Check the Issues page
Create a new issue with detailed information
Join our community discussions

Built with ❤️ and Appreciation by Sven Stadhouders

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
src		src
.editorconfig		.editorconfig
.env.example		.env.example
.eslintrc		.eslintrc
.gitignore		.gitignore
.ncurc.json		.ncurc.json
.prettierrc.json		.prettierrc.json
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
build.sh		build.sh
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAGNA Studio - RAG Server

🚀 Features

📋 API Endpoints

🛠️ Tech Stack

🏃‍♂️ Quick Start

Prerequisites

Development Setup

Docker Deployment

📖 API Documentation

Document Embedding

Semantic Search

Text Tokenization

🤝 Contributing

Development Guidelines

📄 License

🔗 Related Projects

📞 Support

About

Uh oh!

Releases 1

Packages

Languages

License

hopkins385/rag-server-ts

Folders and files

Latest commit

History

Repository files navigation

RAGNA Studio - RAG Server

🚀 Features

📋 API Endpoints

🛠️ Tech Stack

🏃‍♂️ Quick Start

Prerequisites

Development Setup

Docker Deployment

📖 API Documentation

Document Embedding

Semantic Search

Text Tokenization

🤝 Contributing

Development Guidelines

📄 License

🔗 Related Projects

📞 Support

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages