Noveum AI Gateway

A hyper-efficient, lightweight AI Gateway that provides a unified interface to access various AI model providers through a single endpoint. Built for edge deployment using Cloudflare Workers, it offers seamless integration with popular AI providers while maintaining high performance and low latency.

🌟 Features

🚀 Edge-Optimized Performance: Built on Cloudflare Workers for minimal latency
🔄 Universal Interface: Single endpoint for multiple AI providers
🔌 Provider Agnostic: Easily switch between different AI providers
📡 Streaming Support: Real-time streaming responses for all supported providers
🛠 Extensible Middleware: Customizable request/response pipeline
✅ Built-in Validation: Automatic request validation and error handling
🔄 Auto-Transform: Automatic request/response transformation
📝 Detailed Metrics: Comprehensive request metrics and cost tracking
📝 Comprehensive Logging: Detailed logging for monitoring and debugging
💪 Type-Safe: Built with TypeScript for robust type safety
🔒 OpenAI Compatible: Drop-in replacement for OpenAI's API

🤖 Supported Providers

Provider	Streaming	OpenAI Compatible
OpenAI	✅	Native
Anthropic	✅	✅
GROQ	✅	✅
Fireworks	✅	✅
Together	✅	✅

🚀 Quick Start

Using Cloudflare Workers (Recommended)

# Install Wrangler CLI
npm install -g wrangler

# Clone and Setup
git clone https://github.com/Noveum/ai-gateway.git
cd ai-gateway
npm install

# Login to Cloudflare
wrangler login

# Development
npm run dev     # Server starts at http://localhost:3000

# Deploy
npm run deploy

Using Docker (Alternative)

docker pull noveum/ai-gateway:latest
docker run -p 3000:3000 noveum/ai-gateway:latest

📚 Usage Examples

OpenAI-Compatible Interface

The gateway provides a drop-in replacement for OpenAI's API. You can use your existing OpenAI client libraries by just changing the base URL:

// TypeScript/JavaScript
import OpenAI from 'openai';

const openai = new OpenAI({
    baseURL: 'http://localhost:3000/v1',
    apiKey: 'your-provider-api-key',
    defaultHeaders: { 'x-provider': 'openai' }
});

const response = await openai.chat.completions.create({
    model: 'gpt-4',
    messages: [{ role: 'user', content: 'Hello!' }]
});

Provider-Specific Examples

Anthropic (Claude)

curl -X POST http://localhost:3000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "x-provider: anthropic" \
  -H "Authorization: Bearer your-anthropic-api-key" \
  -d '{
    "model": "claude-3-sonnet-20240229-v1:0",
    "messages": [{"role": "user", "content": "Hello!"}],
    "temperature": 0.7,
    "max_tokens": 1000
  }'

GROQ

curl -X POST http://localhost:3000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "x-provider: groq" \
  -H "Authorization: Bearer your-groq-api-key" \
  -d '{
    "model": "mixtral-8x7b-32768",
    "messages": [{"role": "user", "content": "Hello!"}],
    "temperature": 0.7,
    "max_tokens": 1000
  }'

Streaming Example

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:3000/v1",
    api_key="your-provider-api-key",
    default_headers={"x-provider": "anthropic"}  # or any other provider
)

stream = client.chat.completions.create(
    model="claude-3-sonnet-20240229-v1:0",
    messages=[{"role": "user", "content": "Write a story"}],
    stream=True
)

for chunk in stream:
    if chunk.choices[0].delta.content is not None:
        print(chunk.choices[0].delta.content, end="")

Example Response with Metrics

{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "created": 1709312768,
  "model": "gpt-4",
  "choices": [{
    "index": 0,
    "message": {
      "role": "assistant",
      "content": "Hello! How can I help you today?"
    },
    "finish_reason": "stop"
  }],
  "usage": {
    "prompt_tokens": 10,
    "completion_tokens": 9,
    "total_tokens": 19
  },
  "system_fingerprint": "fp_1234",
  "metrics": {
    "latency_ms": 450,
    "tokens_per_second": 42.2,
    "cost": {
      "input_cost": 0.0003,
      "output_cost": 0.0006,
      "total_cost": 0.0009
    }
  }
}

📖 Documentation

📄 Contributing Opportunities

We welcome contributions! Here are some tasks we're actively looking for help with:

High Priority Tasks

AWS Bedrock Integration
- Add support for AWS Bedrock models
- Implement authentication and cost tracking
- Get Started →
Testing Framework
- Set up unit and integration tests
- Add provider-specific test cases
- Get Started →
Performance Benchmarks
- Create benchmarking suite
- Compare with other AI gateways
- Get Started →

Feature Requests

Prometheus Integration
- Add metrics exporter
- Create Grafana dashboards
- Get Started →
Response Caching
- Implement caching layer
- Add cache invalidation
- Get Started →
Rate Limiting
- Add per-user rate limits
- Implement token bucket algorithm
- Get Started →

Documentation

Provider Guides
- Create setup guides for each provider
- Add troubleshooting sections
- Get Started →
Deployment Examples
- Add Docker Compose examples
- Create cloud deployment guides
- Get Started →

Want to contribute?

Pick a task from above
Open an issue to discuss your approach
Submit a pull request

Need help? Join our Discord or check existing issues.

📄 Metrics & Monitoring

The gateway collects detailed metrics for every request, providing insights into:

📈 Real-time performance tracking
💰 Token usage and cost calculation
🔄 Streaming metrics support
📊 Provider-specific metadata
⏱️ Latency and TTFB monitoring
🔍 Detailed debugging information

For detailed metrics documentation, see METRICS.md

📄 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

📊 Contact

GitHub Issues: https://github.com/Noveum/ai-gateway/issues
Twitter: @NoveumAI

Made with ❤️ by the Noveum Team

Copyright 2024 Noveum AI

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

🙏 Acknowledgments

Built with Hono
Deployed on Cloudflare Workers

📬 Contact

GitHub Issues: https://github.com/Noveum/ai-gateway/issues
Twitter: @NoveumAI

Made with ❤️ by the Noveum Team

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
docs		docs
scripts		scripts
src		src
tests		tests
.cursorrules		.cursorrules
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
DEPLOYMENT.md		DEPLOYMENT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
eslint.config.js		eslint.config.js
package-lock.json		package-lock.json
package.json		package.json
rollup.config.js		rollup.config.js
tsconfig.json		tsconfig.json
wrangler.toml		wrangler.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Noveum AI Gateway

🌟 Features

🤖 Supported Providers

🚀 Quick Start

Using Cloudflare Workers (Recommended)

Using Docker (Alternative)

📚 Usage Examples

OpenAI-Compatible Interface

Provider-Specific Examples

Anthropic (Claude)

GROQ

Streaming Example

Example Response with Metrics

📖 Documentation

📄 Contributing Opportunities

High Priority Tasks

Feature Requests

Documentation

📄 Metrics & Monitoring

📄 License

📊 Contact

🙏 Acknowledgments

📬 Contact

About

Uh oh!

Releases 2

Packages

Uh oh!

Languages

License

Noveum/ai-gateway-ts

Folders and files

Latest commit

History

Repository files navigation

Noveum AI Gateway

🌟 Features

🤖 Supported Providers

🚀 Quick Start

Using Cloudflare Workers (Recommended)

Using Docker (Alternative)

📚 Usage Examples

OpenAI-Compatible Interface

Provider-Specific Examples

Anthropic (Claude)

GROQ

Streaming Example

Example Response with Metrics

📖 Documentation

📄 Contributing Opportunities

High Priority Tasks

Feature Requests

Documentation

📄 Metrics & Monitoring

📄 License

📊 Contact

🙏 Acknowledgments

📬 Contact

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Languages

Packages