Esperanto 🌐

Esperanto is a powerful Python library that provides a unified interface for interacting with various Large Language Model (LLM) providers. It simplifies the process of working with different AI models (LLMs, Embedders, Transcribers) APIs by offering a consistent interface while maintaining provider-specific optimizations.

Features ✨

Unified Interface: Work with multiple LLM providers using a consistent API
Provider Support:
- OpenAI (GPT-4, GPT-3.5, o1, Whisper, TTS)
- Anthropic (Claude 3)
- OpenRouter (Access to multiple models)
- xAI (Grok)
- Groq (Mixtral, Llama, Whisper)
- Google GenAI (Gemini LLM, Text To Speech, Embedding)
- Vertex AI (Google Cloud)
- Ollama (Local deployment)
- ElevenLabs (Text-to-Speech)
Embedding Support: Multiple embedding providers for vector representations
Speech-to-Text Support: Transcribe audio using multiple providers
Text-to-Speech Support: Generate speech using multiple providers
Async Support: Both synchronous and asynchronous API calls
Streaming: Support for streaming responses
Structured Output: JSON output formatting (where supported)
LangChain Integration: Easy conversion to LangChain chat models

For detailed information about our providers, check out:

LLM Providers Documentation
Embedding Providers Documentation
Speech-to-Text Providers Documentation
Text-to-Speech Providers Documentation

Installation 🚀

Install Esperanto using pip:

pip install esperanto

For specific providers, install with their extras:

# For OpenAI support
pip install "esperanto[openai]"

# For Anthropic support
pip install "esperanto[anthropic]"

# For Google (GenAI) support
pip install "esperanto[google]"

# For Vertex AI support
pip install "esperanto[vertex]"

# For Groq support
pip install "esperanto[groq]"

# For Ollama support
pip install "esperanto[ollama]"

# For all providers
pip install "esperanto[all]"

Provider Support Matrix

Provider	LLM Support	Embedding Support	Speech-to-Text	Text-to-Speech	JSON Mode
OpenAI	✅	✅	✅	✅	✅
Anthropic	✅	❌	❌	❌	✅
Groq	✅	❌	✅	❌	✅
Google (GenAI)	✅	✅	❌	✅	✅
Vertex AI	✅	✅	❌	❌	❌
Ollama	✅	✅	❌	❌	❌
ElevenLabs	❌	❌	❌	✅	❌

Quick Start 🏃‍♂️

You can use Esperanto in two ways: directly with provider-specific classes or through the AI Factory.

Using AI Factory

from esperanto.factory import AIFactory

# Get available providers for each model type
providers = AIFactory.get_available_providers()
print(providers)
# Output:
# {
#     'language': ['openai', 'anthropic', 'google', 'groq', 'ollama', 'openrouter', 'xai'],
#     'embedding': ['openai', 'google', 'ollama', 'vertex'],
#     'speech_to_text': ['openai', 'groq'],
#     'text_to_speech': ['openai', 'elevenlabs', 'google']
# }

# Create a language model instance with structured output (JSON)
model = AIFactory.create_language(
    "openai", 
    "gpt-3.5-turbo",
    structured={"type": "json"}
)
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "What's the capital of France?"},
]
response = model.chat_complete(messages)  # Response will be in JSON format

# Create an embedding instance
model = AIFactory.create_embedding("openai", "text-embedding-3-small")
texts = ["Hello, world!", "Another text"]
embeddings = model.embed(texts)

Standardized Responses

All providers in Esperanto return standardized response objects, making it easy to work with different models without changing your code.

LLM Responses

from esperanto.factory import AIFactory

model = AIFactory.create_language("openai", "gpt-3.5-turbo")
messages = [{"role": "user", "content": "Hello!"}]

# All LLM responses follow this structure
response = model.chat_complete(messages)
print(response.choices[0].message.content)  # The actual response text
print(response.choices[0].message.role)     # 'assistant'
print(response.model)                       # The model used
print(response.usage.total_tokens)          # Token usage information

# For streaming responses
for chunk in model.chat_complete(messages):
    print(chunk.choices[0].delta.content)   # Partial response text

Embedding Responses

from esperanto.factory import AIFactory

model = AIFactory.create_embedding("openai", "text-embedding-3-small")
texts = ["Hello, world!", "Another text"]

# All embedding responses follow this structure
response = model.embed(texts)
print(response.data[0].embedding)     # Vector for first text
print(response.data[0].index)         # Index of the text (0)
print(response.model)                 # The model used
print(response.usage.total_tokens)    # Token usage information

The standardized response objects ensure consistency across different providers, making it easy to:

Switch between providers without changing your application code
Handle responses in a uniform way
Access common attributes like token usage and model information

Links 🔗

Documentation: GitHub Documentation
Source Code: GitHub Repository
Issue Tracker: GitHub Issues

License 📄

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.pypi.md

README.pypi.md

Esperanto 🌐

Features ✨

Installation 🚀

Provider Support Matrix

Quick Start 🏃‍♂️

Using AI Factory

Standardized Responses

LLM Responses

Embedding Responses

Links 🔗

License 📄

Files

README.pypi.md

Latest commit

History

README.pypi.md

File metadata and controls

Esperanto 🌐

Features ✨

Installation 🚀

Provider Support Matrix

Quick Start 🏃‍♂️

Using AI Factory

Standardized Responses

LLM Responses

Embedding Responses

Links 🔗

License 📄