Inferless

pyannote-speaker-diarization-3.1 Public template
A state-of-the-art model that segments and labels audio recordings by accurately distinguishing different speakers. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>

inferless/pyannote-speaker-diarization-3.1’s past year of commit activity

Python 5 3 0 0 Updated Aug 14, 2025
facebook-bart-cnn Public template
A variant of the BART model designed specifically for natural language summarization. It was pre-trained on a large corpus of English text and later fine-tuned on the CNN/Daily Mail dataset. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>

inferless/facebook-bart-cnn’s past year of commit activity

Python 9 3 0 1 Updated Aug 14, 2025
qwen-image Public

inferless/qwen-image’s past year of commit activity

Python 0 0 0 0 Updated Aug 12, 2025
Qwen3-30B-A3B-Instruct-2507 Public

inferless/Qwen3-30B-A3B-Instruct-2507’s past year of commit activity

0 0 0 0 Updated Aug 12, 2025
qwen3-coder-30B-a3B-instruct Public
30.5B MoE code generation model purpose-tuned for code generation and agentic tool use. <metadata> gpu: A100 | collections: ["HF Transformers"] </metadata>

inferless/qwen3-coder-30B-a3B-instruct’s past year of commit activity

Python 0 0 0 0 Updated Aug 12, 2025
gpt-oss-20b Public template
A 21B open‑weight language model (with ~3.6 billion active parameters per token) developed by OpenAI for reasoning, tool integration, and low‑latency usage. <metadata> gpu: A100 | collections: ["HF Transformers"] </metadata>

inferless/gpt-oss-20b’s past year of commit activity

Python 0 0 0 0 Updated Aug 6, 2025
voxtral-mini-3b Public template
3B parameter audio-language model with speech transcription, translation, and audio understanding capabilities. <metadata> gpu: A10 | collections:["HF_Transformers"] </metadata>

inferless/voxtral-mini-3b’s past year of commit activity

Python 0 1 0 0 Updated Jul 30, 2025
kyutai-tts-1.6b Public template
1.6B parameter text-to-speech model that supports real-time streaming text input with ultra-low latency and voice conditioning capabilities.<metadata> gpu: A10 | collections:["HF_Transformers"] </metadata>

inferless/kyutai-tts-1.6b’s past year of commit activity

Python 0 0 0 0 Updated Jul 30, 2025
llama-3.1-8b-instruct-gguf Public template
An 8B-parameter, instruction-tuned variant of Meta's Llama-3.1 model, optimized in GGUF format for efficient inference. <metadata> gpu: A100 | collections: ["lama.cpp"] </metadata>

inferless/llama-3.1-8b-instruct-gguf’s past year of commit activity

Python 1 7 0 0 Updated Jul 17, 2025
stable-diffusion-3-5-large-turbo Public template
A fast, optimized diffusion model that generates high-quality images from text prompts, ideal for creative visual content. <metadata> gpu: A100 | collections: ["Diffusers"] </metadata>

inferless/stable-diffusion-3-5-large-turbo’s past year of commit activity

Python 0 11 0 0 Updated Jul 17, 2025

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inferless

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!