gemma
Here are 110 public repositories matching this topic...
🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
-
Updated
Jul 1, 2024 - C++
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
-
Updated
Jul 1, 2024 - Python
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.
-
Updated
Jul 1, 2024 - TypeScript
ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
-
Updated
Jul 1, 2024 - Python
EmbeddedLLM: API server for Embedded Device Deployment. Currently support ONNX-DirectML.
-
Updated
Jul 1, 2024 - Python
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
Updated
Jul 1, 2024 - Python
A collection of guides and examples for the Gemma open models from Google.
-
Updated
Jul 1, 2024 - Jupyter Notebook
🤖 Adds AI to Google Search. Ask from any site. Powered by Google Gemma + GPT-4o!
-
Updated
Jul 1, 2024 - JavaScript
NAACL '24 (Demo) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
-
Updated
Jul 1, 2024 - Python
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
-
Updated
Jul 1, 2024 - Go
This repository highlights the LLMs reasoning capabilities of ✨ Mistral / LLaMA-3 / Phi-3 / Gemma / Flan-T5 / GPT-4o ✨ in Targeted Sentiment Analysis in Russian / Translated to English mass-media 📊
-
Updated
Jun 30, 2024 - Python
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
-
Updated
Jul 1, 2024 - Python
A collection of Jupyter notebook experiments and applications centered around Generative AI with LLMs.
-
Updated
Jun 28, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to the gemma topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gemma topic, visit your repo's landing page and select "manage topics."