Stars
Interact with your documents using the power of GPT, 100% privately, no data leaks
A Gradio web UI for Large Language Models with support for multiple inference backends.
Python tool for converting files and office documents to Markdown.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
A modular graph-based Retrieval-Augmented Generation (RAG) system
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Fully open reproduction of DeepSeek-R1
Agno is a lightweight library for building Multimodal Agents. Use it to give LLMs superpowers like memory, knowledge, tools and reasoning.
DSPy: The framework for programming—not prompting—language models
LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
PyTorch implementations of Generative Adversarial Networks.
State-of-the-Art Text Embeddings
🤗 smolagents: a barebones library for agents that think in python code.
A very simple framework for state-of-the-art Natural Language Processing (NLP)
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
An open-source NLP research library, built on PyTorch.
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)