GPT4o

Community Open Source Implementation of GPT4o in PyTorch

Install

TikToken Tokenzier: We know fursure the tokenizer. Which is here
Model understands Images and Audio Natively. There are 2 approaches, process them natively or use encoders for each. I think here they're using encoders like whisper and vit for simplicity and brevity.
Using DALLE3 as the output head to generate images
Tokens to denote when to generate an image or audio
Whisper output head for the audio outputs

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github		.github
gpt4o		gpt4o
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
agorabanner.png		agorabanner.png
example.py		example.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt