Open-Superintelligence-Lab / omni-diffusion-model Public

Notifications You must be signed in to change notification settings
Fork 0
Star 2

Text, image, audio, video unified into a single diffusion model with 1 latent space.

2 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Repository files navigation

omni-diffusion-model

Text, image, audio, video unified into a single diffusion model with 1 latent space.

inspired by https://arxiv.org/abs/2510.13721 - NExT-OMNI: Towards Any-to-Any Omnimodal Foundation Models with Discrete Flow Matching

About

Text, image, audio, video unified into a single diffusion model with 1 latent space.

Custom properties

Report repository

Releases

No releases published

Packages

No packages published