omni-diffusion-model

Text, image, audio, video unified into a single diffusion model with 1 latent space.

inspired by https://arxiv.org/abs/2510.13721 - NExT-OMNI: Towards Any-to-Any Omnimodal Foundation Models with Discrete Flow Matching

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback