Skip to content

Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Notifications You must be signed in to change notification settings

yuvraj108c/ComfyUI-FLOAT

Repository files navigation

ComfyUI FLOAT

python arXiv by-nc-sa/4.0

This project provides a ComfyUI wrapper of FLOAT for Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

0506.4.mp4

⭐ Support

If you like my projects and wish to see updates and new features, please consider supporting me. It helps a lot!

ComfyUI-Depth-Anything-Tensorrt ComfyUI-Upscaler-Tensorrt ComfyUI-Dwpose-Tensorrt ComfyUI-Rife-Tensorrt

ComfyUI-Whisper ComfyUI_InvSR ComfyUI-FLOAT ComfyUI-Thera ComfyUI-Video-Depth-Anything ComfyUI-PiperTTS

buy-me-coffees paypal-donation

🚀 Installation

git clone https://github.com/yuvraj108c/ComfyUI-FLOAT.git
cd ./ComfyUI-FLOAT
pip install -r requirements.txt

☀️ Usage

  • Load example workflow
  • Upload driving image and audio, click queue
  • Models autodownload to /ComfyUI/models/float

Citation

@article{ki2024float,
  title={FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait},
  author={Ki, Taekyung and Min, Dongchan and Chae, Gyeongsu},
  journal={arXiv preprint arXiv:2412.01064},
  year={2024}
}

Acknowledgments

Thanks to simplepod.ai for providing GPU servers

License

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)

About

Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages