Skip to content

Conversation

@nalexand
Copy link

5.43 it/sec on 3070ti laptop 8Gb. (2.3 sec for each second of generated music)

Linear layers loading in Fp8, can be some degradation in quality. But still usable.

Before 280sec ~ 2hr
After 280sec ~ 10min

Python 3.12
torch==2.8.0+cu128

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant