Skip to content

Switch to per-channel FP8: 3.5x throughput improvement (41.1ms TPOT, …

2bf3af8
Select commit
Loading
Failed to load commit list.
Open

Add Kimi-K2-Instruct-0905 contrib model (1T MoE on trn2.48xlarge) #131

Switch to per-channel FP8: 3.5x throughput improvement (41.1ms TPOT, …
2bf3af8
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs