Skip to content

feat(cuda): add fused GDN decode and RMSNorm+SiLU gating kernels for …

2cf50f4
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

feat(pymllm): VocabParallelEmbedding & pymllm's cuda infra init #640

feat(cuda): add fused GDN decode and RMSNorm+SiLU gating kernels for …
2cf50f4
Select commit
Loading
Failed to load commit list.