[mcore] moonlight (small model with deepseekv3 arch) (#1284) #7
Annotations
1 error
e2e_gsm8k_megatron
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|