Skip to content

Non-record: 30-epoch Cosine TTT on SwiGLU architecture (1xH100, val_b…

04738a3
Select commit
Loading
Failed to load commit list.
Open

Non-record: 30ep Cosine TTT on SwiGLU + U-Net (1xH100, val_bpb=1.1175) #661

Non-record: 30-epoch Cosine TTT on SwiGLU architecture (1xH100, val_b…
04738a3
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs