Skip to content

v0.6.0

Choose a tag to compare

@eitanturok eitanturok released this 30 Aug 18:55

What's New

1. Torch 2.4 Compatibility (#145)

MegaBlocks now supports Torch 2.4!

2. New CI/CD

MegaBlocks has new Github Actions for better CI/CD! Now on every PR, MegaBlocks will automatically perform code linting and formatting (#131) and run tests on a GPU (#127).

3. Remove Weight Parallelism (#137)

Weight parallelism was not in use and so we removed it.

4. Shared Experts (#109)
Implement shared experts, based on the DeepSeekMoE paper.

Bug Fixes

  1. Better handle incompatible ffn sizes (#108)
  2. Fix AMP for memory optimized options (#111)
  3. Don't save moe lb-loss tensors (#119)

What's Changed

New Contributors

Full Changelog: v0.5.1...v0.6.0