You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! MegaBlocks isn't a standalone training framework, but it's relatively easy to use from any framework. We use Megatron-LM and have a fork of it with MegaBlocks support. You could also use MegaBlocks from another framework like HuggingFace, for example.
Do you have SFT scripts? And Hyperparameters that you used to fine-tune the instruct version of your model? It would mean the world for the OS community!
No description provided.
The text was updated successfully, but these errors were encountered: