We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
If not, could I learn about the training parameters (e.g. effective batch size, learning rate, clipping, etc.)
The text was updated successfully, but these errors were encountered:
Hello, the settings specified in the config_X_mira.yaml file are ready to be used for training Mira on A100 40G GPU.
Sorry, something went wrong.
Wait, so the model was trained with 1 A100 and no gradient accumulation?
No, the Mira-v0 model was trained on 32 A100 GPUs for approximately two days.
No branches or pull requests
If not, could I learn about the training parameters (e.g. effective batch size, learning rate, clipping, etc.)
The text was updated successfully, but these errors were encountered: