-
Notifications
You must be signed in to change notification settings - Fork 17
Issues: NVIDIA/NeMo-Run
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
AssertionError from nemo_run/core/runners/fdl_runner.py when using PreTrainingDataModule
#104
opened Nov 9, 2024 by
RachitBansal
zlib.error: Error -3 while decompressing data: incorrect header check
#97
opened Oct 29, 2024 by
RachitBansal
When running with detach=False on slurm, add an option to cancel experiment when ctrl+C is pressed
#71
opened Sep 30, 2024 by
Kipok
Print the content of slurm script in the sbatch command, so that it's visible in the logs
#68
opened Sep 28, 2024 by
Kipok
The error message when a factory is ill typed is still incorrect
#60
opened Sep 21, 2024 by
hemildesai
Logs are not being streamed with exp.run(detach=False, tail_logs=True)
#58
opened Sep 20, 2024 by
Kipok
Allow running multiple nemo run tasks in parallel with DockerExecutor
#57
opened Sep 19, 2024 by
Kipok
Check for invalid characters in task/experiment names and raise a clear error
#38
opened Sep 10, 2024 by
Kipok
Make an option to run task groups sequentially instead of in-parallel
#37
opened Sep 10, 2024 by
Kipok
Make job_name customizable or add ability to reuse code across tasks with different name
#36
opened Sep 9, 2024 by
Kipok
Add slurm_template parameter to SlurmExecutor (and use it in SlurmBatchRequest)
#15
opened Aug 28, 2024 by
Kipok
Can we remove the "title" parameter from experiment and only keep "id"?
#9
opened Aug 26, 2024 by
Kipok
ProTip!
What’s not been updated in a month: updated:<2024-10-24.