Skip to content

Commit

Permalink
updates for new container 24.09
Browse files Browse the repository at this point in the history
  • Loading branch information
pablo-garay committed Nov 18, 2024
1 parent 597f83a commit f77e3e6
Show file tree
Hide file tree
Showing 6 changed files with 6 additions and 6 deletions.
2 changes: 1 addition & 1 deletion auto_configurator/conf/config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ launcher_scripts_path: ${auto_configurator_path}/../launcher_scripts
base_results_dir: ${auto_configurator_path}/results
data_dir: ${launcher_scripts_path}/data

training_container: nvcr.io/nvidia/nemo:24.07
training_container: nvcr.io/nvidia/nemo:24.09
container_mounts:
- null

Expand Down
2 changes: 1 addition & 1 deletion auto_configurator/tests/config_tests/test_main_config.py
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ def test_config(self):
base_results_dir: ${auto_configurator_path}/results
data_dir: ${launcher_scripts_path}/data
training_container: nvcr.io/nvidia/nemo:24.07
training_container: nvcr.io/nvidia/nemo:24.09
container_mounts:
- null
Expand Down
2 changes: 1 addition & 1 deletion csp_tools/aws/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@
# See the License for the specific language governing permissions and
# limitations under the License.

FROM nvcr.io/nvidia/nemo:24.07
FROM nvcr.io/nvidia/nemo:24.09

ARG NCCL_VERSION=2.18.5-1+cuda12.2
ARG EFA_INSTALLER_VERSION=1.28.0
Expand Down
2 changes: 1 addition & 1 deletion examples/peft/llama/a100/lora_4gpu_k8s.sh
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ HYDRA_FULL_ERROR=1 python3 ${NEMO_FRAMEWORK_LAUNCHER_DIR}/launcher_scripts/main.
cluster=k8s_v2 \
cluster_type=k8s \
cluster.ib_interfaces=null \
container=nvcr.io/nvidia/nemo:24.07 \
container=nvcr.io/nvidia/nemo:24.09 \
stages=[peft] \
peft=${PEFT_CONFIG} \
launcher_scripts_path=${NEMO_FRAMEWORK_LAUNCHER_DIR}/launcher_scripts \
Expand Down
2 changes: 1 addition & 1 deletion launcher_scripts/conf/config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@ data_dir: ${launcher_scripts_path}/data # Location to store and read the data.
base_results_dir: ${launcher_scripts_path}/results # Location to store the results, checkpoints and logs.
container_mounts: # List of additional paths to mount to container. They will be mounted to same path.
- null
container: nvcr.io/nvidia/nemo:24.07
container: nvcr.io/nvidia/nemo:24.09

wandb_api_key_file: null # File where the w&B api key is stored. Key must be on the first line.
wandb_api_bcp_secret_key: null # For BCP clusters, read the W&B api key directly from the environment variable set as a secret from BCP. The value must match the name of the environment variable in BCP, such as WANDB_TOKEN.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@ def test_config(self):
base_results_dir: ${launcher_scripts_path}/results # Location to store the results, checkpoints and logs.
container_mounts: # List of additional paths to mount to container. They will be mounted to same path.
- null
container: nvcr.io/nvidia/nemo:24.07
container: nvcr.io/nvidia/nemo:24.09
wandb_api_key_file: null # File where the w&B api key is stored. Key must be on the first line.
wandb_api_bcp_secret_key: null # For BCP clusters, read the W&B api key directly from the environment variable set as a secret from BCP. The value must match the name of the environment variable in BCP, such as WANDB_TOKEN.
Expand Down

0 comments on commit f77e3e6

Please sign in to comment.