-
This is a continuation of #115, which worked on 23.10, but now I am testing on 24.08. I got an output in the end, but with errors:
I modified the answer #115 (reply in thread) by replacing ens3 with eth0, because mpirun wants it to be either eth0 or lo. Otherwise, there would be an extra error message
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi @rht, Thank you for the inquiry. For more details, please refer to the "Software in the container" section of the NGC overview page: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/cuquantum-appliance. |
Beta Was this translation helpful? Give feedback.
Hi @rht,
Thank you for the inquiry.
For 24.08 container images, HCOLL is enabled by default for optimal collective operations on systems using Mellanox InfiniBand adapters.
It can be disabled by adding an option
--mca coll_hcoll_enable 0
.For more details, please refer to the "Software in the container" section of the NGC overview page: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/cuquantum-appliance.