Unequal consumption of Resources among all GPUs in multi-node execution #140
Replies: 1 comment 4 replies
-
Hi @intelligi123 , With the mpirun command above, the device index mapping will be as follows: |
Beta Was this translation helpful? Give feedback.
-
Hi,
I am running a quantum volume circuit on a multi-node environment where there are two nodes and each has two GPUs.
I have kept total 4 processes and when I look at resource consumption at each GPU, its not uniform. Each node has one GPU which is consumed most and other almost stayed idle (can be seen with RAM usage and GPU Utility). Below is output of nvidia-smi on both nodes:
and this one is second node
The command I am executing along with code is below:
/usr/local/openmpi/bin/mpirun --map-by node -x UCX_TLS=all -x UCX_NET_DEVICES=ens34 -x UCX_LOG_LEVEL=info --hostfile hostfile -n 4 /home/cuquantum/conda/envs/cuquantum-23.10/bin/python quantum_vol.py --nbits 33
Quantum Volume Generation
Beta Was this translation helpful? Give feedback.
All reactions