Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ROCm] Add script to run multi gpu tests #25

Merged

Conversation

hsharsha
Copy link

@hsharsha hsharsha commented Jul 2, 2024

No description provided.

@hsharsha
Copy link
Author

hsharsha commented Jul 3, 2024

Updated with pjrt distrbuted tests and check for number of gpus

@hsharsha hsharsha requested a review from i-chaochen July 3, 2024 11:10
@hsharsha hsharsha force-pushed the rocm-jaxlib-v0.4.28-qa-multi-gpu-tets branch from af476e8 to c1abbb8 Compare July 3, 2024 11:19
TF_GPU_COUNT=$(rocm-smi -i|grep 'Device ID' |grep 'GPU' |wc -l)
fi
if [[ $TF_GPU_COUNT -lt 4 ]]; then
echo "Found only ${TF_GPU_COUNT} gpus, multi-gpu tests need atleast 4 gpus."

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

a space between atleast ? :)

@i-chaochen i-chaochen merged commit a52b6a3 into rocm-jaxlib-v0.4.28-qa Jul 4, 2024
3 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants