-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove atomic add #21
Comments
Do you know the 30% is for what grain size? How many atoms/MPIrank? |
Total Number of Atom 10752. Program was submitted on 1 KNL nodes with 8 MPI rank and 8 OpenMP thread per rank. No hyper-threading is used. |
So, 1344 atoms/MPIrank with 8 OpenMP threads? I guess the atomic op impacts less on large grain size. It would be nice to investigate it wrt the grain size and # of threads. EDIT: |
I think when we go for larger number of threads, the atomic only makes the problem worse. |
Need to remove atomic add. Consuming almost 30% of time.
The text was updated successfully, but these errors were encountered: