Thread ID Cleanup, main branch (2025.01.09.) #810

krasznaa · 2025-01-09T06:26:04Z

Following #808, I thought it would be time to clean up the rest of the device functions as well with how they receive thread identifiers.

Made all functions that need a "global index" use traccc::device::global_index_t;
- This allowed removing a number of static_cast-s from these functions;
All functions that receive a thread_id and/or barrier object, now do so using constant references;
- All such types provide only const functions. So I didn't see much reason for using non-const references here. 🤔

After this, I went and harmonized the CUDA, SYCL and ALPAKA codes a little as well.

Moved all thread_id types to be private headers in their libraries;
Added automatic, compile-time checks that they would all fulfill the traccc::device::concepts::thread_id1 concept;
Introduced traccc::cuda::details::global_index1() and traccc::sycl::details::global_index(...) as helper functions for generating traccc::device::global_index_t values;

While doing all of this, I tried to fix up the includes in all the touched files a bit. Since many of them were doing very questionable things. (Including way more files than necessary, hiding missing includes in some of the common device headers.)

I'm pretty happy with these updates myself, but am interested in your opinions.

Made all of them into "private headers", and added automated tests that they would fulfill the appropriate concept.

While also cleaning up the includes of the files a bit.

stephenswat

I generally dislike this idea that because vecmem only supports 32-bit integers we should be downcasting all out accesses in a 64-bit memory space to 32-bit at the generation site of those indices, rather than generating them at the appropriate size and only downcasting them necessary.

device/common/include/traccc/fitting/device/impl/fit.ipp

device/cuda/src/sanity/contiguous_on.cuh

device/cuda/src/utils/global_index.hpp

stephenswat · 2025-01-09T09:20:33Z

device/cuda/src/utils/global_index.hpp

+/// Function creating a global index in a 1D CUDA kernel
+__device__ inline device::global_index_t global_index1() {
+
+    return blockIdx.x * blockDim.x + threadIdx.x;


Also, I don't think we need this; the thread identifier classes already serve this purpose. 😕

I thought about it. But note that global_index_t is sort of "its own thing" in the code, it's not directly tied to the thread_id classes. Those are still allowed to return any integer if they really need to. (That's another story by itself.)

This just seemed like a nicely readable way of expressing what we want. 🤔 In both the CUDA and SYCL code.

device/cuda/src/utils/thread_id.hpp

device/sycl/src/utils/global_index.hpp

device/sycl/src/utils/thread_id.hpp

sonarqubecloud · 2025-01-09T10:06:59Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

krasznaa added cleanup Makes the code all clean and tidy cuda Changes related to CUDA sycl Changes related to SYCL alpaka Changes related to Alpaka labels Jan 9, 2025

krasznaa requested review from stephenswat and beomki-yeo January 9, 2025 06:26

krasznaa added 4 commits January 9, 2025 08:10

Harmonized the different thread_id classes with each other.

c29320f

Made all of them into "private headers", and added automated tests that they would fulfill the appropriate concept.

Synchronized how thread IDs would be used in the device functions.

ceb4a81

While also cleaning up the includes of the files a bit.

Call device functions consistently from CUDA.

2aa7731

Call device functions consistently from SYCL.

74badb9

krasznaa force-pushed the ThreadIdCleanup-main-20250108 branch from ced855e to 74badb9 Compare January 9, 2025 07:14

stephenswat requested changes Jan 9, 2025

View reviewed changes

Implement (some of) Stephen's suggestions.

b0ffe4c

stephenswat approved these changes Jan 9, 2025

View reviewed changes

stephenswat enabled auto-merge (squash) January 9, 2025 10:11

stephenswat merged commit bdfa3f5 into acts-project:main Jan 9, 2025
27 of 29 checks passed

krasznaa deleted the ThreadIdCleanup-main-20250108 branch January 9, 2025 10:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thread ID Cleanup, main branch (2025.01.09.) #810

Thread ID Cleanup, main branch (2025.01.09.) #810

krasznaa commented Jan 9, 2025

stephenswat left a comment

stephenswat Jan 9, 2025

krasznaa Jan 9, 2025

sonarqubecloud bot commented Jan 9, 2025

Thread ID Cleanup, main branch (2025.01.09.) #810

Thread ID Cleanup, main branch (2025.01.09.) #810

Conversation

krasznaa commented Jan 9, 2025

stephenswat left a comment

Choose a reason for hiding this comment

stephenswat Jan 9, 2025

Choose a reason for hiding this comment

krasznaa Jan 9, 2025

Choose a reason for hiding this comment

sonarqubecloud bot commented Jan 9, 2025

Quality Gate passed