Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adds support for large number of items in DeviceSelect and DevicePartition #2400

Merged
merged 44 commits into from
Oct 8, 2024

Conversation

elstehle
Copy link
Collaborator

@elstehle elstehle commented Sep 10, 2024

Description

This PR implements streaming DeviceSelect and DevicePartition that, for very large inputs exceeding INT_MAX number of items, splits up the input into partitions of at most INT_MAX number of items and processes one partition at a time.

Closes #2238
Closes #1422
Closes #1437
Closes #1614

TODOs

  • Adapt DeviceSelect interface to take num_items as int64_t.
  • Adapt DevicePartition interface to take templatized num_items, use the kernel with minimal changes for int32_t and smaller NumItemsT and use full-fledged streaming kernel for the remaining types.
  • Add tests for large number of items for DeviceSelect::Flagged.
  • Add tests for large number of items for DevicePartition::If.
  • Add tests for large number of items for DevicePartition::Flagged.
  • Add tests for large number of items for DeviceSelect::Unique.
  • Add benchmarks for DevicePartition with two distinct iterators (one for selected and one for rejected items).
  • Mitigate performance degradation for DeviceSelect::If.
  • Mitigate performance degradation for DeviceSelect::Flagged.
  • Mitigate performance degradation for DeviceSelect::Unique.
  • Mitigate performance degradation for DevicePartition::If.
  • Mitigate performance degradation for DevicePartition::Flagged.
  • Mitigate performance degradation for DevicePartition::If with distinct partitions.
  • Update tests for large number of items for DeviceSelect::If.
  • Open issue for adding support for large number of items for DevicePartition::If using ThreeWayPartition and tests.
  • Account in thrust dispatch for copy_if et al. for the optimal choice of offset types.

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Latest benchmark results on 95de26f

Approach

  • The streaming DeviceSelect and DevicePartition splits up inputs larger than INT_MAX into partitions of up to INT_MAX items each, repeatedly invoking the respective algorithm
  • We provide a streaming_context object to the algorithm that provides all information about the current partition (i.e., for the current kernel invocation), like offsets into the input and output iterators.
  • For DevicePartition::{If,Flagged}
    • we use the streaming_context, iff the user-provided offset type is uint_32 or wider than 4 bytes
    • otherwise, we use a dummy streaming_context that basically returns a 0 immediate value for the offsets et al. to keep performance impact minimal
  • For DeviceSelect::{If,Flagged,Unique} we always use the streaming_context as there's negligible performance downside. Always using the streaming_context provides the benefit that we compile just one single kernel no matter the user-provided offset type here. Another benefit is that, in future, we only have to tune one kernel template specialization.

Summaries

How to interpret:

  • Diff i32 vs i32.main: We use only a dummy streaming_context. These columns compare the algorithm with a dummy streaming_context to the performance numbers we got in main (using i32 offsets) today.
  • Diff i64 vs i32.main: We use a streaming_context that provides offsets using i64. I.e., whenever we need to index into the overall inputs / outputs across partitions, we use i64 offsets, other indexing happens using i32. These columns compare the algorithm with a proper streaming_context to what we have in main today using i32(!) offsets.

What to focus on:

  • DeviceSelect is, from now on, always using i64, because there is only very limited performance downside from using i64 instead of i32 with the streaming approach, while we have the benefit of just having to maintain and tune a single kernel template instantiating going forward. Given that, we want to focus on the rightmost two columns for DeviceSelect in the following summary.
  • DevicePartition is using a static dispatch, i.e., i32 (dummy streaming context) and i64 (full-fledged streaming), depending on the user-provided OffsetT. A use of i32 retains sass compatibility to what we have in main today. So, basically unchanged for i32 user-provided offset type.

Summary on benefits of streaming i64 versus main i64 offset type:
In the following we compare the worst-case slowdown of 2^28 number of items of the two mentioned approaches:

  • DeviceSelect::If: 4.64% versus 169%
  • DeviceSelect::Flagged: 1.5% versus 91%
  • DevicePartition::If: 18.9% versus 30.95%
  • DevicePartition::Flagged: 11.8% versus 36.9%

Select.If

Diff i32 vs i32.main
any num items
Diff i32 vs i32.main
2^28 num items
Diff i64 vs i32.main
any num items
Diff i64 vs i32.main
2^28 num items
min -3.84% -0.29% -3.37% -1.97%
max 2.46% 0.41% 8.27% 4.64%
avg -0.30% 0.06% 0.81% 0.85%

Select.Flagged

Diff i32 vs i32.main
any num items
Diff i32 vs i32.main
2^28 num items
Diff i64 vs i32.main
any num items
Diff i64 vs i32.main
2^28 num items
min -2.37% -0.03% -3.91% -0.31%
max 2.59% 0.15% 3.74% 1.51%
avg 0.05% 0.03% 0.71% 0.40%

Select.Unique

Diff i32 vs i32.main
any num items
Diff i32 vs i32.main
2^28 num items
Diff i64 vs i32.main
any num items
Diff i64 vs i32.main
2^28 num items
min -3.95% -0.35% -20.17% -20.17%
max 4.20% 0.49% 10.61% 5.62%
avg 0.11% 0.06% 0.74% -1.51%

Partition.If

Diff i32 vs i32.main
any num items
Diff i32 vs i32.main
2^28 num items
Diff i32 vs i32.main
any num items
Diff i64 vs i32.main
2^28 num items
min -5.59% -0.67% -4.68% -0.97%
max 4.70% 0.30% 18.93% 18.93%
avg 0.23% -0.02% 3.18% 2.43%

Partition.Flagged

Diff i32 vs i32.main
any num items
Diff i32 vs i32.main
2^28 num items
Diff i64 vs i32.main
any num items
Diff i64 vs i32.main
2^28 num items
min -6.00% -0.10% -8.04% -2.40%
max 3.11% 0.23% 14.96% 11.78%
avg -0.29% 0.01% 2.68% 3.08%

Detailed benchmark results

H100 select.if
T{ct} OffsetT{ct} IsInPlace{ct} Elements{io} Entropy Ref Time Ref Noise Cmp Time Cmp Noise Diff %Diff Status
I8 I32 false 2^16 1 9.615 us 2.23% 9.756 us 1.95% 0.142 us 1.47% PASS
I8 I32 false 2^20 1 12.369 us 1.88% 12.387 us 2.03% 0.017 us 0.14% PASS
I8 I32 false 2^24 1 51.682 us 1.28% 51.498 us 1.26% -0.184 us -0.36% PASS
I8 I32 false 2^28 1 693.828 us 0.50% 695.389 us 0.50% 1.561 us 0.23% PASS
I8 I32 false 2^16 0.544 10.134 us 1.87% 10.096 us 2.31% -0.038 us -0.37% PASS
I8 I32 false 2^20 0.544 12.299 us 2.24% 12.216 us 2.23% -0.083 us -0.67% PASS
I8 I32 false 2^24 0.544 48.397 us 1.18% 48.352 us 1.13% -0.044 us -0.09% PASS
I8 I32 false 2^28 0.544 638.261 us 0.50% 639.985 us 0.50% 1.724 us 0.27% PASS
I8 I32 false 2^16 0 9.894 us 2.15% 9.728 us 2.73% -0.166 us -1.68% PASS
I8 I32 false 2^20 0 12.024 us 1.91% 12.027 us 2.21% 0.004 us 0.03% PASS
I8 I32 false 2^24 0 42.395 us 1.16% 42.447 us 1.09% 0.053 us 0.12% PASS
I8 I32 false 2^28 0 525.106 us 0.51% 526.334 us 0.50% 1.228 us 0.23% PASS
I8 I64 false 2^16 1 9.655 us 2.75% 10.085 us 1.82% 0.430 us 4.46% FAIL
I8 I64 false 2^20 1 13.305 us 1.39% 12.691 us 2.39% -0.615 us -4.62% FAIL
I8 I64 false 2^24 1 71.144 us 0.56% 52.994 us 1.16% -18.150 us -25.51% FAIL
I8 I64 false 2^28 1 1.030 ms 0.26% 720.246 us 0.50% -310.104 us -30.10% FAIL
I8 I64 false 2^16 0.544 9.787 us 2.33% 10.081 us 2.04% 0.295 us 3.01% FAIL
I8 I64 false 2^20 0.544 13.228 us 1.48% 12.539 us 2.21% -0.690 us -5.21% FAIL
I8 I64 false 2^24 0.544 68.402 us 0.53% 50.107 us 1.04% -18.295 us -26.75% FAIL
I8 I64 false 2^28 0.544 979.551 us 0.29% 667.859 us 0.44% -311.692 us -31.82% FAIL
I8 I64 false 2^16 0 9.545 us 1.84% 9.729 us 1.81% 0.184 us 1.93% FAIL
I8 I64 false 2^20 0 13.040 us 1.67% 12.117 us 1.72% -0.923 us -7.08% FAIL
I8 I64 false 2^24 0 63.764 us 0.57% 43.668 us 1.04% -20.095 us -31.52% FAIL
I8 I64 false 2^28 0 880.409 us 0.50% 546.558 us 0.50% -333.850 us -37.92% FAIL
I16 I32 false 2^16 1 10.188 us 2.16% 10.091 us 2.17% -0.097 us -0.95% PASS
I16 I32 false 2^20 1 13.681 us 1.69% 13.293 us 1.53% -0.389 us -2.84% FAIL
I16 I32 false 2^24 1 63.912 us 3.31% 63.712 us 3.31% -0.200 us -0.31% PASS
I16 I32 false 2^28 1 879.192 us 1.37% 876.910 us 1.42% -2.282 us -0.26% PASS
I16 I32 false 2^16 0.544 10.050 us 1.83% 9.862 us 2.59% -0.189 us -1.88% FAIL
I16 I32 false 2^20 0.544 13.371 us 2.12% 13.277 us 1.93% -0.094 us -0.70% PASS
I16 I32 false 2^24 0.544 56.241 us 2.94% 56.252 us 2.97% 0.012 us 0.02% PASS
I16 I32 false 2^28 0.544 704.237 us 1.47% 703.930 us 1.59% -0.308 us -0.04% PASS
I16 I32 false 2^16 0 9.960 us 2.11% 9.851 us 2.11% -0.108 us -1.09% PASS
I16 I32 false 2^20 0 12.916 us 2.07% 12.726 us 1.67% -0.190 us -1.47% PASS
I16 I32 false 2^24 0 48.137 us 2.66% 47.944 us 2.68% -0.193 us -0.40% PASS
I16 I32 false 2^28 0 509.084 us 1.04% 507.599 us 1.07% -1.485 us -0.29% PASS
I16 I64 false 2^16 1 9.992 us 2.32% 10.357 us 2.13% 0.365 us 3.65% FAIL
I16 I64 false 2^20 1 13.909 us 1.46% 13.221 us 1.50% -0.688 us -4.95% FAIL
I16 I64 false 2^24 1 77.217 us 0.74% 64.186 us 3.31% -13.031 us -16.88% FAIL
I16 I64 false 2^28 1 1.108 ms 0.50% 880.682 us 1.62% -226.909 us -20.49% FAIL
I16 I64 false 2^16 0.544 10.086 us 2.59% 10.301 us 1.88% 0.215 us 2.13% FAIL
I16 I64 false 2^20 0.544 14.001 us 1.84% 13.278 us 1.60% -0.723 us -5.16% FAIL
I16 I64 false 2^24 0.544 75.116 us 0.65% 57.275 us 2.97% -17.841 us -23.75% FAIL
I16 I64 false 2^28 0.544 1.066 ms 0.50% 716.201 us 1.63% -349.492 us -32.79% FAIL
I16 I64 false 2^16 0 9.824 us 2.06% 10.133 us 1.93% 0.308 us 3.14% FAIL
I16 I64 false 2^20 0 13.607 us 2.04% 12.857 us 1.54% -0.750 us -5.51% FAIL
I16 I64 false 2^24 0 67.763 us 0.69% 49.303 us 2.80% -18.461 us -27.24% FAIL
I16 I64 false 2^28 0 924.306 us 0.50% 523.233 us 1.06% -401.073 us -43.39% FAIL
I32 I32 false 2^16 1 10.382 us 2.15% 10.275 us 1.69% -0.107 us -1.03% PASS
I32 I32 false 2^20 1 14.838 us 1.32% 14.837 us 1.42% -0.001 us -0.00% PASS
I32 I32 false 2^24 1 90.727 us 2.76% 90.766 us 2.73% 0.038 us 0.04% PASS
I32 I32 false 2^28 1 1.342 ms 0.97% 1.342 ms 0.96% 0.238 us 0.02% PASS
I32 I32 false 2^16 0.544 10.344 us 2.22% 10.329 us 2.18% -0.015 us -0.15% PASS
I32 I32 false 2^20 0.544 14.715 us 1.64% 14.605 us 2.15% -0.110 us -0.75% PASS
I32 I32 false 2^24 0.544 78.514 us 2.88% 78.510 us 2.87% -0.004 us -0.01% PASS
I32 I32 false 2^28 0.544 1.071 ms 1.11% 1.072 ms 1.10% 1.617 us 0.15% PASS
I32 I32 false 2^16 0 9.948 us 2.68% 9.950 us 1.95% 0.002 us 0.02% PASS
I32 I32 false 2^20 0 13.623 us 1.61% 13.662 us 1.52% 0.040 us 0.29% PASS
I32 I32 false 2^24 0 62.246 us 2.16% 62.345 us 2.15% 0.100 us 0.16% PASS
I32 I32 false 2^28 0 647.513 us 1.51% 650.172 us 1.63% 2.659 us 0.41% PASS
I32 I64 false 2^16 1 9.697 us 2.26% 10.443 us 1.82% 0.746 us 7.69% FAIL
I32 I64 false 2^20 1 15.087 us 1.67% 14.762 us 1.70% -0.325 us -2.15% FAIL
I32 I64 false 2^24 1 100.485 us 1.45% 89.800 us 2.84% -10.685 us -10.63% FAIL
I32 I64 false 2^28 1 1.501 ms 0.65% 1.327 ms 0.98% -173.847 us -11.58% FAIL
I32 I64 false 2^16 0.544 9.875 us 2.23% 10.490 us 1.75% 0.615 us 6.23% FAIL
I32 I64 false 2^20 0.544 14.882 us 1.34% 14.373 us 1.31% -0.509 us -3.42% FAIL
I32 I64 false 2^24 0.544 88.945 us 1.06% 77.898 us 2.84% -11.046 us -12.42% FAIL
I32 I64 false 2^28 0.544 1.250 ms 1.16% 1.068 ms 1.11% -181.766 us -14.54% FAIL
I32 I64 false 2^16 0 9.578 us 2.34% 9.917 us 2.03% 0.339 us 3.54% FAIL
I32 I64 false 2^20 0 14.588 us 1.39% 13.952 us 1.64% -0.636 us -4.36% FAIL
I32 I64 false 2^24 0 76.622 us 0.84% 63.028 us 2.22% -13.593 us -17.74% FAIL
I32 I64 false 2^28 0 971.795 us 0.50% 657.466 us 2.13% -314.328 us -32.35% FAIL
I64 I32 false 2^16 1 10.087 us 1.62% 10.143 us 1.75% 0.056 us 0.56% PASS
I64 I32 false 2^20 1 18.297 us 1.73% 18.462 us 1.33% 0.165 us 0.90% PASS
I64 I32 false 2^24 1 166.099 us 2.00% 166.365 us 2.00% 0.266 us 0.16% PASS
I64 I32 false 2^28 1 2.561 ms 0.54% 2.565 ms 0.54% 3.934 us 0.15% PASS
I64 I32 false 2^16 0.544 10.366 us 2.50% 10.133 us 1.89% -0.233 us -2.25% FAIL
I64 I32 false 2^20 0.544 17.783 us 1.36% 17.842 us 1.40% 0.059 us 0.33% PASS
I64 I32 false 2^24 0.544 137.691 us 2.70% 137.958 us 2.66% 0.266 us 0.19% PASS
I64 I32 false 2^28 0.544 2.027 ms 0.76% 2.030 ms 0.79% 3.105 us 0.15% PASS
I64 I32 false 2^16 0 9.789 us 1.70% 9.847 us 2.69% 0.058 us 0.59% PASS
I64 I32 false 2^20 0 16.812 us 1.23% 16.873 us 1.23% 0.061 us 0.36% PASS
I64 I32 false 2^24 0 98.824 us 2.12% 98.988 us 2.14% 0.164 us 0.17% PASS
I64 I32 false 2^28 0 1.231 ms 0.90% 1.232 ms 0.86% 1.250 us 0.10% PASS
I64 I64 false 2^16 1 10.367 us 2.78% 10.342 us 2.41% -0.025 us -0.24% PASS
I64 I64 false 2^20 1 20.541 us 2.73% 18.714 us 1.63% -1.827 us -8.89% FAIL
I64 I64 false 2^24 1 182.687 us 1.47% 166.276 us 2.04% -16.410 us -8.98% FAIL
I64 I64 false 2^28 1 2.817 ms 0.39% 2.550 ms 0.57% -266.268 us -9.45% FAIL
I64 I64 false 2^16 0.544 10.222 us 1.83% 10.339 us 2.72% 0.118 us 1.15% PASS
I64 I64 false 2^20 0.544 19.929 us 2.50% 18.202 us 1.37% -1.727 us -8.67% FAIL
I64 I64 false 2^24 0.544 153.464 us 1.33% 137.986 us 2.77% -15.478 us -10.09% FAIL
I64 I64 false 2^28 0.544 2.287 ms 0.44% 2.020 ms 0.73% -266.897 us -11.67% FAIL
I64 I64 false 2^16 0 10.144 us 1.64% 9.892 us 2.95% -0.251 us -2.48% FAIL
I64 I64 false 2^20 0 19.226 us 2.44% 16.995 us 1.16% -2.231 us -11.60% FAIL
I64 I64 false 2^24 0 121.284 us 0.96% 99.785 us 2.35% -21.499 us -17.73% FAIL
I64 I64 false 2^28 0 1.621 ms 0.50% 1.238 ms 0.90% -382.691 us -23.61% FAIL
I128 I32 false 2^16 1 11.583 us 2.04% 11.566 us 2.04% -0.017 us -0.15% PASS
I128 I32 false 2^20 1 31.011 us 2.39% 30.768 us 2.26% -0.243 us -0.78% PASS
I128 I32 false 2^24 1 343.806 us 1.37% 343.900 us 1.34% 0.094 us 0.03% PASS
I128 I32 false 2^28 1 5.357 ms 0.39% 5.357 ms 0.37% -0.333 us -0.01% PASS
I128 I32 false 2^16 0.544 11.592 us 2.15% 11.247 us 2.04% -0.345 us -2.98% FAIL
I128 I32 false 2^20 0.544 28.548 us 2.17% 28.362 us 2.23% -0.186 us -0.65% PASS
I128 I32 false 2^24 0.544 280.557 us 1.40% 280.092 us 1.45% -0.465 us -0.17% PASS
I128 I32 false 2^28 0.544 4.281 ms 0.41% 4.280 ms 0.38% -1.637 us -0.04% PASS
I128 I32 false 2^16 0 10.743 us 2.57% 10.549 us 1.98% -0.195 us -1.81% PASS
I128 I32 false 2^20 0 25.425 us 1.59% 25.017 us 1.67% -0.408 us -1.60% FAIL
I128 I32 false 2^24 0 182.875 us 1.10% 182.701 us 1.05% -0.174 us -0.09% PASS
I128 I32 false 2^28 0 2.516 ms 0.27% 2.518 ms 0.27% 2.180 us 0.09% PASS
I128 I64 false 2^16 1 11.910 us 1.89% 11.454 us 2.02% -0.456 us -3.83% FAIL
I128 I64 false 2^20 1 33.251 us 2.25% 30.489 us 2.41% -2.762 us -8.31% FAIL
I128 I64 false 2^24 1 362.310 us 1.11% 342.496 us 1.34% -19.814 us -5.47% FAIL
I128 I64 false 2^28 1 5.699 ms 0.31% 5.339 ms 0.36% -360.036 us -6.32% FAIL
I128 I64 false 2^16 0.544 11.616 us 2.32% 11.327 us 1.98% -0.289 us -2.49% FAIL
I128 I64 false 2^20 0.544 30.883 us 2.25% 28.584 us 2.25% -2.300 us -7.45% FAIL
I128 I64 false 2^24 0.544 295.823 us 1.05% 280.570 us 1.47% -15.253 us -5.16% FAIL
I128 I64 false 2^28 0.544 4.611 ms 0.25% 4.284 ms 0.39% -326.787 us -7.09% FAIL
I128 I64 false 2^16 0 11.377 us 1.46% 10.443 us 2.20% -0.934 us -8.21% FAIL
I128 I64 false 2^20 0 28.637 us 1.85% 25.352 us 1.59% -3.285 us -11.47% FAIL
I128 I64 false 2^24 0 233.011 us 0.48% 183.262 us 1.08% -49.749 us -21.35% FAIL
I128 I64 false 2^28 0 3.420 ms 0.09% 2.523 ms 0.27% -897.169 us -26.23% FAIL
F32 I32 false 2^16 1 10.390 us 2.13% 9.991 us 2.15% -0.399 us -3.84% FAIL
F32 I32 false 2^20 1 14.942 us 2.13% 15.157 us 1.40% 0.215 us 1.44% FAIL
F32 I32 false 2^24 1 90.789 us 2.80% 91.176 us 2.79% 0.387 us 0.43% PASS
F32 I32 false 2^28 1 1.366 ms 0.98% 1.367 ms 0.94% 1.122 us 0.08% PASS
F32 I32 false 2^16 0.544 9.705 us 2.45% 9.918 us 2.16% 0.213 us 2.19% FAIL
F32 I32 false 2^20 0.544 13.853 us 1.55% 14.194 us 1.57% 0.341 us 2.46% FAIL
F32 I32 false 2^24 0.544 64.509 us 2.27% 64.461 us 2.23% -0.048 us -0.07% PASS
F32 I32 false 2^28 0.544 771.832 us 1.06% 772.142 us 1.05% 0.310 us 0.04% PASS
F32 I32 false 2^16 0 9.891 us 1.91% 10.084 us 2.30% 0.193 us 1.95% FAIL
F32 I32 false 2^20 0 13.818 us 1.74% 14.138 us 1.66% 0.320 us 2.32% FAIL
F32 I32 false 2^24 0 63.206 us 2.12% 62.700 us 2.08% -0.505 us -0.80% PASS
F32 I32 false 2^28 0 648.461 us 1.47% 648.896 us 1.50% 0.435 us 0.07% PASS
F32 I64 false 2^16 1 9.842 us 2.16% 10.650 us 2.35% 0.807 us 8.20% FAIL
F32 I64 false 2^20 1 15.107 us 1.47% 14.944 us 1.53% -0.163 us -1.08% PASS
F32 I64 false 2^24 1 100.245 us 1.55% 90.145 us 2.75% -10.100 us -10.08% FAIL
F32 I64 false 2^28 1 1.505 ms 0.66% 1.339 ms 0.97% -165.525 us -11.00% FAIL
F32 I64 false 2^16 0.544 9.731 us 2.18% 10.508 us 1.52% 0.777 us 7.99% FAIL
F32 I64 false 2^20 0.544 14.849 us 1.68% 14.830 us 1.34% -0.019 us -0.13% PASS
F32 I64 false 2^24 0.544 79.124 us 0.93% 64.997 us 2.26% -14.127 us -17.85% FAIL
F32 I64 false 2^28 0.544 1.045 ms 0.50% 774.265 us 1.07% -270.347 us -25.88% FAIL
F32 I64 false 2^16 0 9.756 us 2.28% 10.183 us 2.21% 0.427 us 4.38% FAIL
F32 I64 false 2^20 0 14.739 us 1.22% 14.244 us 1.49% -0.495 us -3.36% FAIL
F32 I64 false 2^24 0 76.771 us 0.82% 63.433 us 2.24% -13.338 us -17.37% FAIL
F32 I64 false 2^28 0 973.874 us 0.50% 654.323 us 1.48% -319.551 us -32.81% FAIL
F64 I32 false 2^16 1 10.543 us 1.55% 10.282 us 1.84% -0.260 us -2.47% FAIL
F64 I32 false 2^20 1 18.402 us 1.72% 18.114 us 1.19% -0.287 us -1.56% FAIL
F64 I32 false 2^24 1 166.407 us 1.99% 165.972 us 2.07% -0.435 us -0.26% PASS
F64 I32 false 2^28 1 2.560 ms 0.54% 2.560 ms 0.52% 0.283 us 0.01% PASS
F64 I32 false 2^16 0.544 10.043 us 1.83% 9.754 us 2.14% -0.289 us -2.88% FAIL
F64 I32 false 2^20 0.544 17.167 us 1.44% 16.839 us 1.46% -0.329 us -1.91% FAIL
F64 I32 false 2^24 0.544 108.678 us 2.69% 108.116 us 2.72% -0.562 us -0.52% PASS
F64 I32 false 2^28 0.544 1.467 ms 0.82% 1.467 ms 0.83% 0.071 us 0.00% PASS
F64 I32 false 2^16 0 9.912 us 2.26% 9.675 us 2.18% -0.237 us -2.39% FAIL
F64 I32 false 2^20 0 16.795 us 1.35% 16.669 us 1.47% -0.126 us -0.75% PASS
F64 I32 false 2^24 0 98.651 us 2.09% 98.460 us 2.09% -0.191 us -0.19% PASS
F64 I32 false 2^28 0 1.230 ms 0.90% 1.229 ms 0.91% -0.443 us -0.04% PASS
F64 I64 false 2^16 1 10.615 us 2.65% 10.343 us 2.55% -0.272 us -2.56% FAIL
F64 I64 false 2^20 1 20.607 us 2.98% 18.308 us 1.47% -2.299 us -11.16% FAIL
F64 I64 false 2^24 1 183.174 us 1.47% 165.650 us 2.05% -17.524 us -9.57% FAIL
F64 I64 false 2^28 1 2.820 ms 0.38% 2.548 ms 0.50% -272.545 us -9.66% FAIL
F64 I64 false 2^16 0.544 10.274 us 2.97% 9.886 us 2.20% -0.388 us -3.77% FAIL
F64 I64 false 2^20 0.544 19.481 us 2.37% 17.259 us 1.30% -2.223 us -11.41% FAIL
F64 I64 false 2^24 0.544 129.070 us 1.15% 110.843 us 2.80% -18.228 us -14.12% FAIL
F64 I64 false 2^28 0.544 1.781 ms 1.33% 1.490 ms 0.89% -291.622 us -16.37% FAIL
F64 I64 false 2^16 0 9.913 us 2.12% 9.890 us 1.59% -0.022 us -0.23% PASS
F64 I64 false 2^20 0 19.201 us 2.19% 16.929 us 1.65% -2.272 us -11.83% FAIL
F64 I64 false 2^24 0 120.914 us 0.94% 99.418 us 2.31% -21.496 us -17.78% FAIL
F64 I64 false 2^28 0 1.616 ms 0.55% 1.234 ms 0.92% -381.272 us -23.60% FAIL
H100 select.flagged
T{ct} OffsetT{ct} IsInPlace{ct} Elements{io} Entropy Ref Time Ref Noise Cmp Time Cmp Noise Diff %Diff Status
I8 I32 false 2^16 1 10.642 us 2.22% 10.512 us 1.88% -0.130 us -1.22% PASS
I8 I32 false 2^20 1 13.225 us 1.64% 13.079 us 1.32% -0.146 us -1.10% PASS
I8 I32 false 2^24 1 60.702 us 1.51% 60.794 us 1.53% 0.092 us 0.15% PASS
I8 I32 false 2^28 1 790.258 us 0.50% 790.413 us 0.50% 0.155 us 0.02% PASS
I8 I32 false 2^16 0.544 10.673 us 2.31% 10.720 us 1.81% 0.047 us 0.44% PASS
I8 I32 false 2^20 0.544 13.317 us 1.65% 13.263 us 1.68% -0.054 us -0.41% PASS
I8 I32 false 2^24 0.544 59.003 us 1.38% 59.017 us 1.51% 0.015 us 0.02% PASS
I8 I32 false 2^28 0.544 765.974 us 0.50% 766.285 us 0.50% 0.311 us 0.04% PASS
I8 I32 false 2^16 0 10.300 us 2.30% 10.310 us 2.68% 0.010 us 0.10% PASS
I8 I32 false 2^20 0 12.845 us 2.00% 12.687 us 2.18% -0.159 us -1.23% PASS
I8 I32 false 2^24 0 52.267 us 1.62% 52.354 us 1.68% 0.086 us 0.17% PASS
I8 I32 false 2^28 0 626.874 us 0.25% 626.930 us 0.26% 0.056 us 0.01% PASS
I8 I64 false 2^16 1 9.858 us 1.82% 10.842 us 1.72% 0.984 us 9.98% FAIL
I8 I64 false 2^20 1 14.088 us 1.62% 13.490 us 2.02% -0.598 us -4.24% FAIL
I8 I64 false 2^24 1 77.259 us 0.61% 61.535 us 1.74% -15.724 us -20.35% FAIL
I8 I64 false 2^28 1 1.097 ms 0.31% 793.862 us 0.50% -302.658 us -27.60% FAIL
I8 I64 false 2^16 0.544 9.823 us 3.02% 10.673 us 2.15% 0.850 us 8.65% FAIL
I8 I64 false 2^20 0.544 14.196 us 1.68% 13.715 us 1.86% -0.480 us -3.38% FAIL
I8 I64 false 2^24 0.544 75.283 us 0.61% 60.351 us 1.75% -14.932 us -19.83% FAIL
I8 I64 false 2^28 0.544 1.062 ms 0.38% 777.499 us 0.50% -284.641 us -26.80% FAIL
I8 I64 false 2^16 0 9.612 us 1.94% 10.321 us 2.50% 0.709 us 7.38% FAIL
I8 I64 false 2^20 0 13.700 us 1.82% 12.743 us 1.88% -0.956 us -6.98% FAIL
I8 I64 false 2^24 0 69.928 us 0.69% 53.549 us 1.72% -16.380 us -23.42% FAIL
I8 I64 false 2^28 0 944.196 us 0.14% 631.125 us 0.31% -313.070 us -33.16% FAIL
I16 I32 false 2^16 1 11.057 us 1.87% 11.065 us 1.62% 0.008 us 0.07% PASS
I16 I32 false 2^20 1 14.143 us 1.34% 14.158 us 1.61% 0.015 us 0.11% PASS
I16 I32 false 2^24 1 78.279 us 2.11% 78.320 us 2.08% 0.041 us 0.05% PASS
I16 I32 false 2^28 1 991.335 us 0.94% 991.462 us 0.94% 0.126 us 0.01% PASS
I16 I32 false 2^16 0.544 10.925 us 1.79% 10.748 us 2.62% -0.176 us -1.61% PASS
I16 I32 false 2^20 0.544 14.612 us 1.44% 14.584 us 1.69% -0.027 us -0.19% PASS
I16 I32 false 2^24 0.544 73.370 us 2.13% 73.273 us 2.13% -0.097 us -0.13% PASS
I16 I32 false 2^28 0.544 905.254 us 0.69% 905.461 us 0.68% 0.207 us 0.02% PASS
I16 I32 false 2^16 0 10.374 us 1.40% 10.402 us 2.06% 0.028 us 0.27% PASS
I16 I32 false 2^20 0 13.607 us 1.77% 13.528 us 1.41% -0.079 us -0.58% PASS
I16 I32 false 2^24 0 64.793 us 1.97% 64.838 us 1.99% 0.045 us 0.07% PASS
I16 I32 false 2^28 0 681.464 us 0.28% 681.679 us 0.30% 0.215 us 0.03% PASS
I16 I64 false 2^16 1 9.638 us 2.14% 11.176 us 2.20% 1.538 us 15.95% FAIL
I16 I64 false 2^20 1 14.241 us 1.49% 14.322 us 1.47% 0.081 us 0.57% PASS
I16 I64 false 2^24 1 82.856 us 0.87% 79.151 us 2.07% -3.705 us -4.47% FAIL
I16 I64 false 2^28 1 1.161 ms 0.50% 993.472 us 0.99% -167.250 us -14.41% FAIL
I16 I64 false 2^16 0.544 9.697 us 1.57% 11.108 us 1.57% 1.411 us 14.55% FAIL
I16 I64 false 2^20 0.544 14.643 us 1.50% 14.876 us 1.41% 0.233 us 1.59% FAIL
I16 I64 false 2^24 0.544 80.738 us 0.79% 73.895 us 2.16% -6.843 us -8.48% FAIL
I16 I64 false 2^28 0.544 1.130 ms 0.50% 905.962 us 0.65% -224.089 us -19.83% FAIL
I16 I64 false 2^16 0 9.573 us 2.40% 10.746 us 2.37% 1.173 us 12.26% FAIL
I16 I64 false 2^20 0 14.089 us 1.59% 13.774 us 1.71% -0.315 us -2.23% FAIL
I16 I64 false 2^24 0 73.942 us 0.87% 66.095 us 1.95% -7.847 us -10.61% FAIL
I16 I64 false 2^28 0 978.560 us 0.15% 691.732 us 0.31% -286.828 us -29.31% FAIL
I32 I32 false 2^16 1 10.516 us 2.56% 10.551 us 2.00% 0.035 us 0.33% PASS
I32 I32 false 2^20 1 15.285 us 1.76% 15.589 us 1.46% 0.304 us 1.99% FAIL
I32 I32 false 2^24 1 104.666 us 1.41% 104.954 us 1.40% 0.287 us 0.27% PASS
I32 I32 false 2^28 1 1.511 ms 0.62% 1.512 ms 0.62% 0.500 us 0.03% PASS
I32 I32 false 2^16 0.544 10.437 us 2.62% 10.486 us 2.45% 0.049 us 0.47% PASS
I32 I32 false 2^20 0.544 15.461 us 1.71% 15.796 us 1.73% 0.335 us 2.17% FAIL
I32 I32 false 2^24 0.544 93.532 us 1.35% 93.976 us 1.33% 0.444 us 0.47% PASS
I32 I32 false 2^28 0.544 1.265 ms 0.75% 1.266 ms 0.67% 1.590 us 0.13% PASS
I32 I32 false 2^16 0 9.900 us 2.02% 10.156 us 1.96% 0.256 us 2.59% FAIL
I32 I32 false 2^20 0 15.046 us 1.76% 15.074 us 1.68% 0.029 us 0.19% PASS
I32 I32 false 2^24 0 78.658 us 1.37% 78.974 us 1.35% 0.317 us 0.40% PASS
I32 I32 false 2^28 0 846.775 us 0.32% 848.046 us 0.35% 1.270 us 0.15% PASS
I32 I64 false 2^16 1 10.240 us 2.12% 10.761 us 1.97% 0.520 us 5.08% FAIL
I32 I64 false 2^20 1 16.274 us 1.61% 15.857 us 1.66% -0.418 us -2.57% FAIL
I32 I64 false 2^24 1 110.168 us 1.27% 105.553 us 1.49% -4.615 us -4.19% FAIL
I32 I64 false 2^28 1 1.623 ms 0.56% 1.514 ms 0.59% -109.258 us -6.73% FAIL
I32 I64 false 2^16 0.544 10.143 us 1.75% 10.659 us 2.55% 0.516 us 5.09% FAIL
I32 I64 false 2^20 0.544 16.146 us 1.27% 15.816 us 1.66% -0.330 us -2.04% FAIL
I32 I64 false 2^24 0.544 98.427 us 1.06% 94.093 us 1.50% -4.335 us -4.40% FAIL
I32 I64 false 2^28 0.544 1.356 ms 0.62% 1.266 ms 0.71% -89.485 us -6.60% FAIL
I32 I64 false 2^16 0 9.938 us 2.61% 10.256 us 2.83% 0.319 us 3.21% FAIL
I32 I64 false 2^20 0 15.826 us 1.35% 15.384 us 1.74% -0.442 us -2.79% FAIL
I32 I64 false 2^24 0 84.242 us 0.91% 79.775 us 1.51% -4.467 us -5.30% FAIL
I32 I64 false 2^28 0 1.050 ms 0.18% 851.447 us 0.37% -198.948 us -18.94% FAIL
I64 I32 false 2^16 1 10.729 us 1.23% 10.711 us 2.28% -0.018 us -0.17% PASS
I64 I32 false 2^20 1 20.523 us 1.98% 20.713 us 1.89% 0.190 us 0.93% PASS
I64 I32 false 2^24 1 181.016 us 1.42% 181.368 us 1.40% 0.352 us 0.19% PASS
I64 I32 false 2^28 1 2.743 ms 0.41% 2.744 ms 0.40% 0.618 us 0.02% PASS
I64 I32 false 2^16 0.544 10.724 us 1.61% 10.705 us 1.68% -0.019 us -0.18% PASS
I64 I32 false 2^20 0.544 20.356 us 1.61% 20.328 us 1.57% -0.029 us -0.14% PASS
I64 I32 false 2^24 0.544 152.404 us 1.50% 152.650 us 1.51% 0.245 us 0.16% PASS
I64 I32 false 2^28 0.544 2.243 ms 0.65% 2.243 ms 0.63% -0.552 us -0.02% PASS
I64 I32 false 2^16 0 9.817 us 1.95% 9.689 us 2.17% -0.128 us -1.30% PASS
I64 I32 false 2^20 0 18.624 us 1.65% 18.599 us 1.64% -0.025 us -0.14% PASS
I64 I32 false 2^24 0 112.796 us 1.40% 112.704 us 1.38% -0.092 us -0.08% PASS
I64 I32 false 2^28 0 1.380 ms 0.40% 1.380 ms 0.41% -0.001 us -0.00% PASS
I64 I64 false 2^16 1 10.120 us 1.98% 10.561 us 1.88% 0.441 us 4.36% FAIL
I64 I64 false 2^20 1 20.437 us 1.91% 20.349 us 1.96% -0.088 us -0.43% PASS
I64 I64 false 2^24 1 191.133 us 1.43% 179.614 us 1.25% -11.518 us -6.03% FAIL
I64 I64 false 2^28 1 2.947 ms 0.40% 2.735 ms 0.38% -212.575 us -7.21% FAIL
I64 I64 false 2^16 0.544 10.031 us 1.98% 10.331 us 1.63% 0.300 us 2.99% FAIL
I64 I64 false 2^20 0.544 20.133 us 1.65% 19.976 us 1.66% -0.156 us -0.78% PASS
I64 I64 false 2^24 0.544 161.361 us 1.34% 151.804 us 1.49% -9.557 us -5.92% FAIL
I64 I64 false 2^28 0.544 2.414 ms 0.58% 2.241 ms 0.64% -172.494 us -7.15% FAIL
I64 I64 false 2^16 0 10.057 us 2.07% 9.868 us 2.06% -0.189 us -1.88% PASS
I64 I64 false 2^20 0 19.613 us 1.61% 18.769 us 1.79% -0.845 us -4.31% FAIL
I64 I64 false 2^24 0 126.745 us 1.05% 113.789 us 1.52% -12.956 us -10.22% FAIL
I64 I64 false 2^28 0 1.685 ms 0.18% 1.399 ms 0.44% -286.295 us -16.99% FAIL
I128 I32 false 2^16 1 10.861 us 2.04% 10.758 us 1.87% -0.103 us -0.95% PASS
I128 I32 false 2^20 1 30.212 us 2.49% 30.277 us 2.51% 0.065 us 0.21% PASS
I128 I32 false 2^24 1 344.317 us 1.40% 344.460 us 1.46% 0.143 us 0.04% PASS
I128 I32 false 2^28 1 5.399 ms 0.39% 5.397 ms 0.38% -1.795 us -0.03% PASS
I128 I32 false 2^16 0.544 11.041 us 2.56% 11.095 us 2.12% 0.054 us 0.49% PASS
I128 I32 false 2^20 0.544 28.082 us 2.55% 28.065 us 2.51% -0.017 us -0.06% PASS
I128 I32 false 2^24 0.544 278.520 us 1.53% 278.575 us 1.51% 0.055 us 0.02% PASS
I128 I32 false 2^28 0.544 4.268 ms 0.72% 4.267 ms 0.72% -0.465 us -0.01% PASS
I128 I32 false 2^16 0 10.728 us 1.77% 10.684 us 1.69% -0.044 us -0.41% PASS
I128 I32 false 2^20 0 25.798 us 1.49% 25.846 us 1.57% 0.048 us 0.19% PASS
I128 I32 false 2^24 0 183.919 us 1.04% 183.796 us 1.07% -0.123 us -0.07% PASS
I128 I32 false 2^28 0 2.586 ms 0.26% 2.586 ms 0.28% 0.728 us 0.03% PASS
I128 I64 false 2^16 1 11.776 us 2.15% 11.257 us 1.70% -0.519 us -4.41% FAIL
I128 I64 false 2^20 1 33.620 us 2.84% 30.475 us 2.49% -3.146 us -9.36% FAIL
I128 I64 false 2^24 1 380.930 us 1.00% 345.478 us 1.42% -35.452 us -9.31% FAIL
I128 I64 false 2^28 1 6.011 ms 0.24% 5.408 ms 0.39% -602.774 us -10.03% FAIL
I128 I64 false 2^16 0.544 11.698 us 2.33% 11.025 us 1.67% -0.673 us -5.75% FAIL
I128 I64 false 2^20 0.544 31.222 us 2.37% 28.318 us 2.35% -2.904 us -9.30% FAIL
I128 I64 false 2^24 0.544 314.758 us 0.81% 279.716 us 1.51% -35.042 us -11.13% FAIL
I128 I64 false 2^28 0.544 4.860 ms 0.61% 4.282 ms 0.72% -578.244 us -11.90% FAIL
I128 I64 false 2^16 0 11.466 us 1.80% 10.550 us 2.85% -0.916 us -7.99% FAIL
I128 I64 false 2^20 0 28.602 us 1.25% 25.985 us 1.53% -2.617 us -9.15% FAIL
I128 I64 false 2^24 0 253.403 us 0.36% 185.124 us 1.06% -68.278 us -26.94% FAIL
I128 I64 false 2^28 0 3.781 ms 0.07% 2.592 ms 0.29% -1188.596 us -31.44% FAIL
F32 I32 false 2^16 1 10.482 us 1.72% 10.453 us 1.81% -0.029 us -0.27% PASS
F32 I32 false 2^20 1 15.581 us 1.72% 15.616 us 1.90% 0.035 us 0.22% PASS
F32 I32 false 2^24 1 104.867 us 1.40% 105.006 us 1.44% 0.139 us 0.13% PASS
F32 I32 false 2^28 1 1.513 ms 0.59% 1.513 ms 0.62% -0.195 us -0.01% PASS
F32 I32 false 2^16 0.544 10.178 us 1.94% 10.372 us 2.10% 0.194 us 1.91% PASS
F32 I32 false 2^20 0.544 15.603 us 1.70% 15.433 us 1.60% -0.170 us -1.09% PASS
F32 I32 false 2^24 0.544 93.831 us 1.35% 93.691 us 1.36% -0.140 us -0.15% PASS
F32 I32 false 2^28 0.544 1.266 ms 0.68% 1.266 ms 0.67% 0.194 us 0.02% PASS
F32 I32 false 2^16 0 9.954 us 2.02% 9.965 us 1.75% 0.011 us 0.11% PASS
F32 I32 false 2^20 0 14.771 us 1.74% 14.753 us 1.70% -0.018 us -0.12% PASS
F32 I32 false 2^24 0 78.442 us 1.34% 78.600 us 1.39% 0.158 us 0.20% PASS
F32 I32 false 2^28 0 847.837 us 0.33% 847.678 us 0.31% -0.160 us -0.02% PASS
F32 I64 false 2^16 1 9.945 us 2.09% 10.486 us 2.28% 0.541 us 5.44% FAIL
F32 I64 false 2^20 1 16.030 us 1.41% 15.550 us 1.68% -0.480 us -3.00% FAIL
F32 I64 false 2^24 1 110.083 us 1.27% 104.860 us 1.48% -5.223 us -4.74% FAIL
F32 I64 false 2^28 1 1.623 ms 0.53% 1.513 ms 0.60% -110.116 us -6.78% FAIL
F32 I64 false 2^16 0.544 9.779 us 1.55% 10.434 us 2.49% 0.656 us 6.71% FAIL
F32 I64 false 2^20 0.544 15.927 us 1.29% 15.424 us 1.49% -0.503 us -3.16% FAIL
F32 I64 false 2^24 0.544 98.312 us 1.08% 93.433 us 1.54% -4.879 us -4.96% FAIL
F32 I64 false 2^28 0.544 1.356 ms 0.62% 1.266 ms 0.70% -90.307 us -6.66% FAIL
F32 I64 false 2^16 0 9.803 us 2.18% 9.980 us 2.27% 0.176 us 1.80% PASS
F32 I64 false 2^20 0 15.631 us 1.42% 14.850 us 1.70% -0.782 us -5.00% FAIL
F32 I64 false 2^24 0 84.088 us 0.85% 79.082 us 1.49% -5.006 us -5.95% FAIL
F32 I64 false 2^28 0 1.051 ms 0.17% 851.240 us 0.37% -199.573 us -18.99% FAIL
F64 I32 false 2^16 1 10.605 us 1.70% 10.639 us 2.13% 0.034 us 0.32% PASS
F64 I32 false 2^20 1 20.293 us 1.74% 20.282 us 1.76% -0.011 us -0.05% PASS
F64 I32 false 2^24 1 180.718 us 1.42% 180.745 us 1.42% 0.028 us 0.02% PASS
F64 I32 false 2^28 1 2.741 ms 0.37% 2.745 ms 0.39% 3.910 us 0.14% PASS
F64 I32 false 2^16 0.544 10.707 us 2.53% 10.454 us 1.84% -0.254 us -2.37% FAIL
F64 I32 false 2^20 0.544 19.786 us 1.56% 20.038 us 1.56% 0.252 us 1.28% PASS
F64 I32 false 2^24 0.544 152.112 us 1.53% 152.308 us 1.56% 0.197 us 0.13% PASS
F64 I32 false 2^28 0.544 2.244 ms 0.64% 2.243 ms 0.63% -0.311 us -0.01% PASS
F64 I32 false 2^16 0 9.937 us 2.22% 10.034 us 2.60% 0.096 us 0.97% PASS
F64 I32 false 2^20 0 18.443 us 1.69% 18.370 us 1.69% -0.073 us -0.40% PASS
F64 I32 false 2^24 0 112.667 us 1.38% 112.703 us 1.37% 0.036 us 0.03% PASS
F64 I32 false 2^28 0 1.381 ms 0.41% 1.381 ms 0.41% -0.110 us -0.01% PASS
F64 I64 false 2^16 1 10.359 us 2.34% 10.615 us 1.92% 0.256 us 2.47% FAIL
F64 I64 false 2^20 1 20.746 us 1.85% 20.438 us 1.96% -0.308 us -1.49% PASS
F64 I64 false 2^24 1 191.157 us 1.41% 179.715 us 1.32% -11.443 us -5.99% FAIL
F64 I64 false 2^28 1 2.948 ms 0.39% 2.735 ms 0.39% -213.533 us -7.24% FAIL
F64 I64 false 2^16 0.544 10.114 us 2.16% 10.289 us 1.81% 0.174 us 1.72% PASS
F64 I64 false 2^20 0.544 20.151 us 1.71% 20.049 us 1.86% -0.102 us -0.51% PASS
F64 I64 false 2^24 0.544 161.486 us 1.33% 151.786 us 1.46% -9.700 us -6.01% FAIL
F64 I64 false 2^28 0.544 2.414 ms 0.57% 2.242 ms 0.66% -171.599 us -7.11% FAIL
F64 I64 false 2^16 0 10.013 us 2.12% 10.188 us 1.98% 0.175 us 1.74% PASS
F64 I64 false 2^20 0 19.554 us 1.61% 19.009 us 1.82% -0.545 us -2.79% FAIL
F64 I64 false 2^24 0 126.766 us 1.07% 114.033 us 1.50% -12.732 us -10.04% FAIL
F64 I64 false 2^28 0 1.685 ms 0.18% 1.399 ms 0.44% -285.959 us -16.97% FAIL
H100 select.unique
OffsetT{ct} T{ct} IsInPlace{ct} Elements{io} MaxSegSize GPU Time Ref Time Ref Noise Cmp Time Cmp Noise Cmp Time %Diff Status
I32 I8 FALSE 2^16 2^1 9.333 9.223 us 1.67% 9.333 us 1.92% 9.333 us 1.19% PASS
I32 I8 FALSE 2^20 2^1 12.328 12.053 us 1.44% 12.328 us 2.36% 12.328 us 2.28% FAIL
I32 I8 FALSE 2^24 2^1 59.963 59.445 us 0.67% 59.963 us 0.67% 59.963 us 0.87% FAIL
I32 I8 FALSE 2^28 2^1 832.33 829.746 us 0.42% 832.330 us 0.39% 832.330 us 0.31% PASS
I32 I8 FALSE 2^16 2^4 9.492 9.218 us 2.42% 9.492 us 1.77% 9.492 us 2.97% FAIL
I32 I8 FALSE 2^20 2^4 12.085 11.756 us 1.93% 12.085 us 1.89% 12.085 us 2.80% FAIL
I32 I8 FALSE 2^24 2^4 53.775 53.234 us 0.77% 53.775 us 0.77% 53.775 us 1.02% FAIL
I32 I8 FALSE 2^28 2^4 710.587 708.514 us 0.50% 710.587 us 0.50% 710.587 us 0.29% PASS
I32 I8 FALSE 2^16 2^8 9.593 9.244 us 2.15% 9.593 us 1.90% 9.593 us 3.77% FAIL
I32 I8 FALSE 2^20 2^8 11.862 11.459 us 2.05% 11.862 us 2.12% 11.862 us 3.51% FAIL
I32 I8 FALSE 2^24 2^8 53.205 52.736 us 0.78% 53.205 us 0.73% 53.205 us 0.89% FAIL
I32 I8 FALSE 2^28 2^8 699.52 697.433 us 0.48% 699.520 us 0.46% 699.520 us 0.30% PASS
I32 I8 TRUE 2^16 2^1 10.786 10.508 us 1.81% 10.786 us 2.10% 10.786 us 2.64% FAIL
I32 I8 TRUE 2^20 2^1 13.387 13.103 us 1.62% 13.387 us 2.37% 13.387 us 2.17% FAIL
I32 I8 TRUE 2^24 2^1 72.918 72.475 us 0.61% 72.918 us 0.65% 72.918 us 0.61% FAIL
I32 I8 TRUE 2^28 2^1 1041 1.036 ms 0.47% 1.041 ms 0.45% 1.041 ms 0.49% FAIL
I32 I8 TRUE 2^16 2^4 10.288 10.310 us 2.17% 10.288 us 2.11% 10.288 us -0.21% PASS
I32 I8 TRUE 2^20 2^4 12.735 12.767 us 1.76% 12.735 us 1.87% 12.735 us -0.25% PASS
I32 I8 TRUE 2^24 2^4 65.957 65.639 us 0.78% 65.957 us 0.76% 65.957 us 0.48% PASS
I32 I8 TRUE 2^28 2^4 912.316 908.079 us 0.50% 912.316 us 0.50% 912.316 us 0.47% PASS
I32 I8 TRUE 2^16 2^8 10.383 10.334 us 2.34% 10.383 us 2.01% 10.383 us 0.47% PASS
I32 I8 TRUE 2^20 2^8 12.785 12.754 us 1.82% 12.785 us 1.77% 12.785 us 0.24% PASS
I32 I8 TRUE 2^24 2^8 65.413 65.178 us 0.79% 65.413 us 0.80% 65.413 us 0.36% PASS
I32 I8 TRUE 2^28 2^8 894.493 891.029 us 0.50% 894.493 us 0.50% 894.493 us 0.39% PASS
I64 I8 FALSE 2^16 2^1 9.962 9.863 us 1.90% 9.962 us 2.03% 9.962 us 1.00% PASS
I64 I8 FALSE 2^20 2^1 12.093 13.197 us 1.19% 12.093 us 1.84% 12.093 us -8.37% FAIL
I64 I8 FALSE 2^24 2^1 51.366 69.805 us 0.50% 51.366 us 1.00% 51.366 us -26.42% FAIL
I64 I8 FALSE 2^28 2^1 686.141 1.012 ms 0.38% 686.141 us 0.50% 686.141 us -32.23% FAIL
I64 I8 FALSE 2^16 2^4 9.701 9.827 us 2.36% 9.701 us 2.35% 9.701 us -1.28% PASS
I64 I8 FALSE 2^20 2^4 12.339 12.948 us 1.58% 12.339 us 1.92% 12.339 us -4.70% FAIL
I64 I8 FALSE 2^24 2^4 44.541 65.654 us 0.54% 44.541 us 1.03% 44.541 us -32.16% FAIL
I64 I8 FALSE 2^28 2^4 566.333 924.168 us 0.49% 566.333 us 0.50% 566.333 us -38.72% FAIL
I64 I8 FALSE 2^16 2^8 9.997 9.669 us 2.40% 9.997 us 2.07% 9.997 us 3.39% FAIL
I64 I8 FALSE 2^20 2^8 11.858 12.927 us 1.53% 11.858 us 1.94% 11.858 us -8.27% FAIL
I64 I8 FALSE 2^24 2^8 46.086 64.735 us 0.51% 46.086 us 0.94% 46.086 us -28.81% FAIL
I64 I8 FALSE 2^28 2^8 585.084 900.683 us 0.42% 585.084 us 0.50% 585.084 us -35.04% FAIL
I64 I8 TRUE 2^16 2^1 10.871 10.911 us 1.92% 10.871 us 2.32% 10.871 us -0.37% PASS
I64 I8 TRUE 2^20 2^1 13.846 15.788 us 1.45% 13.846 us 1.77% 13.846 us -12.30% FAIL
I64 I8 TRUE 2^24 2^1 60.6 95.073 us 0.51% 60.600 us 0.88% 60.600 us -36.26% FAIL
I64 I8 TRUE 2^28 2^1 838.543 1.439 ms 0.39% 838.543 us 0.43% 838.543 us -41.74% FAIL
I64 I8 TRUE 2^16 2^4 10.418 10.621 us 1.93% 10.418 us 2.27% 10.418 us -1.91% PASS
I64 I8 TRUE 2^20 2^4 14.122 15.553 us 1.67% 14.122 us 2.06% 14.122 us -9.20% FAIL
I64 I8 TRUE 2^24 2^4 54.262 90.902 us 0.47% 54.262 us 0.82% 54.262 us -40.31% FAIL
I64 I8 TRUE 2^28 2^4 724.87 1.348 ms 0.42% 724.870 us 0.49% 724.870 us -46.23% FAIL
I64 I8 TRUE 2^16 2^8 10.656 10.403 us 2.53% 10.656 us 2.74% 10.656 us 2.43% PASS
I64 I8 TRUE 2^20 2^8 13.793 15.398 us 1.87% 13.793 us 1.96% 13.793 us -10.42% FAIL
I64 I8 TRUE 2^24 2^8 57.067 90.535 us 0.48% 57.067 us 0.73% 57.067 us -36.97% FAIL
I64 I8 TRUE 2^28 2^8 759.182 1.332 ms 0.28% 759.182 us 0.43% 759.182 us -43.01% FAIL
I32 I16 FALSE 2^16 2^1 9.859 10.030 us 1.78% 9.859 us 2.49% 9.859 us -1.71% PASS
I32 I16 FALSE 2^20 2^1 13.527 13.498 us 2.20% 13.527 us 1.81% 13.527 us 0.22% PASS
I32 I16 FALSE 2^24 2^1 58.823 58.892 us 2.98% 58.823 us 2.98% 58.823 us -0.12% PASS
I32 I16 FALSE 2^28 2^1 760.404 763.049 us 1.76% 760.404 us 1.74% 760.404 us -0.35% PASS
I32 I16 FALSE 2^16 2^4 9.908 9.957 us 2.22% 9.908 us 2.30% 9.908 us -0.49% PASS
I32 I16 FALSE 2^20 2^4 13.088 13.222 us 1.61% 13.088 us 2.23% 13.088 us -1.01% PASS
I32 I16 FALSE 2^24 2^4 51.252 51.422 us 2.54% 51.252 us 2.52% 51.252 us -0.33% PASS
I32 I16 FALSE 2^28 2^4 582.381 583.987 us 0.97% 582.381 us 0.96% 582.381 us -0.27% PASS
I32 I16 FALSE 2^16 2^8 9.735 9.719 us 2.66% 9.735 us 3.06% 9.735 us 0.17% PASS
I32 I16 FALSE 2^20 2^8 12.702 13.146 us 2.03% 12.702 us 1.89% 12.702 us -3.38% FAIL
I32 I16 FALSE 2^24 2^8 48.975 49.525 us 2.53% 48.975 us 2.57% 48.975 us -1.11% PASS
I32 I16 FALSE 2^28 2^8 535.998 536.911 us 0.93% 535.998 us 0.94% 535.998 us -0.17% PASS
I32 I16 TRUE 2^16 2^1 9.753 10.154 us 2.35% 9.753 us 2.24% 9.753 us -3.95% FAIL
I32 I16 TRUE 2^20 2^1 13.25 13.374 us 2.07% 13.250 us 2.19% 13.250 us -0.93% PASS
I32 I16 TRUE 2^24 2^1 58.583 58.789 us 3.00% 58.583 us 2.97% 58.583 us -0.35% PASS
I32 I16 TRUE 2^28 2^1 761.16 763.505 us 1.78% 761.160 us 1.80% 761.160 us -0.31% PASS
I32 I16 TRUE 2^16 2^4 9.693 9.947 us 2.67% 9.693 us 3.09% 9.693 us -2.56% PASS
I32 I16 TRUE 2^20 2^4 12.88 13.155 us 2.10% 12.880 us 1.67% 12.880 us -2.09% FAIL
I32 I16 TRUE 2^24 2^4 51.004 51.221 us 2.52% 51.004 us 2.53% 51.004 us -0.42% PASS
I32 I16 TRUE 2^28 2^4 583.492 583.958 us 1.01% 583.492 us 0.98% 583.492 us -0.08% PASS
I32 I16 TRUE 2^16 2^8 9.526 9.760 us 2.53% 9.526 us 2.22% 9.526 us -2.39% FAIL
I32 I16 TRUE 2^20 2^8 12.556 12.863 us 1.77% 12.556 us 1.77% 12.556 us -2.39% FAIL
I32 I16 TRUE 2^24 2^8 49.119 49.450 us 2.61% 49.119 us 2.58% 49.119 us -0.67% PASS
I32 I16 TRUE 2^28 2^8 536.408 536.782 us 0.91% 536.408 us 0.94% 536.408 us -0.07% PASS
I64 I16 FALSE 2^16 2^1 10.046 9.915 us 2.09% 10.046 us 2.18% 10.046 us 1.32% PASS
I64 I16 FALSE 2^20 2^1 13.157 14.042 us 1.87% 13.157 us 1.75% 13.157 us -6.31% FAIL
I64 I16 FALSE 2^24 2^1 59.483 77.191 us 0.65% 59.483 us 2.98% 59.483 us -22.94% FAIL
I64 I16 FALSE 2^28 2^1 774.309 1.101 ms 0.50% 774.309 us 1.81% 774.309 us -29.64% FAIL
I64 I16 FALSE 2^16 2^4 9.837 9.924 us 2.24% 9.837 us 2.40% 9.837 us -0.87% PASS
I64 I16 FALSE 2^20 2^4 13.554 13.873 us 1.61% 13.554 us 2.05% 13.554 us -2.30% FAIL
I64 I16 FALSE 2^24 2^4 51.446 71.013 us 0.64% 51.446 us 2.51% 51.446 us -27.55% FAIL
I64 I16 FALSE 2^28 2^4 589.338 985.858 us 0.50% 589.338 us 0.97% 589.338 us -40.22% FAIL
I64 I16 FALSE 2^16 2^8 10.135 9.786 us 2.00% 10.135 us 2.04% 10.135 us 3.57% FAIL
I64 I16 FALSE 2^20 2^8 12.965 13.831 us 1.85% 12.965 us 1.77% 12.965 us -6.26% FAIL
I64 I16 FALSE 2^24 2^8 51.444 69.675 us 0.64% 51.444 us 2.64% 51.444 us -26.17% FAIL
I64 I16 FALSE 2^28 2^8 567.098 948.194 us 0.50% 567.098 us 0.92% 567.098 us -40.19% FAIL
I64 I16 TRUE 2^16 2^1 10.066 10.745 us 2.03% 10.066 us 2.18% 10.066 us -6.32% FAIL
I64 I16 TRUE 2^20 2^1 13.132 16.089 us 1.59% 13.132 us 1.75% 13.132 us -18.38% FAIL
I64 I16 TRUE 2^24 2^1 59.459 103.600 us 0.59% 59.459 us 2.97% 59.459 us -42.61% FAIL
I64 I16 TRUE 2^28 2^1 774.021 1.544 ms 0.44% 774.021 us 1.83% 774.021 us -49.88% FAIL
I64 I16 TRUE 2^16 2^4 9.787 10.581 us 2.21% 9.787 us 2.46% 9.787 us -7.51% FAIL
I64 I16 TRUE 2^20 2^4 13.57 16.009 us 1.46% 13.570 us 1.75% 13.570 us -15.23% FAIL
I64 I16 TRUE 2^24 2^4 51.549 96.185 us 0.58% 51.549 us 2.54% 51.549 us -46.41% FAIL
I64 I16 TRUE 2^28 2^4 589.176 1.412 ms 0.47% 589.176 us 0.96% 589.176 us -58.28% FAIL
I64 I16 TRUE 2^16 2^8 10.212 10.539 us 2.07% 10.212 us 2.04% 10.212 us -3.10% FAIL
I64 I16 TRUE 2^20 2^8 12.965 15.943 us 1.71% 12.965 us 1.70% 12.965 us -18.68% FAIL
I64 I16 TRUE 2^24 2^8 51.418 95.192 us 0.64% 51.418 us 2.64% 51.418 us -45.99% FAIL
I64 I16 TRUE 2^28 2^8 566.825 1.378 ms 0.41% 566.825 us 0.90% 566.825 us -58.86% FAIL
I32 I32 FALSE 2^16 2^1 10.127 10.404 us 1.97% 10.127 us 2.14% 10.127 us -2.67% FAIL
I32 I32 FALSE 2^20 2^1 14.982 15.178 us 1.84% 14.982 us 1.77% 14.982 us -1.29% PASS
I32 I32 FALSE 2^24 2^1 83.709 83.857 us 2.67% 83.709 us 2.68% 83.709 us -0.18% PASS
I32 I32 FALSE 2^28 2^1 1160 1.163 ms 1.27% 1.160 ms 1.28% 1.160 ms -0.22% PASS
I32 I32 FALSE 2^16 2^4 9.507 9.603 us 2.14% 9.507 us 2.10% 9.507 us -1.00% PASS
I32 I32 FALSE 2^20 2^4 14.202 14.361 us 1.48% 14.202 us 1.53% 14.202 us -1.11% PASS
I32 I32 FALSE 2^24 2^4 66.532 66.589 us 2.21% 66.532 us 2.20% 66.532 us -0.09% PASS
I32 I32 FALSE 2^28 2^4 799.908 800.557 us 1.09% 799.908 us 1.11% 799.908 us -0.08% PASS
I32 I32 FALSE 2^16 2^8 9.453 9.521 us 2.12% 9.453 us 2.00% 9.453 us -0.72% PASS
I32 I32 FALSE 2^20 2^8 14.045 14.143 us 1.52% 14.045 us 1.48% 14.045 us -0.69% PASS
I32 I32 FALSE 2^24 2^8 64.458 64.547 us 2.16% 64.458 us 2.14% 64.458 us -0.14% PASS
I32 I32 FALSE 2^28 2^8 677.687 677.664 us 1.15% 677.687 us 1.11% 677.687 us 0.00% PASS
I32 I32 TRUE 2^16 2^1 9.867 9.857 us 2.02% 9.867 us 1.93% 9.867 us 0.10% PASS
I32 I32 TRUE 2^20 2^1 14.952 14.938 us 1.43% 14.952 us 1.56% 14.952 us 0.09% PASS
I32 I32 TRUE 2^24 2^1 83.575 83.695 us 2.67% 83.575 us 2.67% 83.575 us -0.14% PASS
I32 I32 TRUE 2^28 2^1 1161 1.162 ms 1.33% 1.161 ms 1.31% 1.161 ms -0.07% PASS
I32 I32 TRUE 2^16 2^4 9.615 9.584 us 2.00% 9.615 us 2.40% 9.615 us 0.32% PASS
I32 I32 TRUE 2^20 2^4 13.997 14.120 us 1.42% 13.997 us 1.43% 13.997 us -0.87% PASS
I32 I32 TRUE 2^24 2^4 66.583 66.533 us 2.22% 66.583 us 2.22% 66.583 us 0.08% PASS
I32 I32 TRUE 2^28 2^4 799.891 800.304 us 1.12% 799.891 us 1.08% 799.891 us -0.05% PASS
I32 I32 TRUE 2^16 2^8 9.94 9.865 us 1.89% 9.940 us 2.10% 9.940 us 0.76% PASS
I32 I32 TRUE 2^20 2^8 13.837 13.956 us 1.58% 13.837 us 1.52% 13.837 us -0.85% PASS
I32 I32 TRUE 2^24 2^8 64.523 64.617 us 2.18% 64.523 us 2.17% 64.523 us -0.15% PASS
I32 I32 TRUE 2^28 2^8 677.822 677.519 us 1.18% 677.822 us 1.14% 677.822 us 0.04% PASS
I64 I32 FALSE 2^16 2^1 10.663 9.882 us 2.19% 10.663 us 1.79% 10.663 us 7.91% FAIL
I64 I32 FALSE 2^20 2^1 14.869 15.239 us 1.38% 14.869 us 1.35% 14.869 us -2.43% FAIL
I64 I32 FALSE 2^24 2^1 83.916 92.946 us 1.28% 83.916 us 2.67% 83.916 us -9.72% FAIL
I64 I32 FALSE 2^28 2^1 1164 1.313 ms 0.88% 1.164 ms 1.28% 1.164 ms -11.32% FAIL
I64 I32 FALSE 2^16 2^4 10.419 9.807 us 2.36% 10.419 us 2.07% 10.419 us 6.24% FAIL
I64 I32 FALSE 2^20 2^4 14.456 15.006 us 1.47% 14.456 us 1.41% 14.456 us -3.67% FAIL
I64 I32 FALSE 2^24 2^4 67.406 80.391 us 0.82% 67.406 us 2.24% 67.406 us -16.15% FAIL
I64 I32 FALSE 2^28 2^4 806.936 1.066 ms 0.50% 806.936 us 1.06% 806.936 us -24.33% FAIL
I64 I32 FALSE 2^16 2^8 9.966 9.487 us 2.37% 9.966 us 2.64% 9.966 us 5.05% FAIL
I64 I32 FALSE 2^20 2^8 14.433 15.003 us 1.48% 14.433 us 1.50% 14.433 us -3.80% FAIL
I64 I32 FALSE 2^24 2^8 65.99 77.905 us 0.81% 65.990 us 2.38% 65.990 us -15.29% FAIL
I64 I32 FALSE 2^28 2^8 699.739 994.830 us 0.50% 699.739 us 1.14% 699.739 us -29.66% FAIL
I64 I32 TRUE 2^16 2^1 10.3 10.344 us 1.78% 10.300 us 1.84% 10.300 us -0.42% PASS
I64 I32 TRUE 2^20 2^1 15.097 17.382 us 1.50% 15.097 us 1.56% 15.097 us -13.15% FAIL
I64 I32 TRUE 2^24 2^1 83.956 116.842 us 0.81% 83.956 us 2.65% 83.956 us -28.15% FAIL
I64 I32 TRUE 2^28 2^1 1165 1.715 ms 0.50% 1.165 ms 1.23% 1.165 ms -32.11% FAIL
I64 I32 TRUE 2^16 2^4 10.235 10.210 us 1.97% 10.235 us 1.91% 10.235 us 0.25% PASS
I64 I32 TRUE 2^20 2^4 14.365 17.128 us 1.38% 14.365 us 1.39% 14.365 us -16.13% FAIL
I64 I32 TRUE 2^24 2^4 67.264 105.799 us 0.75% 67.264 us 2.22% 67.264 us -36.42% FAIL
I64 I32 TRUE 2^28 2^4 807.155 1.506 ms 0.50% 807.155 us 1.07% 807.155 us -46.40% FAIL
I64 I32 TRUE 2^16 2^8 9.802 10.229 us 2.04% 9.802 us 2.27% 9.802 us -4.17% FAIL
I64 I32 TRUE 2^20 2^8 14.433 17.053 us 1.28% 14.433 us 1.17% 14.433 us -15.36% FAIL
I64 I32 TRUE 2^24 2^8 65.891 103.375 us 0.78% 65.891 us 2.33% 65.891 us -36.26% FAIL
I64 I32 TRUE 2^28 2^8 699.537 1.442 ms 0.31% 699.537 us 1.10% 699.537 us -51.47% FAIL
I32 I64 FALSE 2^16 2^1 10.191 10.468 us 1.70% 10.191 us 1.90% 10.191 us -2.65% FAIL
I32 I64 FALSE 2^20 2^1 18.44 18.714 us 1.47% 18.440 us 1.30% 18.440 us -1.47% FAIL
I32 I64 FALSE 2^24 2^1 148.263 148.710 us 2.47% 148.263 us 2.45% 148.263 us -0.30% PASS
I32 I64 FALSE 2^28 2^1 2201 2.202 ms 1.06% 2.201 ms 1.05% 2.201 ms -0.07% PASS
I32 I64 FALSE 2^16 2^4 9.67 9.768 us 2.17% 9.670 us 2.13% 9.670 us -1.00% PASS
I32 I64 FALSE 2^20 2^4 17.198 17.337 us 1.55% 17.198 us 1.51% 17.198 us -0.81% PASS
I32 I64 FALSE 2^24 2^4 108.961 109.136 us 2.13% 108.961 us 2.14% 108.961 us -0.16% PASS
I32 I64 FALSE 2^28 2^4 1498 1.498 ms 0.79% 1.498 ms 0.76% 1.498 ms 0.01% PASS
I32 I64 FALSE 2^16 2^8 9.581 9.885 us 2.15% 9.581 us 1.99% 9.581 us -3.08% FAIL
I32 I64 FALSE 2^20 2^8 16.68 16.903 us 1.08% 16.680 us 1.31% 16.680 us -1.32% FAIL
I32 I64 FALSE 2^24 2^8 100.282 100.337 us 1.69% 100.282 us 1.67% 100.282 us -0.06% PASS
I32 I64 FALSE 2^28 2^8 1289 1.289 ms 0.67% 1.289 ms 0.61% 1.289 ms 0.03% PASS
I32 I64 TRUE 2^16 2^1 10.229 10.369 us 1.84% 10.229 us 1.81% 10.229 us -1.35% PASS
I32 I64 TRUE 2^20 2^1 18.301 18.526 us 1.77% 18.301 us 1.34% 18.301 us -1.22% PASS
I32 I64 TRUE 2^24 2^1 148.435 148.669 us 2.47% 148.435 us 2.48% 148.435 us -0.16% PASS
I32 I64 TRUE 2^28 2^1 2202 2.202 ms 1.04% 2.202 ms 1.06% 2.202 ms 0.01% PASS
I32 I64 TRUE 2^16 2^4 9.81 10.092 us 1.61% 9.810 us 1.96% 9.810 us -2.79% FAIL
I32 I64 TRUE 2^20 2^4 16.797 17.125 us 1.56% 16.797 us 1.19% 16.797 us -1.91% FAIL
I32 I64 TRUE 2^24 2^4 108.959 109.133 us 2.13% 108.959 us 2.07% 108.959 us -0.16% PASS
I32 I64 TRUE 2^28 2^4 1498 1.497 ms 0.78% 1.498 ms 0.77% 1.498 ms 0.05% PASS
I32 I64 TRUE 2^16 2^8 9.936 9.661 us 2.12% 9.936 us 1.61% 9.936 us 2.85% FAIL
I32 I64 TRUE 2^20 2^8 16.868 16.485 us 1.23% 16.868 us 1.23% 16.868 us 2.32% FAIL
I32 I64 TRUE 2^24 2^8 100.662 100.255 us 1.68% 100.662 us 1.70% 100.662 us 0.41% PASS
I32 I64 TRUE 2^28 2^8 1289 1.288 ms 0.61% 1.289 ms 0.67% 1.289 ms 0.03% PASS
I64 I64 FALSE 2^16 2^1 10.791 10.234 us 1.78% 10.791 us 2.35% 10.791 us 5.44% FAIL
I64 I64 FALSE 2^20 2^1 18.516 19.849 us 2.05% 18.516 us 1.77% 18.516 us -6.71% FAIL
I64 I64 FALSE 2^24 2^1 148.794 163.405 us 1.46% 148.794 us 2.52% 148.794 us -8.94% FAIL
I64 I64 FALSE 2^28 2^1 2201 2.463 ms 0.71% 2.201 ms 1.04% 2.201 ms -10.65% FAIL
I64 I64 FALSE 2^16 2^4 10.491 10.080 us 1.74% 10.491 us 2.47% 10.491 us 4.08% FAIL
I64 I64 FALSE 2^20 2^4 17.507 18.984 us 1.84% 17.507 us 1.43% 17.507 us -7.78% FAIL
I64 I64 FALSE 2^24 2^4 109.814 131.272 us 1.12% 109.814 us 2.22% 109.814 us -16.35% FAIL
I64 I64 FALSE 2^28 2^4 1505 1.821 ms 0.50% 1.505 ms 0.79% 1.505 ms -17.33% FAIL
I64 I64 FALSE 2^16 2^8 10.181 10.069 us 1.66% 10.181 us 2.48% 10.181 us 1.11% PASS
I64 I64 FALSE 2^20 2^8 16.975 18.782 us 1.71% 16.975 us 1.59% 16.975 us -9.62% FAIL
I64 I64 FALSE 2^24 2^8 102.454 123.054 us 0.89% 102.454 us 2.11% 102.454 us -16.74% FAIL
I64 I64 FALSE 2^28 2^8 1306 1.659 ms 0.16% 1.306 ms 0.63% 1.306 ms -21.28% FAIL
I64 I64 TRUE 2^16 2^1 10.83 11.161 us 1.72% 10.830 us 2.16% 10.830 us -2.96% FAIL
I64 I64 TRUE 2^20 2^1 18.491 22.630 us 1.31% 18.491 us 1.13% 18.491 us -18.29% FAIL
I64 I64 TRUE 2^24 2^1 148.663 196.722 us 0.69% 148.663 us 2.56% 148.663 us -24.43% FAIL
I64 I64 TRUE 2^28 2^1 2202 2.943 ms 0.50% 2.202 ms 1.06% 2.202 ms -25.18% FAIL
I64 I64 TRUE 2^16 2^4 10.463 10.979 us 1.51% 10.463 us 1.88% 10.463 us -4.70% FAIL
I64 I64 TRUE 2^20 2^4 17.648 21.901 us 1.28% 17.648 us 1.44% 17.648 us -19.42% FAIL
I64 I64 TRUE 2^24 2^4 109.67 173.008 us 0.62% 109.670 us 2.16% 109.670 us -36.61% FAIL
I64 I64 TRUE 2^28 2^4 1505 2.537 ms 0.15% 1.505 ms 0.79% 1.505 ms -40.68% FAIL
I64 I64 TRUE 2^16 2^8 10.098 10.813 us 1.88% 10.098 us 1.56% 10.098 us -6.62% FAIL
I64 I64 TRUE 2^20 2^8 17.193 21.634 us 1.22% 17.193 us 1.57% 17.193 us -20.53% FAIL
I64 I64 TRUE 2^24 2^8 102.407 165.594 us 0.59% 102.407 us 2.05% 102.407 us -38.16% FAIL
I64 I64 TRUE 2^28 2^8 1306 2.396 ms 0.12% 1.306 ms 0.63% 1.306 ms -45.48% FAIL
I32 I128 FALSE 2^16 2^1 11.283 11.144 us 2.06% 11.283 us 2.31% 11.283 us 1.25% PASS
I32 I128 FALSE 2^20 2^1 29.451 29.101 us 2.14% 29.451 us 2.22% 29.451 us 1.20% PASS
I32 I128 FALSE 2^24 2^1 299.321 299.134 us 1.47% 299.321 us 1.48% 299.321 us 0.06% PASS
I32 I128 FALSE 2^28 2^1 4603 4.595 ms 0.60% 4.603 ms 0.69% 4.603 ms 0.18% PASS
I32 I128 FALSE 2^16 2^4 10.768 10.334 us 2.19% 10.768 us 2.09% 10.768 us 4.20% FAIL
I32 I128 FALSE 2^20 2^4 26.979 26.546 us 2.68% 26.979 us 2.42% 26.979 us 1.63% PASS
I32 I128 FALSE 2^24 2^4 215.357 214.937 us 1.33% 215.357 us 1.34% 215.357 us 0.20% PASS
I32 I128 FALSE 2^28 2^4 3183 3.183 ms 0.28% 3.183 ms 0.30% 3.183 ms 0.01% PASS
I32 I128 FALSE 2^16 2^8 10.411 10.218 us 2.15% 10.411 us 2.36% 10.411 us 1.89% PASS
I32 I128 FALSE 2^20 2^8 26.204 25.935 us 2.32% 26.204 us 2.17% 26.204 us 1.04% PASS
I32 I128 FALSE 2^24 2^8 188.378 188.171 us 1.19% 188.378 us 1.18% 188.378 us 0.11% PASS
I32 I128 FALSE 2^28 2^8 2617 2.615 ms 0.31% 2.617 ms 0.26% 2.617 ms 0.06% PASS
I32 I128 TRUE 2^16 2^1 11.94 11.724 us 1.92% 11.940 us 2.06% 11.940 us 1.85% PASS
I32 I128 TRUE 2^20 2^1 31.008 30.604 us 1.85% 31.008 us 1.88% 31.008 us 1.32% PASS
I32 I128 TRUE 2^24 2^1 310.191 309.659 us 1.20% 310.191 us 1.15% 310.191 us 0.17% PASS
I32 I128 TRUE 2^28 2^1 4800 4.790 ms 0.50% 4.800 ms 0.57% 4.800 ms 0.21% PASS
I32 I128 TRUE 2^16 2^4 11.263 11.078 us 1.83% 11.263 us 2.15% 11.263 us 1.68% PASS
I32 I128 TRUE 2^20 2^4 28.48 28.077 us 1.90% 28.480 us 1.97% 28.480 us 1.44% PASS
I32 I128 TRUE 2^24 2^4 227.577 226.878 us 1.03% 227.577 us 1.02% 227.577 us 0.31% PASS
I32 I128 TRUE 2^28 2^4 3311 3.308 ms 0.26% 3.311 ms 0.27% 3.311 ms 0.10% PASS
I32 I128 TRUE 2^16 2^8 11.349 11.139 us 2.22% 11.349 us 1.90% 11.349 us 1.89% PASS
I32 I128 TRUE 2^20 2^8 27.492 27.150 us 1.82% 27.492 us 2.05% 27.492 us 1.26% PASS
I32 I128 TRUE 2^24 2^8 204.807 204.161 us 0.93% 204.807 us 0.94% 204.807 us 0.32% PASS
I32 I128 TRUE 2^28 2^8 2895 2.892 ms 0.22% 2.895 ms 0.22% 2.895 ms 0.11% PASS
I64 I128 FALSE 2^16 2^1 11.72 11.488 us 1.76% 11.720 us 2.35% 11.720 us 2.02% FAIL
I64 I128 FALSE 2^20 2^1 29.233 30.824 us 2.54% 29.233 us 2.29% 29.233 us -5.16% FAIL
I64 I128 FALSE 2^24 2^1 299.031 323.681 us 0.94% 299.031 us 1.44% 299.031 us -7.62% FAIL
I64 I128 FALSE 2^28 2^1 4598 5.089 ms 0.50% 4.598 ms 0.74% 4.598 ms -9.64% FAIL
I64 I128 FALSE 2^16 2^4 11.161 11.244 us 1.74% 11.161 us 1.82% 11.161 us -0.74% PASS
I64 I128 FALSE 2^20 2^4 26.628 28.950 us 2.05% 26.628 us 2.52% 26.628 us -8.02% FAIL
I64 I128 FALSE 2^24 2^4 214.735 269.709 us 0.41% 214.735 us 1.33% 214.735 us -20.38% FAIL
I64 I128 FALSE 2^28 2^4 3183 4.090 ms 0.10% 3.183 ms 0.32% 3.183 ms -22.17% FAIL
I64 I128 FALSE 2^16 2^8 11.105 11.312 us 2.10% 11.105 us 2.19% 11.105 us -1.84% PASS
I64 I128 FALSE 2^20 2^8 26.084 28.252 us 1.53% 26.084 us 2.17% 26.084 us -7.67% FAIL
I64 I128 FALSE 2^24 2^8 188.859 255.333 us 0.32% 188.859 us 1.14% 188.859 us -26.03% FAIL
I64 I128 FALSE 2^28 2^8 2621 3.836 ms 0.08% 2.621 ms 0.25% 2.621 ms -31.69% FAIL
I64 I128 TRUE 2^16 2^1 12.54 12.841 us 1.34% 12.540 us 1.94% 12.540 us -2.34% FAIL
I64 I128 TRUE 2^20 2^1 30.976 37.201 us 1.39% 30.976 us 1.94% 30.976 us -16.73% FAIL
I64 I128 TRUE 2^24 2^1 310.935 425.195 us 0.39% 310.935 us 1.16% 310.935 us -26.87% FAIL
I64 I128 TRUE 2^28 2^1 4831 6.649 ms 0.50% 4.831 ms 0.62% 4.831 ms -27.35% FAIL
I64 I128 TRUE 2^16 2^4 11.566 12.436 us 1.62% 11.566 us 1.81% 11.566 us -7.00% FAIL
I64 I128 TRUE 2^20 2^4 28.323 35.528 us 1.22% 28.323 us 1.86% 28.323 us -20.28% FAIL
I64 I128 TRUE 2^24 2^4 228.25 381.924 us 0.33% 228.250 us 1.00% 228.250 us -40.24% FAIL
I64 I128 TRUE 2^28 2^4 3325 5.925 ms 0.08% 3.325 ms 0.24% 3.325 ms -43.88% FAIL
I64 I128 TRUE 2^16 2^8 11.256 12.505 us 1.71% 11.256 us 2.20% 11.256 us -9.99% FAIL
I64 I128 TRUE 2^20 2^8 27.9 35.049 us 1.17% 27.900 us 1.91% 27.900 us -20.40% FAIL
I64 I128 TRUE 2^24 2^8 206.725 367.810 us 0.30% 206.725 us 0.89% 206.725 us -43.80% FAIL
I64 I128 TRUE 2^28 2^8 2928 5.677 ms 0.08% 2.928 ms 0.20% 2.928 ms -48.42% FAIL
I32 F32 FALSE 2^16 2^1 10.114 9.865 us 2.08% 10.114 us 2.80% 10.114 us 2.53% FAIL
I32 F32 FALSE 2^20 2^1 15.232 14.974 us 1.40% 15.232 us 1.21% 15.232 us 1.72% FAIL
I32 F32 FALSE 2^24 2^1 84.12 83.875 us 2.63% 84.120 us 2.64% 84.120 us 0.29% PASS
I32 F32 FALSE 2^28 2^1 842.161 839.903 us 1.40% 842.161 us 1.42% 842.161 us 0.27% PASS
I32 F32 FALSE 2^16 2^4 9.743 9.625 us 2.11% 9.743 us 2.36% 9.743 us 1.23% PASS
I32 F32 FALSE 2^20 2^4 14.385 14.065 us 1.46% 14.385 us 2.05% 14.385 us 2.28% FAIL
I32 F32 FALSE 2^24 2^4 67.238 66.819 us 2.19% 67.238 us 2.24% 67.238 us 0.63% PASS
I32 F32 FALSE 2^28 2^4 782.775 781.657 us 1.20% 782.775 us 1.24% 782.775 us 0.14% PASS
I32 F32 FALSE 2^16 2^8 10.19 9.856 us 1.87% 10.190 us 2.48% 10.190 us 3.39% FAIL
I32 F32 FALSE 2^20 2^8 13.909 13.961 us 1.51% 13.909 us 1.46% 13.909 us -0.38% PASS
I32 F32 FALSE 2^24 2^8 64.756 64.746 us 2.19% 64.756 us 2.15% 64.756 us 0.02% PASS
I32 F32 FALSE 2^28 2^8 679.94 679.438 us 1.16% 679.940 us 1.14% 679.940 us 0.07% PASS
I32 F32 TRUE 2^16 2^1 10.23 10.260 us 1.91% 10.230 us 1.99% 10.230 us -0.29% PASS
I32 F32 TRUE 2^20 2^1 14.993 14.780 us 1.47% 14.993 us 1.92% 14.993 us 1.44% PASS
I32 F32 TRUE 2^24 2^1 84.257 83.917 us 2.66% 84.257 us 2.66% 84.257 us 0.41% PASS
I32 F32 TRUE 2^28 2^1 841.32 839.663 us 1.40% 841.320 us 1.40% 841.320 us 0.20% PASS
I32 F32 TRUE 2^16 2^4 9.959 9.951 us 2.41% 9.959 us 2.51% 9.959 us 0.08% PASS
I32 F32 TRUE 2^20 2^4 14.113 14.015 us 1.38% 14.113 us 1.40% 14.113 us 0.70% PASS
I32 F32 TRUE 2^24 2^4 66.826 66.872 us 2.26% 66.826 us 2.22% 66.826 us -0.07% PASS
I32 F32 TRUE 2^28 2^4 782.105 781.727 us 1.17% 782.105 us 1.19% 782.105 us 0.05% PASS
I32 F32 TRUE 2^16 2^8 9.934 9.831 us 1.84% 9.934 us 1.68% 9.934 us 1.05% PASS
I32 F32 TRUE 2^20 2^8 13.951 14.027 us 1.54% 13.951 us 1.48% 13.951 us -0.54% PASS
I32 F32 TRUE 2^24 2^8 64.797 64.841 us 2.20% 64.797 us 2.21% 64.797 us -0.07% PASS
I32 F32 TRUE 2^28 2^8 680.272 679.451 us 1.15% 680.272 us 1.15% 680.272 us 0.12% PASS
I64 F32 FALSE 2^16 2^1 10.6 9.967 us 2.59% 10.600 us 1.95% 10.600 us 6.35% FAIL
I64 F32 FALSE 2^20 2^1 14.954 15.294 us 1.56% 14.954 us 1.39% 14.954 us -2.22% FAIL
I64 F32 FALSE 2^24 2^1 84.031 93.085 us 1.24% 84.031 us 2.72% 84.031 us -9.73% FAIL
I64 F32 FALSE 2^28 2^1 850.624 1.090 ms 0.61% 850.624 us 1.40% 850.624 us -21.93% FAIL
I64 F32 FALSE 2^16 2^4 10.632 9.850 us 2.51% 10.632 us 1.70% 10.632 us 7.95% FAIL
I64 F32 FALSE 2^20 2^4 14.325 14.997 us 1.42% 14.325 us 1.46% 14.325 us -4.48% FAIL
I64 F32 FALSE 2^24 2^4 67.541 80.396 us 0.82% 67.541 us 2.23% 67.541 us -15.99% FAIL
I64 F32 FALSE 2^28 2^4 786.15 1.052 ms 0.50% 786.150 us 1.23% 786.150 us -25.26% FAIL
I64 F32 FALSE 2^16 2^8 10.48 10.092 us 2.08% 10.480 us 2.46% 10.480 us 3.84% FAIL
I64 F32 FALSE 2^20 2^8 14.737 15.342 us 1.39% 14.737 us 1.67% 14.737 us -3.94% FAIL
I64 F32 FALSE 2^24 2^8 66.28 78.257 us 0.82% 66.280 us 2.34% 66.280 us -15.31% FAIL
I64 F32 FALSE 2^28 2^8 698.819 993.722 us 0.50% 698.819 us 1.10% 698.819 us -29.68% FAIL
I64 F32 TRUE 2^16 2^1 10.579 10.589 us 2.73% 10.579 us 2.65% 10.579 us -0.09% PASS
I64 F32 TRUE 2^20 2^1 15.495 17.673 us 1.15% 15.495 us 1.51% 15.495 us -12.33% FAIL
I64 F32 TRUE 2^24 2^1 84.287 117.320 us 0.83% 84.287 us 2.67% 84.287 us -28.16% FAIL
I64 F32 TRUE 2^28 2^1 851.472 1.532 ms 0.50% 851.472 us 1.45% 851.472 us -44.41% FAIL
I64 F32 TRUE 2^16 2^4 10.612 10.667 us 1.82% 10.612 us 1.96% 10.612 us -0.51% PASS
I64 F32 TRUE 2^20 2^4 14.854 17.496 us 1.52% 14.854 us 1.47% 14.854 us -15.10% FAIL
I64 F32 TRUE 2^24 2^4 67.86 105.966 us 0.75% 67.860 us 2.22% 67.860 us -35.96% FAIL
I64 F32 TRUE 2^28 2^4 786.561 1.496 ms 0.50% 786.561 us 1.23% 786.561 us -47.42% FAIL
I64 F32 TRUE 2^16 2^8 10.257 10.542 us 2.63% 10.257 us 1.85% 10.257 us -2.70% FAIL
I64 F32 TRUE 2^20 2^8 14.773 17.390 us 1.37% 14.773 us 1.69% 14.773 us -15.05% FAIL
I64 F32 TRUE 2^24 2^8 66.293 103.624 us 0.79% 66.293 us 2.32% 66.293 us -36.03% FAIL
I64 F32 TRUE 2^28 2^8 698.844 1.440 ms 0.31% 698.844 us 1.13% 698.844 us -51.47% FAIL
I32 F64 FALSE 2^16 2^1 10.404 10.402 us 2.90% 10.404 us 2.61% 10.404 us 0.02% PASS
I32 F64 FALSE 2^20 2^1 18.651 18.686 us 1.16% 18.651 us 1.65% 18.651 us -0.19% PASS
I32 F64 FALSE 2^24 2^1 148.457 148.385 us 2.48% 148.457 us 2.51% 148.457 us 0.05% PASS
I32 F64 FALSE 2^28 2^1 2200 2.197 ms 1.04% 2.200 ms 1.08% 2.200 ms 0.13% PASS
I32 F64 FALSE 2^16 2^4 9.887 9.885 us 1.56% 9.887 us 2.96% 9.887 us 0.03% PASS
I32 F64 FALSE 2^20 2^4 17.142 17.242 us 1.46% 17.142 us 1.68% 17.142 us -0.58% PASS
I32 F64 FALSE 2^24 2^4 108.907 108.702 us 2.09% 108.907 us 2.10% 108.907 us 0.19% PASS
I32 F64 FALSE 2^28 2^4 1495 1.494 ms 0.79% 1.495 ms 0.77% 1.495 ms 0.03% PASS
I32 F64 FALSE 2^16 2^8 9.744 9.882 us 2.02% 9.744 us 2.64% 9.744 us -1.40% PASS
I32 F64 FALSE 2^20 2^8 16.95 16.887 us 1.50% 16.950 us 1.20% 16.950 us 0.37% PASS
I32 F64 FALSE 2^24 2^8 100.253 100.304 us 1.68% 100.253 us 1.66% 100.253 us -0.05% PASS
I32 F64 FALSE 2^28 2^8 1288 1.288 ms 0.65% 1.288 ms 0.61% 1.288 ms 0.02% PASS
I32 F64 TRUE 2^16 2^1 10.455 10.319 us 2.60% 10.455 us 1.53% 10.455 us 1.31% PASS
I32 F64 TRUE 2^20 2^1 18.434 18.550 us 1.32% 18.434 us 1.07% 18.434 us -0.62% PASS
I32 F64 TRUE 2^24 2^1 148.37 148.351 us 2.54% 148.370 us 2.46% 148.370 us 0.01% PASS
I32 F64 TRUE 2^28 2^1 2199 2.198 ms 1.08% 2.199 ms 1.05% 2.199 ms 0.06% PASS
I32 F64 TRUE 2^16 2^4 10.16 10.028 us 1.79% 10.160 us 2.38% 10.160 us 1.31% PASS
I32 F64 TRUE 2^20 2^4 16.949 17.066 us 1.40% 16.949 us 1.04% 16.949 us -0.69% PASS
I32 F64 TRUE 2^24 2^4 108.876 108.558 us 2.05% 108.876 us 2.08% 108.876 us 0.29% PASS
I32 F64 TRUE 2^28 2^4 1494 1.494 ms 0.76% 1.494 ms 0.77% 1.494 ms 0.04% PASS
I32 F64 TRUE 2^16 2^8 9.847 9.899 us 2.49% 9.847 us 1.66% 9.847 us -0.52% PASS
I32 F64 TRUE 2^20 2^8 16.625 16.651 us 1.90% 16.625 us 1.49% 16.625 us -0.15% PASS
I32 F64 TRUE 2^24 2^8 100.468 100.386 us 1.68% 100.468 us 1.66% 100.468 us 0.08% PASS
I32 F64 TRUE 2^28 2^8 1288 1.288 ms 0.56% 1.288 ms 0.65% 1.288 ms 0.00% PASS
I64 F64 FALSE 2^16 2^1 10.66 10.490 us 2.44% 10.660 us 2.48% 10.660 us 1.62% PASS
I64 F64 FALSE 2^20 2^1 18.152 19.658 us 2.11% 18.152 us 1.37% 18.152 us -7.66% FAIL
I64 F64 FALSE 2^24 2^1 148.175 162.802 us 1.44% 148.175 us 2.53% 148.175 us -8.98% FAIL
I64 F64 FALSE 2^28 2^1 2200 2.461 ms 0.73% 2.200 ms 1.07% 2.200 ms -10.63% FAIL
I64 F64 FALSE 2^16 2^4 10.153 9.987 us 2.07% 10.153 us 2.11% 10.153 us 1.66% PASS
I64 F64 FALSE 2^20 2^4 17.083 18.962 us 1.92% 17.083 us 1.19% 17.083 us -9.91% FAIL
I64 F64 FALSE 2^24 2^4 109.092 130.379 us 1.16% 109.092 us 2.18% 109.092 us -16.33% FAIL
I64 F64 FALSE 2^28 2^4 1502 1.810 ms 0.50% 1.502 ms 0.79% 1.502 ms -17.04% FAIL
I64 F64 FALSE 2^16 2^8 9.916 10.079 us 2.14% 9.916 us 1.97% 9.916 us -1.62% PASS
I64 F64 FALSE 2^20 2^8 16.706 18.712 us 1.77% 16.706 us 1.26% 16.706 us -10.72% FAIL
I64 F64 FALSE 2^24 2^8 101.521 121.786 us 0.87% 101.521 us 2.00% 101.521 us -16.64% FAIL
I64 F64 FALSE 2^28 2^8 1301 1.642 ms 0.21% 1.301 ms 0.65% 1.301 ms -20.74% FAIL
I64 F64 TRUE 2^16 2^1 10.432 11.181 us 1.40% 10.432 us 1.98% 10.432 us -6.70% FAIL
I64 F64 TRUE 2^20 2^1 18.14 22.506 us 1.36% 18.140 us 1.34% 18.140 us -19.40% FAIL
I64 F64 TRUE 2^24 2^1 148.15 196.024 us 0.69% 148.150 us 2.56% 148.150 us -24.42% FAIL
I64 F64 TRUE 2^28 2^1 2200 2.937 ms 0.50% 2.200 ms 1.09% 2.200 ms -25.09% FAIL
I64 F64 TRUE 2^16 2^4 10.249 10.941 us 1.74% 10.249 us 1.97% 10.249 us -6.32% FAIL
I64 F64 TRUE 2^20 2^4 17.03 21.859 us 1.46% 17.030 us 1.43% 17.030 us -22.09% FAIL
I64 F64 TRUE 2^24 2^4 109.106 172.046 us 0.61% 109.106 us 2.15% 109.106 us -36.58% FAIL
I64 F64 TRUE 2^28 2^4 1501 2.523 ms 0.15% 1.501 ms 0.76% 1.501 ms -40.50% FAIL
I64 F64 TRUE 2^16 2^8 9.98 10.978 us 1.86% 9.980 us 1.92% 9.980 us -9.09% FAIL
I64 F64 TRUE 2^20 2^8 16.841 21.527 us 1.37% 16.841 us 1.48% 16.841 us -21.77% FAIL
I64 F64 TRUE 2^24 2^8 101.513 164.315 us 0.60% 101.513 us 1.98% 101.513 us -38.22% FAIL
I64 F64 TRUE 2^28 2^8 1302 2.379 ms 0.14% 1.302 ms 0.62% 1.302 ms -45.26% FAIL
H100 partition.if
T{ct} OffsetT{ct} DistinctPartitions{ct} Elements{io} Entropy Ref Time Ref Noise Cmp Time Cmp Noise Diff %Diff Status
I8 I32 false 2^16 1 9.977 us 1.85% 9.946 us 1.47% -0.031 us -0.31% PASS
I8 I32 false 2^20 1 12.823 us 1.57% 12.593 us 1.86% -0.229 us -1.79% FAIL
I8 I32 false 2^24 1 56.131 us 0.84% 56.007 us 0.82% -0.124 us -0.22% PASS
I8 I32 false 2^28 1 762.150 us 0.33% 764.417 us 0.33% 2.267 us 0.30% PASS
I8 I32 false 2^16 0.544 10.327 us 2.26% 10.342 us 1.67% 0.015 us 0.14% PASS
I8 I32 false 2^20 0.544 13.129 us 1.89% 12.903 us 2.51% -0.226 us -1.72% PASS
I8 I32 false 2^24 0.544 58.360 us 0.79% 58.156 us 0.80% -0.204 us -0.35% PASS
I8 I32 false 2^28 0.544 792.869 us 0.32% 795.253 us 0.34% 2.384 us 0.30% PASS
I8 I32 false 2^16 0 10.084 us 2.35% 9.947 us 2.02% -0.137 us -1.35% PASS
I8 I32 false 2^20 0 12.867 us 2.21% 12.682 us 2.07% -0.185 us -1.44% PASS
I8 I32 false 2^24 0 54.925 us 0.81% 54.794 us 0.87% -0.131 us -0.24% PASS
I8 I32 false 2^28 0 743.024 us 0.31% 744.596 us 0.33% 1.572 us 0.21% PASS
I8 I32 true 2^16 1 10.319 us 1.64% 10.337 us 2.26% 0.017 us 0.17% PASS
I8 I32 true 2^20 1 13.275 us 1.97% 13.092 us 2.09% -0.182 us -1.37% PASS
I8 I32 true 2^24 1 57.376 us 0.76% 57.067 us 0.81% -0.310 us -0.54% PASS
I8 I32 true 2^28 1 772.965 us 0.33% 774.699 us 0.34% 1.734 us 0.22% PASS
I8 I32 true 2^16 0.544 10.106 us 1.79% 10.526 us 1.86% 0.420 us 4.16% FAIL
I8 I32 true 2^20 0.544 13.185 us 2.17% 13.562 us 2.43% 0.377 us 2.86% FAIL
I8 I32 true 2^24 0.544 59.504 us 0.73% 60.095 us 0.73% 0.590 us 0.99% FAIL
I8 I32 true 2^28 0.544 809.819 us 0.27% 811.663 us 0.30% 1.844 us 0.23% PASS
I8 I32 true 2^16 0 10.158 us 1.76% 10.355 us 1.55% 0.198 us 1.95% FAIL
I8 I32 true 2^20 0 12.888 us 1.95% 13.387 us 1.91% 0.499 us 3.87% FAIL
I8 I32 true 2^24 0 56.615 us 0.82% 56.990 us 0.76% 0.375 us 0.66% PASS
I8 I32 true 2^28 0 770.494 us 0.30% 771.660 us 0.29% 1.167 us 0.15% PASS
I8 I64 false 2^16 1 9.394 us 2.71% 11.232 us 1.04% 1.838 us 19.56% FAIL
I8 I64 false 2^20 1 13.478 us 1.69% 14.168 us 1.83% 0.690 us 5.12% FAIL
I8 I64 false 2^24 1 69.962 us 0.45% 62.423 us 0.67% -7.539 us -10.78% FAIL
I8 I64 false 2^28 1 997.934 us 0.24% 839.967 us 0.31% -157.967 us -15.83% FAIL
I8 I64 false 2^16 0.544 9.635 us 2.33% 11.387 us 1.42% 1.752 us 18.19% FAIL
I8 I64 false 2^20 0.544 13.431 us 1.39% 14.406 us 1.49% 0.975 us 7.26% FAIL
I8 I64 false 2^24 0.544 70.161 us 0.50% 67.155 us 0.62% -3.006 us -4.29% FAIL
I8 I64 false 2^28 0.544 999.017 us 0.24% 919.809 us 0.23% -79.208 us -7.93% FAIL
I8 I64 false 2^16 0 9.446 us 2.56% 11.478 us 1.53% 2.033 us 21.52% FAIL
I8 I64 false 2^20 0 13.302 us 1.51% 14.418 us 1.74% 1.116 us 8.39% FAIL
I8 I64 false 2^24 0 69.571 us 0.48% 65.087 us 0.68% -4.483 us -6.44% FAIL
I8 I64 false 2^28 0 992.318 us 0.24% 883.679 us 0.25% -108.639 us -10.95% FAIL
I8 I64 true 2^16 1 9.544 us 2.39% 10.558 us 1.88% 1.013 us 10.62% FAIL
I8 I64 true 2^20 1 13.371 us 1.50% 13.398 us 1.80% 0.026 us 0.20% PASS
I8 I64 true 2^24 1 70.752 us 0.50% 59.365 us 0.81% -11.387 us -16.09% FAIL
I8 I64 true 2^28 1 1.007 ms 0.23% 805.290 us 0.33% -201.468 us -20.01% FAIL
I8 I64 true 2^16 0.544 9.675 us 2.34% 10.737 us 1.76% 1.063 us 10.98% FAIL
I8 I64 true 2^20 0.544 13.378 us 1.48% 13.658 us 1.56% 0.279 us 2.09% FAIL
I8 I64 true 2^24 0.544 71.135 us 0.49% 61.269 us 0.71% -9.866 us -13.87% FAIL
I8 I64 true 2^28 0.544 1.011 ms 0.24% 834.775 us 0.29% -176.497 us -17.45% FAIL
I8 I64 true 2^16 0 9.773 us 2.26% 10.718 us 2.30% 0.944 us 9.66% FAIL
I8 I64 true 2^20 0 13.293 us 1.54% 13.269 us 1.63% -0.024 us -0.18% PASS
I8 I64 true 2^24 0 69.947 us 0.46% 59.009 us 0.75% -10.938 us -15.64% FAIL
I8 I64 true 2^28 0 997.857 us 0.23% 801.080 us 0.29% -196.778 us -19.72% FAIL
I16 I32 false 2^16 1 9.805 us 2.23% 10.072 us 2.44% 0.268 us 2.73% FAIL
I16 I32 false 2^20 1 12.899 us 2.02% 13.327 us 1.77% 0.428 us 3.32% FAIL
I16 I32 false 2^24 1 65.428 us 2.56% 65.815 us 2.52% 0.387 us 0.59% PASS
I16 I32 false 2^28 1 900.492 us 1.14% 900.182 us 1.17% -0.310 us -0.03% PASS
I16 I32 false 2^16 0.544 9.979 us 1.90% 10.238 us 2.01% 0.259 us 2.60% FAIL
I16 I32 false 2^20 0.544 13.230 us 1.73% 13.457 us 1.96% 0.227 us 1.71% PASS
I16 I32 false 2^24 0.544 65.730 us 2.02% 65.998 us 2.02% 0.267 us 0.41% PASS
I16 I32 false 2^28 0.544 909.833 us 1.27% 905.552 us 0.95% -4.281 us -0.47% PASS
I16 I32 false 2^16 0 9.719 us 2.10% 10.174 us 2.00% 0.455 us 4.68% FAIL
I16 I32 false 2^20 0 12.841 us 1.96% 13.173 us 1.99% 0.332 us 2.59% FAIL
I16 I32 false 2^24 0 62.141 us 2.28% 62.390 us 2.30% 0.250 us 0.40% PASS
I16 I32 false 2^28 0 848.690 us 1.18% 848.943 us 1.18% 0.252 us 0.03% PASS
I16 I32 true 2^16 1 10.158 us 2.60% 10.487 us 2.14% 0.330 us 3.24% FAIL
I16 I32 true 2^20 1 13.298 us 2.29% 13.431 us 2.14% 0.134 us 1.01% PASS
I16 I32 true 2^24 1 64.714 us 2.69% 64.848 us 2.69% 0.134 us 0.21% PASS
I16 I32 true 2^28 1 891.790 us 1.60% 886.678 us 1.31% -5.112 us -0.57% PASS
I16 I32 true 2^16 0.544 10.197 us 2.28% 10.425 us 1.69% 0.227 us 2.23% FAIL
I16 I32 true 2^20 0.544 13.232 us 1.70% 13.629 us 1.63% 0.397 us 3.00% FAIL
I16 I32 true 2^24 0.544 64.784 us 2.17% 65.104 us 2.16% 0.320 us 0.49% PASS
I16 I32 true 2^28 0.544 897.504 us 1.97% 891.503 us 1.57% -6.001 us -0.67% PASS
I16 I32 true 2^16 0 10.165 us 2.02% 10.319 us 2.48% 0.155 us 1.52% PASS
I16 I32 true 2^20 0 13.367 us 2.24% 13.544 us 1.85% 0.177 us 1.32% PASS
I16 I32 true 2^24 0 63.480 us 2.32% 63.300 us 2.34% -0.180 us -0.28% PASS
I16 I32 true 2^28 0 856.726 us 1.38% 853.886 us 1.10% -2.840 us -0.33% PASS
I16 I64 false 2^16 1 9.738 us 2.10% 10.783 us 1.95% 1.045 us 10.73% FAIL
I16 I64 false 2^20 1 14.317 us 1.67% 14.169 us 1.87% -0.148 us -1.03% PASS
I16 I64 false 2^24 1 77.004 us 0.63% 66.282 us 2.59% -10.722 us -13.92% FAIL
I16 I64 false 2^28 1 1.084 ms 0.50% 909.191 us 1.69% -174.732 us -16.12% FAIL
I16 I64 false 2^16 0.544 9.706 us 2.05% 10.858 us 1.96% 1.152 us 11.87% FAIL
I16 I64 false 2^20 0.544 14.181 us 1.92% 14.486 us 1.94% 0.305 us 2.15% FAIL
I16 I64 false 2^24 0.544 77.494 us 0.62% 68.051 us 1.92% -9.443 us -12.19% FAIL
I16 I64 false 2^28 0.544 1.092 ms 0.50% 951.190 us 2.01% -140.406 us -12.86% FAIL
I16 I64 false 2^16 0 9.996 us 2.15% 10.879 us 1.48% 0.883 us 8.83% FAIL
I16 I64 false 2^20 0 14.447 us 1.77% 14.425 us 2.02% -0.022 us -0.15% PASS
I16 I64 false 2^24 0 76.925 us 0.63% 66.267 us 1.81% -10.658 us -13.85% FAIL
I16 I64 false 2^28 0 1.076 ms 0.50% 905.602 us 1.87% -170.717 us -15.86% FAIL
I16 I64 true 2^16 1 9.888 us 2.81% 10.528 us 1.78% 0.640 us 6.47% FAIL
I16 I64 true 2^20 1 14.493 us 1.57% 13.931 us 1.79% -0.562 us -3.88% FAIL
I16 I64 true 2^24 1 77.387 us 0.70% 65.776 us 2.10% -11.611 us -15.00% FAIL
I16 I64 true 2^28 1 1.084 ms 0.50% 905.585 us 1.86% -178.013 us -16.43% FAIL
I16 I64 true 2^16 0.544 9.951 us 1.93% 10.475 us 1.92% 0.524 us 5.26% FAIL
I16 I64 true 2^20 0.544 14.563 us 1.64% 14.239 us 1.81% -0.324 us -2.23% FAIL
I16 I64 true 2^24 0.544 78.408 us 0.65% 66.737 us 1.71% -11.672 us -14.89% FAIL
I16 I64 true 2^28 0.544 1.099 ms 0.50% 925.692 us 2.60% -172.830 us -15.73% FAIL
I16 I64 true 2^16 0 10.010 us 2.24% 10.429 us 1.55% 0.419 us 4.19% FAIL
I16 I64 true 2^20 0 14.426 us 1.89% 13.958 us 1.63% -0.468 us -3.24% FAIL
I16 I64 true 2^24 0 77.170 us 0.68% 64.416 us 1.65% -12.753 us -16.53% FAIL
I16 I64 true 2^28 0 1.080 ms 0.49% 873.730 us 1.48% -206.714 us -19.13% FAIL
I32 I32 false 2^16 1 9.891 us 2.11% 9.882 us 2.15% -0.009 us -0.09% PASS
I32 I32 false 2^20 1 15.063 us 1.39% 15.004 us 2.18% -0.059 us -0.39% PASS
I32 I32 false 2^24 1 93.088 us 2.14% 92.841 us 2.15% -0.248 us -0.27% PASS
I32 I32 false 2^28 1 1.367 ms 0.90% 1.365 ms 0.95% -2.360 us -0.17% PASS
I32 I32 false 2^16 0.544 9.906 us 2.92% 9.784 us 2.16% -0.122 us -1.23% PASS
I32 I32 false 2^20 0.544 15.242 us 1.56% 15.155 us 1.62% -0.087 us -0.57% PASS
I32 I32 false 2^24 0.544 95.889 us 2.12% 95.647 us 2.15% -0.242 us -0.25% PASS
I32 I32 false 2^28 0.544 1.398 ms 0.89% 1.396 ms 0.96% -1.447 us -0.10% PASS
I32 I32 false 2^16 0 9.691 us 2.02% 9.637 us 1.98% -0.054 us -0.56% PASS
I32 I32 false 2^20 0 14.768 us 1.45% 14.735 us 1.53% -0.033 us -0.22% PASS
I32 I32 false 2^24 0 92.777 us 2.17% 92.687 us 2.16% -0.090 us -0.10% PASS
I32 I32 false 2^28 0 1.367 ms 0.94% 1.366 ms 0.92% -0.992 us -0.07% PASS
I32 I32 true 2^16 1 9.668 us 1.62% 9.725 us 1.96% 0.057 us 0.59% PASS
I32 I32 true 2^20 1 14.855 us 1.72% 14.986 us 1.72% 0.131 us 0.88% PASS
I32 I32 true 2^24 1 92.322 us 2.15% 92.249 us 2.17% -0.073 us -0.08% PASS
I32 I32 true 2^28 1 1.361 ms 0.93% 1.361 ms 0.91% -0.030 us -0.00% PASS
I32 I32 true 2^16 0.544 9.602 us 2.26% 9.661 us 1.99% 0.059 us 0.62% PASS
I32 I32 true 2^20 0.544 15.211 us 1.56% 15.244 us 1.52% 0.034 us 0.22% PASS
I32 I32 true 2^24 0.544 93.833 us 2.24% 93.870 us 2.20% 0.037 us 0.04% PASS
I32 I32 true 2^28 0.544 1.382 ms 1.00% 1.382 ms 0.99% -0.558 us -0.04% PASS
I32 I32 true 2^16 0 9.677 us 2.23% 9.590 us 2.09% -0.086 us -0.89% PASS
I32 I32 true 2^20 0 15.014 us 1.72% 15.090 us 1.64% 0.077 us 0.51% PASS
I32 I32 true 2^24 0 92.093 us 2.16% 92.027 us 2.15% -0.066 us -0.07% PASS
I32 I32 true 2^28 0 1.361 ms 0.93% 1.360 ms 0.93% -1.347 us -0.10% PASS
I32 I64 false 2^16 1 9.719 us 2.46% 10.232 us 1.98% 0.513 us 5.28% FAIL
I32 I64 false 2^20 1 15.485 us 1.34% 15.765 us 1.77% 0.280 us 1.81% FAIL
I32 I64 false 2^24 1 98.714 us 1.32% 94.815 us 2.60% -3.899 us -3.95% FAIL
I32 I64 false 2^28 1 1.465 ms 0.93% 1.388 ms 1.19% -76.986 us -5.26% FAIL
I32 I64 false 2^16 0.544 9.730 us 2.00% 10.660 us 1.81% 0.930 us 9.56% FAIL
I32 I64 false 2^20 0.544 15.602 us 1.66% 15.884 us 1.58% 0.282 us 1.81% FAIL
I32 I64 false 2^24 0.544 100.040 us 1.33% 99.843 us 2.73% -0.197 us -0.20% PASS
I32 I64 false 2^28 0.544 1.485 ms 0.96% 1.441 ms 1.62% -44.093 us -2.97% FAIL
I32 I64 false 2^16 0 9.901 us 2.06% 10.652 us 1.74% 0.751 us 7.59% FAIL
I32 I64 false 2^20 0 15.536 us 1.44% 15.785 us 1.43% 0.249 us 1.60% FAIL
I32 I64 false 2^24 0 98.607 us 1.35% 96.479 us 2.75% -2.128 us -2.16% FAIL
I32 I64 false 2^28 0 1.464 ms 0.90% 1.403 ms 1.36% -61.114 us -4.17% FAIL
I32 I64 true 2^16 1 10.019 us 2.99% 9.944 us 2.39% -0.075 us -0.74% PASS
I32 I64 true 2^20 1 15.538 us 1.48% 15.501 us 1.58% -0.036 us -0.23% PASS
I32 I64 true 2^24 1 98.476 us 1.39% 93.647 us 2.22% -4.828 us -4.90% FAIL
I32 I64 true 2^28 1 1.465 ms 1.10% 1.377 ms 1.12% -87.996 us -6.01% FAIL
I32 I64 true 2^16 0.544 9.905 us 2.36% 10.106 us 2.40% 0.201 us 2.02% PASS
I32 I64 true 2^20 0.544 15.581 us 1.39% 15.564 us 1.97% -0.017 us -0.11% PASS
I32 I64 true 2^24 0.544 100.427 us 1.41% 95.090 us 2.21% -5.338 us -5.32% FAIL
I32 I64 true 2^28 0.544 1.495 ms 1.17% 1.395 ms 1.20% -100.690 us -6.73% FAIL
I32 I64 true 2^16 0 10.010 us 2.45% 10.006 us 1.97% -0.004 us -0.04% PASS
I32 I64 true 2^20 0 15.476 us 1.35% 15.528 us 1.54% 0.052 us 0.33% PASS
I32 I64 true 2^24 0 98.578 us 1.40% 93.751 us 2.25% -4.826 us -4.90% FAIL
I32 I64 true 2^28 0 1.466 ms 1.13% 1.377 ms 1.00% -89.582 us -6.11% FAIL
I64 I32 false 2^16 1 10.715 us 2.40% 10.770 us 2.63% 0.055 us 0.51% PASS
I64 I32 false 2^20 1 19.166 us 1.83% 19.263 us 1.67% 0.097 us 0.51% PASS
I64 I32 false 2^24 1 169.829 us 1.71% 169.753 us 1.75% -0.077 us -0.05% PASS
I64 I32 false 2^28 1 2.592 ms 0.51% 2.591 ms 0.51% -0.330 us -0.01% PASS
I64 I32 false 2^16 0.544 10.586 us 2.44% 10.633 us 2.35% 0.047 us 0.44% PASS
I64 I32 false 2^20 0.544 19.061 us 1.62% 19.032 us 1.46% -0.029 us -0.15% PASS
I64 I32 false 2^24 0.544 170.921 us 1.70% 171.076 us 1.74% 0.155 us 0.09% PASS
I64 I32 false 2^28 0.544 2.609 ms 0.51% 2.609 ms 0.50% 0.070 us 0.00% PASS
I64 I32 false 2^16 0 10.667 us 2.39% 10.746 us 2.70% 0.079 us 0.74% PASS
I64 I32 false 2^20 0 19.197 us 1.48% 19.164 us 1.40% -0.033 us -0.17% PASS
I64 I32 false 2^24 0 169.840 us 1.72% 169.853 us 1.73% 0.012 us 0.01% PASS
I64 I32 false 2^28 0 2.593 ms 0.51% 2.592 ms 0.52% -0.364 us -0.01% PASS
I64 I32 true 2^16 1 10.881 us 2.83% 10.910 us 2.82% 0.028 us 0.26% PASS
I64 I32 true 2^20 1 19.131 us 1.46% 19.148 us 1.43% 0.016 us 0.09% PASS
I64 I32 true 2^24 1 169.154 us 1.66% 169.370 us 1.73% 0.216 us 0.13% PASS
I64 I32 true 2^28 1 2.585 ms 0.46% 2.585 ms 0.43% -0.069 us -0.00% PASS
I64 I32 true 2^16 0.544 10.648 us 2.90% 10.705 us 2.68% 0.057 us 0.53% PASS
I64 I32 true 2^20 0.544 18.842 us 1.40% 18.969 us 1.38% 0.127 us 0.67% PASS
I64 I32 true 2^24 0.544 170.392 us 1.73% 170.327 us 1.68% -0.065 us -0.04% PASS
I64 I32 true 2^28 0.544 2.601 ms 0.51% 2.600 ms 0.50% -0.350 us -0.01% PASS
I64 I32 true 2^16 0 10.709 us 2.69% 10.561 us 3.23% -0.148 us -1.38% PASS
I64 I32 true 2^20 0 19.066 us 1.40% 18.760 us 1.60% -0.306 us -1.60% FAIL
I64 I32 true 2^24 0 169.421 us 1.74% 169.188 us 1.73% -0.233 us -0.14% PASS
I64 I32 true 2^28 0 2.584 ms 0.45% 2.587 ms 0.47% 3.121 us 0.12% PASS
I64 I64 false 2^16 1 10.481 us 2.06% 10.730 us 2.52% 0.249 us 2.38% FAIL
I64 I64 false 2^20 1 20.540 us 2.66% 19.149 us 1.59% -1.390 us -6.77% FAIL
I64 I64 false 2^24 1 182.701 us 1.40% 173.053 us 2.26% -9.648 us -5.28% FAIL
I64 I64 false 2^28 1 2.820 ms 0.43% 2.623 ms 0.68% -197.316 us -7.00% FAIL
I64 I64 false 2^16 0.544 10.483 us 2.00% 10.821 us 2.47% 0.337 us 3.22% FAIL
I64 I64 false 2^20 0.544 20.588 us 2.54% 19.057 us 1.68% -1.532 us -7.44% FAIL
I64 I64 false 2^24 0.544 184.171 us 1.27% 174.097 us 2.17% -10.074 us -5.47% FAIL
I64 I64 false 2^28 0.544 2.847 ms 0.44% 2.633 ms 0.68% -214.152 us -7.52% FAIL
I64 I64 false 2^16 0 10.471 us 2.14% 10.988 us 2.39% 0.517 us 4.93% FAIL
I64 I64 false 2^20 0 20.422 us 2.45% 19.307 us 1.36% -1.115 us -5.46% FAIL
I64 I64 false 2^24 0 182.778 us 1.37% 175.619 us 2.52% -7.159 us -3.92% FAIL
I64 I64 false 2^28 0 2.820 ms 0.47% 2.646 ms 0.73% -174.496 us -6.19% FAIL
I64 I64 true 2^16 1 10.465 us 2.94% 10.372 us 2.79% -0.092 us -0.88% PASS
I64 I64 true 2^20 1 20.500 us 2.65% 18.916 us 1.45% -1.584 us -7.73% FAIL
I64 I64 true 2^24 1 182.644 us 1.41% 169.434 us 1.73% -13.210 us -7.23% FAIL
I64 I64 true 2^28 1 2.818 ms 0.34% 2.584 ms 0.50% -234.197 us -8.31% FAIL
I64 I64 true 2^16 0.544 10.514 us 1.82% 10.516 us 2.75% 0.002 us 0.02% PASS
I64 I64 true 2^20 0.544 20.496 us 2.47% 18.984 us 1.48% -1.513 us -7.38% FAIL
I64 I64 true 2^24 0.544 184.167 us 1.32% 170.288 us 1.70% -13.878 us -7.54% FAIL
I64 I64 true 2^28 0.544 2.849 ms 0.38% 2.598 ms 0.49% -251.218 us -8.82% FAIL
I64 I64 true 2^16 0 10.449 us 2.29% 10.947 us 2.62% 0.498 us 4.77% FAIL
I64 I64 true 2^20 0 20.249 us 2.81% 19.228 us 1.50% -1.022 us -5.05% FAIL
I64 I64 true 2^24 0 182.288 us 1.44% 169.539 us 1.75% -12.749 us -6.99% FAIL
I64 I64 true 2^28 0 2.818 ms 0.39% 2.587 ms 0.49% -231.637 us -8.22% FAIL
I128 I32 false 2^16 1 10.943 us 2.73% 11.279 us 2.34% 0.336 us 3.07% FAIL
I128 I32 false 2^20 1 30.264 us 4.47% 30.826 us 4.20% 0.562 us 1.86% PASS
I128 I32 false 2^24 1 335.012 us 1.64% 335.166 us 1.65% 0.154 us 0.05% PASS
I128 I32 false 2^28 1 5.256 ms 0.46% 5.257 ms 0.42% 0.926 us 0.02% PASS
I128 I32 false 2^16 0.544 11.015 us 2.74% 11.332 us 2.18% 0.317 us 2.88% FAIL
I128 I32 false 2^20 0.544 30.836 us 4.24% 31.205 us 4.26% 0.370 us 1.20% PASS
I128 I32 false 2^24 0.544 339.466 us 1.69% 339.952 us 1.66% 0.486 us 0.14% PASS
I128 I32 false 2^28 0.544 5.314 ms 0.43% 5.314 ms 0.38% -0.072 us -0.00% PASS
I128 I32 false 2^16 0 10.894 us 2.37% 11.407 us 2.30% 0.512 us 4.70% FAIL
I128 I32 false 2^20 0 30.343 us 4.41% 30.777 us 4.26% 0.434 us 1.43% PASS
I128 I32 false 2^24 0 335.262 us 1.63% 335.946 us 1.64% 0.684 us 0.20% PASS
I128 I32 false 2^28 0 5.254 ms 0.44% 5.253 ms 0.40% -1.462 us -0.03% PASS
I128 I32 true 2^16 1 11.107 us 2.66% 11.389 us 2.28% 0.282 us 2.54% FAIL
I128 I32 true 2^20 1 30.423 us 4.01% 30.900 us 3.98% 0.477 us 1.57% PASS
I128 I32 true 2^24 1 334.021 us 1.57% 334.542 us 1.60% 0.521 us 0.16% PASS
I128 I32 true 2^28 1 5.250 ms 0.38% 5.247 ms 0.43% -3.200 us -0.06% PASS
I128 I32 true 2^16 0.544 11.368 us 2.81% 11.535 us 2.39% 0.167 us 1.47% PASS
I128 I32 true 2^20 0.544 31.247 us 3.89% 31.633 us 3.83% 0.386 us 1.24% PASS
I128 I32 true 2^24 0.544 338.913 us 1.65% 338.641 us 1.61% -0.272 us -0.08% PASS
I128 I32 true 2^28 0.544 5.302 ms 0.39% 5.308 ms 0.41% 6.189 us 0.12% PASS
I128 I32 true 2^16 0 11.398 us 2.27% 11.403 us 2.35% 0.005 us 0.05% PASS
I128 I32 true 2^20 0 30.799 us 3.96% 30.887 us 3.96% 0.088 us 0.29% PASS
I128 I32 true 2^24 0 334.524 us 1.60% 334.547 us 1.59% 0.023 us 0.01% PASS
I128 I32 true 2^28 0 5.250 ms 0.42% 5.250 ms 0.42% -0.078 us -0.00% PASS
I128 I64 false 2^16 1 11.637 us 2.59% 11.283 us 2.18% -0.354 us -3.04% FAIL
I128 I64 false 2^20 1 33.213 us 2.28% 31.742 us 3.51% -1.471 us -4.43% FAIL
I128 I64 false 2^24 1 362.620 us 1.08% 333.153 us 1.58% -29.467 us -8.13% FAIL
I128 I64 false 2^28 1 5.701 ms 0.30% 5.205 ms 0.36% -495.588 us -8.69% FAIL
I128 I64 false 2^16 0.544 11.778 us 1.66% 11.403 us 1.93% -0.375 us -3.18% FAIL
I128 I64 false 2^20 0.544 33.189 us 2.14% 32.182 us 3.46% -1.007 us -3.03% FAIL
I128 I64 false 2^24 0.544 365.502 us 1.09% 338.419 us 1.75% -27.083 us -7.41% FAIL
I128 I64 false 2^28 0.544 5.744 ms 0.29% 5.268 ms 0.44% -475.115 us -8.27% FAIL
I128 I64 false 2^16 0 11.758 us 1.61% 11.309 us 2.81% -0.450 us -3.83% FAIL
I128 I64 false 2^20 0 33.079 us 2.21% 31.831 us 3.46% -1.248 us -3.77% FAIL
I128 I64 false 2^24 0 362.738 us 1.05% 334.314 us 1.63% -28.424 us -7.84% FAIL
I128 I64 false 2^28 0 5.698 ms 0.33% 5.217 ms 0.39% -481.876 us -8.46% FAIL
I128 I64 true 2^16 1 11.552 us 1.59% 11.261 us 2.15% -0.291 us -2.52% FAIL
I128 I64 true 2^20 1 32.755 us 2.21% 31.428 us 3.42% -1.328 us -4.05% FAIL
I128 I64 true 2^24 1 362.195 us 1.07% 335.316 us 1.59% -26.879 us -7.42% FAIL
I128 I64 true 2^28 1 5.702 ms 0.31% 5.259 ms 0.44% -443.698 us -7.78% FAIL
I128 I64 true 2^16 0.544 11.606 us 1.67% 11.649 us 2.27% 0.043 us 0.37% PASS
I128 I64 true 2^20 0.544 32.761 us 2.14% 32.027 us 3.27% -0.734 us -2.24% FAIL
I128 I64 true 2^24 0.544 365.123 us 1.10% 339.217 us 1.61% -25.906 us -7.10% FAIL
I128 I64 true 2^28 0.544 5.745 ms 0.28% 5.308 ms 0.43% -436.834 us -7.60% FAIL
I128 I64 true 2^16 0 11.532 us 1.92% 11.394 us 2.77% -0.138 us -1.20% PASS
I128 I64 true 2^20 0 32.722 us 2.20% 31.461 us 3.50% -1.260 us -3.85% FAIL
I128 I64 true 2^24 0 362.223 us 1.10% 335.186 us 1.61% -27.037 us -7.46% FAIL
I128 I64 true 2^28 0 5.698 ms 0.35% 5.254 ms 0.46% -443.753 us -7.79% FAIL
F32 I32 false 2^16 1 9.583 us 2.56% 9.923 us 1.91% 0.340 us 3.55% FAIL
F32 I32 false 2^20 1 14.732 us 1.61% 15.075 us 1.66% 0.342 us 2.32% FAIL
F32 I32 false 2^24 1 92.560 us 2.13% 93.055 us 2.11% 0.495 us 0.53% PASS
F32 I32 false 2^28 1 1.375 ms 0.93% 1.375 ms 0.98% 0.229 us 0.02% PASS
F32 I32 false 2^16 0.544 9.755 us 2.15% 10.171 us 2.27% 0.416 us 4.27% FAIL
F32 I32 false 2^20 0.544 15.362 us 1.75% 15.631 us 1.34% 0.269 us 1.75% FAIL
F32 I32 false 2^24 0.544 95.611 us 2.19% 96.041 us 2.19% 0.431 us 0.45% PASS
F32 I32 false 2^28 0.544 1.395 ms 0.93% 1.395 ms 0.89% 0.117 us 0.01% PASS
F32 I32 false 2^16 0 9.593 us 2.10% 9.893 us 2.05% 0.300 us 3.13% FAIL
F32 I32 false 2^20 0 15.232 us 1.40% 14.982 us 1.42% -0.251 us -1.64% FAIL
F32 I32 false 2^24 0 92.960 us 2.15% 92.633 us 2.15% -0.327 us -0.35% PASS
F32 I32 false 2^28 0 1.366 ms 0.93% 1.366 ms 0.97% -0.347 us -0.03% PASS
F32 I32 true 2^16 1 10.155 us 2.19% 9.587 us 2.09% -0.568 us -5.59% FAIL
F32 I32 true 2^20 1 15.328 us 1.62% 15.244 us 1.92% -0.083 us -0.54% PASS
F32 I32 true 2^24 1 92.200 us 2.14% 92.016 us 2.14% -0.184 us -0.20% PASS
F32 I32 true 2^28 1 1.367 ms 0.90% 1.367 ms 0.89% -0.232 us -0.02% PASS
F32 I32 true 2^16 0.544 10.034 us 2.93% 9.686 us 1.97% -0.348 us -3.47% FAIL
F32 I32 true 2^20 0.544 15.525 us 1.56% 15.274 us 1.53% -0.251 us -1.62% FAIL
F32 I32 true 2^24 0.544 94.314 us 2.21% 94.110 us 2.19% -0.204 us -0.22% PASS
F32 I32 true 2^28 0.544 1.379 ms 0.91% 1.379 ms 0.91% -0.589 us -0.04% PASS
F32 I32 true 2^16 0 9.841 us 2.80% 9.627 us 2.02% -0.215 us -2.18% FAIL
F32 I32 true 2^20 0 15.332 us 2.04% 14.997 us 1.69% -0.335 us -2.19% FAIL
F32 I32 true 2^24 0 92.355 us 2.13% 92.074 us 2.18% -0.281 us -0.30% PASS
F32 I32 true 2^28 0 1.358 ms 0.91% 1.358 ms 0.88% 0.293 us 0.02% PASS
F32 I64 false 2^16 1 10.029 us 2.03% 10.227 us 1.73% 0.199 us 1.98% FAIL
F32 I64 false 2^20 1 15.751 us 1.32% 15.441 us 1.68% -0.310 us -1.97% FAIL
F32 I64 false 2^24 1 98.959 us 1.35% 94.459 us 2.62% -4.500 us -4.55% FAIL
F32 I64 false 2^28 1 1.467 ms 0.73% 1.395 ms 1.16% -72.613 us -4.95% FAIL
F32 I64 false 2^16 0.544 10.150 us 2.18% 10.398 us 2.16% 0.248 us 2.44% FAIL
F32 I64 false 2^20 0.544 15.813 us 1.49% 15.844 us 1.40% 0.030 us 0.19% PASS
F32 I64 false 2^24 0.544 100.171 us 1.33% 99.543 us 2.78% -0.628 us -0.63% PASS
F32 I64 false 2^28 0.544 1.481 ms 0.95% 1.442 ms 1.48% -39.527 us -2.67% FAIL
F32 I64 false 2^16 0 10.102 us 2.32% 10.635 us 1.80% 0.533 us 5.28% FAIL
F32 I64 false 2^20 0 15.848 us 1.41% 15.853 us 1.57% 0.005 us 0.03% PASS
F32 I64 false 2^24 0 98.737 us 1.35% 96.517 us 2.80% -2.221 us -2.25% FAIL
F32 I64 false 2^28 0 1.460 ms 0.71% 1.399 ms 1.14% -60.506 us -4.14% FAIL
F32 I64 true 2^16 1 10.200 us 2.72% 10.084 us 2.17% -0.116 us -1.14% PASS
F32 I64 true 2^20 1 15.792 us 1.84% 15.479 us 1.64% -0.313 us -1.98% FAIL
F32 I64 true 2^24 1 98.660 us 1.40% 93.713 us 2.24% -4.948 us -5.01% FAIL
F32 I64 true 2^28 1 1.471 ms 0.97% 1.381 ms 0.95% -90.041 us -6.12% FAIL
F32 I64 true 2^16 0.544 10.123 us 2.73% 10.104 us 2.43% -0.020 us -0.20% PASS
F32 I64 true 2^20 0.544 15.785 us 1.79% 15.631 us 1.47% -0.154 us -0.97% PASS
F32 I64 true 2^24 0.544 100.436 us 1.41% 95.231 us 2.15% -5.205 us -5.18% FAIL
F32 I64 true 2^28 0.544 1.490 ms 1.12% 1.390 ms 1.02% -99.092 us -6.65% FAIL
F32 I64 true 2^16 0 10.218 us 1.96% 10.006 us 2.68% -0.212 us -2.07% FAIL
F32 I64 true 2^20 0 15.825 us 1.45% 15.522 us 1.66% -0.303 us -1.91% FAIL
F32 I64 true 2^24 0 98.911 us 1.41% 93.845 us 2.22% -5.066 us -5.12% FAIL
F32 I64 true 2^28 0 1.462 ms 0.77% 1.374 ms 0.94% -87.291 us -5.97% FAIL
F64 I32 false 2^16 1 10.581 us 2.83% 10.723 us 2.77% 0.142 us 1.34% PASS
F64 I32 false 2^20 1 19.096 us 1.80% 19.236 us 1.62% 0.139 us 0.73% PASS
F64 I32 false 2^24 1 169.753 us 1.75% 169.697 us 1.70% -0.056 us -0.03% PASS
F64 I32 false 2^28 1 2.593 ms 0.52% 2.592 ms 0.53% -0.751 us -0.03% PASS
F64 I32 false 2^16 0.544 10.583 us 2.41% 10.308 us 2.65% -0.275 us -2.60% FAIL
F64 I32 false 2^20 0.544 19.019 us 1.51% 18.753 us 1.64% -0.267 us -1.40% PASS
F64 I32 false 2^24 0.544 170.998 us 1.70% 170.708 us 1.73% -0.290 us -0.17% PASS
F64 I32 false 2^28 0.544 2.608 ms 0.50% 2.608 ms 0.54% 0.212 us 0.01% PASS
F64 I32 false 2^16 0 10.658 us 2.95% 10.470 us 2.71% -0.188 us -1.76% PASS
F64 I32 false 2^20 0 19.058 us 1.85% 18.857 us 1.58% -0.201 us -1.05% PASS
F64 I32 false 2^24 0 169.910 us 1.70% 169.646 us 1.72% -0.264 us -0.16% PASS
F64 I32 false 2^28 0 2.593 ms 0.52% 2.593 ms 0.52% -0.551 us -0.02% PASS
F64 I32 true 2^16 1 10.771 us 2.62% 10.572 us 2.72% -0.199 us -1.85% PASS
F64 I32 true 2^20 1 19.078 us 1.50% 18.769 us 1.55% -0.308 us -1.62% FAIL
F64 I32 true 2^24 1 169.477 us 1.75% 169.072 us 1.71% -0.405 us -0.24% PASS
F64 I32 true 2^28 1 2.586 ms 0.48% 2.587 ms 0.45% 0.272 us 0.01% PASS
F64 I32 true 2^16 0.544 10.515 us 3.41% 10.311 us 2.81% -0.204 us -1.94% PASS
F64 I32 true 2^20 0.544 18.892 us 1.54% 18.656 us 1.87% -0.236 us -1.25% PASS
F64 I32 true 2^24 0.544 170.374 us 1.76% 170.163 us 1.74% -0.211 us -0.12% PASS
F64 I32 true 2^28 0.544 2.600 ms 0.45% 2.600 ms 0.45% 0.797 us 0.03% PASS
F64 I32 true 2^16 0 10.679 us 2.62% 10.363 us 2.96% -0.316 us -2.96% FAIL
F64 I32 true 2^20 0 18.968 us 1.76% 18.750 us 1.46% -0.218 us -1.15% PASS
F64 I32 true 2^24 0 169.344 us 1.74% 169.190 us 1.77% -0.154 us -0.09% PASS
F64 I32 true 2^28 0 2.585 ms 0.52% 2.586 ms 0.50% 1.524 us 0.06% PASS
F64 I64 false 2^16 1 10.434 us 2.12% 10.804 us 2.51% 0.370 us 3.54% FAIL
F64 I64 false 2^20 1 20.490 us 2.73% 19.082 us 1.42% -1.408 us -6.87% FAIL
F64 I64 false 2^24 1 182.621 us 1.36% 172.926 us 2.27% -9.695 us -5.31% FAIL
F64 I64 false 2^28 1 2.818 ms 0.40% 2.620 ms 0.69% -198.526 us -7.04% FAIL
F64 I64 false 2^16 0.544 10.428 us 2.13% 10.956 us 2.34% 0.528 us 5.06% FAIL
F64 I64 false 2^20 0.544 20.419 us 2.51% 19.146 us 1.39% -1.273 us -6.23% FAIL
F64 I64 false 2^24 0.544 183.995 us 1.31% 174.516 us 2.25% -9.479 us -5.15% FAIL
F64 I64 false 2^28 0.544 2.843 ms 0.43% 2.637 ms 0.73% -205.520 us -7.23% FAIL
F64 I64 false 2^16 0 10.317 us 2.94% 10.843 us 2.45% 0.526 us 5.10% FAIL
F64 I64 false 2^20 0 20.274 us 2.74% 19.289 us 1.35% -0.985 us -4.86% FAIL
F64 I64 false 2^24 0 182.591 us 1.38% 175.846 us 2.56% -6.744 us -3.69% FAIL
F64 I64 false 2^28 0 2.820 ms 0.45% 2.648 ms 0.86% -172.130 us -6.10% FAIL
F64 I64 true 2^16 1 10.509 us 2.12% 10.445 us 2.86% -0.064 us -0.61% PASS
F64 I64 true 2^20 1 20.485 us 2.76% 18.833 us 1.44% -1.653 us -8.07% FAIL
F64 I64 true 2^24 1 182.408 us 1.39% 169.105 us 1.73% -13.303 us -7.29% FAIL
F64 I64 true 2^28 1 2.816 ms 0.36% 2.585 ms 0.48% -231.566 us -8.22% FAIL
F64 I64 true 2^16 0.544 10.525 us 2.10% 10.383 us 2.70% -0.143 us -1.35% PASS
F64 I64 true 2^20 0.544 20.570 us 2.58% 18.623 us 1.43% -1.947 us -9.46% FAIL
F64 I64 true 2^24 0.544 184.319 us 1.32% 170.135 us 1.74% -14.184 us -7.70% FAIL
F64 I64 true 2^28 0.544 2.847 ms 0.39% 2.598 ms 0.48% -248.409 us -8.73% FAIL
F64 I64 true 2^16 0 10.465 us 1.71% 10.488 us 2.60% 0.023 us 0.22% PASS
F64 I64 true 2^20 0 20.470 us 2.73% 18.858 us 1.50% -1.612 us -7.87% FAIL
F64 I64 true 2^24 0 182.573 us 1.39% 169.220 us 1.75% -13.352 us -7.31% FAIL
F64 I64 true 2^28 0 2.818 ms 0.39% 2.582 ms 0.45% -235.453 us -8.36% FAIL
H100 partition.flagged
T{ct} OffsetT{ct} DistinctPartitions{ct} Elements{io} Entropy Ref Time Ref Noise Cmp Time Cmp Noise Diff %Diff Status
I8 I32 false 2^16 1 9.854 us 1.91% 9.755 us 1.62% -0.099 us -1.00% PASS
I8 I32 false 2^20 1 12.834 us 1.67% 12.852 us 1.71% 0.017 us 0.13% PASS
I8 I32 false 2^24 1 58.800 us 1.39% 58.982 us 1.40% 0.182 us 0.31% PASS
I8 I32 false 2^28 1 763.924 us 0.52% 763.855 us 0.50% -0.069 us -0.01% PASS
I8 I32 false 2^16 0.544 10.116 us 2.06% 10.057 us 2.05% -0.059 us -0.59% PASS
I8 I32 false 2^20 0.544 13.088 us 1.70% 13.073 us 1.65% -0.014 us -0.11% PASS
I8 I32 false 2^24 0.544 61.212 us 1.32% 61.238 us 1.33% 0.026 us 0.04% PASS
I8 I32 false 2^28 0.544 800.936 us 0.50% 800.939 us 0.50% 0.003 us 0.00% PASS
I8 I32 false 2^16 0 9.990 us 2.46% 9.997 us 2.18% 0.007 us 0.07% PASS
I8 I32 false 2^20 0 13.215 us 1.91% 13.133 us 1.99% -0.082 us -0.62% PASS
I8 I32 false 2^24 0 58.963 us 1.34% 58.857 us 1.35% -0.106 us -0.18% PASS
I8 I32 false 2^28 0 763.696 us 0.32% 763.945 us 0.34% 0.248 us 0.03% PASS
I8 I32 true 2^16 1 10.345 us 2.01% 10.406 us 2.00% 0.061 us 0.59% PASS
I8 I32 true 2^20 1 13.693 us 2.11% 13.843 us 2.10% 0.150 us 1.10% PASS
I8 I32 true 2^24 1 60.300 us 1.36% 60.425 us 1.39% 0.126 us 0.21% PASS
I8 I32 true 2^28 1 778.153 us 0.50% 778.201 us 0.50% 0.048 us 0.01% PASS
I8 I32 true 2^16 0.544 10.830 us 2.21% 10.810 us 1.95% -0.020 us -0.19% PASS
I8 I32 true 2^20 0.544 14.043 us 1.93% 13.997 us 2.02% -0.046 us -0.33% PASS
I8 I32 true 2^24 0.544 62.294 us 1.35% 62.212 us 1.38% -0.081 us -0.13% PASS
I8 I32 true 2^28 0.544 810.458 us 0.50% 810.280 us 0.50% -0.178 us -0.02% PASS
I8 I32 true 2^16 0 10.606 us 2.58% 10.670 us 1.75% 0.064 us 0.60% PASS
I8 I32 true 2^20 0 14.001 us 1.90% 13.899 us 2.01% -0.102 us -0.73% PASS
I8 I32 true 2^24 0 60.420 us 1.33% 60.357 us 1.33% -0.062 us -0.10% PASS
I8 I32 true 2^28 0 779.607 us 0.32% 779.429 us 0.32% -0.178 us -0.02% PASS
I8 I64 false 2^16 1 10.011 us 2.27% 11.199 us 2.68% 1.188 us 11.87% FAIL
I8 I64 false 2^20 1 14.551 us 1.38% 14.437 us 1.84% -0.114 us -0.78% PASS
I8 I64 false 2^24 1 75.662 us 0.60% 63.301 us 1.61% -12.361 us -16.34% FAIL
I8 I64 false 2^28 1 1.050 ms 0.32% 821.768 us 0.50% -228.446 us -21.75% FAIL
I8 I64 false 2^16 0.544 9.884 us 1.89% 11.528 us 1.70% 1.644 us 16.63% FAIL
I8 I64 false 2^20 0.544 14.565 us 1.75% 14.706 us 1.63% 0.141 us 0.97% PASS
I8 I64 false 2^24 0.544 76.285 us 0.60% 67.244 us 1.40% -9.041 us -11.85% FAIL
I8 I64 false 2^28 0.544 1.059 ms 0.36% 893.159 us 0.50% -166.148 us -15.68% FAIL
I8 I64 false 2^16 0 9.816 us 2.81% 11.485 us 1.49% 1.669 us 17.00% FAIL
I8 I64 false 2^20 0 14.557 us 1.21% 14.791 us 1.70% 0.234 us 1.60% FAIL
I8 I64 false 2^24 0 75.435 us 0.65% 64.879 us 1.25% -10.556 us -13.99% FAIL
I8 I64 false 2^28 0 1.048 ms 0.13% 853.665 us 0.32% -194.577 us -18.56% FAIL
I8 I64 true 2^16 1 10.078 us 2.41% 10.532 us 3.26% 0.454 us 4.50% FAIL
I8 I64 true 2^20 1 14.577 us 1.91% 13.588 us 2.04% -0.990 us -6.79% FAIL
I8 I64 true 2^24 1 76.142 us 0.64% 59.263 us 1.64% -16.879 us -22.17% FAIL
I8 I64 true 2^28 1 1.065 ms 0.30% 759.411 us 0.57% -305.961 us -28.72% FAIL
I8 I64 true 2^16 0.544 10.156 us 1.66% 10.775 us 1.72% 0.619 us 6.10% FAIL
I8 I64 true 2^20 0.544 14.714 us 1.36% 13.616 us 1.96% -1.098 us -7.46% FAIL
I8 I64 true 2^24 0.544 77.324 us 0.59% 61.829 us 1.58% -15.496 us -20.04% FAIL
I8 I64 true 2^28 0.544 1.084 ms 0.35% 804.412 us 0.55% -279.560 us -25.79% FAIL
I8 I64 true 2^16 0 10.231 us 2.44% 10.747 us 2.20% 0.516 us 5.04% FAIL
I8 I64 true 2^20 0 14.574 us 1.86% 13.591 us 1.99% -0.984 us -6.75% FAIL
I8 I64 true 2^24 0 76.515 us 0.61% 59.660 us 1.61% -16.855 us -22.03% FAIL
I8 I64 true 2^28 0 1.069 ms 0.12% 762.090 us 0.39% -307.296 us -28.74% FAIL
I16 I32 false 2^16 1 11.494 us 1.47% 11.291 us 1.78% -0.203 us -1.77% FAIL
I16 I32 false 2^20 1 14.169 us 1.39% 14.131 us 1.81% -0.038 us -0.27% PASS
I16 I32 false 2^24 1 74.429 us 1.62% 74.216 us 1.65% -0.213 us -0.29% PASS
I16 I32 false 2^28 1 935.754 us 0.87% 935.954 us 0.73% 0.200 us 0.02% PASS
I16 I32 false 2^16 0.544 12.026 us 1.92% 11.794 us 1.55% -0.232 us -1.93% FAIL
I16 I32 false 2^20 0.544 14.733 us 1.30% 14.729 us 1.42% -0.003 us -0.02% PASS
I16 I32 false 2^24 0.544 78.461 us 1.21% 78.503 us 1.21% 0.042 us 0.05% PASS
I16 I32 false 2^28 0.544 1.009 ms 0.68% 1.009 ms 0.68% 0.602 us 0.06% PASS
I16 I32 false 2^16 0 11.636 us 1.71% 11.272 us 2.15% -0.364 us -3.13% FAIL
I16 I32 false 2^20 0 14.301 us 1.37% 14.272 us 1.44% -0.029 us -0.20% PASS
I16 I32 false 2^24 0 74.791 us 1.54% 74.713 us 1.54% -0.078 us -0.10% PASS
I16 I32 false 2^28 0 935.854 us 0.32% 936.479 us 0.34% 0.625 us 0.07% PASS
I16 I32 true 2^16 1 11.854 us 1.52% 12.053 us 1.49% 0.199 us 1.68% FAIL
I16 I32 true 2^20 1 14.528 us 1.16% 14.461 us 2.11% -0.067 us -0.46% PASS
I16 I32 true 2^24 1 75.439 us 1.47% 75.385 us 1.53% -0.054 us -0.07% PASS
I16 I32 true 2^28 1 950.958 us 1.05% 953.112 us 1.19% 2.154 us 0.23% PASS
I16 I32 true 2^16 0.544 12.186 us 1.26% 12.080 us 1.45% -0.106 us -0.87% PASS
I16 I32 true 2^20 0.544 14.946 us 1.52% 14.877 us 1.10% -0.070 us -0.47% PASS
I16 I32 true 2^24 0.544 78.244 us 1.54% 78.378 us 1.52% 0.134 us 0.17% PASS
I16 I32 true 2^28 0.544 989.611 us 0.69% 990.247 us 0.67% 0.636 us 0.06% PASS
I16 I32 true 2^16 0 11.869 us 1.89% 11.847 us 1.79% -0.022 us -0.18% PASS
I16 I32 true 2^20 0 14.718 us 1.35% 14.668 us 1.36% -0.050 us -0.34% PASS
I16 I32 true 2^24 0 75.701 us 1.36% 75.752 us 1.38% 0.051 us 0.07% PASS
I16 I32 true 2^28 0 948.490 us 0.27% 948.488 us 0.28% -0.002 us -0.00% PASS
I16 I64 false 2^16 1 10.068 us 2.00% 12.607 us 1.29% 2.539 us 25.22% FAIL
I16 I64 false 2^20 1 15.425 us 1.45% 15.478 us 1.47% 0.052 us 0.34% PASS
I16 I64 false 2^24 1 82.780 us 0.87% 80.817 us 1.76% -1.963 us -2.37% FAIL
I16 I64 false 2^28 1 1.139 ms 1.27% 990.083 us 1.37% -149.189 us -13.10% FAIL
I16 I64 false 2^16 0.544 10.003 us 1.95% 12.691 us 2.11% 2.688 us 26.87% FAIL
I16 I64 false 2^20 0.544 15.385 us 1.36% 15.597 us 1.65% 0.212 us 1.38% FAIL
I16 I64 false 2^24 0.544 83.452 us 0.85% 83.955 us 1.35% 0.504 us 0.60% PASS
I16 I64 false 2^28 0.544 1.146 ms 0.50% 1.050 ms 0.50% -95.970 us -8.37% FAIL
I16 I64 false 2^16 0 10.077 us 2.10% 12.375 us 1.74% 2.298 us 22.81% FAIL
I16 I64 false 2^20 0 15.425 us 1.47% 15.389 us 1.79% -0.036 us -0.23% PASS
I16 I64 false 2^24 0 82.640 us 0.87% 81.719 us 1.45% -0.920 us -1.11% FAIL
I16 I64 false 2^28 0 1.128 ms 0.19% 1.019 ms 0.18% -108.870 us -9.65% FAIL
I16 I64 true 2^16 1 10.010 us 1.77% 11.661 us 1.67% 1.651 us 16.49% FAIL
I16 I64 true 2^20 1 15.343 us 1.47% 14.602 us 1.44% -0.741 us -4.83% FAIL
I16 I64 true 2^24 1 83.159 us 0.90% 76.019 us 1.32% -7.140 us -8.59% FAIL
I16 I64 true 2^28 1 1.147 ms 1.38% 974.377 us 1.52% -172.722 us -15.06% FAIL
I16 I64 true 2^16 0.544 9.986 us 1.85% 12.093 us 1.68% 2.107 us 21.10% FAIL
I16 I64 true 2^20 0.544 15.390 us 1.68% 14.762 us 1.54% -0.629 us -4.09% FAIL
I16 I64 true 2^24 0.544 84.797 us 0.84% 79.191 us 1.41% -5.606 us -6.61% FAIL
I16 I64 true 2^28 0.544 1.170 ms 0.50% 1.004 ms 0.60% -165.675 us -14.16% FAIL
I16 I64 true 2^16 0 10.080 us 2.13% 11.595 us 1.55% 1.515 us 15.03% FAIL
I16 I64 true 2^20 0 15.239 us 1.25% 14.716 us 1.48% -0.523 us -3.43% FAIL
I16 I64 true 2^24 0 83.088 us 0.94% 76.574 us 1.33% -6.514 us -7.84% FAIL
I16 I64 true 2^28 0 1.137 ms 0.20% 967.365 us 0.22% -169.978 us -14.95% FAIL
I32 I32 false 2^16 1 10.600 us 2.02% 10.482 us 2.42% -0.118 us -1.12% PASS
I32 I32 false 2^20 1 16.651 us 1.88% 16.478 us 1.66% -0.174 us -1.04% PASS
I32 I32 false 2^24 1 104.328 us 0.87% 103.965 us 0.84% -0.363 us -0.35% PASS
I32 I32 false 2^28 1 1.521 ms 0.56% 1.522 ms 0.57% 0.503 us 0.03% PASS
I32 I32 false 2^16 0.544 10.706 us 2.22% 10.952 us 2.10% 0.246 us 2.30% FAIL
I32 I32 false 2^20 0.544 16.469 us 1.62% 16.629 us 1.23% 0.160 us 0.97% PASS
I32 I32 false 2^24 0.544 105.189 us 0.84% 105.345 us 0.85% 0.156 us 0.15% PASS
I32 I32 false 2^28 0.544 1.533 ms 0.55% 1.534 ms 0.59% 0.082 us 0.01% PASS
I32 I32 false 2^16 0 10.709 us 2.28% 10.805 us 2.42% 0.096 us 0.89% PASS
I32 I32 false 2^20 0 16.239 us 1.47% 16.643 us 1.47% 0.403 us 2.48% FAIL
I32 I32 false 2^24 0 103.987 us 0.86% 104.060 us 0.85% 0.073 us 0.07% PASS
I32 I32 false 2^28 0 1.519 ms 0.32% 1.519 ms 0.33% 0.210 us 0.01% PASS
I32 I32 true 2^16 1 11.183 us 1.92% 11.422 us 1.67% 0.239 us 2.14% FAIL
I32 I32 true 2^20 1 16.636 us 1.69% 16.654 us 1.39% 0.017 us 0.10% PASS
I32 I32 true 2^24 1 103.652 us 0.92% 103.984 us 0.90% 0.332 us 0.32% PASS
I32 I32 true 2^28 1 1.517 ms 0.64% 1.518 ms 0.65% 1.262 us 0.08% PASS
I32 I32 true 2^16 0.544 11.192 us 2.08% 11.447 us 2.08% 0.255 us 2.27% FAIL
I32 I32 true 2^20 0.544 16.518 us 1.83% 16.683 us 1.42% 0.165 us 1.00% PASS
I32 I32 true 2^24 0.544 104.116 us 0.93% 104.513 us 0.92% 0.397 us 0.38% PASS
I32 I32 true 2^28 0.544 1.521 ms 0.58% 1.521 ms 0.57% 0.104 us 0.01% PASS
I32 I32 true 2^16 0 11.163 us 2.14% 11.264 us 1.85% 0.101 us 0.91% PASS
I32 I32 true 2^20 0 16.537 us 1.68% 16.811 us 1.47% 0.274 us 1.66% FAIL
I32 I32 true 2^24 0 103.604 us 0.92% 103.850 us 0.92% 0.246 us 0.24% PASS
I32 I32 true 2^28 0 1.513 ms 0.35% 1.512 ms 0.34% -0.219 us -0.01% PASS
I32 I64 false 2^16 1 9.794 us 2.22% 11.917 us 1.69% 2.123 us 21.68% FAIL
I32 I64 false 2^20 1 16.634 us 1.48% 17.128 us 1.62% 0.494 us 2.97% FAIL
I32 I64 false 2^24 1 108.461 us 1.07% 107.269 us 1.25% -1.192 us -1.10% FAIL
I32 I64 false 2^28 1 1.595 ms 0.87% 1.569 ms 0.89% -26.209 us -1.64% FAIL
I32 I64 false 2^16 0.544 10.191 us 2.07% 11.623 us 1.62% 1.432 us 14.05% FAIL
I32 I64 false 2^20 0.544 16.845 us 1.51% 17.332 us 1.45% 0.488 us 2.89% FAIL
I32 I64 false 2^24 0.544 109.433 us 1.10% 108.332 us 1.24% -1.101 us -1.01% PASS
I32 I64 false 2^28 0.544 1.598 ms 0.58% 1.572 ms 0.67% -25.792 us -1.61% FAIL
I32 I64 false 2^16 0 10.303 us 2.27% 12.027 us 2.06% 1.724 us 16.73% FAIL
I32 I64 false 2^20 0 16.714 us 1.29% 17.283 us 1.49% 0.569 us 3.41% FAIL
I32 I64 false 2^24 0 108.157 us 1.07% 108.494 us 1.19% 0.337 us 0.31% PASS
I32 I64 false 2^28 0 1.582 ms 0.37% 1.570 ms 0.44% -12.345 us -0.78% FAIL
I32 I64 true 2^16 1 10.281 us 2.43% 11.276 us 1.93% 0.995 us 9.68% FAIL
I32 I64 true 2^20 1 16.627 us 1.73% 16.696 us 1.72% 0.068 us 0.41% PASS
I32 I64 true 2^24 1 108.543 us 1.10% 104.449 us 0.90% -4.094 us -3.77% FAIL
I32 I64 true 2^28 1 1.600 ms 0.95% 1.530 ms 0.74% -69.635 us -4.35% FAIL
I32 I64 true 2^16 0.544 10.326 us 1.89% 11.163 us 1.87% 0.837 us 8.10% FAIL
I32 I64 true 2^20 0.544 17.010 us 1.30% 16.679 us 1.48% -0.330 us -1.94% FAIL
I32 I64 true 2^24 0.544 110.565 us 1.11% 104.749 us 0.89% -5.815 us -5.26% FAIL
I32 I64 true 2^28 0.544 1.609 ms 0.58% 1.530 ms 0.57% -79.244 us -4.93% FAIL
I32 I64 true 2^16 0 10.338 us 1.87% 11.218 us 2.13% 0.880 us 8.52% FAIL
I32 I64 true 2^20 0 16.671 us 1.54% 16.770 us 1.61% 0.099 us 0.59% PASS
I32 I64 true 2^24 0 108.527 us 1.12% 104.245 us 0.89% -4.281 us -3.94% FAIL
I32 I64 true 2^28 0 1.585 ms 0.37% 1.522 ms 0.35% -63.256 us -3.99% FAIL
I64 I32 false 2^16 1 10.454 us 2.70% 10.088 us 2.00% -0.366 us -3.50% FAIL
I64 I32 false 2^20 1 19.665 us 1.72% 19.361 us 1.44% -0.304 us -1.54% FAIL
I64 I32 false 2^24 1 178.833 us 1.39% 178.542 us 1.42% -0.290 us -0.16% PASS
I64 I32 false 2^28 1 2.743 ms 0.42% 2.744 ms 0.43% 1.163 us 0.04% PASS
I64 I32 false 2^16 0.544 10.588 us 2.36% 10.234 us 2.13% -0.353 us -3.34% FAIL
I64 I32 false 2^20 0.544 20.130 us 1.41% 19.789 us 1.27% -0.341 us -1.70% FAIL
I64 I32 false 2^24 0.544 181.197 us 1.38% 180.996 us 1.38% -0.201 us -0.11% PASS
I64 I32 false 2^28 0.544 2.771 ms 0.54% 2.771 ms 0.53% 0.047 us 0.00% PASS
I64 I32 false 2^16 0 10.468 us 2.01% 10.125 us 2.31% -0.343 us -3.28% FAIL
I64 I32 false 2^20 0 19.733 us 1.48% 19.362 us 1.40% -0.371 us -1.88% FAIL
I64 I32 false 2^24 0 178.861 us 1.41% 178.587 us 1.41% -0.274 us -0.15% PASS
I64 I32 false 2^28 0 2.742 ms 0.41% 2.740 ms 0.40% -1.888 us -0.07% PASS
I64 I32 true 2^16 1 10.345 us 2.54% 10.253 us 2.59% -0.092 us -0.89% PASS
I64 I32 true 2^20 1 19.924 us 2.68% 19.528 us 2.72% -0.396 us -1.99% PASS
I64 I32 true 2^24 1 186.321 us 1.95% 186.079 us 1.98% -0.243 us -0.13% PASS
I64 I32 true 2^28 1 2.879 ms 0.49% 2.879 ms 0.53% -0.550 us -0.02% PASS
I64 I32 true 2^16 0.544 10.320 us 2.24% 10.151 us 2.23% -0.169 us -1.63% PASS
I64 I32 true 2^20 0.544 20.222 us 2.86% 19.925 us 2.80% -0.297 us -1.47% PASS
I64 I32 true 2^24 0.544 189.594 us 1.99% 189.387 us 2.06% -0.207 us -0.11% PASS
I64 I32 true 2^28 0.544 2.923 ms 0.62% 2.923 ms 0.63% 0.388 us 0.01% PASS
I64 I32 true 2^16 0 10.518 us 1.99% 9.994 us 2.12% -0.524 us -4.98% FAIL
I64 I32 true 2^20 0 19.763 us 2.92% 19.513 us 2.74% -0.250 us -1.26% PASS
I64 I32 true 2^24 0 186.644 us 1.99% 186.311 us 2.04% -0.333 us -0.18% PASS
I64 I32 true 2^28 0 2.878 ms 0.56% 2.878 ms 0.58% 0.047 us 0.00% PASS
I64 I64 false 2^16 1 10.562 us 2.71% 10.160 us 2.11% -0.401 us -3.80% FAIL
I64 I64 false 2^20 1 20.832 us 2.02% 19.709 us 3.08% -1.123 us -5.39% FAIL
I64 I64 false 2^24 1 191.044 us 1.38% 185.822 us 2.09% -5.223 us -2.73% FAIL
I64 I64 false 2^28 1 2.943 ms 0.38% 2.873 ms 0.60% -69.563 us -2.36% FAIL
I64 I64 false 2^16 0.544 10.572 us 1.89% 10.228 us 2.08% -0.345 us -3.26% FAIL
I64 I64 false 2^20 0.544 20.979 us 1.97% 20.283 us 2.90% -0.696 us -3.32% FAIL
I64 I64 false 2^24 0.544 192.212 us 1.27% 189.645 us 2.19% -2.567 us -1.34% FAIL
I64 I64 false 2^28 0.544 2.964 ms 0.50% 2.925 ms 0.70% -38.482 us -1.30% FAIL
I64 I64 false 2^16 0 10.753 us 2.21% 10.382 us 1.94% -0.371 us -3.45% FAIL
I64 I64 false 2^20 0 20.626 us 2.13% 19.949 us 2.99% -0.677 us -3.28% FAIL
I64 I64 false 2^24 0 190.871 us 1.33% 187.040 us 2.25% -3.831 us -2.01% FAIL
I64 I64 false 2^28 0 2.938 ms 0.38% 2.887 ms 0.59% -51.282 us -1.75% FAIL
I64 I64 true 2^16 1 10.577 us 2.42% 10.021 us 2.26% -0.556 us -5.26% FAIL
I64 I64 true 2^20 1 20.741 us 1.91% 19.443 us 2.66% -1.299 us -6.26% FAIL
I64 I64 true 2^24 1 190.831 us 1.38% 185.448 us 1.97% -5.383 us -2.82% FAIL
I64 I64 true 2^28 1 2.941 ms 0.39% 2.873 ms 0.56% -68.554 us -2.33% FAIL
I64 I64 true 2^16 0.544 10.694 us 2.00% 10.123 us 2.11% -0.570 us -5.33% FAIL
I64 I64 true 2^20 0.544 21.076 us 2.01% 20.216 us 2.94% -0.860 us -4.08% FAIL
I64 I64 true 2^24 0.544 192.555 us 1.29% 189.550 us 2.00% -3.004 us -1.56% FAIL
I64 I64 true 2^28 0.544 2.969 ms 0.50% 2.921 ms 0.64% -48.292 us -1.63% FAIL
I64 I64 true 2^16 0 10.829 us 1.83% 10.422 us 2.04% -0.407 us -3.76% FAIL
I64 I64 true 2^20 0 20.850 us 2.24% 19.919 us 2.73% -0.930 us -4.46% FAIL
I64 I64 true 2^24 0 190.628 us 1.42% 186.228 us 1.99% -4.400 us -2.31% FAIL
I64 I64 true 2^28 0 2.938 ms 0.39% 2.872 ms 0.56% -66.087 us -2.25% FAIL
I128 I32 false 2^16 1 11.257 us 2.06% 11.210 us 2.05% -0.048 us -0.42% PASS
I128 I32 false 2^20 1 30.127 us 3.67% 29.953 us 3.79% -0.174 us -0.58% PASS
I128 I32 false 2^24 1 338.354 us 0.93% 338.348 us 0.96% -0.005 us -0.00% PASS
I128 I32 false 2^28 1 5.297 ms 0.27% 5.294 ms 0.26% -2.775 us -0.05% PASS
I128 I32 false 2^16 0.544 11.275 us 1.97% 11.168 us 2.43% -0.107 us -0.95% PASS
I128 I32 false 2^20 0.544 30.473 us 3.41% 30.411 us 3.48% -0.062 us -0.20% PASS
I128 I32 false 2^24 0.544 340.951 us 0.96% 340.902 us 0.95% -0.049 us -0.01% PASS
I128 I32 false 2^28 0.544 5.337 ms 0.58% 5.338 ms 0.57% 1.046 us 0.02% PASS
I128 I32 false 2^16 0 11.237 us 2.38% 11.156 us 1.74% -0.080 us -0.72% PASS
I128 I32 false 2^20 0 29.917 us 3.65% 29.965 us 3.61% 0.047 us 0.16% PASS
I128 I32 false 2^24 0 338.292 us 0.96% 338.217 us 0.95% -0.076 us -0.02% PASS
I128 I32 false 2^28 0 5.294 ms 0.24% 5.294 ms 0.25% 0.250 us 0.00% PASS
I128 I32 true 2^16 1 11.346 us 2.14% 11.161 us 2.37% -0.186 us -1.64% PASS
I128 I32 true 2^20 1 30.635 us 3.63% 30.408 us 3.65% -0.227 us -0.74% PASS
I128 I32 true 2^24 1 341.296 us 0.90% 341.339 us 0.88% 0.043 us 0.01% PASS
I128 I32 true 2^28 1 5.335 ms 0.22% 5.338 ms 0.26% 2.443 us 0.05% PASS
I128 I32 true 2^16 0.544 11.445 us 2.10% 11.121 us 2.31% -0.324 us -2.83% FAIL
I128 I32 true 2^20 0.544 31.104 us 3.36% 30.704 us 3.44% -0.400 us -1.28% PASS
I128 I32 true 2^24 0.544 343.894 us 0.91% 343.904 us 0.89% 0.011 us 0.00% PASS
I128 I32 true 2^28 0.544 5.381 ms 0.60% 5.381 ms 0.59% -0.238 us -0.00% PASS
I128 I32 true 2^16 0 11.660 us 2.36% 10.960 us 1.90% -0.700 us -6.00% FAIL
I128 I32 true 2^20 0 30.597 us 3.61% 30.190 us 3.68% -0.407 us -1.33% PASS
I128 I32 true 2^24 0 341.122 us 0.88% 341.111 us 0.90% -0.011 us -0.00% PASS
I128 I32 true 2^28 0 5.340 ms 0.23% 5.341 ms 0.23% 1.147 us 0.02% PASS
I128 I64 false 2^16 1 11.481 us 2.44% 10.887 us 2.01% -0.594 us -5.18% FAIL
I128 I64 false 2^20 1 33.416 us 2.76% 30.518 us 3.21% -2.898 us -8.67% FAIL
I128 I64 false 2^24 1 380.759 us 0.95% 352.947 us 1.24% -27.812 us -7.30% FAIL
I128 I64 false 2^28 1 6.010 ms 0.19% 5.521 ms 0.34% -489.141 us -8.14% FAIL
I128 I64 false 2^16 0.544 11.565 us 1.50% 11.000 us 1.88% -0.565 us -4.88% FAIL
I128 I64 false 2^20 0.544 33.541 us 2.72% 31.010 us 3.21% -2.530 us -7.54% FAIL
I128 I64 false 2^24 0.544 384.123 us 0.97% 357.328 us 1.26% -26.795 us -6.98% FAIL
I128 I64 false 2^28 0.544 6.063 ms 0.55% 5.587 ms 0.59% -475.432 us -7.84% FAIL
I128 I64 false 2^16 0 11.508 us 1.55% 11.036 us 1.85% -0.472 us -4.10% FAIL
I128 I64 false 2^20 0 33.401 us 2.81% 30.606 us 3.21% -2.795 us -8.37% FAIL
I128 I64 false 2^24 0 380.977 us 0.98% 352.595 us 1.26% -28.382 us -7.45% FAIL
I128 I64 false 2^28 0 6.010 ms 0.28% 5.520 ms 0.37% -489.904 us -8.15% FAIL
I128 I64 true 2^16 1 11.604 us 1.77% 10.606 us 2.23% -0.998 us -8.60% FAIL
I128 I64 true 2^20 1 33.314 us 2.79% 30.255 us 3.32% -3.059 us -9.18% FAIL
I128 I64 true 2^24 1 380.748 us 0.94% 356.045 us 1.21% -24.703 us -6.49% FAIL
I128 I64 true 2^28 1 6.017 ms 0.26% 5.584 ms 0.35% -433.088 us -7.20% FAIL
I128 I64 true 2^16 0.544 11.730 us 1.77% 10.875 us 2.01% -0.855 us -7.29% FAIL
I128 I64 true 2^20 0.544 33.565 us 2.67% 30.811 us 3.28% -2.755 us -8.21% FAIL
I128 I64 true 2^24 0.544 384.292 us 0.96% 359.780 us 1.16% -24.513 us -6.38% FAIL
I128 I64 true 2^28 0.544 6.064 ms 0.54% 5.643 ms 0.58% -421.311 us -6.95% FAIL
I128 I64 true 2^16 0 11.958 us 2.12% 10.722 us 2.27% -1.236 us -10.34% FAIL
I128 I64 true 2^20 0 33.516 us 2.76% 30.391 us 3.35% -3.125 us -9.32% FAIL
I128 I64 true 2^24 0 380.786 us 0.94% 356.491 us 1.22% -24.295 us -6.38% FAIL
I128 I64 true 2^28 0 6.014 ms 0.25% 5.581 ms 0.35% -433.475 us -7.21% FAIL
F32 I32 false 2^16 1 11.166 us 2.16% 10.725 us 2.39% -0.441 us -3.95% FAIL
F32 I32 false 2^20 1 16.619 us 1.94% 16.275 us 1.89% -0.344 us -2.07% FAIL
F32 I32 false 2^24 1 104.250 us 0.86% 103.854 us 0.84% -0.396 us -0.38% PASS
F32 I32 false 2^28 1 1.523 ms 0.60% 1.522 ms 0.59% -1.041 us -0.07% PASS
F32 I32 false 2^16 0.544 11.073 us 1.84% 10.619 us 2.11% -0.454 us -4.10% FAIL
F32 I32 false 2^20 0.544 16.944 us 1.79% 16.562 us 1.46% -0.382 us -2.26% FAIL
F32 I32 false 2^24 0.544 105.536 us 0.81% 105.229 us 0.82% -0.308 us -0.29% PASS
F32 I32 false 2^28 0.544 1.533 ms 0.56% 1.534 ms 0.59% 0.078 us 0.01% PASS
F32 I32 false 2^16 0 10.871 us 1.32% 10.288 us 1.86% -0.583 us -5.37% FAIL
F32 I32 false 2^20 0 16.818 us 1.28% 16.745 us 1.71% -0.073 us -0.43% PASS
F32 I32 false 2^24 0 104.293 us 0.85% 104.136 us 0.86% -0.157 us -0.15% PASS
F32 I32 false 2^28 0 1.519 ms 0.34% 1.519 ms 0.33% 0.011 us 0.00% PASS
F32 I32 true 2^16 1 11.317 us 1.97% 11.262 us 2.00% -0.055 us -0.49% PASS
F32 I32 true 2^20 1 16.803 us 1.90% 16.853 us 1.75% 0.050 us 0.30% PASS
F32 I32 true 2^24 1 104.092 us 0.90% 103.998 us 0.91% -0.094 us -0.09% PASS
F32 I32 true 2^28 1 1.519 ms 0.67% 1.517 ms 0.58% -1.564 us -0.10% PASS
F32 I32 true 2^16 0.544 11.219 us 1.55% 11.180 us 1.74% -0.039 us -0.34% PASS
F32 I32 true 2^20 0.544 16.735 us 1.77% 16.752 us 1.52% 0.017 us 0.10% PASS
F32 I32 true 2^24 0.544 104.188 us 0.92% 104.243 us 0.91% 0.055 us 0.05% PASS
F32 I32 true 2^28 0.544 1.521 ms 0.57% 1.521 ms 0.58% -0.365 us -0.02% PASS
F32 I32 true 2^16 0 11.260 us 1.44% 11.201 us 1.46% -0.060 us -0.53% PASS
F32 I32 true 2^20 0 16.729 us 1.77% 16.688 us 1.74% -0.040 us -0.24% PASS
F32 I32 true 2^24 0 103.844 us 0.94% 103.795 us 0.94% -0.050 us -0.05% PASS
F32 I32 true 2^28 0 1.513 ms 0.31% 1.512 ms 0.33% -0.830 us -0.05% PASS
F32 I64 false 2^16 1 10.199 us 2.12% 11.875 us 1.71% 1.676 us 16.43% FAIL
F32 I64 false 2^20 1 16.566 us 1.45% 17.508 us 1.69% 0.942 us 5.69% FAIL
F32 I64 false 2^24 1 108.404 us 1.07% 107.521 us 1.27% -0.883 us -0.81% PASS
F32 I64 false 2^28 1 1.595 ms 0.93% 1.568 ms 0.90% -27.383 us -1.72% FAIL
F32 I64 false 2^16 0.544 10.255 us 1.99% 11.956 us 1.43% 1.701 us 16.59% FAIL
F32 I64 false 2^20 0.544 16.691 us 1.43% 17.532 us 1.33% 0.841 us 5.04% FAIL
F32 I64 false 2^24 0.544 109.537 us 1.08% 108.612 us 1.25% -0.925 us -0.84% PASS
F32 I64 false 2^28 0.544 1.598 ms 0.58% 1.573 ms 0.67% -25.289 us -1.58% FAIL
F32 I64 false 2^16 0 9.914 us 2.12% 11.930 us 1.42% 2.017 us 20.34% FAIL
F32 I64 false 2^20 0 16.656 us 1.50% 17.273 us 1.49% 0.617 us 3.70% FAIL
F32 I64 false 2^24 0 108.053 us 1.12% 108.493 us 1.19% 0.440 us 0.41% PASS
F32 I64 false 2^28 0 1.582 ms 0.39% 1.571 ms 0.46% -11.494 us -0.73% FAIL
F32 I64 true 2^16 1 10.090 us 2.33% 11.270 us 2.34% 1.180 us 11.70% FAIL
F32 I64 true 2^20 1 16.578 us 1.45% 16.998 us 1.70% 0.420 us 2.53% FAIL
F32 I64 true 2^24 1 108.482 us 1.14% 104.695 us 0.90% -3.786 us -3.49% FAIL
F32 I64 true 2^28 1 1.600 ms 1.02% 1.528 ms 0.68% -71.580 us -4.47% FAIL
F32 I64 true 2^16 0.544 10.147 us 2.33% 11.358 us 2.73% 1.210 us 11.93% FAIL
F32 I64 true 2^20 0.544 16.586 us 1.54% 16.783 us 1.73% 0.196 us 1.18% PASS
F32 I64 true 2^24 0.544 110.301 us 1.13% 105.060 us 0.90% -5.241 us -4.75% FAIL
F32 I64 true 2^28 0.544 1.610 ms 0.58% 1.530 ms 0.56% -80.173 us -4.98% FAIL
F32 I64 true 2^16 0 10.124 us 2.46% 11.363 us 1.76% 1.239 us 12.24% FAIL
F32 I64 true 2^20 0 16.458 us 1.34% 16.701 us 1.62% 0.243 us 1.48% FAIL
F32 I64 true 2^24 0 108.204 us 1.10% 104.252 us 0.92% -3.952 us -3.65% FAIL
F32 I64 true 2^28 0 1.586 ms 0.38% 1.521 ms 0.35% -65.005 us -4.10% FAIL
F64 I32 false 2^16 1 10.186 us 2.21% 10.091 us 2.30% -0.095 us -0.93% PASS
F64 I32 false 2^20 1 19.517 us 1.49% 19.385 us 1.54% -0.132 us -0.68% PASS
F64 I32 false 2^24 1 178.546 us 1.38% 178.607 us 1.40% 0.061 us 0.03% PASS
F64 I32 false 2^28 1 2.745 ms 0.41% 2.744 ms 0.39% -1.547 us -0.06% PASS
F64 I32 false 2^16 0.544 10.299 us 2.34% 10.484 us 2.06% 0.185 us 1.80% PASS
F64 I32 false 2^20 0.544 19.792 us 1.31% 20.079 us 1.48% 0.287 us 1.45% FAIL
F64 I32 false 2^24 0.544 181.011 us 1.39% 181.238 us 1.37% 0.228 us 0.13% PASS
F64 I32 false 2^28 0.544 2.771 ms 0.55% 2.771 ms 0.55% 0.287 us 0.01% PASS
F64 I32 false 2^16 0 10.159 us 2.04% 10.475 us 1.86% 0.316 us 3.11% FAIL
F64 I32 false 2^20 0 19.369 us 1.41% 19.553 us 1.75% 0.184 us 0.95% PASS
F64 I32 false 2^24 0 178.653 us 1.41% 178.824 us 1.37% 0.171 us 0.10% PASS
F64 I32 false 2^28 0 2.742 ms 0.43% 2.741 ms 0.39% -1.427 us -0.05% PASS
F64 I32 true 2^16 1 10.469 us 2.48% 10.378 us 2.05% -0.091 us -0.87% PASS
F64 I32 true 2^20 1 19.698 us 2.73% 19.808 us 2.66% 0.109 us 0.55% PASS
F64 I32 true 2^24 1 186.231 us 1.98% 186.256 us 1.92% 0.025 us 0.01% PASS
F64 I32 true 2^28 1 2.880 ms 0.54% 2.879 ms 0.56% -0.622 us -0.02% PASS
F64 I32 true 2^16 0.544 10.349 us 2.41% 10.407 us 2.24% 0.058 us 0.56% PASS
F64 I32 true 2^20 0.544 19.933 us 2.71% 20.243 us 2.74% 0.311 us 1.56% PASS
F64 I32 true 2^24 0.544 189.458 us 2.01% 189.831 us 2.01% 0.372 us 0.20% PASS
F64 I32 true 2^28 0.544 2.923 ms 0.60% 2.925 ms 0.66% 1.358 us 0.05% PASS
F64 I32 true 2^16 0 10.033 us 2.53% 10.322 us 2.12% 0.289 us 2.88% FAIL
F64 I32 true 2^20 0 19.512 us 2.72% 19.708 us 2.85% 0.195 us 1.00% PASS
F64 I32 true 2^24 0 186.373 us 2.01% 186.600 us 1.98% 0.227 us 0.12% PASS
F64 I32 true 2^28 0 2.878 ms 0.53% 2.878 ms 0.56% 0.353 us 0.01% PASS
F64 I64 false 2^16 1 10.188 us 2.01% 10.297 us 2.48% 0.109 us 1.07% PASS
F64 I64 false 2^20 1 20.554 us 1.99% 20.022 us 2.88% -0.532 us -2.59% FAIL
F64 I64 false 2^24 1 190.633 us 1.34% 186.082 us 2.08% -4.551 us -2.39% FAIL
F64 I64 false 2^28 1 2.942 ms 0.38% 2.872 ms 0.62% -70.295 us -2.39% FAIL
F64 I64 false 2^16 0.544 10.345 us 1.81% 10.628 us 1.93% 0.283 us 2.73% FAIL
F64 I64 false 2^20 0.544 20.672 us 1.88% 20.424 us 3.06% -0.248 us -1.20% PASS
F64 I64 false 2^24 0.544 192.202 us 1.23% 190.023 us 2.27% -2.179 us -1.13% PASS
F64 I64 false 2^28 0.544 2.965 ms 0.50% 2.924 ms 0.71% -41.133 us -1.39% FAIL
F64 I64 false 2^16 0 10.161 us 2.00% 10.630 us 2.08% 0.468 us 4.61% FAIL
F64 I64 false 2^20 0 20.488 us 1.94% 20.188 us 2.86% -0.299 us -1.46% PASS
F64 I64 false 2^24 0 190.770 us 1.37% 187.519 us 2.32% -3.250 us -1.70% FAIL
F64 I64 false 2^28 0 2.939 ms 0.37% 2.888 ms 0.60% -51.215 us -1.74% FAIL
F64 I64 true 2^16 1 10.306 us 2.14% 10.351 us 2.22% 0.045 us 0.43% PASS
F64 I64 true 2^20 1 20.460 us 1.91% 19.747 us 2.62% -0.713 us -3.48% FAIL
F64 I64 true 2^24 1 190.798 us 1.43% 185.977 us 1.99% -4.821 us -2.53% FAIL
F64 I64 true 2^28 1 2.942 ms 0.38% 2.873 ms 0.51% -69.472 us -2.36% FAIL
F64 I64 true 2^16 0.544 10.300 us 1.98% 10.310 us 2.08% 0.010 us 0.10% PASS
F64 I64 true 2^20 0.544 20.515 us 1.85% 20.197 us 2.71% -0.319 us -1.55% PASS
F64 I64 true 2^24 0.544 192.522 us 1.28% 189.557 us 2.02% -2.965 us -1.54% FAIL
F64 I64 true 2^28 0.544 2.969 ms 0.50% 2.920 ms 0.60% -48.681 us -1.64% FAIL
F64 I64 true 2^16 0 10.231 us 1.99% 10.404 us 2.11% 0.173 us 1.69% PASS
F64 I64 true 2^20 0 20.790 us 2.20% 19.879 us 2.77% -0.911 us -4.38% FAIL
F64 I64 true 2^24 0 190.585 us 1.42% 186.164 us 2.00% -4.421 us -2.32% FAIL
F64 I64 true 2^28 0 2.938 ms 0.37% 2.873 ms 0.61% -64.492 us -2.20% FAIL

@elstehle elstehle changed the title Enh/streaming selection Adds support for large number of items in DeviceSelect and DevicePartition Sep 11, 2024
Copy link
Contributor

🟨 CI finished in 3h 40m: Pass: 97%/251 | Total: 2d 01h | Avg: 11m 45s | Max: 1h 29m | Hits: 99%/20079
  • 🟨 cub: Pass: 95%/132 | Total: 1d 11h | Avg: 16m 11s | Max: 1h 29m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/124 | Total:  1d 10h | Avg: 16m 45s | Max:  1h 29m
      🟩 arm64              Pass: 100%/8   | Total: 58m 41s | Avg:  7m 20s | Max:  8m 59s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  5m 04s
      🔍 nvcc               Pass:  95%/130 | Total:  1d 11h | Avg: 16m 22s | Max:  1h 29m
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/59  | Total: 12h 02m | Avg: 12m 15s | Max: 41m 54s
      🟩 GCC                Pass: 100%/64  | Total: 20h 37m | Avg: 19m 19s | Max:  1h 29m
      🟩 Intel              Pass: 100%/3   | Total: 22m 27s | Avg:  7m 29s | Max:  7m 52s
      🔥 MSVC               Pass:   0%/6   | Total:  2h 35m | Avg: 25m 51s | Max: 35m 38s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  93%/99  | Total: 22h 40m | Avg: 13m 44s | Max: 43m 09s
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 36m | Avg: 19m 34s | Max: 22m 55s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 13m | Avg: 16m 38s | Max: 21m 19s
      🟩 HostLaunch         Pass: 100%/8   | Total:  3h 46m | Avg: 28m 19s | Max:  1h 29m
      🟩 SmallGMem          Pass: 100%/1   | Total: 45m 28s | Avg: 45m 28s | Max: 45m 28s
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 35m | Avg: 26m 53s | Max: 32m 02s
    🟨 ctk
      🟨 11.1               Pass:  93%/15  | Total:  2h 23m | Avg:  9m 33s | Max: 41m 35s
      🟩 11.8               Pass: 100%/3   | Total: 21m 22s | Avg:  7m 07s | Max:  7m 29s
      🟨 12.5               Pass:  95%/114 | Total:  1d 08h | Avg: 17m 18s | Max:  1h 29m
    🟨 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  5m 04s
      🟨 nvcc11.1           Pass:  93%/15  | Total:  2h 23m | Avg:  9m 33s | Max: 41m 35s
      🟩 nvcc11.8           Pass: 100%/3   | Total: 21m 22s | Avg:  7m 07s | Max:  7m 29s
      🟨 nvcc12.5           Pass:  95%/112 | Total:  1d 08h | Avg: 17m 31s | Max:  1h 29m
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 36m 34s | Avg:  6m 05s | Max:  7m 11s
      🟩 Clang10            Pass: 100%/3   | Total: 20m 52s | Avg:  6m 57s | Max:  7m 04s
      🟩 Clang11            Pass: 100%/4   | Total: 24m 55s | Avg:  6m 13s | Max:  6m 26s
      🟩 Clang12            Pass: 100%/4   | Total: 24m 54s | Avg:  6m 13s | Max:  6m 40s
      🟩 Clang13            Pass: 100%/4   | Total: 24m 59s | Avg:  6m 14s | Max:  6m 28s
      🟩 Clang14            Pass: 100%/4   | Total: 24m 43s | Avg:  6m 10s | Max:  6m 16s
      🟩 Clang15            Pass: 100%/4   | Total: 25m 14s | Avg:  6m 18s | Max:  6m 24s
      🟩 Clang16            Pass: 100%/4   | Total: 26m 47s | Avg:  6m 41s | Max:  7m 16s
      🟩 Clang17            Pass: 100%/26  | Total:  8h 33m | Avg: 19m 45s | Max: 41m 54s
      🟩 GCC6               Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  4m 51s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 30m | Avg: 25m 08s | Max: 41m 35s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 56m | Avg: 19m 21s | Max: 35m 28s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 56m | Avg: 19m 22s | Max: 34m 34s
      🟩 GCC10              Pass: 100%/4   | Total: 25m 15s | Avg:  6m 18s | Max:  6m 27s
      🟩 GCC11              Pass: 100%/7   | Total: 46m 15s | Avg:  6m 36s | Max:  7m 29s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 47m | Avg: 26m 55s | Max: 34m 57s
      🟩 GCC13              Pass: 100%/29  | Total: 11h 04m | Avg: 22m 55s | Max:  1h 29m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 22m 27s | Avg:  7m 29s | Max:  7m 52s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 35m 38s | Avg: 35m 38s | Max: 35m 38s
      🟥 MSVC14.29          Pass:   0%/2   | Total: 49m 33s | Avg: 24m 46s | Max: 35m 26s
      🟥 MSVC14.39          Pass:   0%/3   | Total:  1h 10m | Avg: 23m 20s | Max: 35m 25s
    🟨 std
      🟩 11                 Pass: 100%/34  | Total:  8h 52m | Avg: 15m 40s | Max:  1h 29m
      🟨 14                 Pass:  91%/37  | Total: 10h 28m | Avg: 16m 59s | Max: 41m 35s
      🟨 17                 Pass:  94%/37  | Total:  9h 47m | Avg: 15m 53s | Max: 45m 28s
      🟨 20                 Pass:  95%/24  | Total:  6h 27m | Avg: 16m 09s | Max: 43m 09s
    🟨 gpu
      🟨 v100               Pass:  95%/132 | Total:  1d 11h | Avg: 16m 11s | Max:  1h 29m
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 21m 22s | Avg:  7m 07s | Max:  7m 29s
      🟩 90a                Pass: 100%/4   | Total: 19m 12s | Avg:  4m 48s | Max:  5m 00s
    
  • 🟩 thrust: Pass: 100%/118 | Total: 13h 19m | Avg: 6m 46s | Max: 25m 42s | Hits: 99%/20079

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 12h 45m | Avg:  6m 57s | Max: 25m 42s | Hits:  99%/20079 
      🟩 arm64              Pass: 100%/8   | Total: 34m 16s | Avg:  4m 17s | Max:  4m 54s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 09m | Avg:  4m 36s | Max: 15m 35s | Hits:  99%/2231  
      🟩 11.8               Pass: 100%/3   | Total: 13m 58s | Avg:  4m 39s | Max:  5m 09s
      🟩 12.5               Pass: 100%/100 | Total: 11h 56m | Avg:  7m 09s | Max: 25m 42s | Hits:  99%/17848 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  9m 38s | Avg:  4m 49s | Max:  4m 53s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 09m | Avg:  4m 36s | Max: 15m 35s | Hits:  99%/2231  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 13m 58s | Avg:  4m 39s | Max:  5m 09s
      🟩 nvcc12.5           Pass: 100%/98  | Total: 11h 46m | Avg:  7m 12s | Max: 25m 42s | Hits:  99%/17848 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 38s | Avg:  4m 49s | Max:  4m 53s
      🟩 nvcc               Pass: 100%/116 | Total: 13h 09m | Avg:  6m 48s | Max: 25m 42s | Hits:  99%/20079 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 28m 38s | Avg:  4m 46s | Max:  5m 43s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 21s | Avg:  5m 47s | Max:  6m 09s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 37s | Avg:  4m 39s | Max:  4m 44s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 34s | Avg:  4m 38s | Max:  5m 07s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 00s | Avg:  4m 45s | Max:  4m 57s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 17s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 06s | Avg:  4m 46s | Max:  5m 15s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 12s | Avg:  4m 33s | Max:  4m 49s
      🟩 Clang17            Pass: 100%/18  | Total:  2h 10m | Avg:  7m 15s | Max: 15m 00s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 32s | Avg:  3m 46s | Max:  3m 52s
      🟩 GCC7               Pass: 100%/6   | Total: 24m 22s | Avg:  4m 03s | Max:  4m 55s
      🟩 GCC8               Pass: 100%/6   | Total: 24m 31s | Avg:  4m 05s | Max:  4m 49s
      🟩 GCC9               Pass: 100%/6   | Total: 25m 42s | Avg:  4m 17s | Max:  5m 08s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 01s | Avg:  4m 45s | Max:  4m 58s
      🟩 GCC11              Pass: 100%/7   | Total: 33m 32s | Avg:  4m 47s | Max:  5m 25s
      🟩 GCC12              Pass: 100%/4   | Total: 41m 21s | Avg: 10m 20s | Max: 25m 42s
      🟩 GCC13              Pass: 100%/20  | Total:  2h 31m | Avg:  7m 34s | Max: 16m 29s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 28s | Avg:  6m 09s | Max:  6m 25s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 35s | Avg: 15m 35s | Max: 15m 35s | Hits:  99%/2231  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 33m 55s | Avg: 16m 57s | Max: 17m 31s | Hits:  99%/4462  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  1h 53m | Avg: 18m 57s | Max: 23m 07s | Hits:  99%/13386 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total:  4h 50m | Avg:  5m 41s | Max: 15m 00s
      🟩 GCC                Pass: 100%/55  | Total:  5h 27m | Avg:  5m 57s | Max: 25m 42s
      🟩 Intel              Pass: 100%/3   | Total: 18m 28s | Avg:  6m 09s | Max:  6m 25s
      🟩 MSVC               Pass: 100%/9   | Total:  2h 43m | Avg: 18m 08s | Max: 23m 07s | Hits:  99%/20079 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 13h 19m | Avg:  6m 46s | Max: 25m 42s | Hits:  99%/20079 
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  9h 30m | Avg:  5m 45s | Max: 25m 42s | Hits:  99%/13386 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 02m | Avg: 11m 07s | Max: 23m 07s | Hits:  99%/6693  
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 46m | Avg: 13m 19s | Max: 15m 00s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 13m 58s | Avg:  4m 39s | Max:  5m 09s
      🟩 90a                Pass: 100%/4   | Total: 15m 59s | Avg:  3m 59s | Max:  4m 08s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 51m | Avg:  5m 43s | Max: 25m 42s
      🟩 14                 Pass: 100%/34  | Total:  3h 50m | Avg:  6m 47s | Max: 19m 24s | Hits:  99%/8924  
      🟩 17                 Pass: 100%/33  | Total:  3h 48m | Avg:  6m 56s | Max: 22m 25s | Hits:  99%/6693  
      🟩 20                 Pass: 100%/21  | Total:  2h 47m | Avg:  7m 59s | Max: 23m 07s | Hits:  99%/4462  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 251)

# Runner
178 linux-amd64-cpu16
42 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 4h 03m: Pass: 100%/251 | Total: 5d 11h | Avg: 31m 24s | Max: 1h 05m | Hits: 86%/24441
  • 🟩 cub: Pass: 100%/132 | Total: 3d 08h | Avg: 36m 38s | Max: 1h 05m | Hits: 85%/4362

    🟩 cpu
      🟩 amd64              Pass: 100%/124 | Total:  3d 02h | Avg: 35m 59s | Max:  1h 05m | Hits:  85%/4362  
      🟩 arm64              Pass: 100%/8   | Total:  6h 14m | Avg: 46m 51s | Max: 49m 42s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  9h 07m | Avg: 36m 31s | Max: 43m 57s | Hits:  83%/727   
      🟩 11.8               Pass: 100%/3   | Total:  2h 56m | Avg: 58m 41s | Max:  1h 05m
      🟩 12.5               Pass: 100%/114 | Total:  2d 20h | Avg: 36m 04s | Max:  1h 04m | Hits:  86%/3635  
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 28m 28s | Avg: 14m 14s | Max: 14m 36s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  9h 07m | Avg: 36m 31s | Max: 43m 57s | Hits:  83%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 56m | Avg: 58m 41s | Max:  1h 05m
      🟩 nvcc12.5           Pass: 100%/112 | Total:  2d 20h | Avg: 36m 28s | Max:  1h 04m | Hits:  86%/3635  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 28m 28s | Avg: 14m 14s | Max: 14m 36s
      🟩 nvcc               Pass: 100%/130 | Total:  3d 08h | Avg: 36m 59s | Max:  1h 05m | Hits:  85%/4362  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 59m | Avg: 39m 54s | Max: 44m 35s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 07m | Avg: 42m 37s | Max: 47m 23s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 43m | Avg: 40m 58s | Max: 44m 17s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 39m | Avg: 39m 51s | Max: 40m 57s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 38m | Avg: 39m 43s | Max: 39m 53s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 48m | Avg: 42m 11s | Max: 45m 17s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 40m | Avg: 40m 12s | Max: 41m 12s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 46m | Avg: 41m 31s | Max: 45m 11s
      🟩 Clang17            Pass: 100%/26  | Total: 12h 32m | Avg: 28m 56s | Max: 52m 10s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 15m | Avg: 37m 36s | Max: 38m 08s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 59m | Avg: 39m 52s | Max: 44m 35s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 41m | Avg: 36m 56s | Max: 40m 07s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 53m | Avg: 38m 56s | Max: 44m 30s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 54m | Avg: 43m 38s | Max: 46m 05s
      🟩 GCC11              Pass: 100%/7   | Total:  5h 37m | Avg: 48m 14s | Max:  1h 05m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 40m | Avg: 40m 10s | Max: 40m 45s
      🟩 GCC13              Pass: 100%/29  | Total: 13h 49m | Avg: 28m 36s | Max: 49m 42s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 07m | Avg: 42m 29s | Max: 43m 50s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 43m 57s | Avg: 43m 57s | Max: 43m 57s | Hits:  83%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 58m | Avg: 59m 27s | Max:  1h 04m | Hits:  86%/1454  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 57m | Avg: 59m 02s | Max:  1h 03m | Hits:  86%/2181  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/59  | Total:  1d 10h | Avg: 35m 32s | Max: 52m 10s
      🟩 GCC                Pass: 100%/64  | Total:  1d 13h | Avg: 35m 30s | Max:  1h 05m
      🟩 Intel              Pass: 100%/3   | Total:  2h 07m | Avg: 42m 29s | Max: 43m 50s
      🟩 MSVC               Pass: 100%/6   | Total:  5h 39m | Avg: 56m 39s | Max:  1h 04m | Hits:  85%/4362  
    🟩 gpu
      🟩 v100               Pass: 100%/132 | Total:  3d 08h | Avg: 36m 38s | Max:  1h 05m | Hits:  85%/4362  
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  2d 19h | Avg: 40m 58s | Max:  1h 05m | Hits:  85%/4362  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 59m | Avg: 22m 29s | Max: 41m 48s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 23m | Avg: 17m 55s | Max: 22m 56s
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 43m | Avg: 20m 26s | Max: 27m 06s
      🟩 SmallGMem          Pass: 100%/1   | Total: 45m 10s | Avg: 45m 10s | Max: 45m 10s
      🟩 TestGPU            Pass: 100%/8   | Total:  4h 09m | Avg: 31m 07s | Max: 52m 10s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 56m | Avg: 58m 41s | Max:  1h 05m
      🟩 90a                Pass: 100%/4   | Total:  1h 11m | Avg: 17m 45s | Max: 18m 15s
    🟩 std
      🟩 11                 Pass: 100%/34  | Total: 20h 58m | Avg: 37m 00s | Max:  1h 05m
      🟩 14                 Pass: 100%/37  | Total: 23h 05m | Avg: 37m 26s | Max:  1h 04m | Hits:  83%/2181  
      🟩 17                 Pass: 100%/37  | Total: 23h 16m | Avg: 37m 44s | Max:  1h 03m | Hits:  88%/1454  
      🟩 20                 Pass: 100%/24  | Total: 13h 16m | Avg: 33m 12s | Max: 55m 49s | Hits:  88%/727   
    
  • 🟩 thrust: Pass: 100%/118 | Total: 2d 02h | Avg: 25m 41s | Max: 56m 57s | Hits: 86%/20079

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  1d 22h | Avg: 25m 37s | Max: 56m 57s | Hits:  86%/20079 
      🟩 arm64              Pass: 100%/8   | Total:  3h 33m | Avg: 26m 42s | Max: 31m 43s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  6h 17m | Avg: 25m 11s | Max: 47m 47s | Hits:  80%/2231  
      🟩 11.8               Pass: 100%/3   | Total:  1h 44m | Avg: 34m 41s | Max: 36m 36s
      🟩 12.5               Pass: 100%/100 | Total:  1d 18h | Avg: 25m 30s | Max: 56m 57s | Hits:  87%/17848 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 52m 00s | Avg: 26m 00s | Max: 28m 02s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  6h 17m | Avg: 25m 11s | Max: 47m 47s | Hits:  80%/2231  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 44m | Avg: 34m 41s | Max: 36m 36s
      🟩 nvcc12.5           Pass: 100%/98  | Total:  1d 17h | Avg: 25m 29s | Max: 56m 57s | Hits:  87%/17848 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 52m 00s | Avg: 26m 00s | Max: 28m 02s
      🟩 nvcc               Pass: 100%/116 | Total:  2d 01h | Avg: 25m 41s | Max: 56m 57s | Hits:  86%/20079 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 34m | Avg: 25m 47s | Max: 31m 55s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 20m | Avg: 26m 49s | Max: 29m 56s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 48m | Avg: 27m 04s | Max: 30m 25s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 42m | Avg: 25m 31s | Max: 27m 18s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 41m | Avg: 25m 24s | Max: 27m 36s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 47m | Avg: 26m 49s | Max: 30m 45s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 42m | Avg: 25m 43s | Max: 27m 58s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 20s | Max: 29m 52s
      🟩 Clang17            Pass: 100%/18  | Total:  5h 43m | Avg: 19m 03s | Max: 29m 45s
      🟩 GCC6               Pass: 100%/2   | Total: 45m 41s | Avg: 22m 50s | Max: 24m 23s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 26m | Avg: 24m 21s | Max: 29m 12s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 31m | Avg: 25m 15s | Max: 31m 36s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 42m | Avg: 27m 00s | Max: 31m 48s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 49m | Avg: 27m 29s | Max: 30m 21s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 38m | Avg: 31m 17s | Max: 36m 36s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 53m | Avg: 28m 23s | Max: 30m 39s
      🟩 GCC13              Pass: 100%/20  | Total:  6h 32m | Avg: 19m 38s | Max: 32m 04s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 41m | Avg: 33m 56s | Max: 37m 47s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 47m 47s | Avg: 47m 47s | Max: 47m 47s | Hits:  80%/2231  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 50m | Avg: 55m 13s | Max: 56m 57s | Hits:  80%/4462  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  3h 45m | Avg: 37m 38s | Max: 56m 56s | Hits:  89%/13386 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total: 20h 05m | Avg: 23m 38s | Max: 31m 55s
      🟩 GCC                Pass: 100%/55  | Total: 22h 20m | Avg: 24m 22s | Max: 36m 36s
      🟩 Intel              Pass: 100%/3   | Total:  1h 41m | Avg: 33m 56s | Max: 37m 47s
      🟩 MSVC               Pass: 100%/9   | Total:  6h 24m | Avg: 42m 40s | Max: 56m 57s | Hits:  86%/20079 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  2d 02h | Avg: 25m 41s | Max: 56m 57s | Hits:  86%/20079 
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  1d 22h | Avg: 28m 06s | Max: 56m 57s | Hits:  80%/13386 
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 59m | Avg: 10m 51s | Max: 20m 58s | Hits:  99%/6693  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 10m | Avg: 16m 17s | Max: 32m 04s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 44m | Avg: 34m 41s | Max: 36m 36s
      🟩 90a                Pass: 100%/4   | Total:  1h 02m | Avg: 15m 32s | Max: 17m 21s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 10h 22m | Avg: 20m 45s | Max: 30m 58s
      🟩 14                 Pass: 100%/34  | Total: 15h 31m | Avg: 27m 23s | Max: 56m 57s | Hits:  85%/8924  
      🟩 17                 Pass: 100%/33  | Total: 15h 21m | Avg: 27m 55s | Max: 55m 17s | Hits:  86%/6693  
      🟩 20                 Pass: 100%/21  | Total:  9h 17m | Avg: 26m 31s | Max: 56m 56s | Hits:  89%/4462  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 251)

# Runner
178 linux-amd64-cpu16
42 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 5h 40m: Pass: 98%/251 | Total: 5d 12h | Avg: 31m 46s | Max: 1h 19m | Hits: 84%/22260
  • 🟨 cub: Pass: 96%/132 | Total: 3d 13h | Avg: 38m 48s | Max: 1h 19m | Hits: 72%/2181

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  96%/124 | Total:  3d 06h | Avg: 38m 11s | Max:  1h 19m | Hits:  72%/2181  
      🟩 arm64              Pass: 100%/8   | Total:  6h 28m | Avg: 48m 36s | Max: 50m 39s
    🔍 ctk: 12.5 🔍
      🟩 11.1               Pass: 100%/15  | Total:  9h 40m | Avg: 38m 41s | Max: 42m 55s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total:  3h 53m | Avg:  1h 17m | Max:  1h 19m
      🔍 12.5               Pass:  96%/114 | Total:  2d 23h | Avg: 37m 48s | Max:  1h 00m | Hits:  59%/1454  
    🔍 cudacxx: nvcc12.5 🔍
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 29m 58s | Avg: 14m 59s | Max: 15m 35s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  9h 40m | Avg: 38m 41s | Max: 42m 55s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 53m | Avg:  1h 17m | Max:  1h 19m
      🔍 nvcc12.5           Pass:  96%/112 | Total:  2d 23h | Avg: 38m 12s | Max:  1h 00m | Hits:  59%/1454  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 29m 58s | Avg: 14m 59s | Max: 15m 35s
      🔍 nvcc               Pass:  96%/130 | Total:  3d 12h | Avg: 39m 10s | Max:  1h 19m | Hits:  72%/2181  
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 35m | Avg: 45m 55s | Max: 54m 13s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 24m | Avg: 48m 03s | Max: 48m 17s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 41s | Max: 55m 31s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 17m | Avg: 49m 17s | Max: 51m 13s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 16m | Avg: 49m 00s | Max: 50m 18s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 19m | Avg: 49m 52s | Max: 52m 45s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 11m | Avg: 47m 53s | Max: 48m 28s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 11m | Avg: 47m 45s | Max: 47m 55s
      🟩 Clang17            Pass: 100%/26  | Total: 12h 58m | Avg: 29m 56s | Max: 50m 39s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 20m | Avg: 40m 14s | Max: 41m 32s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 58m | Avg: 39m 45s | Max: 40m 25s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 04m | Avg: 40m 42s | Max: 43m 29s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 04m | Avg: 40m 45s | Max: 42m 19s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 40m | Avg: 40m 00s | Max: 40m 31s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 39m | Avg: 57m 04s | Max:  1h 19m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 44m | Avg: 41m 06s | Max: 42m 56s
      🟨 GCC13              Pass:  96%/29  | Total: 12h 27m | Avg: 25m 46s | Max:  1h 00m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 44m | Avg: 54m 54s | Max: 56m 06s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 36s | Avg: 15m 36s | Max: 15m 36s | Hits:  99%/727   
      🟨 MSVC14.29          Pass:  50%/2   | Total:  1h 53m | Avg: 56m 36s | Max: 59m 54s | Hits:  59%/727   
      🟨 MSVC14.39          Pass:  33%/3   | Total:  2h 50m | Avg: 56m 56s | Max:  1h 00m | Hits:  59%/727   
    🟨 cxx_family
      🟩 Clang              Pass: 100%/59  | Total:  1d 15h | Avg: 40m 20s | Max: 55m 31s
      🟨 GCC                Pass:  98%/64  | Total:  1d 13h | Avg: 35m 36s | Max:  1h 19m
      🟩 Intel              Pass: 100%/3   | Total:  2h 44m | Avg: 54m 54s | Max: 56m 06s
      🟨 MSVC               Pass:  50%/6   | Total:  4h 59m | Avg: 49m 56s | Max:  1h 00m | Hits:  72%/2181  
    🟨 jobs
      🟨 Build              Pass:  96%/99  | Total:  3d 01h | Avg: 44m 17s | Max:  1h 19m | Hits:  72%/2181  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 48m | Avg: 21m 04s | Max: 33m 46s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 23m | Avg: 17m 53s | Max: 29m 07s
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 39m | Avg: 19m 53s | Max: 30m 41s
      🟥 SmallGMem          Pass:   0%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 28m | Avg: 26m 00s | Max: 37m 23s
    🟨 std
      🟩 11                 Pass: 100%/34  | Total: 21h 42m | Avg: 38m 18s | Max:  1h 14m
      🟩 14                 Pass: 100%/37  | Total:  1d 00h | Avg: 39m 11s | Max:  1h 19m | Hits:  72%/2181  
      🟨 17                 Pass:  91%/37  | Total:  1d 00h | Avg: 40m 09s | Max:  1h 19m
      🟨 20                 Pass:  95%/24  | Total: 14h 45m | Avg: 36m 52s | Max: 55m 15s
    🟨 gpu
      🟨 v100               Pass:  96%/132 | Total:  3d 13h | Avg: 38m 48s | Max:  1h 19m | Hits:  72%/2181  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 53m | Avg:  1h 17m | Max:  1h 19m
      🟩 90a                Pass: 100%/4   | Total: 14m 50s | Avg:  3m 42s | Max:  3m 57s
    
  • 🟩 thrust: Pass: 100%/118 | Total: 1d 23h | Avg: 24m 02s | Max: 1h 07m | Hits: 85%/20079

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  1d 19h | Avg: 23m 46s | Max:  1h 07m | Hits:  85%/20079 
      🟩 arm64              Pass: 100%/8   | Total:  3h 41m | Avg: 27m 41s | Max: 31m 34s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 12m | Avg:  4m 48s | Max: 19m 40s | Hits:  99%/2231  
      🟩 11.8               Pass: 100%/3   | Total:  1h 51m | Avg: 37m 09s | Max: 40m 08s
      🟩 12.5               Pass: 100%/100 | Total:  1d 20h | Avg: 26m 31s | Max:  1h 07m | Hits:  83%/17848 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 51m 03s | Avg: 25m 31s | Max: 26m 50s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 12m | Avg:  4m 48s | Max: 19m 40s | Hits:  99%/2231  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 51m | Avg: 37m 09s | Max: 40m 08s
      🟩 nvcc12.5           Pass: 100%/98  | Total:  1d 19h | Avg: 26m 32s | Max:  1h 07m | Hits:  83%/17848 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 51m 03s | Avg: 25m 31s | Max: 26m 50s
      🟩 nvcc               Pass: 100%/116 | Total:  1d 22h | Avg: 24m 00s | Max:  1h 07m | Hits:  85%/20079 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 37m | Avg: 16m 10s | Max: 31m 23s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 24m | Avg: 28m 02s | Max: 30m 03s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 15s | Max: 30m 03s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 50m | Avg: 27m 36s | Max: 30m 26s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 57m | Avg: 29m 28s | Max: 33m 01s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 48m | Avg: 27m 14s | Max: 28m 42s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 50m | Avg: 27m 44s | Max: 30m 14s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 21s | Max: 30m 12s
      🟩 Clang17            Pass: 100%/18  | Total:  5h 52m | Avg: 19m 36s | Max: 30m 40s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 18s | Avg:  3m 39s | Max:  3m 42s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 32m | Avg: 15m 26s | Max: 29m 30s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 32m | Avg: 15m 25s | Max: 29m 35s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 40m | Avg: 16m 48s | Max: 34m 14s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 00m | Avg: 30m 04s | Max: 33m 54s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 57m | Avg: 33m 51s | Max: 40m 08s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 05m | Avg: 31m 21s | Max: 35m 53s
      🟩 GCC13              Pass: 100%/20  | Total:  6h 08m | Avg: 18m 24s | Max: 38m 26s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 43m | Avg: 34m 37s | Max: 40m 18s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 40s | Avg: 19m 40s | Max: 19m 40s | Hits:  99%/2231  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 34s | Max: 55m 03s | Hits:  74%/4462  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  4h 18m | Avg: 43m 05s | Max:  1h 07m | Hits:  86%/13386 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total: 20h 00m | Avg: 23m 32s | Max: 33m 01s
      🟩 GCC                Pass: 100%/55  | Total: 19h 04m | Avg: 20m 48s | Max: 40m 08s
      🟩 Intel              Pass: 100%/3   | Total:  1h 43m | Avg: 34m 37s | Max: 40m 18s
      🟩 MSVC               Pass: 100%/9   | Total:  6h 27m | Avg: 43m 02s | Max:  1h 07m | Hits:  85%/20079 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  1d 23h | Avg: 24m 02s | Max:  1h 07m | Hits:  85%/20079 
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  1d 18h | Avg: 25m 53s | Max:  1h 07m | Hits:  78%/13386 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 33m | Avg: 13m 56s | Max: 38m 26s | Hits:  99%/6693  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 00m | Avg: 15m 01s | Max: 17m 31s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 51m | Avg: 37m 09s | Max: 40m 08s
      🟩 90a                Pass: 100%/4   | Total: 15m 35s | Avg:  3m 53s | Max:  4m 04s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  9h 17m | Avg: 18m 34s | Max: 33m 34s
      🟩 14                 Pass: 100%/34  | Total: 13h 48m | Avg: 24m 21s | Max:  1h 07m | Hits:  86%/8924  
      🟩 17                 Pass: 100%/33  | Total: 14h 22m | Avg: 26m 07s | Max:  1h 04m | Hits:  82%/6693  
      🟩 20                 Pass: 100%/21  | Total:  9h 48m | Avg: 28m 02s | Max:  1h 00m | Hits:  86%/4462  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 16m 28s | Avg: 16m 28s | Max: 16m 28s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 251)

# Runner
178 linux-amd64-cpu16
42 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 3h 33m: Pass: 100%/251 | Total: 6d 10h | Avg: 36m 57s | Max: 1h 41m | Hits: 78%/24441
  • 🟩 cub: Pass: 100%/132 | Total: 4d 05h | Avg: 46m 01s | Max: 1h 41m | Hits: 59%/4362

    🟩 cpu
      🟩 amd64              Pass: 100%/124 | Total:  3d 21h | Avg: 45m 25s | Max:  1h 41m | Hits:  59%/4362  
      🟩 arm64              Pass: 100%/8   | Total:  7h 22m | Avg: 55m 20s | Max:  1h 01m
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 16h 30m | Avg:  1h 06m | Max:  1h 12m | Hits:  59%/727   
      🟩 11.8               Pass: 100%/3   | Total:  4h 53m | Avg:  1h 37m | Max:  1h 41m
      🟩 12.5               Pass: 100%/114 | Total:  3d 07h | Avg: 42m 01s | Max:  1h 22m | Hits:  59%/3635  
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 48m 55s | Avg: 24m 27s | Max: 24m 43s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 16h 30m | Avg:  1h 06m | Max:  1h 12m | Hits:  59%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  4h 53m | Avg:  1h 37m | Max:  1h 41m
      🟩 nvcc12.5           Pass: 100%/112 | Total:  3d 07h | Avg: 42m 20s | Max:  1h 22m | Hits:  59%/3635  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 48m 55s | Avg: 24m 27s | Max: 24m 43s
      🟩 nvcc               Pass: 100%/130 | Total:  4d 04h | Avg: 46m 21s | Max:  1h 41m | Hits:  59%/4362  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  5h 48m | Avg: 58m 06s | Max:  1h 10m
      🟩 Clang10            Pass: 100%/3   | Total:  2h 36m | Avg: 52m 13s | Max: 54m 41s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 27m | Avg: 51m 47s | Max: 56m 57s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 29m | Avg: 52m 22s | Max: 56m 01s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 30m | Avg: 52m 33s | Max: 55m 47s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 11s | Max: 56m 30s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 22m | Avg: 50m 33s | Max: 53m 44s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 09s | Max: 56m 25s
      🟩 Clang17            Pass: 100%/26  | Total: 13h 03m | Avg: 30m 08s | Max:  1h 01m
      🟩 GCC6               Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m
      🟩 GCC7               Pass: 100%/6   | Total:  5h 46m | Avg: 57m 47s | Max:  1h 09m
      🟩 GCC8               Pass: 100%/6   | Total:  5h 56m | Avg: 59m 25s | Max:  1h 10m
      🟩 GCC9               Pass: 100%/6   | Total:  6h 01m | Avg:  1h 00m | Max:  1h 12m
      🟩 GCC10              Pass: 100%/4   | Total:  3h 27m | Avg: 51m 45s | Max: 54m 13s
      🟩 GCC11              Pass: 100%/7   | Total:  8h 17m | Avg:  1h 11m | Max:  1h 41m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 26m | Avg: 51m 44s | Max: 54m 09s
      🟩 GCC13              Pass: 100%/29  | Total: 15h 12m | Avg: 31m 27s | Max:  1h 22m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 39m | Avg: 53m 08s | Max: 53m 32s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 55m 29s | Avg: 55m 29s | Max: 55m 29s | Hits:  59%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 09m | Hits:  59%/1454  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 04m | Avg:  1h 01m | Max:  1h 02m | Hits:  59%/2181  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/59  | Total:  1d 18h | Avg: 42m 50s | Max:  1h 10m
      🟩 GCC                Pass: 100%/64  | Total:  2d 02h | Avg: 47m 08s | Max:  1h 41m
      🟩 Intel              Pass: 100%/3   | Total:  2h 39m | Avg: 53m 08s | Max: 53m 32s
      🟩 MSVC               Pass: 100%/6   | Total:  6h 10m | Avg:  1h 01m | Max:  1h 09m | Hits:  59%/4362  
    🟩 gpu
      🟩 v100               Pass: 100%/132 | Total:  4d 05h | Avg: 46m 01s | Max:  1h 41m | Hits:  59%/4362  
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  3d 17h | Avg: 54m 14s | Max:  1h 41m | Hits:  59%/4362  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 22m | Avg: 17m 49s | Max: 20m 32s
      🟩 GraphCapture       Pass: 100%/8   | Total:  3h 09m | Avg: 23m 42s | Max:  1h 22m
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 15m | Avg: 16m 58s | Max: 18m 09s
      🟩 SmallGMem          Pass: 100%/1   | Total: 33m 21s | Avg: 33m 21s | Max: 33m 21s
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 22m | Avg: 25m 22s | Max: 30m 58s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  4h 53m | Avg:  1h 37m | Max:  1h 41m
      🟩 90a                Pass: 100%/4   | Total:  1h 28m | Avg: 22m 13s | Max: 23m 14s
    🟩 std
      🟩 11                 Pass: 100%/34  | Total:  1d 02h | Avg: 46m 33s | Max:  1h 41m
      🟩 14                 Pass: 100%/37  | Total:  1d 04h | Avg: 46m 49s | Max:  1h 36m | Hits:  59%/2181  
      🟩 17                 Pass: 100%/37  | Total:  1d 05h | Avg: 48m 13s | Max:  1h 35m | Hits:  59%/1454  
      🟩 20                 Pass: 100%/24  | Total: 16h 14m | Avg: 40m 36s | Max:  1h 02m | Hits:  59%/727   
    
  • 🟩 thrust: Pass: 100%/118 | Total: 2d 05h | Avg: 26m 59s | Max: 1h 08m | Hits: 82%/20079

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  2d 01h | Avg: 26m 56s | Max:  1h 08m | Hits:  82%/20079 
      🟩 arm64              Pass: 100%/8   | Total:  3h 41m | Avg: 27m 42s | Max: 31m 23s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  6h 51m | Avg: 27m 27s | Max: 51m 06s | Hits:  74%/2231  
      🟩 11.8               Pass: 100%/3   | Total:  1h 55m | Avg: 38m 20s | Max: 43m 17s
      🟩 12.5               Pass: 100%/100 | Total:  1d 20h | Avg: 26m 35s | Max:  1h 08m | Hits:  83%/17848 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 56m 04s | Avg: 28m 02s | Max: 29m 37s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  6h 51m | Avg: 27m 27s | Max: 51m 06s | Hits:  74%/2231  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 55m | Avg: 38m 20s | Max: 43m 17s
      🟩 nvcc12.5           Pass: 100%/98  | Total:  1d 19h | Avg: 26m 33s | Max:  1h 08m | Hits:  83%/17848 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 56m 04s | Avg: 28m 02s | Max: 29m 37s
      🟩 nvcc               Pass: 100%/116 | Total:  2d 04h | Avg: 26m 58s | Max:  1h 08m | Hits:  82%/20079 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 38m | Avg: 26m 25s | Max: 29m 47s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 24m | Avg: 28m 05s | Max: 30m 13s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 51m | Avg: 27m 46s | Max: 29m 29s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 24s | Max: 30m 23s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 20s | Max: 29m 16s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 53m | Avg: 28m 26s | Max: 31m 56s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 56m | Avg: 29m 13s | Max: 33m 45s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 48m | Avg: 27m 07s | Max: 29m 53s
      🟩 Clang17            Pass: 100%/18  | Total:  5h 53m | Avg: 19m 38s | Max: 30m 27s
      🟩 GCC6               Pass: 100%/2   | Total: 50m 57s | Avg: 25m 28s | Max: 28m 15s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 38m | Avg: 26m 28s | Max: 30m 08s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 48m | Avg: 28m 00s | Max: 31m 02s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 44m | Avg: 27m 25s | Max: 34m 42s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 55m | Avg: 28m 59s | Max: 31m 52s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 53m | Avg: 33m 17s | Max: 43m 17s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 05m | Avg: 31m 23s | Max: 35m 57s
      🟩 GCC13              Pass: 100%/20  | Total:  6h 23m | Avg: 19m 11s | Max: 31m 23s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 45m | Avg: 35m 14s | Max: 40m 35s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 51m 06s | Avg: 51m 06s | Max: 51m 06s | Hits:  74%/2231  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 52m | Avg: 56m 04s | Max: 56m 20s | Hits:  74%/4462  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  4h 10m | Avg: 41m 44s | Max:  1h 08m | Hits:  86%/13386 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total: 21h 05m | Avg: 24m 48s | Max: 33m 45s
      🟩 GCC                Pass: 100%/55  | Total: 23h 20m | Avg: 25m 28s | Max: 43m 17s
      🟩 Intel              Pass: 100%/3   | Total:  1h 45m | Avg: 35m 14s | Max: 40m 35s
      🟩 MSVC               Pass: 100%/9   | Total:  6h 53m | Avg: 45m 58s | Max:  1h 08m | Hits:  82%/20079 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  2d 05h | Avg: 26m 59s | Max:  1h 08m | Hits:  82%/20079 
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  2d 01h | Avg: 29m 49s | Max:  1h 08m | Hits:  74%/13386 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 05m | Avg: 11m 25s | Max: 23m 53s | Hits:  99%/6693  
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 47m | Avg: 13m 26s | Max: 15m 52s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 55m | Avg: 38m 20s | Max: 43m 17s
      🟩 90a                Pass: 100%/4   | Total:  1h 07m | Avg: 16m 59s | Max: 18m 15s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 10h 57m | Avg: 21m 54s | Max: 30m 35s
      🟩 14                 Pass: 100%/34  | Total: 16h 13m | Avg: 28m 37s | Max: 55m 49s | Hits:  80%/8924  
      🟩 17                 Pass: 100%/33  | Total: 16h 11m | Avg: 29m 26s | Max: 59m 01s | Hits:  82%/6693  
      🟩 20                 Pass: 100%/21  | Total:  9h 43m | Avg: 27m 47s | Max:  1h 08m | Hits:  86%/4462  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 251)

# Runner
178 linux-amd64-cpu16
42 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 4h 37m: Pass: 99%/251 | Total: 5d 21h | Avg: 33m 49s | Max: 1h 20m | Hits: 87%/24441
  • 🟨 cub: Pass: 99%/132 | Total: 3d 16h | Avg: 40m 03s | Max: 1h 20m | Hits: 88%/4362

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  99%/124 | Total:  3d 09h | Avg: 39m 33s | Max:  1h 20m | Hits:  88%/4362  
      🟩 arm64              Pass: 100%/8   | Total:  6h 22m | Avg: 47m 49s | Max: 48m 56s
    🔍 ctk: 12.5 🔍
      🟩 11.1               Pass: 100%/15  | Total: 12h 34m | Avg: 50m 18s | Max: 53m 11s | Hits:  88%/727   
      🟩 11.8               Pass: 100%/3   | Total:  3h 53m | Avg:  1h 17m | Max:  1h 20m
      🔍 12.5               Pass:  99%/114 | Total:  2d 23h | Avg: 37m 43s | Max: 59m 54s | Hits:  88%/3635  
    🔍 cudacxx: nvcc12.5 🔍
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 29m 16s | Avg: 14m 38s | Max: 14m 53s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 12h 34m | Avg: 50m 18s | Max: 53m 11s | Hits:  88%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 53m | Avg:  1h 17m | Max:  1h 20m
      🔍 nvcc12.5           Pass:  99%/112 | Total:  2d 23h | Avg: 38m 07s | Max: 59m 54s | Hits:  88%/3635  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 29m 16s | Avg: 14m 38s | Max: 14m 53s
      🔍 nvcc               Pass:  99%/130 | Total:  3d 15h | Avg: 40m 27s | Max:  1h 20m | Hits:  88%/4362  
    🔍 cxx: GCC13 🔍
      🟩 Clang9             Pass: 100%/6   | Total:  4h 45m | Avg: 47m 31s | Max: 52m 03s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 07m | Avg: 42m 20s | Max: 43m 26s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 55m | Avg: 43m 59s | Max: 47m 31s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 02m | Avg: 45m 42s | Max: 47m 58s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 55m | Avg: 43m 52s | Max: 47m 47s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 51m | Avg: 42m 51s | Max: 43m 40s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 49m | Avg: 42m 25s | Max: 44m 58s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 50m | Avg: 42m 31s | Max: 42m 53s
      🟩 Clang17            Pass: 100%/26  | Total: 12h 48m | Avg: 29m 32s | Max: 50m 33s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 41m | Avg: 50m 32s | Max: 51m 41s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 40m | Avg: 46m 41s | Max: 52m 57s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 45m | Avg: 47m 35s | Max: 53m 11s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 41m | Avg: 46m 59s | Max: 53m 04s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 59m | Avg: 44m 45s | Max: 47m 19s
      🟩 GCC11              Pass: 100%/7   | Total:  7h 01m | Avg:  1h 00m | Max:  1h 20m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 40m | Avg: 40m 13s | Max: 41m 49s
      🔍 GCC13              Pass:  96%/29  | Total: 14h 45m | Avg: 30m 31s | Max: 49m 26s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 18m | Avg: 46m 00s | Max: 48m 09s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 46m 47s | Avg: 46m 47s | Max: 46m 47s | Hits:  88%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 51m | Avg: 55m 56s | Max: 59m 20s | Hits:  88%/1454  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 50m | Avg: 56m 42s | Max: 58m 57s | Hits:  88%/2181  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/59  | Total:  1d 13h | Avg: 37m 43s | Max: 52m 03s
      🔍 GCC                Pass:  98%/64  | Total:  1d 19h | Avg: 40m 33s | Max:  1h 20m
      🟩 Intel              Pass: 100%/3   | Total:  2h 18m | Avg: 46m 00s | Max: 48m 09s
      🟩 MSVC               Pass: 100%/6   | Total:  5h 28m | Avg: 54m 47s | Max: 59m 20s | Hits:  88%/4362  
    🔍 jobs: TestGPU 🔍
      🟩 Build              Pass: 100%/99  | Total:  3d 02h | Avg: 45m 04s | Max:  1h 20m | Hits:  88%/4362  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 46m | Avg: 20m 49s | Max: 24m 00s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 32m | Avg: 19m 07s | Max: 38m 11s
      🟩 HostLaunch         Pass: 100%/8   | Total:  3h 05m | Avg: 23m 13s | Max: 43m 21s
      🟩 SmallGMem          Pass: 100%/1   | Total: 40m 03s | Avg: 40m 03s | Max: 40m 03s
      🔍 TestGPU            Pass:  87%/8   | Total:  4h 40m | Avg: 35m 01s | Max: 50m 33s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/34  | Total: 22h 10m | Avg: 39m 08s | Max:  1h 19m
      🟩 14                 Pass: 100%/37  | Total:  1d 00h | Avg: 40m 28s | Max:  1h 14m | Hits:  88%/2181  
      🟩 17                 Pass: 100%/37  | Total:  1d 01h | Avg: 41m 31s | Max:  1h 20m | Hits:  88%/1454  
      🔍 20                 Pass:  95%/24  | Total: 15h 24m | Avg: 38m 30s | Max: 59m 54s | Hits:  88%/727   
    🟨 gpu
      🟨 v100               Pass:  99%/132 | Total:  3d 16h | Avg: 40m 03s | Max:  1h 20m | Hits:  88%/4362  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 53m | Avg:  1h 17m | Max:  1h 20m
      🟩 90a                Pass: 100%/4   | Total:  1h 20m | Avg: 20m 00s | Max: 20m 10s
    
  • 🟩 thrust: Pass: 100%/118 | Total: 2d 05h | Avg: 26m 59s | Max: 1h 02m | Hits: 86%/20079

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  2d 01h | Avg: 26m 58s | Max:  1h 02m | Hits:  86%/20079 
      🟩 arm64              Pass: 100%/8   | Total:  3h 37m | Avg: 27m 07s | Max: 30m 45s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  6h 19m | Avg: 25m 19s | Max: 49m 45s | Hits:  80%/2231  
      🟩 11.8               Pass: 100%/3   | Total:  1h 42m | Avg: 34m 15s | Max: 38m 42s
      🟩 12.5               Pass: 100%/100 | Total:  1d 21h | Avg: 27m 00s | Max:  1h 02m | Hits:  87%/17848 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 51m 43s | Avg: 25m 51s | Max: 26m 48s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  6h 19m | Avg: 25m 19s | Max: 49m 45s | Hits:  80%/2231  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 42m | Avg: 34m 15s | Max: 38m 42s
      🟩 nvcc12.5           Pass: 100%/98  | Total:  1d 20h | Avg: 27m 02s | Max:  1h 02m | Hits:  87%/17848 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 51m 43s | Avg: 25m 51s | Max: 26m 48s
      🟩 nvcc               Pass: 100%/116 | Total:  2d 04h | Avg: 27m 00s | Max:  1h 02m | Hits:  86%/20079 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 34m | Avg: 25m 49s | Max: 28m 55s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 32m | Avg: 30m 52s | Max: 32m 28s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 55m | Avg: 28m 50s | Max: 32m 31s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 53m | Avg: 28m 15s | Max: 33m 18s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 56m | Avg: 29m 02s | Max: 31m 09s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 00s | Max: 33m 59s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 53m | Avg: 28m 16s | Max: 31m 36s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 52m | Avg: 28m 04s | Max: 29m 35s
      🟩 Clang17            Pass: 100%/18  | Total:  6h 02m | Avg: 20m 09s | Max: 30m 45s
      🟩 GCC6               Pass: 100%/2   | Total: 43m 12s | Avg: 21m 36s | Max: 23m 29s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 30m | Avg: 25m 05s | Max: 30m 41s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 34m | Avg: 25m 46s | Max: 28m 59s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 42m | Avg: 27m 09s | Max: 32m 25s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 03m | Avg: 30m 45s | Max: 35m 40s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 44m | Avg: 32m 04s | Max: 38m 42s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 53m | Avg: 28m 15s | Max: 29m 44s
      🟩 GCC13              Pass: 100%/20  | Total:  6h 40m | Avg: 20m 00s | Max: 33m 26s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 44m | Avg: 34m 59s | Max: 37m 43s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 49m 45s | Avg: 49m 45s | Max: 49m 45s | Hits:  80%/2231  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 53m | Avg: 56m 54s | Max:  1h 01m | Hits:  80%/4462  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  3h 59m | Avg: 39m 54s | Max:  1h 02m | Hits:  89%/13386 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total: 21h 44m | Avg: 25m 34s | Max: 33m 59s
      🟩 GCC                Pass: 100%/55  | Total: 22h 51m | Avg: 24m 56s | Max: 38m 42s
      🟩 Intel              Pass: 100%/3   | Total:  1h 44m | Avg: 34m 59s | Max: 37m 43s
      🟩 MSVC               Pass: 100%/9   | Total:  6h 43m | Avg: 44m 46s | Max:  1h 02m | Hits:  86%/20079 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  2d 05h | Avg: 26m 59s | Max:  1h 02m | Hits:  86%/20079 
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  2d 00h | Avg: 29m 29s | Max:  1h 02m | Hits:  80%/13386 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 06m | Avg: 11m 31s | Max: 24m 23s | Hits:  99%/6693  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 17m | Avg: 17m 12s | Max: 21m 03s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 42m | Avg: 34m 15s | Max: 38m 42s
      🟩 90a                Pass: 100%/4   | Total:  1h 11m | Avg: 17m 48s | Max: 20m 03s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 11h 12m | Avg: 22m 24s | Max: 30m 20s
      🟩 14                 Pass: 100%/34  | Total: 16h 03m | Avg: 28m 20s | Max: 52m 46s | Hits:  85%/8924  
      🟩 17                 Pass: 100%/33  | Total: 16h 01m | Avg: 29m 07s | Max:  1h 01m | Hits:  86%/6693  
      🟩 20                 Pass: 100%/21  | Total:  9h 46m | Avg: 27m 57s | Max:  1h 02m | Hits:  89%/4462  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 251)

# Runner
178 linux-amd64-cpu16
42 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 5h 27m: Pass: 96%/251 | Total: 5d 14h | Avg: 32m 10s | Max: 1h 30m | Hits: 81%/15567
  • 🟨 cub: Pass: 96%/132 | Total: 3d 10h | Avg: 37m 35s | Max: 1h 30m | Hits: 87%/2181

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  96%/124 | Total:  3d 04h | Avg: 36m 59s | Max:  1h 30m | Hits:  87%/2181  
      🟩 arm64              Pass: 100%/8   | Total:  6h 14m | Avg: 46m 51s | Max: 48m 05s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 29m 21s | Avg: 14m 40s | Max: 15m 13s
      🔍 nvcc               Pass:  96%/130 | Total:  3d 10h | Avg: 37m 56s | Max:  1h 30m | Hits:  87%/2181  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  95%/99  | Total:  2d 23h | Avg: 43m 28s | Max:  1h 30m | Hits:  87%/2181  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 31m | Avg: 18m 54s | Max: 22m 32s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 03m | Avg: 15m 29s | Max: 16m 43s
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 24m | Avg: 18m 02s | Max: 18m 55s
      🟩 SmallGMem          Pass: 100%/1   | Total: 35m 30s | Avg: 35m 30s | Max: 35m 30s
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 24m | Avg: 25m 32s | Max: 30m 03s
    🔍 sm: 60;70;80;90 🔍
      🔍 60;70;80;90        Pass:  66%/3   | Total:  4h 00m | Avg:  1h 20m | Max:  1h 30m
      🟩 90a                Pass: 100%/4   | Total:  1h 12m | Avg: 18m 00s | Max: 18m 41s
    🟨 ctk
      🟩 11.1               Pass: 100%/15  | Total: 12h 38m | Avg: 50m 35s | Max: 53m 53s | Hits:  87%/727   
      🟨 11.8               Pass:  66%/3   | Total:  4h 00m | Avg:  1h 20m | Max:  1h 30m
      🟨 12.5               Pass:  97%/114 | Total:  2d 18h | Avg: 34m 45s | Max:  1h 00m | Hits:  87%/1454  
    🟨 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 29m 21s | Avg: 14m 40s | Max: 15m 13s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 12h 38m | Avg: 50m 35s | Max: 53m 53s | Hits:  87%/727   
      🟨 nvcc11.8           Pass:  66%/3   | Total:  4h 00m | Avg:  1h 20m | Max:  1h 30m
      🟨 nvcc12.5           Pass:  97%/112 | Total:  2d 17h | Avg: 35m 07s | Max:  1h 00m | Hits:  87%/1454  
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 40m | Avg: 46m 45s | Max: 53m 12s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 05m | Avg: 41m 49s | Max: 45m 16s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 46m | Avg: 41m 30s | Max: 44m 56s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 43m | Avg: 40m 48s | Max: 44m 52s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 39m | Avg: 39m 53s | Max: 40m 43s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 43m | Avg: 40m 46s | Max: 42m 38s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 44m | Avg: 41m 14s | Max: 45m 06s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 49m | Avg: 42m 26s | Max: 45m 24s
      🟩 Clang17            Pass: 100%/26  | Total: 11h 49m | Avg: 27m 17s | Max: 47m 08s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 44m | Avg: 52m 26s | Max: 53m 30s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 29m | Avg: 44m 54s | Max: 49m 57s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 25m | Avg: 44m 11s | Max: 49m 09s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 34m | Avg: 45m 45s | Max: 53m 53s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 42m | Avg: 40m 38s | Max: 43m 07s
      🟨 GCC11              Pass:  85%/7   | Total:  6h 43m | Avg: 57m 40s | Max:  1h 30m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 45m | Avg: 41m 15s | Max: 43m 08s
      🟩 GCC13              Pass: 100%/29  | Total: 12h 35m | Avg: 26m 02s | Max: 48m 05s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 07m | Avg: 42m 21s | Max: 42m 24s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 53m 01s | Avg: 53m 01s | Max: 53m 01s | Hits:  87%/727   
      🟨 MSVC14.29          Pass:  50%/2   | Total:  1h 47m | Avg: 53m 52s | Max: 54m 01s | Hits:  87%/727   
      🟨 MSVC14.39          Pass:  33%/3   | Total:  2h 52m | Avg: 57m 26s | Max:  1h 00m | Hits:  87%/727   
    🟨 cxx_family
      🟩 Clang              Pass: 100%/59  | Total:  1d 11h | Avg: 35m 37s | Max: 53m 12s
      🟨 GCC                Pass:  98%/64  | Total:  1d 16h | Avg: 37m 30s | Max:  1h 30m
      🟩 Intel              Pass: 100%/3   | Total:  2h 07m | Avg: 42m 21s | Max: 42m 24s
      🟨 MSVC               Pass:  50%/6   | Total:  5h 33m | Avg: 55m 30s | Max:  1h 00m | Hits:  87%/2181  
    🟨 std
      🟩 11                 Pass: 100%/34  | Total: 21h 32m | Avg: 38m 01s | Max:  1h 18m
      🟨 14                 Pass:  97%/37  | Total:  1d 00h | Avg: 39m 28s | Max:  1h 30m | Hits:  87%/2181  
      🟨 17                 Pass:  94%/37  | Total: 23h 30m | Avg: 38m 06s | Max:  1h 12m
      🟨 20                 Pass:  95%/24  | Total: 13h 18m | Avg: 33m 17s | Max:  1h 00m
    🟨 gpu
      🟨 v100               Pass:  96%/132 | Total:  3d 10h | Avg: 37m 35s | Max:  1h 30m | Hits:  87%/2181  
    
  • 🟨 thrust: Pass: 96%/118 | Total: 2d 03h | Avg: 26m 14s | Max: 58m 12s | Hits: 80%/13386

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  96%/110 | Total:  2d 00h | Avg: 26m 15s | Max: 58m 12s | Hits:  80%/13386 
      🟩 arm64              Pass: 100%/8   | Total:  3h 28m | Avg: 26m 03s | Max: 29m 22s
    🔍 ctk: 12.5 🔍
      🟩 11.1               Pass: 100%/15  | Total:  6h 17m | Avg: 25m 09s | Max: 51m 58s | Hits:  80%/2231  
      🟩 11.8               Pass: 100%/3   | Total:  1h 44m | Avg: 34m 43s | Max: 37m 00s
      🔍 12.5               Pass:  96%/100 | Total:  1d 19h | Avg: 26m 09s | Max: 58m 12s | Hits:  80%/11155 
    🔍 cudacxx: nvcc12.5 🔍
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 51m 08s | Avg: 25m 34s | Max: 27m 46s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  6h 17m | Avg: 25m 09s | Max: 51m 58s | Hits:  80%/2231  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 44m | Avg: 34m 43s | Max: 37m 00s
      🔍 nvcc12.5           Pass:  95%/98  | Total:  1d 18h | Avg: 26m 10s | Max: 58m 12s | Hits:  80%/11155 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 51m 08s | Avg: 25m 34s | Max: 27m 46s
      🔍 nvcc               Pass:  96%/116 | Total:  2d 02h | Avg: 26m 15s | Max: 58m 12s | Hits:  80%/13386 
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 32m | Avg: 25m 24s | Max: 31m 46s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 19m | Avg: 26m 27s | Max: 28m 34s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 48m | Avg: 27m 04s | Max: 29m 38s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 44m | Avg: 26m 01s | Max: 28m 25s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 42m | Avg: 25m 39s | Max: 28m 02s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 53m | Avg: 28m 15s | Max: 30m 05s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 17s | Max: 31m 17s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 21s | Max: 31m 02s
      🟩 Clang17            Pass: 100%/18  | Total:  5h 55m | Avg: 19m 45s | Max: 31m 20s
      🟩 GCC6               Pass: 100%/2   | Total: 45m 13s | Avg: 22m 36s | Max: 25m 23s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 27m | Avg: 24m 36s | Max: 30m 28s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 32m | Avg: 25m 20s | Max: 31m 17s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 31m | Avg: 25m 18s | Max: 29m 33s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 57m | Avg: 29m 24s | Max: 31m 28s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 36m | Avg: 30m 59s | Max: 37m 00s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 55m | Avg: 28m 52s | Max: 30m 58s
      🟨 GCC13              Pass:  95%/20  | Total:  6h 34m | Avg: 19m 44s | Max: 34m 56s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 40m | Avg: 33m 39s | Max: 39m 19s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 51m 58s | Avg: 51m 58s | Max: 51m 58s | Hits:  80%/2231  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 00s | Max: 58m 12s | Hits:  80%/4462  
      🟨 MSVC14.39          Pass:  50%/6   | Total:  4h 22m | Avg: 43m 49s | Max: 56m 50s | Hits:  80%/6693  
    🟨 cxx_family
      🟩 Clang              Pass: 100%/51  | Total: 20h 25m | Avg: 24m 02s | Max: 31m 46s
      🟨 GCC                Pass:  98%/55  | Total: 22h 21m | Avg: 24m 23s | Max: 37m 00s
      🟩 Intel              Pass: 100%/3   | Total:  1h 40m | Avg: 33m 39s | Max: 39m 19s
      🟨 MSVC               Pass:  66%/9   | Total:  7h 08m | Avg: 47m 39s | Max: 58m 12s | Hits:  80%/13386 
    🟨 jobs
      🟩 Build              Pass: 100%/99  | Total:  1d 22h | Avg: 28m 19s | Max: 58m 12s | Hits:  80%/13386 
      🟨 TestCPU            Pass:  72%/11  | Total:  2h 31m | Avg: 13m 47s | Max: 33m 50s
      🟨 TestGPU            Pass:  87%/8   | Total:  2h 21m | Avg: 17m 42s | Max: 34m 56s
    🟨 std
      🟩 11                 Pass: 100%/30  | Total: 10h 49m | Avg: 21m 38s | Max: 34m 56s
      🟨 14                 Pass:  94%/34  | Total: 16h 04m | Avg: 28m 22s | Max: 58m 12s | Hits:  80%/6693  
      🟨 17                 Pass:  96%/33  | Total: 15h 26m | Avg: 28m 04s | Max: 56m 36s | Hits:  80%/4462  
      🟨 20                 Pass:  95%/21  | Total:  9h 17m | Avg: 26m 31s | Max: 56m 41s | Hits:  80%/2231  
    🟨 gpu
      🟨 v100               Pass:  96%/118 | Total:  2d 03h | Avg: 26m 14s | Max: 58m 12s | Hits:  80%/13386 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 44m | Avg: 34m 43s | Max: 37m 00s
      🟩 90a                Pass: 100%/4   | Total:  1h 00m | Avg: 15m 13s | Max: 16m 29s
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 251)

# Runner
178 linux-amd64-cpu16
42 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 1h 27m: Pass: 100%/208 | Total: 4d 18h | Avg: 33m 02s | Max: 1h 02m | Hits: 84%/14058
  • 🟩 cub: Pass: 100%/104 | Total: 2d 19h | Avg: 39m 01s | Max: 58m 49s | Hits: 87%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  2d 13h | Avg: 38m 24s | Max: 58m 49s | Hits:  87%/2908  
      🟩 arm64              Pass: 100%/8   | Total:  6h 12m | Avg: 46m 33s | Max: 47m 15s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  8h 50m | Avg: 35m 23s | Max: 45m 41s | Hits:  87%/727   
      🟩 11.8               Pass: 100%/3   | Total:  2h 43m | Avg: 54m 36s | Max: 58m 49s
      🟩 12.6               Pass: 100%/86  | Total:  2d 08h | Avg: 39m 07s | Max: 58m 05s | Hits:  87%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 51m | Avg: 55m 56s | Max: 58m 05s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 50m | Avg: 35m 23s | Max: 45m 41s | Hits:  87%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 43m | Avg: 54m 36s | Max: 58m 49s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 06h | Avg: 38m 43s | Max: 53m 40s | Hits:  87%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 51m | Avg: 55m 56s | Max: 58m 05s
      🟩 nvcc               Pass: 100%/102 | Total:  2d 17h | Avg: 38m 41s | Max: 58m 49s | Hits:  87%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 44m | Avg: 37m 28s | Max: 40m 45s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 05m | Avg: 41m 44s | Max: 42m 45s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 39m | Avg: 39m 49s | Max: 40m 00s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 40m | Avg: 40m 05s | Max: 41m 10s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 39m | Avg: 39m 49s | Max: 40m 12s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 43m | Avg: 40m 59s | Max: 42m 39s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 38m | Avg: 39m 40s | Max: 40m 07s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 41m | Avg: 40m 25s | Max: 42m 09s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 39m | Avg: 39m 51s | Max: 40m 15s
      🟩 Clang18            Pass: 100%/9   | Total:  6h 18m | Avg: 42m 04s | Max: 58m 05s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 15m | Avg: 37m 35s | Max: 41m 09s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 40m | Avg: 36m 47s | Max: 40m 01s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 45m | Avg: 37m 36s | Max: 41m 51s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 48m | Avg: 38m 02s | Max: 45m 38s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 49m | Avg: 42m 15s | Max: 44m 11s
      🟩 GCC11              Pass: 100%/7   | Total:  5h 31m | Avg: 47m 25s | Max: 58m 49s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 47m | Avg: 41m 56s | Max: 45m 09s
      🟩 GCC13              Pass: 100%/16  | Total:  7h 39m | Avg: 28m 43s | Max: 47m 15s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 06m | Avg: 42m 11s | Max: 43m 03s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 45m 41s | Avg: 45m 41s | Max: 45m 41s | Hits:  87%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 43m | Avg: 51m 56s | Max: 52m 04s | Hits:  87%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 53m 40s | Avg: 53m 40s | Max: 53m 40s | Hits:  87%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 06h | Avg: 40m 14s | Max: 58m 05s
      🟩 GCC                Pass: 100%/51  | Total:  1d 07h | Avg: 36m 49s | Max: 58m 49s
      🟩 Intel              Pass: 100%/3   | Total:  2h 06m | Avg: 42m 11s | Max: 43m 03s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 23m | Avg: 50m 48s | Max: 53m 40s | Hits:  87%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  2d 19h | Avg: 39m 01s | Max: 58m 49s | Hits:  87%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 16h | Avg: 40m 36s | Max: 58m 49s | Hits:  87%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 19m 56s | Avg: 19m 56s | Max: 19m 56s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 46s | Avg: 15m 46s | Max: 15m 46s
      🟩 HostLaunch         Pass: 100%/3   | Total: 54m 34s | Avg: 18m 11s | Max: 19m 14s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 10m | Avg: 23m 28s | Max: 25m 24s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 43m | Avg: 54m 36s | Max: 58m 49s
      🟩 90a                Pass: 100%/4   | Total:  1h 10m | Avg: 17m 33s | Max: 17m 49s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 18h 05m | Avg: 38m 46s | Max: 58m 49s
      🟩 14                 Pass: 100%/27  | Total: 17h 56m | Avg: 39m 53s | Max: 52m 35s | Hits:  87%/1454  
      🟩 17                 Pass: 100%/26  | Total: 17h 49m | Avg: 41m 08s | Max: 58m 05s | Hits:  87%/727   
      🟩 20                 Pass: 100%/23  | Total: 13h 46m | Avg: 35m 57s | Max: 53m 48s | Hits:  87%/727   
    
  • 🟩 thrust: Pass: 100%/103 | Total: 1d 22h | Avg: 27m 09s | Max: 1h 02m | Hits: 84%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  1d 19h | Avg: 27m 10s | Max:  1h 02m | Hits:  84%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  3h 35m | Avg: 26m 57s | Max: 31m 28s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  6h 36m | Avg: 26m 25s | Max: 52m 15s | Hits:  80%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  1h 44m | Avg: 34m 44s | Max: 37m 15s
      🟩 12.6               Pass: 100%/85  | Total:  1d 14h | Avg: 27m 01s | Max:  1h 02m | Hits:  85%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 42m 44s | Avg: 21m 22s | Max: 21m 53s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  6h 36m | Avg: 26m 25s | Max: 52m 15s | Hits:  80%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 44m | Avg: 34m 44s | Max: 37m 15s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 13h | Avg: 27m 09s | Max:  1h 02m | Hits:  85%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 42m 44s | Avg: 21m 22s | Max: 21m 53s
      🟩 nvcc               Pass: 100%/101 | Total:  1d 21h | Avg: 27m 16s | Max:  1h 02m | Hits:  84%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 35m | Avg: 25m 53s | Max: 31m 31s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 21m | Avg: 27m 11s | Max: 29m 30s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 15s | Max: 28m 15s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 52m | Avg: 28m 11s | Max: 30m 34s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 44m | Avg: 26m 14s | Max: 28m 10s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 46m | Avg: 26m 33s | Max: 28m 13s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 47m | Avg: 26m 55s | Max: 28m 57s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 52m | Avg: 28m 02s | Max: 32m 43s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 50m | Avg: 27m 32s | Max: 31m 51s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 11m | Avg: 21m 18s | Max: 27m 58s
      🟩 GCC6               Pass: 100%/2   | Total: 47m 35s | Avg: 23m 47s | Max: 24m 29s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 28m | Avg: 24m 44s | Max: 28m 33s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 40m | Avg: 26m 44s | Max: 34m 21s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 41m | Avg: 26m 54s | Max: 30m 11s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 55m | Avg: 28m 53s | Max: 34m 42s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 40m | Avg: 31m 34s | Max: 37m 15s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 00m | Avg: 30m 14s | Max: 35m 51s
      🟩 GCC13              Pass: 100%/14  | Total:  4h 34m | Avg: 19m 37s | Max: 31m 28s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 43m | Avg: 34m 27s | Max: 37m 09s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 52m 15s | Avg: 52m 15s | Max: 52m 15s | Hits:  80%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m | Hits:  80%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 21m | Avg: 40m 49s | Max: 58m 17s | Hits:  90%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total: 19h 47m | Avg: 25m 49s | Max: 32m 43s
      🟩 GCC                Pass: 100%/49  | Total: 20h 50m | Avg: 25m 30s | Max: 37m 15s
      🟩 Intel              Pass: 100%/3   | Total:  1h 43m | Avg: 34m 27s | Max: 37m 09s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 15m | Avg: 51m 09s | Max:  1h 02m | Hits:  84%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  1d 22h | Avg: 27m 09s | Max:  1h 02m | Hits:  84%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  1d 21h | Avg: 28m 15s | Max:  1h 02m | Hits:  80%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 45m 27s | Avg: 11m 21s | Max: 23m 22s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 38m 57s | Avg: 12m 59s | Max: 14m 00s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 44m | Avg: 34m 44s | Max: 37m 15s
      🟩 90a                Pass: 100%/4   | Total:  1h 05m | Avg: 16m 15s | Max: 17m 41s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 10h 19m | Avg: 22m 08s | Max: 31m 04s
      🟩 14                 Pass: 100%/27  | Total: 13h 36m | Avg: 30m 14s | Max: 59m 20s | Hits:  80%/4460  
      🟩 17                 Pass: 100%/26  | Total: 13h 01m | Avg: 30m 03s | Max:  1h 02m | Hits:  80%/2230  
      🟩 20                 Pass: 100%/22  | Total:  9h 39m | Avg: 26m 20s | Max: 58m 17s | Hits:  90%/4460  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 04s | Avg: 15m 04s | Max: 15m 04s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 208)

# Runner
171 linux-amd64-cpu16
16 linux-arm64-cpu16
12 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

@elstehle elstehle requested a review from a team as a code owner October 1, 2024 17:18
Copy link
Contributor

github-actions bot commented Oct 1, 2024

🟩 CI finished in 1h 59m: Pass: 100%/208 | Total: 5d 18h | Avg: 39m 49s | Max: 1h 11m | Hits: 74%/14058
  • 🟩 cub: Pass: 100%/104 | Total: 3d 14h | Avg: 50m 05s | Max: 1h 11m | Hits: 58%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  3d 07h | Avg: 49m 26s | Max:  1h 11m | Hits:  58%/2908  
      🟩 arm64              Pass: 100%/8   | Total:  7h 43m | Avg: 57m 56s | Max:  1h 05m
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 11h 36m | Avg: 46m 27s | Max: 55m 06s | Hits:  58%/727   
      🟩 11.8               Pass: 100%/3   | Total:  3h 23m | Avg:  1h 07m | Max:  1h 11m
      🟩 12.6               Pass: 100%/86  | Total:  2d 23h | Avg: 50m 06s | Max:  1h 07m | Hits:  58%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 47m | Avg: 53m 54s | Max: 54m 05s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 11h 36m | Avg: 46m 27s | Max: 55m 06s | Hits:  58%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 23m | Avg:  1h 07m | Max:  1h 11m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 22h | Avg: 50m 00s | Max:  1h 07m | Hits:  58%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 54s | Max: 54m 05s
      🟩 nvcc               Pass: 100%/102 | Total:  3d 13h | Avg: 50m 00s | Max:  1h 11m | Hits:  58%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 56m | Avg: 49m 27s | Max: 55m 49s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 35m | Avg: 51m 54s | Max: 56m 43s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 27m | Avg: 51m 51s | Max: 54m 53s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 38m | Avg: 54m 43s | Max: 57m 38s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 33s | Max: 56m 32s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 13s | Max: 56m 03s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 32m | Avg: 53m 10s | Max: 54m 29s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 42m | Avg: 55m 36s | Max: 56m 48s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 30m | Avg: 52m 41s | Max: 55m 44s
      🟩 Clang18            Pass: 100%/9   | Total:  7h 02m | Avg: 46m 55s | Max:  1h 05m
      🟩 GCC6               Pass: 100%/2   | Total:  1h 32m | Avg: 46m 04s | Max: 48m 39s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 51m | Avg: 48m 38s | Max: 55m 57s
      🟩 GCC8               Pass: 100%/6   | Total:  5h 01m | Avg: 50m 12s | Max: 54m 21s
      🟩 GCC9               Pass: 100%/6   | Total:  5h 01m | Avg: 50m 12s | Max: 56m 52s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 35m | Avg: 53m 54s | Max: 56m 27s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 55m | Avg: 59m 23s | Max:  1h 11m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 35m | Avg: 53m 50s | Max: 56m 36s
      🟩 GCC13              Pass: 100%/16  | Total:  9h 54m | Avg: 37m 08s | Max:  1h 02m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 43m | Avg: 54m 29s | Max: 55m 28s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 55m 06s | Avg: 55m 06s | Max: 55m 06s | Hits:  58%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 07m | Hits:  58%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m | Hits:  58%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 15h | Avg: 51m 20s | Max:  1h 05m
      🟩 GCC                Pass: 100%/51  | Total:  1d 16h | Avg: 47m 35s | Max:  1h 11m
      🟩 Intel              Pass: 100%/3   | Total:  2h 43m | Avg: 54m 29s | Max: 55m 28s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 16m | Avg:  1h 04m | Max:  1h 07m | Hits:  58%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  3d 14h | Avg: 50m 05s | Max:  1h 11m | Hits:  58%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  3d 11h | Avg: 52m 11s | Max:  1h 11m | Hits:  58%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 16m 19s | Avg: 16m 19s | Max: 16m 19s
      🟩 GraphCapture       Pass: 100%/1   | Total: 14m 04s | Avg: 14m 04s | Max: 14m 04s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 20m | Avg: 26m 52s | Max: 44m 48s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 26m | Avg: 28m 57s | Max: 45m 43s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 23m | Avg:  1h 07m | Max:  1h 11m
      🟩 90a                Pass: 100%/4   | Total:  1h 34m | Avg: 23m 31s | Max: 24m 41s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 22h 31m | Avg: 48m 16s | Max:  1h 11m
      🟩 14                 Pass: 100%/27  | Total: 23h 49m | Avg: 52m 56s | Max:  1h 07m | Hits:  58%/1454  
      🟩 17                 Pass: 100%/26  | Total: 22h 40m | Avg: 52m 19s | Max:  1h 07m | Hits:  58%/727   
      🟩 20                 Pass: 100%/23  | Total: 17h 47m | Avg: 46m 24s | Max:  1h 06m | Hits:  58%/727   
    
  • 🟩 thrust: Pass: 100%/103 | Total: 2d 02h | Avg: 29m 41s | Max: 1h 03m | Hits: 79%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  1d 23h | Avg: 29m 49s | Max:  1h 03m | Hits:  79%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  3h 44m | Avg: 28m 02s | Max: 34m 48s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 22m | Avg: 29m 29s | Max: 57m 09s | Hits:  74%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  1h 51m | Avg: 37m 01s | Max: 39m 37s
      🟩 12.6               Pass: 100%/85  | Total:  1d 17h | Avg: 29m 27s | Max:  1h 03m | Hits:  80%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 46m 36s | Avg: 23m 18s | Max: 23m 33s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 22m | Avg: 29m 29s | Max: 57m 09s | Hits:  74%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 51m | Avg: 37m 01s | Max: 39m 37s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 16h | Avg: 29m 36s | Max:  1h 03m | Hits:  80%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 46m 36s | Avg: 23m 18s | Max: 23m 33s
      🟩 nvcc               Pass: 100%/101 | Total:  2d 02h | Avg: 29m 48s | Max:  1h 03m | Hits:  79%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 47m | Avg: 27m 51s | Max: 30m 22s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 34m | Avg: 31m 29s | Max: 34m 32s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 59m | Avg: 29m 53s | Max: 32m 25s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 55s | Max: 33m 24s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 58m | Avg: 29m 31s | Max: 33m 22s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 59m | Avg: 29m 58s | Max: 33m 17s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 02m | Avg: 30m 37s | Max: 34m 23s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 52m | Avg: 28m 12s | Max: 29m 58s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 47s | Max: 34m 01s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 20m | Avg: 22m 17s | Max: 29m 33s
      🟩 GCC6               Pass: 100%/2   | Total: 53m 15s | Avg: 26m 37s | Max: 29m 31s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 51m | Avg: 28m 32s | Max: 32m 30s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 51m | Avg: 28m 39s | Max: 31m 26s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 57m | Avg: 29m 38s | Max: 32m 30s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 10m | Avg: 32m 33s | Max: 35m 32s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 03m | Avg: 34m 48s | Max: 39m 37s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 10m | Avg: 32m 34s | Max: 35m 31s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 04m | Avg: 21m 47s | Max: 36m 20s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 56m | Avg: 38m 40s | Max: 41m 28s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 57m 09s | Avg: 57m 09s | Max: 57m 09s | Hits:  74%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 53m | Avg: 56m 48s | Max:  1h 01m | Hits:  74%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 25m | Avg: 42m 49s | Max:  1h 03m | Hits:  86%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total: 21h 41m | Avg: 28m 18s | Max: 34m 32s
      🟩 GCC                Pass: 100%/49  | Total: 23h 03m | Avg: 28m 14s | Max: 39m 37s
      🟩 Intel              Pass: 100%/3   | Total:  1h 56m | Avg: 38m 40s | Max: 41m 28s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 16m | Avg: 51m 16s | Max:  1h 03m | Hits:  79%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  2d 02h | Avg: 29m 41s | Max:  1h 03m | Hits:  79%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 01h | Avg: 30m 57s | Max:  1h 03m | Hits:  74%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 45m 43s | Avg: 11m 25s | Max: 22m 08s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total: 40m 44s | Avg: 13m 34s | Max: 14m 20s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 51m | Avg: 37m 01s | Max: 39m 37s
      🟩 90a                Pass: 100%/4   | Total:  1h 16m | Avg: 19m 06s | Max: 23m 57s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 11h 23m | Avg: 24m 24s | Max: 33m 40s
      🟩 14                 Pass: 100%/27  | Total: 14h 48m | Avg: 32m 54s | Max: 57m 09s | Hits:  74%/4460  
      🟩 17                 Pass: 100%/26  | Total: 14h 14m | Avg: 32m 52s | Max:  1h 01m | Hits:  74%/2230  
      🟩 20                 Pass: 100%/22  | Total: 10h 31m | Avg: 28m 41s | Max:  1h 03m | Hits:  86%/4460  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 208)

# Runner
171 linux-amd64-cpu16
16 linux-arm64-cpu16
12 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

Copy link
Contributor

github-actions bot commented Oct 3, 2024

🟩 CI finished in 1h 40m: Pass: 100%/208 | Total: 4d 20h | Avg: 33m 31s | Max: 1h 00m | Hits: 84%/14058
  • 🟩 cub: Pass: 100%/104 | Total: 2d 21h | Avg: 40m 10s | Max: 53m 52s | Hits: 87%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  2d 15h | Avg: 39m 38s | Max: 53m 52s | Hits:  87%/2908  
      🟩 arm64              Pass: 100%/8   | Total:  6h 12m | Avg: 46m 33s | Max: 48m 09s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  9h 18m | Avg: 37m 15s | Max: 46m 29s | Hits:  87%/727   
      🟩 11.8               Pass: 100%/3   | Total:  2h 39m | Avg: 53m 01s | Max: 53m 52s
      🟩 12.6               Pass: 100%/86  | Total:  2d 09h | Avg: 40m 13s | Max: 53m 17s | Hits:  87%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 44m | Avg: 52m 18s | Max: 53m 04s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  9h 18m | Avg: 37m 15s | Max: 46m 29s | Hits:  87%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 39m | Avg: 53m 01s | Max: 53m 52s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 07h | Avg: 39m 56s | Max: 53m 17s | Hits:  87%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 44m | Avg: 52m 18s | Max: 53m 04s
      🟩 nvcc               Pass: 100%/102 | Total:  2d 19h | Avg: 39m 55s | Max: 53m 52s | Hits:  87%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 55m | Avg: 39m 16s | Max: 41m 50s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 06m | Avg: 42m 02s | Max: 44m 29s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 46m | Avg: 41m 43s | Max: 44m 10s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 45m | Avg: 41m 25s | Max: 43m 48s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 49m | Avg: 42m 22s | Max: 45m 07s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 51m | Avg: 42m 49s | Max: 45m 03s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 43m | Avg: 40m 50s | Max: 43m 19s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 41m | Avg: 40m 16s | Max: 41m 29s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 39m | Avg: 39m 48s | Max: 40m 18s
      🟩 Clang18            Pass: 100%/9   | Total:  6h 22m | Avg: 42m 27s | Max: 53m 04s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 28s | Max: 35m 37s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 47m | Avg: 37m 52s | Max: 40m 38s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 51m | Avg: 38m 39s | Max: 42m 04s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 59m | Avg: 39m 53s | Max: 49m 21s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 52m | Avg: 43m 12s | Max: 45m 32s
      🟩 GCC11              Pass: 100%/7   | Total:  5h 26m | Avg: 46m 42s | Max: 53m 52s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 41m | Avg: 40m 15s | Max: 40m 32s
      🟩 GCC13              Pass: 100%/16  | Total:  8h 27m | Avg: 31m 42s | Max: 48m 09s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 15m | Avg: 45m 13s | Max: 47m 11s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 46m 29s | Avg: 46m 29s | Max: 46m 29s | Hits:  87%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 43m | Avg: 51m 47s | Max: 51m 50s | Hits:  87%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 53m 17s | Avg: 53m 17s | Max: 53m 17s | Hits:  87%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 07h | Avg: 41m 19s | Max: 53m 04s
      🟩 GCC                Pass: 100%/51  | Total:  1d 08h | Avg: 37m 59s | Max: 53m 52s
      🟩 Intel              Pass: 100%/3   | Total:  2h 15m | Avg: 45m 13s | Max: 47m 11s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 23m | Avg: 50m 50s | Max: 53m 17s | Hits:  87%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  2d 21h | Avg: 40m 10s | Max: 53m 52s | Hits:  87%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 18h | Avg: 41m 15s | Max: 53m 52s | Hits:  87%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 30m 43s | Avg: 30m 43s | Max: 30m 43s
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 22s | Avg: 24m 22s | Max: 24m 22s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 12m | Avg: 24m 11s | Max: 24m 47s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 29m | Avg: 29m 46s | Max: 34m 06s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 39m | Avg: 53m 01s | Max: 53m 52s
      🟩 90a                Pass: 100%/4   | Total:  1h 10m | Avg: 17m 43s | Max: 19m 23s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 18h 32m | Avg: 39m 43s | Max: 52m 40s
      🟩 14                 Pass: 100%/27  | Total: 18h 24m | Avg: 40m 53s | Max: 52m 33s | Hits:  87%/1454  
      🟩 17                 Pass: 100%/26  | Total: 17h 57m | Avg: 41m 26s | Max: 53m 52s | Hits:  87%/727   
      🟩 20                 Pass: 100%/23  | Total: 14h 43m | Avg: 38m 24s | Max: 53m 17s | Hits:  87%/727   
    
  • 🟩 thrust: Pass: 100%/103 | Total: 1d 22h | Avg: 27m 01s | Max: 1h 00m | Hits: 84%/11150

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  1d 18h | Avg: 27m 06s | Max:  1h 00m | Hits:  84%/11150 
      🟩 arm64              Pass: 100%/8   | Total:  3h 28m | Avg: 26m 03s | Max: 30m 38s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  6h 27m | Avg: 25m 48s | Max: 50m 28s | Hits:  80%/2230  
      🟩 11.8               Pass: 100%/3   | Total:  1h 41m | Avg: 33m 48s | Max: 36m 32s
      🟩 12.6               Pass: 100%/85  | Total:  1d 14h | Avg: 26m 59s | Max:  1h 00m | Hits:  85%/8920  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 35s | Avg: 25m 17s | Max: 25m 36s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  6h 27m | Avg: 25m 48s | Max: 50m 28s | Hits:  80%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 41m | Avg: 33m 48s | Max: 36m 32s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 13h | Avg: 27m 02s | Max:  1h 00m | Hits:  85%/8920  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 35s | Avg: 25m 17s | Max: 25m 36s
      🟩 nvcc               Pass: 100%/101 | Total:  1d 21h | Avg: 27m 03s | Max:  1h 00m | Hits:  84%/11150 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 27m | Avg: 24m 37s | Max: 28m 50s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 19m | Avg: 26m 27s | Max: 28m 53s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 25s | Max: 31m 05s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 42m | Avg: 25m 37s | Max: 27m 13s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 29s | Max: 31m 33s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 47m | Avg: 26m 45s | Max: 29m 46s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 46m | Avg: 26m 33s | Max: 29m 35s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 27s | Max: 30m 37s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 43m | Avg: 25m 45s | Max: 27m 42s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 31m | Avg: 23m 30s | Max: 30m 18s
      🟩 GCC6               Pass: 100%/2   | Total: 45m 11s | Avg: 22m 35s | Max: 25m 08s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 30m | Avg: 25m 07s | Max: 28m 11s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 36m | Avg: 26m 01s | Max: 29m 05s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 41m | Avg: 26m 54s | Max: 31m 00s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 48m | Avg: 27m 03s | Max: 30m 16s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 36m | Avg: 30m 54s | Max: 36m 32s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 02m | Avg: 30m 42s | Max: 38m 56s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 03m | Avg: 21m 41s | Max: 30m 38s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 43m | Avg: 34m 35s | Max: 37m 38s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 50m 28s | Avg: 50m 28s | Max: 50m 28s | Hits:  80%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 40m | Avg: 50m 09s | Max: 51m 40s | Hits:  80%/4460  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 24m | Avg: 42m 27s | Max:  1h 00m | Hits:  90%/4460  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total: 19h 38m | Avg: 25m 37s | Max: 31m 33s
      🟩 GCC                Pass: 100%/49  | Total: 21h 04m | Avg: 25m 48s | Max: 38m 56s
      🟩 Intel              Pass: 100%/3   | Total:  1h 43m | Avg: 34m 35s | Max: 37m 38s
      🟩 MSVC               Pass: 100%/5   | Total:  3h 55m | Avg: 47m 08s | Max:  1h 00m | Hits:  84%/11150 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  1d 22h | Avg: 27m 01s | Max:  1h 00m | Hits:  84%/11150 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  1d 20h | Avg: 27m 38s | Max:  1h 00m | Hits:  80%/8920  
      🟩 TestCPU            Pass: 100%/4   | Total: 51m 35s | Avg: 12m 53s | Max: 24m 36s | Hits:  99%/2230  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 18m | Avg: 26m 12s | Max: 28m 54s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 41m | Avg: 33m 48s | Max: 36m 32s
      🟩 90a                Pass: 100%/4   | Total:  1h 04m | Avg: 16m 07s | Max: 18m 56s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 10h 24m | Avg: 22m 18s | Max: 31m 09s
      🟩 14                 Pass: 100%/27  | Total: 13h 03m | Avg: 29m 01s | Max: 50m 28s | Hits:  80%/4460  
      🟩 17                 Pass: 100%/26  | Total: 12h 38m | Avg: 29m 10s | Max: 51m 40s | Hits:  80%/2230  
      🟩 20                 Pass: 100%/22  | Total: 10h 16m | Avg: 28m 01s | Max:  1h 00m | Hits:  90%/4460  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 208)

# Runner
171 linux-amd64-cpu16
16 linux-arm64-cpu16
12 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

cub/cub/agent/agent_select_if.cuh Show resolved Hide resolved
cub/cub/agent/agent_select_if.cuh Show resolved Hide resolved
cub/cub/detail/choose_offset.cuh Outdated Show resolved Hide resolved
cub/cub/detail/choose_offset.cuh Outdated Show resolved Hide resolved
cub/cub/device/dispatch/dispatch_select_if.cuh Outdated Show resolved Hide resolved
cub/cub/device/dispatch/dispatch_select_if.cuh Outdated Show resolved Hide resolved
cub/cub/device/dispatch/dispatch_select_if.cuh Outdated Show resolved Hide resolved
cub/cub/device/device_select.cuh Outdated Show resolved Hide resolved
thrust/thrust/system/cuda/detail/copy_if.h Outdated Show resolved Hide resolved
Copy link
Contributor

github-actions bot commented Oct 8, 2024

🟩 CI finished in 2h 19m: Pass: 100%/208 | Total: 5d 17h | Avg: 39m 39s | Max: 1h 27m | Hits: 53%/16003
  • 🟩 cub: Pass: 100%/104 | Total: 3d 12h | Avg: 48m 39s | Max: 1h 27m | Hits: 59%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  3d 05h | Avg: 48m 16s | Max:  1h 27m | Hits:  59%/2908  
      🟩 arm64              Pass: 100%/8   | Total:  7h 06m | Avg: 53m 18s | Max: 55m 38s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 10h 57m | Avg: 43m 48s | Max: 53m 22s | Hits:  59%/727   
      🟩 11.8               Pass: 100%/3   | Total:  3h 28m | Avg:  1h 09m | Max:  1h 13m
      🟩 12.6               Pass: 100%/86  | Total:  2d 21h | Avg: 48m 46s | Max:  1h 27m | Hits:  59%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 46m | Avg: 53m 29s | Max: 53m 38s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 10h 57m | Avg: 43m 48s | Max: 53m 22s | Hits:  59%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 28m | Avg:  1h 09m | Max:  1h 13m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 20h | Avg: 48m 39s | Max:  1h 27m | Hits:  59%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 29s | Max: 53m 38s
      🟩 nvcc               Pass: 100%/102 | Total:  3d 10h | Avg: 48m 34s | Max:  1h 27m | Hits:  59%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 41m | Avg: 46m 53s | Max: 49m 26s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 28m | Avg: 49m 39s | Max: 50m 00s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 18m | Avg: 49m 43s | Max: 51m 58s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 29m | Avg: 52m 15s | Max: 56m 16s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 19m | Avg: 49m 49s | Max: 51m 18s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 05s | Max: 55m 26s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 16m | Avg: 49m 09s | Max: 49m 33s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 19m | Avg: 49m 51s | Max: 52m 56s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 36s | Max: 56m 47s
      🟩 Clang18            Pass: 100%/9   | Total:  6h 55m | Avg: 46m 06s | Max: 56m 04s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 24m | Avg: 42m 25s | Max: 42m 27s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 35m | Avg: 45m 57s | Max: 49m 52s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 40m | Avg: 46m 48s | Max: 54m 25s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 47m | Avg: 47m 55s | Max: 56m 16s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 28m | Avg: 52m 08s | Max: 57m 40s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 52m | Avg: 58m 58s | Max:  1h 13m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 27m | Avg: 51m 49s | Max: 54m 27s
      🟩 GCC13              Pass: 100%/16  | Total: 10h 44m | Avg: 40m 15s | Max:  1h 27m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 48m | Avg: 56m 12s | Max:  1h 01m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 53m 22s | Avg: 53m 22s | Max: 53m 22s | Hits:  59%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 39s | Max: 59m 10s | Hits:  59%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  59%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 13h | Avg: 49m 07s | Max: 56m 47s
      🟩 GCC                Pass: 100%/51  | Total:  1d 16h | Avg: 47m 05s | Max:  1h 27m
      🟩 Intel              Pass: 100%/3   | Total:  2h 48m | Avg: 56m 12s | Max:  1h 01m
      🟩 MSVC               Pass: 100%/4   | Total:  3h 51m | Avg: 57m 48s | Max:  1h 00m | Hits:  59%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  3d 12h | Avg: 48m 39s | Max:  1h 27m | Hits:  59%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  3d 07h | Avg: 49m 50s | Max:  1h 13m | Hits:  59%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 17m 13s | Avg: 17m 13s | Max: 17m 13s
      🟩 GraphCapture       Pass: 100%/1   | Total: 14m 35s | Avg: 14m 35s | Max: 14m 35s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 46m | Avg: 35m 30s | Max:  1h 15m
      🟩 TestGPU            Pass: 100%/3   | Total:  2h 17m | Avg: 45m 46s | Max:  1h 27m
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 28m | Avg:  1h 09m | Max:  1h 13m
      🟩 90a                Pass: 100%/4   | Total:  1h 25m | Avg: 21m 25s | Max: 21m 52s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 23h 38m | Avg: 50m 40s | Max:  1h 27m
      🟩 14                 Pass: 100%/27  | Total: 22h 56m | Avg: 50m 58s | Max:  1h 13m | Hits:  59%/1454  
      🟩 17                 Pass: 100%/26  | Total: 21h 19m | Avg: 49m 12s | Max:  1h 11m | Hits:  59%/727   
      🟩 20                 Pass: 100%/23  | Total: 16h 26m | Avg: 42m 54s | Max:  1h 00m | Hits:  59%/727   
    
  • 🟩 thrust: Pass: 100%/103 | Total: 2d 04h | Avg: 30m 48s | Max: 1h 05m | Hits: 52%/13095

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  2d 00h | Avg: 30m 50s | Max:  1h 05m | Hits:  52%/13095 
      🟩 arm64              Pass: 100%/8   | Total:  4h 04m | Avg: 30m 32s | Max: 34m 42s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 41m | Avg: 30m 47s | Max: 59m 38s | Hits:  25%/2619  
      🟩 11.8               Pass: 100%/3   | Total:  1h 58m | Avg: 39m 23s | Max: 44m 43s
      🟩 12.6               Pass: 100%/85  | Total:  1d 19h | Avg: 30m 30s | Max:  1h 05m | Hits:  59%/10476 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 26s | Avg: 25m 13s | Max: 25m 45s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 41m | Avg: 30m 47s | Max: 59m 38s | Hits:  25%/2619  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 58m | Avg: 39m 23s | Max: 44m 43s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 18h | Avg: 30m 38s | Max:  1h 05m | Hits:  59%/10476 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 26s | Avg: 25m 13s | Max: 25m 45s
      🟩 nvcc               Pass: 100%/101 | Total:  2d 04h | Avg: 30m 55s | Max:  1h 05m | Hits:  52%/13095 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 01m | Avg: 30m 10s | Max: 35m 46s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 35m | Avg: 31m 58s | Max: 35m 57s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 07m | Avg: 31m 53s | Max: 34m 23s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 09s | Max: 35m 56s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 49s | Max: 32m 07s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 00m | Avg: 30m 09s | Max: 32m 39s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 05m | Avg: 31m 22s | Max: 33m 37s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 11s | Max: 35m 00s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 51s | Max: 33m 21s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 40m | Avg: 24m 26s | Max: 31m 47s
      🟩 GCC6               Pass: 100%/2   | Total: 55m 36s | Avg: 27m 48s | Max: 29m 25s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 57m | Avg: 29m 33s | Max: 33m 33s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 02m | Avg: 30m 28s | Max: 33m 32s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 01m | Avg: 30m 10s | Max: 35m 04s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 07m | Avg: 31m 56s | Max: 35m 46s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 08m | Avg: 35m 28s | Max: 44m 43s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 12m | Avg: 33m 11s | Max: 37m 01s
      🟩 GCC13              Pass: 100%/14  | Total:  5h 10m | Avg: 22m 12s | Max: 39m 04s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 06m | Avg: 42m 07s | Max: 46m 06s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 59m 38s | Avg: 59m 38s | Max: 59m 38s | Hits:  25%/2619  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 05m | Hits:  30%/5238  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 24m | Avg: 42m 05s | Max: 57m 59s | Hits:  88%/5238  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total: 22h 46m | Avg: 29m 42s | Max: 35m 57s
      🟩 GCC                Pass: 100%/49  | Total: 23h 36m | Avg: 28m 54s | Max: 44m 43s
      🟩 Intel              Pass: 100%/3   | Total:  2h 06m | Avg: 42m 07s | Max: 46m 06s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 24m | Avg: 52m 50s | Max:  1h 05m | Hits:  52%/13095 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  2d 04h | Avg: 30m 48s | Max:  1h 05m | Hits:  52%/13095 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 03h | Avg: 32m 08s | Max:  1h 05m | Hits:  40%/10476 
      🟩 TestCPU            Pass: 100%/4   | Total: 49m 32s | Avg: 12m 23s | Max: 26m 11s | Hits:  99%/2619  
      🟩 TestGPU            Pass: 100%/3   | Total: 38m 19s | Avg: 12m 46s | Max: 13m 12s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 58m | Avg: 39m 23s | Max: 44m 43s
      🟩 90a                Pass: 100%/4   | Total:  1h 18m | Avg: 19m 39s | Max: 22m 49s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 11h 50m | Avg: 25m 22s | Max: 36m 38s
      🟩 14                 Pass: 100%/27  | Total: 15h 14m | Avg: 33m 53s | Max: 59m 38s | Hits:  28%/5238  
      🟩 17                 Pass: 100%/26  | Total: 14h 58m | Avg: 34m 33s | Max:  1h 05m | Hits:  30%/2619  
      🟩 20                 Pass: 100%/22  | Total: 10h 49m | Avg: 29m 31s | Max: 57m 59s | Hits:  88%/5238  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 15m 00s | Avg: 15m 00s | Max: 15m 00s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 00s | Avg: 15m 00s | Max: 15m 00s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 15m 00s | Avg: 15m 00s | Max: 15m 00s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 15m 00s | Avg: 15m 00s | Max: 15m 00s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 00s | Avg: 15m 00s | Max: 15m 00s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 00s | Avg: 15m 00s | Max: 15m 00s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 00s | Avg: 15m 00s | Max: 15m 00s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 00s | Avg: 15m 00s | Max: 15m 00s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 00s | Avg: 15m 00s | Max: 15m 00s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 208)

# Runner
171 linux-amd64-cpu16
16 linux-arm64-cpu16
12 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

Copy link
Contributor

github-actions bot commented Oct 8, 2024

🟩 CI finished in 2h 06m: Pass: 100%/208 | Total: 4d 23h | Avg: 34m 30s | Max: 1h 17m | Hits: 86%/16003
  • 🟩 cub: Pass: 100%/104 | Total: 2d 23h | Avg: 41m 02s | Max: 1h 17m | Hits: 87%/2908

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total:  2d 16h | Avg: 40m 28s | Max:  1h 17m | Hits:  87%/2908  
      🟩 arm64              Pass: 100%/8   | Total:  6h 22m | Avg: 47m 51s | Max: 52m 33s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  9h 41m | Avg: 38m 45s | Max: 51m 12s | Hits:  87%/727   
      🟩 11.8               Pass: 100%/3   | Total:  2h 44m | Avg: 54m 53s | Max: 56m 49s
      🟩 12.6               Pass: 100%/86  | Total:  2d 10h | Avg: 40m 57s | Max:  1h 17m | Hits:  87%/2181  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 57m | Avg: 58m 35s | Max: 58m 40s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  9h 41m | Avg: 38m 45s | Max: 51m 12s | Hits:  87%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 44m | Avg: 54m 53s | Max: 56m 49s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 08h | Avg: 40m 32s | Max:  1h 17m | Hits:  87%/2181  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 35s | Max: 58m 40s
      🟩 nvcc               Pass: 100%/102 | Total:  2d 21h | Avg: 40m 41s | Max:  1h 17m | Hits:  87%/2908  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 53m | Avg: 38m 59s | Max: 42m 00s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 08m | Avg: 42m 51s | Max: 45m 28s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 42m | Avg: 40m 31s | Max: 41m 56s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 46m | Avg: 41m 41s | Max: 46m 07s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 50m | Avg: 42m 43s | Max: 45m 19s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 41m | Avg: 40m 29s | Max: 41m 36s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 45m | Avg: 41m 23s | Max: 45m 27s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 56m | Avg: 44m 00s | Max: 45m 56s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 53m | Avg: 43m 26s | Max: 45m 24s
      🟩 Clang18            Pass: 100%/9   | Total:  6h 30m | Avg: 43m 26s | Max: 58m 40s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 17m | Avg: 38m 49s | Max: 38m 56s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 57m | Avg: 39m 32s | Max: 41m 54s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 06m | Avg: 41m 07s | Max: 45m 03s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 53m | Avg: 38m 56s | Max: 43m 33s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 54m | Avg: 43m 32s | Max: 45m 30s
      🟩 GCC11              Pass: 100%/7   | Total:  5h 41m | Avg: 48m 43s | Max: 56m 49s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 47m | Avg: 41m 59s | Max: 46m 19s
      🟩 GCC13              Pass: 100%/16  | Total:  8h 40m | Avg: 32m 31s | Max:  1h 17m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 05m | Avg: 41m 57s | Max: 42m 53s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 51m 12s | Avg: 51m 12s | Max: 51m 12s | Hits:  87%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 26s | Max: 57m 40s | Hits:  87%/1454  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 53m 08s | Avg: 53m 08s | Max: 53m 08s | Hits:  87%/727   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  1d 08h | Avg: 41m 58s | Max: 58m 40s
      🟩 GCC                Pass: 100%/51  | Total:  1d 09h | Avg: 39m 11s | Max:  1h 17m
      🟩 Intel              Pass: 100%/3   | Total:  2h 05m | Avg: 41m 57s | Max: 42m 53s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 33m | Avg: 53m 18s | Max: 57m 40s | Hits:  87%/2908  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total:  2d 23h | Avg: 41m 02s | Max:  1h 17m | Hits:  87%/2908  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  2d 19h | Avg: 42m 17s | Max: 58m 40s | Hits:  87%/2908  
      🟩 DeviceLaunch       Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 10s | Avg: 15m 10s | Max: 15m 10s
      🟩 HostLaunch         Pass: 100%/3   | Total: 50m 33s | Avg: 16m 51s | Max: 17m 37s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 05m | Avg: 21m 45s | Max: 22m 45s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 44m | Avg: 54m 53s | Max: 56m 49s
      🟩 90a                Pass: 100%/4   | Total:  1h 11m | Avg: 17m 53s | Max: 19m 44s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 18h 27m | Avg: 39m 33s | Max: 52m 31s
      🟩 14                 Pass: 100%/27  | Total: 18h 55m | Avg: 42m 03s | Max: 57m 40s | Hits:  87%/1454  
      🟩 17                 Pass: 100%/26  | Total: 18h 24m | Avg: 42m 28s | Max: 58m 31s | Hits:  87%/727   
      🟩 20                 Pass: 100%/23  | Total: 15h 20m | Avg: 40m 02s | Max:  1h 17m | Hits:  87%/727   
    
  • 🟩 thrust: Pass: 100%/103 | Total: 2d 00h | Avg: 28m 06s | Max: 1h 01m | Hits: 86%/13095

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total:  1d 20h | Avg: 28m 16s | Max:  1h 01m | Hits:  86%/13095 
      🟩 arm64              Pass: 100%/8   | Total:  3h 29m | Avg: 26m 13s | Max: 29m 48s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 10m | Avg: 28m 40s | Max: 53m 45s | Hits:  82%/2619  
      🟩 11.8               Pass: 100%/3   | Total:  1h 50m | Avg: 36m 42s | Max: 41m 53s
      🟩 12.6               Pass: 100%/85  | Total:  1d 15h | Avg: 27m 42s | Max:  1h 01m | Hits:  86%/10476 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 47m 14s | Avg: 23m 37s | Max: 24m 38s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 10m | Avg: 28m 40s | Max: 53m 45s | Hits:  82%/2619  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 50m | Avg: 36m 42s | Max: 41m 53s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  1d 14h | Avg: 27m 48s | Max:  1h 01m | Hits:  86%/10476 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 47m 14s | Avg: 23m 37s | Max: 24m 38s
      🟩 nvcc               Pass: 100%/101 | Total:  1d 23h | Avg: 28m 11s | Max:  1h 01m | Hits:  86%/13095 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 49m | Avg: 28m 14s | Max: 33m 05s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 28m | Avg: 29m 34s | Max: 33m 07s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 50m | Avg: 27m 39s | Max: 31m 25s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 22s | Max: 29m 13s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 55m | Avg: 28m 55s | Max: 31m 52s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 48m | Avg: 27m 12s | Max: 29m 47s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 53m | Avg: 28m 21s | Max: 32m 26s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 57m | Avg: 29m 23s | Max: 34m 09s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 48m | Avg: 27m 09s | Max: 28m 23s
      🟩 Clang18            Pass: 100%/9   | Total:  3h 26m | Avg: 22m 54s | Max: 37m 27s
      🟩 GCC6               Pass: 100%/2   | Total: 52m 33s | Avg: 26m 16s | Max: 29m 25s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 38m | Avg: 26m 24s | Max: 30m 17s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 50m | Avg: 28m 25s | Max: 31m 47s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 51m | Avg: 28m 32s | Max: 38m 33s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 54m | Avg: 28m 31s | Max: 31m 41s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 50m | Avg: 32m 51s | Max: 41m 53s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 00m | Avg: 30m 09s | Max: 34m 33s
      🟩 GCC13              Pass: 100%/14  | Total:  4h 37m | Avg: 19m 50s | Max: 32m 16s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 39m | Avg: 33m 11s | Max: 36m 23s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 53m 45s | Avg: 53m 45s | Max: 53m 45s | Hits:  82%/2619  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 31s | Max:  1h 01m | Hits:  82%/5238  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 20m | Avg: 40m 26s | Max: 54m 07s | Hits:  91%/5238  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total: 20h 48m | Avg: 27m 08s | Max: 37m 27s
      🟩 GCC                Pass: 100%/49  | Total: 21h 35m | Avg: 26m 26s | Max: 41m 53s
      🟩 Intel              Pass: 100%/3   | Total:  1h 39m | Avg: 33m 11s | Max: 36m 23s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 11m | Avg: 50m 19s | Max:  1h 01m | Hits:  86%/13095 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total:  2d 00h | Avg: 28m 06s | Max:  1h 01m | Hits:  86%/13095 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  1d 22h | Avg: 29m 13s | Max:  1h 01m | Hits:  82%/10476 
      🟩 TestCPU            Pass: 100%/4   | Total: 51m 03s | Avg: 12m 45s | Max: 26m 45s | Hits:  99%/2619  
      🟩 TestGPU            Pass: 100%/3   | Total: 38m 48s | Avg: 12m 56s | Max: 14m 12s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 50m | Avg: 36m 42s | Max: 41m 53s
      🟩 90a                Pass: 100%/4   | Total:  1h 07m | Avg: 16m 49s | Max: 18m 42s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total: 10h 37m | Avg: 22m 46s | Max: 33m 04s
      🟩 14                 Pass: 100%/27  | Total: 14h 02m | Avg: 31m 11s | Max: 55m 37s | Hits:  82%/5238  
      🟩 17                 Pass: 100%/26  | Total: 13h 34m | Avg: 31m 20s | Max:  1h 01m | Hits:  82%/2619  
      🟩 20                 Pass: 100%/22  | Total: 10h 00m | Avg: 27m 17s | Max: 54m 07s | Hits:  91%/5238  
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 13m 40s | Avg: 13m 40s | Max: 13m 40s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 208)

# Runner
171 linux-amd64-cpu16
16 linux-arm64-cpu16
12 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

@elstehle elstehle merged commit 16f9a1a into NVIDIA:main Oct 8, 2024
224 checks passed
@bhack
Copy link

bhack commented Nov 22, 2024

Is this going in the 2.7.0 release?

@bernhardmgruber
Copy link
Contributor

bernhardmgruber commented Nov 22, 2024

$ git tag --contains 16f9a1a
v9.9.9

There is no (normal) tag yet containing the commit made by this PR, so we will ship it in the next release, CCCL 2.8.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
4 participants