Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Try to work around issue with NVHPC in conjunction with older CTK versions #2889

Merged
merged 1 commit into from
Nov 21, 2024

Conversation

miscco
Copy link
Collaborator

@miscco miscco commented Nov 19, 2024

NVHPC can consume older CTK headers for stdpar, so we need to try and avoid using those

@miscco miscco requested review from a team as code owners November 19, 2024 20:11
@miscco miscco changed the title Try to work around issue with NVHPC in conjunction of older CTK versions Try to work around issue with NVHPC in conjunction with older CTK versions Nov 19, 2024
cub/cub/thread/thread_operators.cuh Outdated Show resolved Hide resolved
cub/cub/thread/thread_operators.cuh Outdated Show resolved Hide resolved
cub/cub/thread/thread_operators.cuh Outdated Show resolved Hide resolved
cub/cub/thread/thread_operators.cuh Outdated Show resolved Hide resolved
@miscco miscco force-pushed the fix_extended_floating_point_nvhpc branch from 4b92514 to 06b9ca9 Compare November 19, 2024 21:30
Copy link
Contributor

🟩 CI finished in 3h 08m: Pass: 100%/224 | Total: 3d 01h | Avg: 19m 40s | Max: 1h 12m | Hits: 65%/12224
  • 🟩 thrust: Pass: 100%/111 | Total: 1d 09h | Avg: 18m 13s | Max: 1h 02m | Hits: 73%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 46m 24s | Avg: 23m 12s | Max: 33m 27s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total:  1d 07h | Avg: 18m 32s | Max:  1h 02m | Hits:  73%/9260  
      🟩 arm64              Pass: 100%/8   | Total:  1h 52m | Avg: 14m 04s | Max: 37m 46s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  4h 07m | Avg: 16m 29s | Max: 57m 58s | Hits:  67%/1852  
      🟩 11.8               Pass: 100%/3   | Total: 35m 13s | Avg: 11m 44s | Max: 12m 18s
      🟩 12.5               Pass: 100%/4   | Total:  3h 50m | Avg: 57m 43s | Max:  1h 02m
      🟩 12.6               Pass: 100%/89  | Total:  1d 01h | Avg: 16m 57s | Max: 59m 43s | Hits:  75%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 38m 37s | Avg:  9m 39s | Max: 10m 22s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  4h 07m | Avg: 16m 29s | Max: 57m 58s | Hits:  67%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 35m 13s | Avg: 11m 44s | Max: 12m 18s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  3h 50m | Avg: 57m 43s | Max:  1h 02m
      🟩 nvcc12.6           Pass: 100%/85  | Total:  1d 00h | Avg: 17m 17s | Max: 59m 43s | Hits:  75%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 38m 37s | Avg:  9m 39s | Max: 10m 22s
      🟩 nvcc               Pass: 100%/107 | Total:  1d 09h | Avg: 18m 32s | Max:  1h 02m | Hits:  73%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 59m | Avg: 29m 57s | Max: 33m 19s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 42m | Avg: 34m 17s | Max: 37m 44s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 46s | Max: 36m 46s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 54s | Max: 42m 23s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 58s | Max: 33m 27s
      🟩 Clang14            Pass: 100%/4   | Total: 37m 13s | Avg:  9m 18s | Max:  9m 54s
      🟩 Clang15            Pass: 100%/4   | Total: 36m 26s | Avg:  9m 06s | Max:  9m 30s
      🟩 Clang16            Pass: 100%/4   | Total: 41m 10s | Avg: 10m 17s | Max: 11m 49s
      🟩 Clang17            Pass: 100%/4   | Total: 43m 07s | Avg: 10m 46s | Max: 12m 12s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 54m | Avg: 10m 27s | Max: 18m 03s
      🟩 GCC6               Pass: 100%/2   | Total: 17m 52s | Avg:  8m 56s | Max:  8m 59s
      🟩 GCC7               Pass: 100%/6   | Total: 57m 46s | Avg:  9m 37s | Max: 12m 29s
      🟩 GCC8               Pass: 100%/6   | Total: 56m 58s | Avg:  9m 29s | Max: 10m 04s
      🟩 GCC9               Pass: 100%/6   | Total: 58m 59s | Avg:  9m 49s | Max: 10m 46s
      🟩 GCC10              Pass: 100%/4   | Total: 40m 39s | Avg: 10m 09s | Max: 10m 32s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 16m | Avg: 10m 56s | Max: 12m 44s
      🟩 GCC12              Pass: 100%/4   | Total: 40m 58s | Avg: 10m 14s | Max: 10m 29s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 29m | Avg: 13m 05s | Max: 37m 46s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 34m 03s | Avg: 11m 21s | Max: 12m 26s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 57m 58s | Avg: 57m 58s | Max: 57m 58s | Hits:  67%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 56m | Avg: 58m 05s | Max: 58m 15s | Hits:  67%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 21m | Avg: 40m 59s | Max: 59m 43s | Hits:  83%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  3h 50m | Avg: 57m 43s | Max:  1h 02m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total: 15h 42m | Avg: 19m 37s | Max: 42m 23s
      🟩 GCC                Pass: 100%/51  | Total:  9h 19m | Avg: 10m 58s | Max: 37m 46s
      🟩 Intel              Pass: 100%/3   | Total: 34m 03s | Avg: 11m 21s | Max: 12m 26s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 16m | Avg: 51m 13s | Max: 59m 43s | Hits:  73%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  3h 50m | Avg: 57m 43s | Max:  1h 02m
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total:  1d 09h | Avg: 18m 13s | Max:  1h 02m | Hits:  73%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  1d 07h | Avg: 18m 34s | Max:  1h 02m | Hits:  67%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 01s | Avg: 11m 30s | Max: 22m 15s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 03m | Avg: 15m 50s | Max: 18m 37s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 35m 13s | Avg: 11m 44s | Max: 12m 18s
      🟩 90a                Pass: 100%/4   | Total: 22m 44s | Avg:  5m 41s | Max:  6m 19s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  7h 17m | Avg: 14m 35s | Max: 50m 47s
      🟩 14                 Pass: 100%/29  | Total:  9h 27m | Avg: 19m 33s | Max: 58m 48s | Hits:  67%/3704  
      🟩 17                 Pass: 100%/27  | Total:  8h 56m | Avg: 19m 52s | Max: 58m 50s | Hits:  67%/1852  
      🟩 20                 Pass: 100%/23  | Total:  7h 14m | Avg: 18m 54s | Max:  1h 02m | Hits:  83%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 1d 15h | Avg: 21m 24s | Max: 1h 12m | Hits: 41%/2964

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total:  1d 14h | Avg: 22m 42s | Max:  1h 12m | Hits:  41%/2964  
      🟩 arm64              Pass: 100%/8   | Total: 38m 38s | Avg:  4m 49s | Max:  6m 23s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  5h 06m | Avg: 20m 24s | Max:  1h 03m | Hits:  41%/741   
      🟩 11.8               Pass: 100%/3   | Total: 16m 48s | Avg:  5m 36s | Max:  5m 57s
      🟩 12.5               Pass: 100%/4   | Total:  4h 21m | Avg:  1h 05m | Max:  1h 09m
      🟩 12.6               Pass: 100%/88  | Total:  1d 05h | Avg: 20m 06s | Max:  1h 12m | Hits:  41%/2223  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 16m 20s | Avg:  4m 05s | Max:  4m 13s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  5h 06m | Avg: 20m 24s | Max:  1h 03m | Hits:  41%/741   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 48s | Avg:  5m 36s | Max:  5m 57s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  4h 21m | Avg:  1h 05m | Max:  1h 09m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  1d 05h | Avg: 20m 52s | Max:  1h 12m | Hits:  41%/2223  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 16m 20s | Avg:  4m 05s | Max:  4m 13s
      🟩 nvcc               Pass: 100%/106 | Total:  1d 14h | Avg: 22m 03s | Max:  1h 12m | Hits:  41%/2964  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  5h 24m | Avg: 54m 06s | Max:  1h 03m
      🟩 Clang10            Pass: 100%/3   | Total:  2h 39m | Avg: 53m 13s | Max: 53m 30s
      🟩 Clang11            Pass: 100%/4   | Total:  4h 04m | Avg:  1h 01m | Max:  1h 12m
      🟩 Clang12            Pass: 100%/4   | Total:  3h 57m | Avg: 59m 18s | Max:  1h 11m
      🟩 Clang13            Pass: 100%/4   | Total:  3h 37m | Avg: 54m 20s | Max: 55m 54s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 07s | Avg:  5m 16s | Max:  5m 36s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 48s | Avg:  5m 27s | Max:  5m 50s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 54s | Avg:  5m 13s | Max:  5m 19s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 40s | Avg:  5m 10s | Max:  5m 24s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 38m | Avg:  8m 54s | Max: 37m 14s
      🟩 GCC6               Pass: 100%/2   | Total: 53m 05s | Avg: 26m 32s | Max: 48m 24s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 59s | Avg:  4m 39s | Max:  5m 49s
      🟩 GCC8               Pass: 100%/6   | Total: 27m 21s | Avg:  4m 33s | Max:  5m 07s
      🟩 GCC9               Pass: 100%/6   | Total: 28m 55s | Avg:  4m 49s | Max:  5m 41s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 11m | Avg: 17m 48s | Max: 55m 37s
      🟩 GCC11              Pass: 100%/7   | Total: 39m 01s | Avg:  5m 34s | Max:  5m 57s
      🟩 GCC12              Pass: 100%/4   | Total: 21m 19s | Avg:  5m 19s | Max:  5m 41s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 14m | Avg: 12m 10s | Max: 36m 33s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 02s | Avg:  6m 00s | Max:  6m 09s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 55m 04s | Avg: 55m 04s | Max: 55m 04s | Hits:  41%/741   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  41%/1482  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 09m | Avg:  1h 09m | Max:  1h 09m | Hits:  41%/741   
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  4h 21m | Avg:  1h 05m | Max:  1h 09m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total: 22h 45m | Avg: 28m 27s | Max:  1h 12m
      🟩 GCC                Pass: 100%/51  | Total:  7h 43m | Avg:  9m 05s | Max: 55m 37s
      🟩 Intel              Pass: 100%/3   | Total: 18m 02s | Avg:  6m 00s | Max:  6m 09s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 04m | Avg:  1h 01m | Max:  1h 09m | Hits:  41%/2964  
      🟩 NVHPC              Pass: 100%/4   | Total:  4h 21m | Avg:  1h 05m | Max:  1h 09m
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total:  1d 15h | Avg: 21m 24s | Max:  1h 12m | Hits:  41%/2964  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  1d 11h | Avg: 21m 04s | Max:  1h 12m | Hits:  41%/2964  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 08s | Avg: 24m 08s | Max: 24m 08s
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 25s | Avg: 18m 25s | Max: 18m 25s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 02m | Avg: 20m 47s | Max: 21m 25s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 40m | Avg: 33m 21s | Max: 37m 14s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 48s | Avg:  5m 36s | Max:  5m 57s
      🟩 90a                Pass: 100%/4   | Total: 16m 09s | Avg:  4m 02s | Max:  4m 26s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  9h 03m | Avg: 18m 06s | Max:  1h 01m
      🟩 14                 Pass: 100%/29  | Total: 10h 53m | Avg: 22m 32s | Max:  1h 09m | Hits:  41%/1482  
      🟩 17                 Pass: 100%/27  | Total:  9h 40m | Avg: 21m 30s | Max:  1h 11m | Hits:  41%/741   
      🟩 20                 Pass: 100%/24  | Total:  9h 36m | Avg: 24m 02s | Max:  1h 12m | Hits:  41%/741   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 39s | Avg: 5m 49s | Max: 9m 40s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  9m 40s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  9m 40s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  9m 40s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  9m 40s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  9m 40s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  9m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  9m 40s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
      🟩 Test               Pass: 100%/1   | Total:  9m 40s | Avg:  9m 40s | Max:  9m 40s
    
  • 🟩 python: Pass: 100%/1 | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 224)

# Runner
185 linux-amd64-cpu16
16 linux-arm64-cpu16
14 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

NVHPC can consume older CTK headers for stdpar, so we need to try and avoid using those
@miscco miscco force-pushed the fix_extended_floating_point_nvhpc branch from 06b9ca9 to 9d6e83e Compare November 21, 2024 08:26
Copy link
Contributor

🟩 CI finished in 1h 37m: Pass: 100%/224 | Total: 2d 17h | Avg: 17m 37s | Max: 1h 11m | Hits: 60%/12288
  • 🟩 thrust: Pass: 100%/111 | Total: 1d 03h | Avg: 15m 02s | Max: 1h 07m | Hits: 70%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 23m 11s | Avg: 11m 35s | Max: 16m 50s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total:  1d 03h | Avg: 15m 50s | Max:  1h 07m | Hits:  70%/9260  
      🟩 arm64              Pass: 100%/8   | Total: 38m 19s | Avg:  4m 47s | Max:  5m 27s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  3h 10m | Avg: 12m 43s | Max: 56m 03s | Hits:  62%/1852  
      🟩 11.8               Pass: 100%/3   | Total:  1h 02m | Avg: 20m 41s | Max: 51m 36s
      🟩 12.5               Pass: 100%/4   | Total:  3h 51m | Avg: 57m 55s | Max:  1h 04m
      🟩 12.6               Pass: 100%/89  | Total: 19h 44m | Avg: 13m 18s | Max:  1h 07m | Hits:  71%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 19m 16s | Avg:  4m 49s | Max:  5m 05s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  3h 10m | Avg: 12m 43s | Max: 56m 03s | Hits:  62%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 02m | Avg: 20m 41s | Max: 51m 36s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  3h 51m | Avg: 57m 55s | Max:  1h 04m
      🟩 nvcc12.6           Pass: 100%/85  | Total: 19h 25m | Avg: 13m 42s | Max:  1h 07m | Hits:  71%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 19m 16s | Avg:  4m 49s | Max:  5m 05s
      🟩 nvcc               Pass: 100%/107 | Total:  1d 03h | Avg: 15m 25s | Max:  1h 07m | Hits:  70%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 03m | Avg: 30m 31s | Max: 34m 49s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 36m | Avg: 32m 18s | Max: 36m 03s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 59m | Avg: 29m 55s | Max: 32m 51s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 18s | Max: 46m 15s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 05m | Avg: 31m 19s | Max: 34m 12s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 32s | Avg:  5m 08s | Max:  5m 21s
      🟩 Clang15            Pass: 100%/4   | Total: 22m 06s | Avg:  5m 31s | Max:  6m 49s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 13s | Avg:  5m 03s | Max:  5m 32s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 16s | Avg:  5m 04s | Max:  5m 25s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 08m | Avg:  6m 14s | Max: 16m 56s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 45s | Avg:  4m 22s | Max:  4m 47s
      🟩 GCC7               Pass: 100%/6   | Total: 29m 18s | Avg:  4m 53s | Max:  5m 42s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 18s | Avg:  4m 43s | Max:  5m 19s
      🟩 GCC9               Pass: 100%/6   | Total: 28m 34s | Avg:  4m 45s | Max:  5m 43s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 46s | Avg:  5m 26s | Max:  5m 49s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 24m | Avg: 12m 01s | Max: 51m 36s
      🟩 GCC12              Pass: 100%/4   | Total: 21m 53s | Avg:  5m 28s | Max:  6m 07s
      🟩 GCC13              Pass: 100%/16  | Total:  1h 53m | Avg:  7m 05s | Max: 16m 50s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 20m 34s | Avg:  6m 51s | Max:  7m 05s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 56m 03s | Avg: 56m 03s | Max: 56m 03s | Hits:  62%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  62%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 30m | Avg: 45m 25s | Max:  1h 07m | Hits:  81%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  3h 51m | Avg: 57m 55s | Max:  1h 04m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total: 13h 33m | Avg: 16m 57s | Max: 46m 15s
      🟩 GCC                Pass: 100%/51  | Total:  5h 36m | Avg:  6m 35s | Max: 51m 36s
      🟩 Intel              Pass: 100%/3   | Total: 20m 34s | Avg:  6m 51s | Max:  7m 05s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 26m | Avg: 53m 23s | Max:  1h 07m | Hits:  70%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  3h 51m | Avg: 57m 55s | Max:  1h 04m
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total:  1d 03h | Avg: 15m 02s | Max:  1h 07m | Hits:  70%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  1d 02h | Avg: 15m 10s | Max:  1h 07m | Hits:  62%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 36s | Avg: 11m 39s | Max: 23m 08s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total: 59m 43s | Avg: 14m 55s | Max: 16m 56s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 02m | Avg: 20m 41s | Max: 51m 36s
      🟩 90a                Pass: 100%/4   | Total: 17m 44s | Avg:  4m 26s | Max:  4m 36s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  5h 26m | Avg: 10m 52s | Max: 50m 05s
      🟩 14                 Pass: 100%/29  | Total:  8h 49m | Avg: 18m 15s | Max: 59m 50s | Hits:  62%/3704  
      🟩 17                 Pass: 100%/27  | Total:  7h 00m | Avg: 15m 35s | Max:  1h 02m | Hits:  62%/1852  
      🟩 20                 Pass: 100%/23  | Total:  6h 09m | Avg: 16m 04s | Max:  1h 07m | Hits:  81%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 1d 13h | Avg: 20m 29s | Max: 1h 11m | Hits: 31%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total:  1d 12h | Avg: 21m 43s | Max:  1h 11m | Hits:  31%/3028  
      🟩 arm64              Pass: 100%/8   | Total: 38m 09s | Avg:  4m 46s | Max:  5m 08s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  4h 25m | Avg: 17m 43s | Max: 59m 07s | Hits:  32%/757   
      🟩 11.8               Pass: 100%/3   | Total: 16m 29s | Avg:  5m 29s | Max:  6m 03s
      🟩 12.5               Pass: 100%/4   | Total:  4h 27m | Avg:  1h 06m | Max:  1h 09m
      🟩 12.6               Pass: 100%/88  | Total:  1d 04h | Avg: 19m 22s | Max:  1h 11m | Hits:  31%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 16m 55s | Avg:  4m 13s | Max:  4m 26s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  4h 25m | Avg: 17m 43s | Max: 59m 07s | Hits:  32%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 29s | Avg:  5m 29s | Max:  6m 03s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  4h 27m | Avg:  1h 06m | Max:  1h 09m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  1d 04h | Avg: 20m 05s | Max:  1h 11m | Hits:  31%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 16m 55s | Avg:  4m 13s | Max:  4m 26s
      🟩 nvcc               Pass: 100%/106 | Total:  1d 13h | Avg: 21m 06s | Max:  1h 11m | Hits:  31%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  5h 30m | Avg: 55m 07s | Max: 59m 07s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 49m | Avg: 56m 20s | Max: 59m 15s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 39m | Avg: 54m 49s | Max: 57m 12s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 45m | Avg: 56m 27s | Max:  1h 00m
      🟩 Clang13            Pass: 100%/4   | Total:  3h 45m | Avg: 56m 16s | Max: 57m 59s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  5m 31s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 16s | Avg:  5m 19s | Max:  5m 38s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 35s | Avg:  5m 23s | Max:  5m 33s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 11s | Avg:  5m 17s | Max:  5m 27s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 33m | Avg:  8m 30s | Max: 28m 41s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  4m 52s
      🟩 GCC7               Pass: 100%/6   | Total: 28m 06s | Avg:  4m 41s | Max:  5m 16s
      🟩 GCC8               Pass: 100%/6   | Total: 29m 51s | Avg:  4m 58s | Max:  6m 08s
      🟩 GCC9               Pass: 100%/6   | Total: 28m 56s | Avg:  4m 49s | Max:  5m 49s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 29s | Avg:  5m 22s | Max:  5m 39s
      🟩 GCC11              Pass: 100%/7   | Total: 39m 11s | Avg:  5m 35s | Max:  6m 03s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 44s | Avg:  5m 41s | Max:  5m 53s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 56m | Avg: 11m 00s | Max: 27m 22s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 13s | Avg:  6m 24s | Max:  6m 41s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 59m 07s | Avg: 59m 07s | Max: 59m 07s | Hits:  32%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 08m | Hits:  31%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m | Hits:  31%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  4h 27m | Avg:  1h 06m | Max:  1h 09m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total: 22h 28m | Avg: 28m 05s | Max:  1h 00m
      🟩 GCC                Pass: 100%/51  | Total:  5h 55m | Avg:  6m 58s | Max: 27m 22s
      🟩 Intel              Pass: 100%/3   | Total: 19m 13s | Avg:  6m 24s | Max:  6m 41s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 23m | Avg:  1h 05m | Max:  1h 11m | Hits:  31%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total:  4h 27m | Avg:  1h 06m | Max:  1h 09m
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total:  1d 13h | Avg: 20m 29s | Max:  1h 11m | Hits:  31%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  1d 10h | Avg: 20m 19s | Max:  1h 11m | Hits:  31%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 47s | Avg: 18m 47s | Max: 18m 47s
      🟩 GraphCapture       Pass: 100%/1   | Total: 22m 24s | Avg: 22m 24s | Max: 22m 24s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 02m | Avg: 20m 40s | Max: 24m 19s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 17m | Avg: 25m 43s | Max: 28m 41s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 29s | Avg:  5m 29s | Max:  6m 03s
      🟩 90a                Pass: 100%/4   | Total: 17m 09s | Avg:  4m 17s | Max:  4m 34s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  9h 13m | Avg: 18m 27s | Max:  1h 09m
      🟩 14                 Pass: 100%/29  | Total: 10h 23m | Avg: 21m 29s | Max:  1h 08m | Hits:  32%/1514  
      🟩 17                 Pass: 100%/27  | Total:  9h 21m | Avg: 20m 48s | Max:  1h 07m | Hits:  31%/757   
      🟩 20                 Pass: 100%/24  | Total:  8h 35m | Avg: 21m 28s | Max:  1h 11m | Hits:  31%/757   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 57s | Avg: 5m 28s | Max: 8m 39s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  8m 39s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  8m 39s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  8m 39s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  8m 39s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  8m 39s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  8m 39s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  8m 39s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 18s | Avg:  2m 18s | Max:  2m 18s
      🟩 Test               Pass: 100%/1   | Total:  8m 39s | Avg:  8m 39s | Max:  8m 39s
    
  • 🟩 python: Pass: 100%/1 | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 224)

# Runner
185 linux-amd64-cpu16
16 linux-arm64-cpu16
14 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

@miscco miscco merged commit 1b8151c into NVIDIA:main Nov 21, 2024
239 checks passed
@miscco miscco deleted the fix_extended_floating_point_nvhpc branch November 21, 2024 15:05
davebayer pushed a commit to davebayer/cccl that referenced this pull request Nov 21, 2024
…ons (NVIDIA#2889)

NVHPC can consume older CTK headers for stdpar, so we need to try and avoid using those
pciolkosz pushed a commit to pciolkosz/cccl that referenced this pull request Nov 22, 2024
…ons (NVIDIA#2889)

NVHPC can consume older CTK headers for stdpar, so we need to try and avoid using those
davebayer pushed a commit to davebayer/cccl that referenced this pull request Nov 22, 2024
…ons (NVIDIA#2889)

NVHPC can consume older CTK headers for stdpar, so we need to try and avoid using those
trxcllnt pushed a commit to trxcllnt/cccl that referenced this pull request Nov 23, 2024
…ons (NVIDIA#2889)

NVHPC can consume older CTK headers for stdpar, so we need to try and avoid using those
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants