Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regenerate cuda::ptx headers/docs and run format #2937

Merged
merged 3 commits into from
Nov 25, 2024

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Nov 22, 2024

Overwrites all generated PTX source and documentation files and runs pre-commit run --all-files

Companion PR for tests: #2953

Copy link
Contributor

🟨 CI finished in 1h 50m: Pass: 97%/396 | Total: 8d 18h | Avg: 31m 54s | Max: 1h 21m | Hits: 10%/22058
  • 🟨 libcudacxx: Pass: 93%/118 | Total: 1d 08h | Avg: 16m 17s | Max: 56m 29s | Hits: 4%/9524

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  92%/110 | Total:  1d 06h | Avg: 16m 28s | Max: 56m 29s | Hits:   4%/9524  
      🟩 arm64              Pass: 100%/8   | Total:  1h 50m | Avg: 13m 49s | Max: 18m 05s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  4h 17m | Avg: 17m 09s | Max: 44m 07s | Hits:   4%/2187  
      🟩 11.8               Pass: 100%/3   | Total:  1h 08m | Avg: 22m 41s | Max: 25m 50s
      🟩 12.5               Pass: 100%/4   | Total:  2h 46m | Avg: 41m 37s | Max: 51m 02s
      🔍 12.6               Pass:  91%/96  | Total: 23h 51m | Avg: 14m 54s | Max: 56m 29s | Hits:   3%/7337  
    🔍 cudacxx: ClangCUDA18 🔍
      🔍 ClangCUDA18        Pass:  33%/12  | Total:  2h 26m | Avg: 12m 13s | Max: 19m 23s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  4h 17m | Avg: 17m 09s | Max: 44m 07s | Hits:   4%/2187  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 08m | Avg: 22m 41s | Max: 25m 50s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  2h 46m | Avg: 41m 37s | Max: 51m 02s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 21h 24m | Avg: 15m 17s | Max: 56m 29s | Hits:   3%/7337  
    🔍 cudacxx_family: ClangCUDA 🔍
      🔍 ClangCUDA          Pass:  33%/12  | Total:  2h 26m | Avg: 12m 13s | Max: 19m 23s
      🟩 nvcc               Pass: 100%/106 | Total:  1d 05h | Avg: 16m 45s | Max: 56m 29s | Hits:   4%/9524  
    🔍 cxx: Clang18 🔍
      🟩 Clang9             Pass: 100%/6   | Total:  1h 10m | Avg: 11m 47s | Max: 21m 12s
      🟩 Clang10            Pass: 100%/3   | Total: 45m 22s | Avg: 15m 07s | Max: 23m 19s
      🟩 Clang11            Pass: 100%/4   | Total: 45m 47s | Avg: 11m 26s | Max: 20m 26s
      🟩 Clang12            Pass: 100%/4   | Total: 46m 21s | Avg: 11m 35s | Max: 19m 30s
      🟩 Clang13            Pass: 100%/4   | Total: 47m 42s | Avg: 11m 55s | Max: 19m 45s
      🟩 Clang14            Pass: 100%/4   | Total: 48m 24s | Avg: 12m 06s | Max: 20m 11s
      🟩 Clang15            Pass: 100%/4   | Total: 46m 28s | Avg: 11m 37s | Max: 19m 12s
      🟩 Clang16            Pass: 100%/4   | Total: 54m 08s | Avg: 13m 32s | Max: 17m 50s
      🟩 Clang17            Pass: 100%/4   | Total: 59m 15s | Avg: 14m 48s | Max: 20m 38s
      🔍 Clang18            Pass:  55%/18  | Total:  3h 50m | Avg: 12m 48s | Max: 19m 23s
      🟩 GCC6               Pass: 100%/2   | Total: 37m 12s | Avg: 18m 36s | Max: 20m 36s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 10m | Avg: 11m 42s | Max: 20m 57s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 28m | Avg: 14m 46s | Max: 22m 15s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 27m | Avg: 14m 39s | Max: 20m 29s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 16m | Avg: 19m 11s | Max: 23m 48s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 41m | Avg: 14m 27s | Max: 25m 50s
      🟩 GCC12              Pass: 100%/4   | Total: 50m 10s | Avg: 12m 32s | Max: 22m 20s
      🟩 GCC13              Pass: 100%/17  | Total:  4h 50m | Avg: 17m 05s | Max: 56m 29s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 23m | Avg: 27m 40s | Max: 31m 36s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 44m 07s | Avg: 44m 07s | Max: 44m 07s | Hits:   4%/2187  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 21m | Avg: 40m 33s | Max: 44m 52s | Hits:   4%/4737  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 50m 43s | Avg: 50m 43s | Max: 50m 43s | Hits:   3%/2600  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  2h 46m | Avg: 41m 37s | Max: 51m 02s
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  85%/55  | Total: 11h 34m | Avg: 12m 37s | Max: 23m 19s
      🟩 GCC                Pass: 100%/52  | Total: 13h 22m | Avg: 15m 26s | Max: 56m 29s
      🟩 Intel              Pass: 100%/3   | Total:  1h 23m | Avg: 27m 40s | Max: 31m 36s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 55m | Avg: 43m 59s | Max: 50m 43s | Hits:   4%/9524  
      🟩 NVHPC              Pass: 100%/4   | Total:  2h 46m | Avg: 41m 37s | Max: 51m 02s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  92%/110 | Total:  1d 04h | Avg: 15m 35s | Max: 51m 02s | Hits:   4%/9524  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 59m | Avg: 29m 53s | Max: 35m 05s
      🟩 Test               Pass: 100%/3   | Total:  1h 26m | Avg: 28m 43s | Max: 56m 29s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 15s | Avg:  2m 15s | Max:  2m 15s
    🟨 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 08m | Avg: 22m 41s | Max: 25m 50s
      🟥 90                 Pass:   0%/4   | Total: 40m 20s | Avg: 10m 05s | Max: 12m 03s
      🟨 90a                Pass:  50%/8   | Total:  1h 08m | Avg:  8m 34s | Max: 12m 11s
    🟨 gpu
      🟨 v100               Pass:  93%/118 | Total:  1d 08h | Avg: 16m 17s | Max: 56m 29s | Hits:   4%/9524  
    🟨 std
      🟨 11                 Pass:  93%/32  | Total:  7h 12m | Avg: 13m 31s | Max: 31m 22s
      🟨 14                 Pass:  93%/32  | Total:  8h 00m | Avg: 15m 00s | Max: 44m 07s | Hits:   4%/4477  
      🟨 17                 Pass:  93%/30  | Total:  9h 24m | Avg: 18m 49s | Max: 44m 52s | Hits:   4%/2447  
      🟨 20                 Pass:  91%/23  | Total:  7h 22m | Avg: 19m 15s | Max: 56m 29s | Hits:   3%/2600  
    
  • 🟩 thrust: Pass: 100%/111 | Total: 2d 17h | Avg: 35m 22s | Max: 1h 21m | Hits: 20%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 48m 23s | Avg: 24m 11s | Max: 35m 42s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total:  2d 12h | Avg: 35m 31s | Max:  1h 21m | Hits:  20%/9260  
      🟩 arm64              Pass: 100%/8   | Total:  4h 26m | Avg: 33m 17s | Max: 39m 09s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  8h 14m | Avg: 32m 59s | Max:  1h 11m | Hits:   0%/1852  
      🟩 11.8               Pass: 100%/3   | Total:  2h 10m | Avg: 43m 26s | Max: 49m 15s
      🟩 12.5               Pass: 100%/4   | Total:  4h 53m | Avg:  1h 13m | Max:  1h 17m
      🟩 12.6               Pass: 100%/89  | Total:  2d 02h | Avg: 33m 47s | Max:  1h 21m | Hits:  25%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 51m | Avg: 27m 52s | Max: 30m 45s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 14m | Avg: 32m 59s | Max:  1h 11m | Hits:   0%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 10m | Avg: 43m 26s | Max: 49m 15s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  4h 53m | Avg:  1h 13m | Max:  1h 17m
      🟩 nvcc12.6           Pass: 100%/85  | Total:  2d 00h | Avg: 34m 03s | Max:  1h 21m | Hits:  25%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 51m | Avg: 27m 52s | Max: 30m 45s
      🟩 nvcc               Pass: 100%/107 | Total:  2d 15h | Avg: 35m 38s | Max:  1h 21m | Hits:  20%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 13m | Avg: 32m 17s | Max: 37m 51s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 46m | Avg: 35m 22s | Max: 39m 32s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 20m | Avg: 35m 02s | Max: 39m 11s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 19m | Avg: 34m 45s | Max: 37m 40s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 19m | Avg: 34m 54s | Max: 38m 23s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 28s | Max: 38m 44s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 33m | Avg: 38m 18s | Max: 47m 13s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 22m | Avg: 35m 40s | Max: 39m 31s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 15m | Avg: 33m 45s | Max: 39m 44s
      🟩 Clang18            Pass: 100%/11  | Total:  4h 55m | Avg: 26m 52s | Max: 36m 31s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 15s | Max: 35m 48s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 11m | Avg: 31m 58s | Max: 36m 45s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 11m | Avg: 31m 54s | Max: 39m 36s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 17m | Avg: 32m 57s | Max: 39m 29s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 26m | Avg: 36m 40s | Max: 40m 52s
      🟩 GCC11              Pass: 100%/7   | Total:  4h 34m | Avg: 39m 12s | Max: 49m 15s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 29m | Avg: 37m 17s | Max: 42m 56s
      🟩 GCC13              Pass: 100%/16  | Total:  6h 26m | Avg: 24m 09s | Max: 45m 44s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 05m | Avg: 41m 58s | Max: 46m 37s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m | Hits:   0%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 20m | Hits:   0%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 43m | Avg: 51m 45s | Max:  1h 21m | Hits:  49%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  4h 53m | Avg:  1h 13m | Max:  1h 17m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  1d 02h | Avg: 32m 58s | Max: 47m 13s
      🟩 GCC                Pass: 100%/51  | Total:  1d 02h | Avg: 31m 20s | Max: 49m 15s
      🟩 Intel              Pass: 100%/3   | Total:  2h 05m | Avg: 41m 58s | Max: 46m 37s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 24m | Avg:  1h 04m | Max:  1h 21m | Hits:  20%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  4h 53m | Avg:  1h 13m | Max:  1h 17m
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total:  2d 17h | Avg: 35m 22s | Max:  1h 21m | Hits:  20%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  2d 15h | Avg: 37m 10s | Max:  1h 21m | Hits:   0%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 44m 48s | Avg: 11m 12s | Max: 21m 56s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total: 51m 10s | Avg: 12m 47s | Max: 13m 48s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 10m | Avg: 43m 26s | Max: 49m 15s
      🟩 90a                Pass: 100%/4   | Total:  1h 21m | Avg: 20m 23s | Max: 23m 00s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 13h 58m | Avg: 27m 56s | Max:  1h 04m
      🟩 14                 Pass: 100%/29  | Total: 18h 52m | Avg: 39m 02s | Max:  1h 17m | Hits:   0%/3704  
      🟩 17                 Pass: 100%/27  | Total: 18h 05m | Avg: 40m 11s | Max:  1h 20m | Hits:   0%/1852  
      🟩 20                 Pass: 100%/23  | Total: 13h 41m | Avg: 35m 43s | Max:  1h 21m | Hits:  49%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 4d 04h | Avg: 54m 37s | Max: 1h 16m | Hits: 2%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total:  3d 20h | Avg: 54m 11s | Max:  1h 16m | Hits:   2%/3028  
      🟩 arm64              Pass: 100%/8   | Total:  8h 00m | Avg:  1h 00m | Max:  1h 04m
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 12h 46m | Avg: 51m 07s | Max: 57m 26s | Hits:   2%/757   
      🟩 11.8               Pass: 100%/3   | Total:  3h 42m | Avg:  1h 14m | Max:  1h 16m
      🟩 12.5               Pass: 100%/4   | Total:  4h 47m | Avg:  1h 11m | Max:  1h 14m
      🟩 12.6               Pass: 100%/88  | Total:  3d 06h | Avg: 53m 46s | Max:  1h 08m | Hits:   2%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  4h 04m | Avg:  1h 01m | Max:  1h 03m
      🟩 nvcc11.1           Pass: 100%/15  | Total: 12h 46m | Avg: 51m 07s | Max: 57m 26s | Hits:   2%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 42m | Avg:  1h 14m | Max:  1h 16m
      🟩 nvcc12.5           Pass: 100%/4   | Total:  4h 47m | Avg:  1h 11m | Max:  1h 14m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  3d 02h | Avg: 53m 25s | Max:  1h 08m | Hits:   2%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  4h 04m | Avg:  1h 01m | Max:  1h 03m
      🟩 nvcc               Pass: 100%/106 | Total:  4d 00h | Avg: 54m 22s | Max:  1h 16m | Hits:   2%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  5h 20m | Avg: 53m 22s | Max:  1h 00m
      🟩 Clang10            Pass: 100%/3   | Total:  2h 54m | Avg: 58m 00s | Max:  1h 02m
      🟩 Clang11            Pass: 100%/4   | Total:  3h 42m | Avg: 55m 30s | Max: 57m 13s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 52m | Avg: 58m 00s | Max:  1h 03m
      🟩 Clang13            Pass: 100%/4   | Total:  3h 47m | Avg: 56m 58s | Max:  1h 02m
      🟩 Clang14            Pass: 100%/4   | Total:  3h 52m | Avg: 58m 01s | Max:  1h 01m
      🟩 Clang15            Pass: 100%/4   | Total:  3h 49m | Avg: 57m 25s | Max:  1h 00m
      🟩 Clang16            Pass: 100%/4   | Total:  3h 51m | Avg: 57m 50s | Max: 59m 53s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 49m | Avg: 57m 18s | Max:  1h 02m
      🟩 Clang18            Pass: 100%/11  | Total:  9h 37m | Avg: 52m 27s | Max:  1h 04m
      🟩 GCC6               Pass: 100%/2   | Total:  1h 43m | Avg: 51m 43s | Max: 53m 24s
      🟩 GCC7               Pass: 100%/6   | Total:  5h 21m | Avg: 53m 31s | Max: 59m 43s
      🟩 GCC8               Pass: 100%/6   | Total:  5h 17m | Avg: 52m 50s | Max:  1h 00m
      🟩 GCC9               Pass: 100%/6   | Total:  5h 32m | Avg: 55m 20s | Max:  1h 02m
      🟩 GCC10              Pass: 100%/4   | Total:  4h 00m | Avg:  1h 00m | Max:  1h 02m
      🟩 GCC11              Pass: 100%/7   | Total:  7h 39m | Avg:  1h 05m | Max:  1h 16m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 55m | Avg: 58m 52s | Max:  1h 06m
      🟩 GCC13              Pass: 100%/16  | Total:  9h 46m | Avg: 36m 40s | Max:  1h 03m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  3h 09m | Avg:  1h 03m | Max:  1h 07m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 57m 26s | Avg: 57m 26s | Max: 57m 26s | Hits:   2%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 06m | Hits:   2%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m | Hits:   2%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  4h 47m | Avg:  1h 11m | Max:  1h 14m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  1d 20h | Avg: 55m 44s | Max:  1h 04m
      🟩 GCC                Pass: 100%/51  | Total:  1d 19h | Avg: 50m 53s | Max:  1h 16m
      🟩 Intel              Pass: 100%/3   | Total:  3h 09m | Avg:  1h 03m | Max:  1h 07m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 19m | Avg:  1h 04m | Max:  1h 08m | Hits:   2%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total:  4h 47m | Avg:  1h 11m | Max:  1h 14m
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total:  4d 04h | Avg: 54m 37s | Max:  1h 16m | Hits:   2%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  4d 01h | Avg: 57m 18s | Max:  1h 16m | Hits:   2%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 19m 08s | Avg: 19m 08s | Max: 19m 08s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 52s | Avg: 17m 52s | Max: 17m 52s
      🟩 HostLaunch         Pass: 100%/3   | Total: 51m 14s | Avg: 17m 04s | Max: 19m 10s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 14m | Avg: 24m 49s | Max: 27m 08s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 42m | Avg:  1h 14m | Max:  1h 16m
      🟩 90a                Pass: 100%/4   | Total:  1h 44m | Avg: 26m 08s | Max: 28m 05s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  1d 01h | Avg: 51m 05s | Max:  1h 10m
      🟩 14                 Pass: 100%/29  | Total:  1d 04h | Avg: 58m 31s | Max:  1h 16m | Hits:   2%/1514  
      🟩 17                 Pass: 100%/27  | Total:  1d 02h | Avg: 58m 39s | Max:  1h 15m | Hits:   2%/757   
      🟩 20                 Pass: 100%/24  | Total: 19h 53m | Avg: 49m 44s | Max:  1h 13m | Hits:   2%/757   
    
  • 🟩 cudax: Pass: 100%/54 | Total: 12h 36m | Avg: 14m 00s | Max: 18m 12s | Hits: 11%/246

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total: 11h 41m | Avg: 14m 02s | Max: 18m 12s | Hits:  11%/246   
      🟩 arm64              Pass: 100%/4   | Total: 54m 13s | Avg: 13m 33s | Max: 14m 09s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  4h 15m | Avg: 13m 25s | Max: 15m 49s | Hits:  11%/123   
      🟩 12.5               Pass: 100%/2   | Total: 21m 36s | Avg: 10m 48s | Max: 10m 52s
      🟩 12.6               Pass: 100%/33  | Total:  7h 59m | Avg: 14m 31s | Max: 18m 12s | Hits:  11%/123   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  4h 15m | Avg: 13m 25s | Max: 15m 49s | Hits:  11%/123   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 21m 36s | Avg: 10m 48s | Max: 10m 52s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  7h 59m | Avg: 14m 31s | Max: 18m 12s | Hits:  11%/123   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total: 12h 36m | Avg: 14m 00s | Max: 18m 12s | Hits:  11%/246   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total: 26m 14s | Avg: 13m 07s | Max: 13m 44s
      🟩 Clang10            Pass: 100%/2   | Total: 28m 23s | Avg: 14m 11s | Max: 14m 13s
      🟩 Clang11            Pass: 100%/4   | Total: 54m 39s | Avg: 13m 39s | Max: 14m 29s
      🟩 Clang12            Pass: 100%/4   | Total: 55m 32s | Avg: 13m 53s | Max: 14m 54s
      🟩 Clang13            Pass: 100%/4   | Total: 53m 06s | Avg: 13m 16s | Max: 13m 55s
      🟩 Clang14            Pass: 100%/4   | Total: 54m 46s | Avg: 13m 41s | Max: 14m 06s
      🟩 Clang15            Pass: 100%/2   | Total: 31m 03s | Avg: 15m 31s | Max: 16m 22s
      🟩 Clang16            Pass: 100%/4   | Total: 56m 58s | Avg: 14m 14s | Max: 15m 17s
      🟩 Clang17            Pass: 100%/2   | Total: 29m 05s | Avg: 14m 32s | Max: 14m 56s
      🟩 Clang18            Pass: 100%/2   | Total: 33m 09s | Avg: 16m 34s | Max: 17m 23s
      🟩 GCC9               Pass: 100%/2   | Total: 27m 41s | Avg: 13m 50s | Max: 14m 30s
      🟩 GCC10              Pass: 100%/4   | Total: 57m 13s | Avg: 14m 18s | Max: 15m 10s
      🟩 GCC11              Pass: 100%/4   | Total: 59m 05s | Avg: 14m 46s | Max: 16m 27s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 48m | Avg: 15m 26s | Max: 18m 12s
      🟩 GCC13              Pass: 100%/3   | Total: 38m 14s | Avg: 12m 44s | Max: 14m 05s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 52s | Avg: 10m 52s | Max: 10m 52s | Hits:  11%/123   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 24s | Avg: 10m 24s | Max: 10m 24s | Hits:  11%/123   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 21m 36s | Avg: 10m 48s | Max: 10m 52s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  7h 02m | Avg: 14m 05s | Max: 17m 23s
      🟩 GCC                Pass: 100%/20  | Total:  4h 50m | Avg: 14m 30s | Max: 18m 12s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 16s | Avg: 10m 38s | Max: 10m 52s | Hits:  11%/246   
      🟩 NVHPC              Pass: 100%/2   | Total: 21m 36s | Avg: 10m 48s | Max: 10m 52s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total: 12h 36m | Avg: 14m 00s | Max: 18m 12s | Hits:  11%/246   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total: 11h 15m | Avg: 13m 47s | Max: 18m 01s | Hits:  11%/246   
      🟩 Test               Pass: 100%/5   | Total:  1h 20m | Avg: 16m 01s | Max: 18m 12s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 10m 36s | Avg: 10m 36s | Max: 10m 36s
      🟩 90a                Pass: 100%/1   | Total: 11m 16s | Avg: 11m 16s | Max: 11m 16s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  6h 38m | Avg: 13m 43s | Max: 18m 12s
      🟩 20                 Pass: 100%/25  | Total:  5h 57m | Avg: 14m 18s | Max: 18m 01s | Hits:  11%/246   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 08s | Avg: 5m 04s | Max: 7m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  7m 47s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  7m 47s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  7m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  7m 47s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  7m 47s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  7m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  7m 47s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s
      🟩 Test               Pass: 100%/1   | Total:  7m 47s | Avg:  7m 47s | Max:  7m 47s
    
  • 🟩 python: Pass: 100%/1 | Total: 15m 40s | Avg: 15m 40s | Max: 15m 40s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 40s | Avg: 15m 40s | Max: 15m 40s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 15m 40s | Avg: 15m 40s | Max: 15m 40s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 15m 40s | Avg: 15m 40s | Max: 15m 40s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 40s | Avg: 15m 40s | Max: 15m 40s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 40s | Avg: 15m 40s | Max: 15m 40s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 40s | Avg: 15m 40s | Max: 15m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 40s | Avg: 15m 40s | Max: 15m 40s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 40s | Avg: 15m 40s | Max: 15m 40s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 396)

# Runner
327 linux-amd64-cpu16
28 linux-arm64-cpu16
26 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

@bernhardmgruber bernhardmgruber changed the title Regenerate PTX files and format Regenerate cuda::ptx files and run format Nov 22, 2024
@bernhardmgruber
Copy link
Contributor Author

@miscco We added the header guard to the public headers. Do you have any other wishes? I would then split the PR to propose the changes to tests and sources separately.

@miscco
Copy link
Collaborator

miscco commented Nov 25, 2024

LGTM

@bernhardmgruber
Copy link
Contributor Author

Because I changed the file extension from *.inc to *.h, the generated files are now checked by the header test, which they fail, because they are not standalone :S

Overwrites all generated PTX header and documentation files and runs `pre-commit run --all-files`
@bernhardmgruber bernhardmgruber requested a review from a team as a code owner November 25, 2024 16:22
@bernhardmgruber bernhardmgruber changed the title Regenerate cuda::ptx files and run format Regenerate cuda::ptx headers/docs and run format Nov 25, 2024
@bernhardmgruber
Copy link
Contributor Author

Because I changed the file extension from *.inc to *.h, the generated files are now checked by the header test, which they fail, because they are not standalone :S

I excluded the generated headers from the header check.

Copy link
Contributor

🟩 CI finished in 1h 49m: Pass: 100%/396 | Total: 2d 19h | Avg: 10m 12s | Max: 1h 16m | Hits: 49%/22058
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 19h 15m | Avg: 9m 47s | Max: 48m 35s | Hits: 30%/9524

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 18h 39m | Avg: 10m 10s | Max: 48m 35s | Hits:  30%/9524  
      🟩 arm64              Pass: 100%/8   | Total: 36m 34s | Avg:  4m 34s | Max:  8m 27s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  2h 25m | Avg:  9m 41s | Max: 38m 04s | Hits:  33%/2187  
      🟩 11.8               Pass: 100%/3   | Total: 29m 48s | Avg:  9m 56s | Max: 23m 10s
      🟩 12.5               Pass: 100%/4   | Total:  1h 49m | Avg: 27m 19s | Max: 40m 40s
      🟩 12.6               Pass: 100%/96  | Total: 14h 31m | Avg:  9m 04s | Max: 48m 35s | Hits:  29%/7337  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 34m | Avg: 12m 51s | Max: 21m 04s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 25m | Avg:  9m 41s | Max: 38m 04s | Hits:  33%/2187  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 29m 48s | Avg:  9m 56s | Max: 23m 10s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 49m | Avg: 27m 19s | Max: 40m 40s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 11h 56m | Avg:  8m 32s | Max: 48m 35s | Hits:  29%/7337  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 34m | Avg: 12m 51s | Max: 21m 04s
      🟩 nvcc               Pass: 100%/106 | Total: 16h 41m | Avg:  9m 26s | Max: 48m 35s | Hits:  30%/9524  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 46m 13s | Avg:  7m 42s | Max: 20m 28s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 49s | Avg:  6m 16s | Max:  7m 23s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 16s | Avg:  5m 04s | Max:  5m 57s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 47s | Avg:  4m 41s | Max:  5m 27s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 03s | Avg:  5m 00s | Max:  5m 36s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 02s | Avg:  5m 15s | Max:  6m 56s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 31s | Avg:  4m 52s | Max:  6m 01s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max:  6m 17s
      🟩 Clang17            Pass: 100%/4   | Total: 34m 57s | Avg:  8m 44s | Max: 20m 09s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 23m | Avg: 11m 19s | Max: 22m 45s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 35s | Avg:  2m 47s | Max:  2m 50s
      🟩 GCC7               Pass: 100%/6   | Total: 49m 50s | Avg:  8m 18s | Max: 18m 32s
      🟩 GCC8               Pass: 100%/6   | Total: 48m 53s | Avg:  8m 08s | Max: 17m 41s
      🟩 GCC9               Pass: 100%/6   | Total: 22m 29s | Avg:  3m 44s | Max:  4m 29s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 23s | Avg:  4m 35s | Max:  5m 37s
      🟩 GCC11              Pass: 100%/7   | Total: 49m 13s | Avg:  7m 01s | Max: 23m 10s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 21s | Avg:  4m 35s | Max:  5m 41s
      🟩 GCC13              Pass: 100%/17  | Total:  2h 47m | Avg:  9m 52s | Max: 22m 36s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 20m | Avg: 26m 51s | Max: 32m 40s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 38m 04s | Avg: 38m 04s | Max: 38m 04s | Hits:  33%/2187  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 14m | Avg: 37m 19s | Max: 41m 10s | Hits:  30%/4737  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 48m 35s | Avg: 48m 35s | Max: 48m 35s | Hits:  27%/2600  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 49m | Avg: 27m 19s | Max: 40m 40s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  7h 04m | Avg:  7m 42s | Max: 22m 45s
      🟩 GCC                Pass: 100%/52  | Total:  6h 20m | Avg:  7m 19s | Max: 23m 10s
      🟩 Intel              Pass: 100%/3   | Total:  1h 20m | Avg: 26m 51s | Max: 32m 40s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 41m | Avg: 40m 19s | Max: 48m 35s | Hits:  30%/9524  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 49m | Avg: 27m 19s | Max: 40m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 19h 15m | Avg:  9m 47s | Max: 48m 35s | Hits:  30%/9524  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total: 16h 55m | Avg:  9m 13s | Max: 48m 35s | Hits:  30%/9524  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 21m | Avg: 20m 17s | Max: 22m 36s
      🟩 Test               Pass: 100%/3   | Total: 57m 43s | Avg: 19m 14s | Max: 22m 45s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 29m 48s | Avg:  9m 56s | Max: 23m 10s
      🟩 90                 Pass: 100%/4   | Total: 42m 02s | Avg: 10m 30s | Max: 12m 59s
      🟩 90a                Pass: 100%/8   | Total:  1h 09m | Avg:  8m 42s | Max: 13m 23s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  3h 57m | Avg:  7m 24s | Max: 27m 00s
      🟩 14                 Pass: 100%/32  | Total:  5h 14m | Avg:  9m 50s | Max: 38m 04s | Hits:  32%/4477  
      🟩 17                 Pass: 100%/30  | Total:  5h 38m | Avg: 11m 16s | Max: 41m 10s | Hits:  29%/2447  
      🟩 20                 Pass: 100%/23  | Total:  4h 23m | Avg: 11m 27s | Max: 48m 35s | Hits:  27%/2600  
    
  • 🟩 thrust: Pass: 100%/111 | Total: 19h 39m | Avg: 10m 37s | Max: 1h 14m | Hits: 69%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max: 12m 10s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total: 19h 01m | Avg: 11m 05s | Max:  1h 14m | Hits:  69%/9260  
      🟩 arm64              Pass: 100%/8   | Total: 38m 10s | Avg:  4m 46s | Max:  5m 39s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 57m | Avg:  7m 50s | Max: 57m 23s | Hits:  63%/1852  
      🟩 11.8               Pass: 100%/3   | Total: 16m 37s | Avg:  5m 32s | Max:  5m 51s
      🟩 12.5               Pass: 100%/4   | Total:  3h 44m | Avg: 56m 08s | Max:  1h 06m
      🟩 12.6               Pass: 100%/89  | Total: 13h 41m | Avg:  9m 13s | Max:  1h 14m | Hits:  70%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 19m 30s | Avg:  4m 52s | Max:  4m 59s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 57m | Avg:  7m 50s | Max: 57m 23s | Hits:  63%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 37s | Avg:  5m 32s | Max:  5m 51s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  3h 44m | Avg: 56m 08s | Max:  1h 06m
      🟩 nvcc12.6           Pass: 100%/85  | Total: 13h 21m | Avg:  9m 25s | Max:  1h 14m | Hits:  70%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 19m 30s | Avg:  4m 52s | Max:  4m 59s
      🟩 nvcc               Pass: 100%/107 | Total: 19h 20m | Avg: 10m 50s | Max:  1h 14m | Hits:  69%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 32m 43s | Avg:  5m 27s | Max:  6m 53s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 04s | Avg:  6m 21s | Max:  6m 49s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 46s | Avg:  5m 11s | Max:  5m 29s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 36s | Avg:  5m 09s | Max:  5m 26s
      🟩 Clang13            Pass: 100%/4   | Total: 21m 26s | Avg:  5m 21s | Max:  5m 33s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 47s | Avg:  4m 56s | Max:  5m 16s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  5m 48s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 31s | Avg:  5m 22s | Max:  5m 46s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 21s | Avg:  5m 20s | Max:  5m 36s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 05m | Avg:  5m 58s | Max: 16m 25s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 15s | Avg:  4m 07s | Max:  4m 37s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 46s | Avg:  4m 37s | Max:  5m 05s
      🟩 GCC8               Pass: 100%/6   | Total: 26m 44s | Avg:  4m 27s | Max:  5m 01s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 33s | Avg:  4m 55s | Max:  5m 54s
      🟩 GCC10              Pass: 100%/4   | Total: 22m 25s | Avg:  5m 36s | Max:  5m 52s
      🟩 GCC11              Pass: 100%/7   | Total: 37m 56s | Avg:  5m 25s | Max:  5m 51s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 44s | Avg:  5m 41s | Max:  6m 00s
      🟩 GCC13              Pass: 100%/16  | Total:  1h 51m | Avg:  6m 57s | Max: 14m 46s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 08m | Avg: 42m 54s | Max: 49m 45s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 57m 23s | Avg: 57m 23s | Max: 57m 23s | Hits:  63%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  63%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 38m | Avg: 49m 00s | Max:  1h 14m | Hits:  78%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  3h 44m | Avg: 56m 08s | Max:  1h 06m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 24m | Avg:  5m 30s | Max: 16m 25s
      🟩 GCC                Pass: 100%/51  | Total:  4h 46m | Avg:  5m 37s | Max: 14m 46s
      🟩 Intel              Pass: 100%/3   | Total:  2h 08m | Avg: 42m 54s | Max: 49m 45s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 36m | Avg: 55m 12s | Max:  1h 14m | Hits:  69%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  3h 44m | Avg: 56m 08s | Max:  1h 06m
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total: 19h 39m | Avg: 10m 37s | Max:  1h 14m | Hits:  69%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total: 17h 57m | Avg: 10m 27s | Max:  1h 14m | Hits:  61%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 15s | Avg: 11m 33s | Max: 23m 54s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total: 56m 12s | Avg: 14m 03s | Max: 16m 25s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 37s | Avg:  5m 32s | Max:  5m 51s
      🟩 90a                Pass: 100%/4   | Total: 18m 13s | Avg:  4m 33s | Max:  4m 43s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 43m | Avg:  7m 26s | Max: 42m 51s
      🟩 14                 Pass: 100%/29  | Total:  5h 50m | Avg: 12m 04s | Max:  1h 01m | Hits:  63%/3704  
      🟩 17                 Pass: 100%/27  | Total:  5h 04m | Avg: 11m 15s | Max:  1h 06m | Hits:  63%/1852  
      🟩 20                 Pass: 100%/23  | Total:  4h 44m | Avg: 12m 22s | Max:  1h 14m | Hits:  78%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 23h 47m | Avg: 12m 58s | Max: 1h 16m | Hits: 50%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 23h 10m | Avg: 13m 38s | Max:  1h 16m | Hits:  50%/3028  
      🟩 arm64              Pass: 100%/8   | Total: 37m 04s | Avg:  4m 38s | Max:  4m 56s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 58m | Avg:  7m 53s | Max: 55m 56s | Hits:  65%/757   
      🟩 11.8               Pass: 100%/3   | Total: 16m 18s | Avg:  5m 26s | Max:  5m 28s
      🟩 12.5               Pass: 100%/4   | Total:  4h 32m | Avg:  1h 08m | Max:  1h 16m
      🟩 12.6               Pass: 100%/88  | Total: 17h 00m | Avg: 11m 36s | Max:  1h 11m | Hits:  44%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 16m 39s | Avg:  4m 09s | Max:  4m 33s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 58m | Avg:  7m 53s | Max: 55m 56s | Hits:  65%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 18s | Avg:  5m 26s | Max:  5m 28s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  4h 32m | Avg:  1h 08m | Max:  1h 16m
      🟩 nvcc12.6           Pass: 100%/84  | Total: 16h 44m | Avg: 11m 57s | Max:  1h 11m | Hits:  44%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 16m 39s | Avg:  4m 09s | Max:  4m 33s
      🟩 nvcc               Pass: 100%/106 | Total: 23h 31m | Avg: 13m 18s | Max:  1h 16m | Hits:  50%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 32m 50s | Avg:  5m 28s | Max:  6m 31s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 18s | Avg:  6m 26s | Max:  6m 41s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 33s | Avg:  5m 08s | Max:  5m 20s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 48s | Avg:  5m 27s | Max:  5m 58s
      🟩 Clang13            Pass: 100%/4   | Total: 21m 25s | Avg:  5m 21s | Max:  5m 32s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 41s | Avg:  5m 25s | Max:  5m 40s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  5m 32s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 48s | Avg:  5m 27s | Max:  5m 42s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 19s | Avg:  5m 19s | Max:  5m 39s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 19m | Avg:  7m 15s | Max: 19m 59s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 49s | Avg:  4m 24s | Max:  4m 38s
      🟩 GCC7               Pass: 100%/6   | Total: 28m 35s | Avg:  4m 45s | Max:  5m 30s
      🟩 GCC8               Pass: 100%/6   | Total: 29m 51s | Avg:  4m 58s | Max:  5m 35s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 16m | Avg: 12m 48s | Max: 53m 44s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 27s | Avg:  5m 21s | Max:  5m 32s
      🟩 GCC11              Pass: 100%/7   | Total: 37m 56s | Avg:  5m 25s | Max:  5m 37s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 25s | Avg:  5m 36s | Max:  5m 47s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 35m | Avg: 13m 29s | Max: 27m 37s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 59m | Avg: 59m 49s | Max:  1h 01m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 55m 56s | Avg: 55m 56s | Max: 55m 56s | Hits:  65%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 06m | Hits:  65%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m | Hits:   3%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  4h 32m | Avg:  1h 08m | Max:  1h 16m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 41m | Avg:  5m 52s | Max: 19m 59s
      🟩 GCC                Pass: 100%/51  | Total:  7h 21m | Avg:  8m 39s | Max: 53m 44s
      🟩 Intel              Pass: 100%/3   | Total:  2h 59m | Avg: 59m 49s | Max:  1h 01m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 12m | Avg:  1h 03m | Max:  1h 11m | Hits:  50%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total:  4h 32m | Avg:  1h 08m | Max:  1h 16m
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 23h 47m | Avg: 12m 58s | Max:  1h 16m | Hits:  50%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total: 20h 41m | Avg: 12m 10s | Max:  1h 16m | Hits:  50%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 26m 01s | Avg: 26m 01s | Max: 26m 01s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 12s | Avg: 19m 12s | Max: 19m 12s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 09m | Avg: 23m 05s | Max: 27m 09s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 11m | Avg: 23m 55s | Max: 27m 37s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 18s | Avg:  5m 26s | Max:  5m 28s
      🟩 90a                Pass: 100%/4   | Total: 38m 33s | Avg:  9m 38s | Max: 25m 53s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  4h 58m | Avg:  9m 56s | Max: 59m 41s
      🟩 14                 Pass: 100%/29  | Total:  6h 14m | Avg: 12m 54s | Max:  1h 06m | Hits:  65%/1514  
      🟩 17                 Pass: 100%/27  | Total:  6h 06m | Avg: 13m 34s | Max:  1h 10m | Hits:  65%/757   
      🟩 20                 Pass: 100%/24  | Total:  6h 28m | Avg: 16m 10s | Max:  1h 16m | Hits:   3%/757   
    
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 13m | Avg: 4m 41s | Max: 17m 41s | Hits: 60%/246

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  4h 03m | Avg:  4m 52s | Max: 17m 41s | Hits:  60%/246   
      🟩 arm64              Pass: 100%/4   | Total: 10m 11s | Avg:  2m 32s | Max:  2m 38s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 29m | Avg:  4m 41s | Max: 17m 27s | Hits:  60%/123   
      🟩 12.5               Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 01s
      🟩 12.6               Pass: 100%/33  | Total:  2h 26m | Avg:  4m 26s | Max: 17m 41s | Hits:  60%/123   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 29m | Avg:  4m 41s | Max: 17m 27s | Hits:  60%/123   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 01s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 26m | Avg:  4m 26s | Max: 17m 41s | Hits:  60%/123   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 13m | Avg:  4m 41s | Max: 17m 41s | Hits:  60%/246   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  3m 32s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 53s | Avg:  3m 26s | Max:  3m 42s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 41s | Avg:  2m 55s | Max:  3m 12s
      🟩 Clang12            Pass: 100%/4   | Total: 11m 46s | Avg:  2m 56s | Max:  3m 01s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 13s | Avg:  3m 03s | Max:  3m 07s
      🟩 Clang14            Pass: 100%/4   | Total: 25m 14s | Avg:  6m 18s | Max: 16m 00s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 05s | Avg:  3m 02s | Max:  3m 07s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 15s | Avg:  2m 48s | Max:  3m 08s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 37s | Avg:  3m 18s | Max:  3m 27s
      🟩 Clang18            Pass: 100%/2   | Total: 19m 37s | Avg:  9m 48s | Max: 16m 21s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 27s | Avg:  2m 43s | Max:  2m 49s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 46s | Avg:  2m 56s | Max:  3m 07s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 54s | Avg:  2m 58s | Max:  3m 06s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 02m | Avg:  8m 56s | Max: 17m 41s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 39s | Avg:  2m 33s | Max:  2m 34s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 20s | Avg:  9m 20s | Max:  9m 20s | Hits:  60%/123   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 12s | Avg:  9m 12s | Max:  9m 12s | Hits:  60%/123   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 01s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 58m | Avg:  3m 56s | Max: 16m 21s
      🟩 GCC                Pass: 100%/20  | Total:  1h 39m | Avg:  4m 57s | Max: 17m 41s
      🟩 MSVC               Pass: 100%/2   | Total: 18m 32s | Avg:  9m 16s | Max:  9m 20s | Hits:  60%/246   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 13m | Avg:  4m 41s | Max: 17m 41s | Hits:  60%/246   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  2h 50m | Avg:  3m 29s | Max:  9m 20s | Hits:  60%/246   
      🟩 Test               Pass: 100%/5   | Total:  1h 23m | Avg: 16m 37s | Max: 17m 41s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 37s | Avg:  2m 37s | Max:  2m 37s
      🟩 90a                Pass: 100%/1   | Total:  2m 33s | Avg:  2m 33s | Max:  2m 33s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  1h 59m | Avg:  4m 07s | Max: 17m 27s
      🟩 20                 Pass: 100%/25  | Total:  2h 14m | Avg:  5m 22s | Max: 17m 41s | Hits:  60%/246   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 27s | Avg: 4m 43s | Max: 7m 24s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 27s | Avg:  4m 43s | Max:  7m 24s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 27s | Avg:  4m 43s | Max:  7m 24s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 27s | Avg:  4m 43s | Max:  7m 24s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 27s | Avg:  4m 43s | Max:  7m 24s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 27s | Avg:  4m 43s | Max:  7m 24s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 27s | Avg:  4m 43s | Max:  7m 24s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 27s | Avg:  4m 43s | Max:  7m 24s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
      🟩 Test               Pass: 100%/1   | Total:  7m 24s | Avg:  7m 24s | Max:  7m 24s
    
  • 🟩 python: Pass: 100%/1 | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 396)

# Runner
327 linux-amd64-cpu16
28 linux-arm64-cpu16
26 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

@bernhardmgruber bernhardmgruber merged commit ace320b into NVIDIA:main Nov 25, 2024
412 checks passed
@bernhardmgruber bernhardmgruber deleted the ptx_gen_overwrite branch November 25, 2024 18:39
davebayer pushed a commit to davebayer/cccl that referenced this pull request Nov 25, 2024
Overwrites all generated PTX header and documentation files and runs `pre-commit run --all-files`. Also exclude generated PTX headers from header check.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants