Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Drop C++11 and C++14 support for all of cccl #3417

Merged
merged 7 commits into from
Jan 21, 2025

Conversation

miscco
Copy link
Contributor

@miscco miscco commented Jan 16, 2025

We already dropped support for C++11 and C++14 for CUB and Thrust.

This also removes support for those standard dialects for libcu++

@miscco miscco requested review from a team as code owners January 16, 2025 08:01
@miscco miscco force-pushed the drop_old_dialects branch 3 times, most recently from c5a635d to ee0b495 Compare January 16, 2025 08:07
@miscco miscco added 2.8.0 target for 2.8.0 release breaking Breaking change 3.0 Targeted for 3.0 release and removed 2.8.0 target for 2.8.0 release labels Jan 16, 2025
@miscco miscco force-pushed the drop_old_dialects branch 3 times, most recently from 083de07 to 1d37da4 Compare January 16, 2025 10:28
Copy link
Contributor

🟨 CI finished in 1h 28m: Pass: 99%/139 | Total: 1d 02h | Avg: 11m 34s | Max: 46m 14s | Hits: 505%/23384
  • 🟨 cccl: Pass: 75%/4 | Total: 17m 47s | Avg: 4m 26s | Max: 4m 42s

    🔍 ctk: 12.0 🔍
      🔍 12.0               Pass:  50%/2   | Total:  8m 25s | Avg:  4m 12s | Max:  4m 19s
      🟩 12.6               Pass: 100%/2   | Total:  9m 22s | Avg:  4m 41s | Max:  4m 42s
    🔍 cudacxx: nvcc12.0 🔍
      🔍 nvcc12.0           Pass:  50%/2   | Total:  8m 25s | Avg:  4m 12s | Max:  4m 19s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 22s | Avg:  4m 41s | Max:  4m 42s
    🚨 cxx: Clang14 🚨
      🔥 Clang14            Pass:   0%/1   | Total:  4m 19s | Avg:  4m 19s | Max:  4m 19s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 40s | Avg:  4m 40s | Max:  4m 40s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 42s | Avg:  4m 42s | Max:  4m 42s
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  50%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 40s
      🟩 GCC                Pass: 100%/2   | Total:  8m 48s | Avg:  4m 24s | Max:  4m 42s
    🟨 cpu
      🟨 amd64              Pass:  75%/4   | Total: 17m 47s | Avg:  4m 26s | Max:  4m 42s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  75%/4   | Total: 17m 47s | Avg:  4m 26s | Max:  4m 42s
    🟨 gpu
      🟨 v100               Pass:  75%/4   | Total: 17m 47s | Avg:  4m 26s | Max:  4m 42s
    🟨 jobs
      🟨 Infra              Pass:  75%/4   | Total: 17m 47s | Avg:  4m 26s | Max:  4m 42s
    
  • 🟩 cub: Pass: 100%/38 | Total: 8h 25m | Avg: 13m 18s | Max: 46m 14s | Hits: 539%/3540

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total:  8h 16m | Avg: 13m 46s | Max: 46m 14s | Hits: 539%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  9m 38s | Avg:  4m 49s | Max:  4m 59s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 56m 33s | Avg: 11m 18s | Max: 35m 47s | Hits: 539%/885   
      🟩 12.5               Pass: 100%/2   | Total: 18m 38s | Avg:  9m 19s | Max:  9m 35s
      🟩 12.6               Pass: 100%/31  | Total:  7h 10m | Avg: 13m 53s | Max: 46m 14s | Hits: 539%/2655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 12s | Avg:  4m 36s | Max:  4m 42s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 56m 33s | Avg: 11m 18s | Max: 35m 47s | Hits: 539%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 38s | Avg:  9m 19s | Max:  9m 35s
      🟩 nvcc12.6           Pass: 100%/29  | Total:  7h 01m | Avg: 14m 31s | Max: 46m 14s | Hits: 539%/2655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 12s | Avg:  4m 36s | Max:  4m 42s
      🟩 nvcc               Pass: 100%/36  | Total:  8h 16m | Avg: 13m 47s | Max: 46m 14s | Hits: 539%/3540  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  5m 36s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 25m | Avg: 12m 09s | Max: 32m 30s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 28s | Avg:  5m 14s | Max:  5m 22s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  6m 04s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s
      🟩 GCC12              Pass: 100%/3   | Total: 29m 53s | Avg:  9m 57s | Max: 19m 35s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 15m | Avg: 16m 58s | Max: 46m 14s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 18m | Avg: 39m 26s | Max: 43m 05s | Hits: 539%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 20m | Avg: 40m 25s | Max: 45m 54s | Hits: 539%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 38s | Avg:  9m 19s | Max:  9m 35s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  2h 03m | Avg:  8m 47s | Max: 32m 30s
      🟩 GCC                Pass: 100%/18  | Total:  3h 24m | Avg: 11m 21s | Max: 46m 14s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 39m | Avg: 39m 55s | Max: 45m 54s | Hits: 539%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 38s | Avg:  9m 19s | Max:  9m 35s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 19m 35s
      🟩 v100               Pass: 100%/36  | Total:  8h 01m | Avg: 13m 22s | Max: 46m 14s | Hits: 539%/3540  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  5h 11m | Avg: 10m 01s | Max: 45m 54s | Hits: 539%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 29m 15s | Avg: 29m 15s | Max: 29m 15s
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 00s | Avg: 18m 00s | Max: 18m 00s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 56s | Max: 27m 42s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 18m | Avg: 39m 22s | Max: 46m 14s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 19m 35s
      🟩 90a                Pass: 100%/1   | Total:  4m 28s | Avg:  4m 28s | Max:  4m 28s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  3h 07m | Avg: 13m 25s | Max: 45m 54s | Hits: 539%/2655  
      🟩 20                 Pass: 100%/24  | Total:  5h 17m | Avg: 13m 14s | Max: 46m 14s | Hits: 539%/885   
    
  • 🟩 libcudacxx: Pass: 100%/37 | Total: 8h 35m | Avg: 13m 56s | Max: 35m 20s | Hits: 651%/10102

    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  8h 11m | Avg: 14m 01s | Max: 35m 20s | Hits: 651%/10102 
      🟩 arm64              Pass: 100%/2   | Total: 24m 56s | Avg: 12m 28s | Max: 20m 51s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 54m 54s | Avg: 10m 58s | Max: 25m 10s | Hits: 683%/2481  
      🟩 12.5               Pass: 100%/2   | Total:  1h 05m | Avg: 32m 35s | Max: 35m 20s
      🟩 12.6               Pass: 100%/30  | Total:  6h 35m | Avg: 13m 11s | Max: 33m 49s | Hits: 640%/7621  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 08m | Avg: 17m 03s | Max: 21m 59s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 54m 54s | Avg: 10m 58s | Max: 25m 10s | Hits: 683%/2481  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 05m | Avg: 32m 35s | Max: 35m 20s
      🟩 nvcc12.6           Pass: 100%/26  | Total:  5h 27m | Avg: 12m 36s | Max: 33m 49s | Hits: 640%/7621  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 08m | Avg: 17m 03s | Max: 21m 59s
      🟩 nvcc               Pass: 100%/33  | Total:  7h 27m | Avg: 13m 34s | Max: 35m 20s | Hits: 651%/10102 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 34m 20s | Avg:  8m 35s | Max: 22m 49s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 25s | Avg:  4m 25s | Max:  4m 25s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 25s | Avg:  4m 25s | Max:  4m 25s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 00m | Avg: 15m 06s | Max: 22m 32s
      🟩 GCC7               Pass: 100%/2   | Total: 22m 43s | Avg: 11m 21s | Max: 18m 15s
      🟩 GCC8               Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
      🟩 GCC9               Pass: 100%/2   | Total:  8m 40s | Avg:  4m 20s | Max:  4m 33s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s
      🟩 GCC12              Pass: 100%/1   | Total: 22m 19s | Avg: 22m 19s | Max: 22m 19s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 43m | Avg: 12m 52s | Max: 33m 16s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 50m 42s | Avg: 25m 21s | Max: 25m 32s | Hits: 683%/4972  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 34s | Max: 33m 49s | Hits: 619%/5130  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 35s | Max: 35m 20s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/15  | Total:  2h 48m | Avg: 11m 13s | Max: 22m 49s
      🟩 GCC                Pass: 100%/16  | Total:  2h 48m | Avg: 10m 32s | Max: 33m 16s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 53m | Avg: 28m 27s | Max: 33m 49s | Hits: 651%/10102 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 35s | Max: 35m 20s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  8h 35m | Avg: 13m 56s | Max: 35m 20s | Hits: 651%/10102 
    🟩 jobs
      🟩 Build              Pass: 100%/32  | Total:  6h 47m | Avg: 12m 43s | Max: 35m 20s | Hits: 651%/10102 
      🟩 NVRTC              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 58s | Max: 33m 16s
      🟩 Test               Pass: 100%/2   | Total: 40m 49s | Avg: 20m 24s | Max: 22m 32s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 56s | Avg:  1m 56s | Max:  1m 56s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 43s | Avg: 13m 43s | Max: 13m 43s
      🟩 90a                Pass: 100%/2   | Total: 16m 59s | Avg:  8m 29s | Max: 12m 59s
    🟩 std
      🟩 17                 Pass: 100%/15  | Total:  3h 36m | Avg: 14m 26s | Max: 33m 16s | Hits: 661%/7463  
      🟩 20                 Pass: 100%/21  | Total:  4h 57m | Avg: 14m 09s | Max: 35m 20s | Hits: 621%/2639  
    
  • 🟩 thrust: Pass: 100%/37 | Total: 6h 53m | Avg: 11m 11s | Max: 41m 39s | Hits: 338%/9220

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 27m 43s | Avg: 13m 51s | Max: 22m 07s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  6h 44m | Avg: 11m 33s | Max: 41m 39s | Hits: 338%/9220  
      🟩 arm64              Pass: 100%/2   | Total:  9m 32s | Avg:  4m 46s | Max:  4m 57s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 51m 32s | Avg: 10m 18s | Max: 31m 15s | Hits: 341%/1844  
      🟩 12.5               Pass: 100%/2   | Total: 29m 27s | Avg: 14m 43s | Max: 15m 32s
      🟩 12.6               Pass: 100%/30  | Total:  5h 32m | Avg: 11m 05s | Max: 41m 39s | Hits: 338%/7376  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 23s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 51m 32s | Avg: 10m 18s | Max: 31m 15s | Hits: 341%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 29m 27s | Avg: 14m 43s | Max: 15m 32s
      🟩 nvcc12.6           Pass: 100%/28  | Total:  5h 22m | Avg: 11m 30s | Max: 41m 39s | Hits: 338%/7376  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 23s
      🟩 nvcc               Pass: 100%/35  | Total:  6h 43m | Avg: 11m 31s | Max: 41m 39s | Hits: 338%/9220  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  5m 31s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 57s | Avg:  5m 57s | Max:  5m 57s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 45s | Avg:  5m 45s | Max:  5m 45s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 58s | Avg:  5m 58s | Max:  5m 58s
      🟩 Clang18            Pass: 100%/7   | Total: 53m 18s | Avg:  7m 36s | Max: 19m 01s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  5m 42s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 08s | Avg:  5m 08s | Max:  5m 08s
      🟩 GCC9               Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 29s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 43s | Avg:  5m 43s | Max:  5m 43s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 50s | Avg:  5m 50s | Max:  5m 50s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 15m | Avg:  9m 26s | Max: 22m 07s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 03s | Max: 31m 15s | Hits: 337%/3688  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 50m | Avg: 36m 56s | Max: 41m 39s | Hits: 339%/5532  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 29m 27s | Avg: 14m 43s | Max: 15m 32s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  1h 32m | Avg:  6m 34s | Max: 19m 01s
      🟩 GCC                Pass: 100%/16  | Total:  1h 59m | Avg:  7m 27s | Max: 22m 07s
      🟩 MSVC               Pass: 100%/5   | Total:  2h 52m | Avg: 34m 35s | Max: 41m 39s | Hits: 338%/9220  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 27s | Avg: 14m 43s | Max: 15m 32s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  6h 53m | Avg: 11m 11s | Max: 41m 39s | Hits: 338%/9220  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  5h 00m | Avg:  9m 42s | Max: 41m 39s | Hits: 331%/7376  
      🟩 TestCPU            Pass: 100%/3   | Total: 53m 08s | Avg: 17m 42s | Max: 37m 14s | Hits: 365%/1844  
      🟩 TestGPU            Pass: 100%/3   | Total: 59m 58s | Avg: 19m 59s | Max: 22m 07s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 33s | Avg:  4m 33s | Max:  4m 33s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  2h 42m | Avg: 11m 34s | Max: 31m 57s | Hits: 336%/5532  
      🟩 20                 Pass: 100%/21  | Total:  3h 44m | Avg: 10m 40s | Max: 41m 39s | Hits: 341%/3688  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 54m | Avg: 5m 43s | Max: 19m 39s | Hits: 388%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 44m | Avg:  6m 30s | Max: 19m 39s | Hits: 388%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 24s | Avg:  2m 36s | Max:  2m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 51s | Avg: 10m 51s | Max: 10m 51s | Hits: 388%/261   
      🟩 12.5               Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 20s
      🟩 12.6               Pass: 100%/17  | Total:  1h 33m | Avg:  5m 28s | Max: 19m 39s | Hits: 388%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 51s | Avg: 10m 51s | Max: 10m 51s | Hits: 388%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 20s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 33m | Avg:  5m 28s | Max: 19m 39s | Hits: 388%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 54m | Avg:  5m 43s | Max: 19m 39s | Hits: 388%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 11s | Avg:  3m 11s | Max:  3m 11s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 Clang18            Pass: 100%/4   | Total: 28m 22s | Avg:  7m 05s | Max: 19m 39s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 11s | Avg:  3m 11s | Max:  3m 11s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 GCC12              Pass: 100%/2   | Total: 22m 09s | Avg: 11m 04s | Max: 18m 47s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 36s | Avg:  2m 39s | Max:  2m 51s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 51s | Avg: 10m 51s | Max: 10m 51s | Hits: 388%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 14s | Avg: 12m 14s | Max: 12m 14s | Hits: 388%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 20s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 41m 41s | Avg:  5m 12s | Max: 19m 39s
      🟩 GCC                Pass: 100%/8   | Total: 39m 08s | Avg:  4m 53s | Max: 18m 47s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 05s | Avg: 11m 32s | Max: 12m 14s | Hits: 388%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 20s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  1h 54m | Avg:  5m 43s | Max: 19m 39s | Hits: 388%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 16m | Avg:  4m 13s | Max: 12m 14s | Hits: 388%/522   
      🟩 Test               Pass: 100%/2   | Total: 38m 26s | Avg: 19m 13s | Max: 19m 39s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 41s | Avg:  2m 41s | Max:  2m 41s
      🟩 90a                Pass: 100%/1   | Total:  2m 51s | Avg:  2m 51s | Max:  2m 51s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 13m 14s | Avg:  3m 18s | Max:  5m 17s
      🟩 20                 Pass: 100%/16  | Total:  1h 41m | Avg:  6m 19s | Max: 19m 39s | Hits: 388%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 25s | Avg: 5m 42s | Max: 9m 21s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  9m 21s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  9m 21s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  9m 21s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  9m 21s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  9m 21s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  9m 21s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  9m 21s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
      🟩 Test               Pass: 100%/1   | Total:  9m 21s | Avg:  9m 21s | Max:  9m 21s
    
  • 🟩 python: Pass: 100%/1 | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 30m 20s | Avg: 30m 20s | Max: 30m 20s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 139)

# Runner
92 linux-amd64-cpu16
21 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟨 CI finished in 1h 44m: Pass: 99%/139 | Total: 1d 10h | Avg: 14m 46s | Max: 1h 20m | Hits: 236%/23384
  • 🟨 cccl: Pass: 75%/4 | Total: 18m 27s | Avg: 4m 36s | Max: 5m 12s

    🔍 ctk: 12.0 🔍
      🔍 12.0               Pass:  50%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  4m 46s
      🟩 12.6               Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  5m 12s
    🔍 cudacxx: nvcc12.0 🔍
      🔍 nvcc12.0           Pass:  50%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  4m 46s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  5m 12s
    🚨 cxx: Clang14 🚨
      🔥 Clang14            Pass:   0%/1   | Total:  4m 46s | Avg:  4m 46s | Max:  4m 46s
      🟩 Clang18            Pass: 100%/1   | Total:  5m 12s | Avg:  5m 12s | Max:  5m 12s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  50%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  5m 12s
      🟩 GCC                Pass: 100%/2   | Total:  8m 29s | Avg:  4m 14s | Max:  4m 23s
    🟨 cpu
      🟨 amd64              Pass:  75%/4   | Total: 18m 27s | Avg:  4m 36s | Max:  5m 12s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  75%/4   | Total: 18m 27s | Avg:  4m 36s | Max:  5m 12s
    🟨 gpu
      🟨 v100               Pass:  75%/4   | Total: 18m 27s | Avg:  4m 36s | Max:  5m 12s
    🟨 jobs
      🟨 Infra              Pass:  75%/4   | Total: 18m 27s | Avg:  4m 36s | Max:  5m 12s
    
  • 🟩 cub: Pass: 100%/38 | Total: 11h 57m | Avg: 18m 52s | Max: 1h 14m | Hits: 41%/3540

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total: 11h 47m | Avg: 19m 39s | Max:  1h 14m | Hits:  41%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  5m 01s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 22m | Avg: 16m 33s | Max:  1h 02m | Hits:  41%/885   
      🟩 12.5               Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 14m
      🟩 12.6               Pass: 100%/31  | Total:  8h 14m | Avg: 15m 56s | Max:  1h 11m | Hits:  41%/2655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 45s | Avg:  4m 22s | Max:  4m 23s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 22m | Avg: 16m 33s | Max:  1h 02m | Hits:  41%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 14m
      🟩 nvcc12.6           Pass: 100%/29  | Total:  8h 05m | Avg: 16m 44s | Max:  1h 11m | Hits:  41%/2655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 45s | Avg:  4m 22s | Max:  4m 23s
      🟩 nvcc               Pass: 100%/36  | Total: 11h 48m | Avg: 19m 40s | Max:  1h 14m | Hits:  41%/3540  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 19s | Avg:  5m 19s | Max:  5m 35s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 39s | Avg:  5m 39s | Max:  5m 39s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 20m | Avg: 11m 33s | Max: 29m 14s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 28s | Avg:  5m 14s | Max:  5m 23s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 12s | Avg:  5m 12s | Max:  5m 12s
      🟩 GCC9               Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  5m 34s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 47s | Avg:  5m 47s | Max:  5m 47s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s
      🟩 GCC12              Pass: 100%/3   | Total: 31m 43s | Avg: 10m 34s | Max: 20m 50s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 00m | Avg: 15m 04s | Max: 29m 33s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 07m | Hits:  41%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 11m | Hits:  41%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 14m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  1h 59m | Avg:  8m 32s | Max: 29m 14s
      🟩 GCC                Pass: 100%/18  | Total:  3h 10m | Avg: 10m 34s | Max: 29m 33s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 27m | Avg:  1h 06m | Max:  1h 11m | Hits:  41%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 14m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 25m 11s | Avg: 12m 35s | Max: 20m 50s
      🟩 v100               Pass: 100%/36  | Total: 11h 32m | Avg: 19m 13s | Max:  1h 14m | Hits:  41%/3540  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  9h 00m | Avg: 17m 25s | Max:  1h 14m | Hits:  41%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 26m 48s | Avg: 26m 48s | Max: 26m 48s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 44s | Avg: 19m 44s | Max: 19m 44s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 11m | Avg: 23m 54s | Max: 27m 08s
      🟩 TestGPU            Pass: 100%/2   | Total: 58m 47s | Avg: 29m 23s | Max: 29m 33s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 25m 11s | Avg: 12m 35s | Max: 20m 50s
      🟩 90a                Pass: 100%/1   | Total:  4m 19s | Avg:  4m 19s | Max:  4m 19s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  5h 13m | Avg: 22m 25s | Max:  1h 07m | Hits:  41%/2655  
      🟩 20                 Pass: 100%/24  | Total:  6h 43m | Avg: 16m 48s | Max:  1h 14m | Hits:  41%/885   
    
  • 🟩 libcudacxx: Pass: 100%/37 | Total: 8h 35m | Avg: 13m 56s | Max: 39m 52s | Hits: 397%/10102

    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  8h 28m | Avg: 14m 32s | Max: 39m 52s | Hits: 397%/10102 
      🟩 arm64              Pass: 100%/2   | Total:  6m 53s | Avg:  3m 26s | Max:  3m 31s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 48m 43s | Avg:  9m 44s | Max: 34m 14s | Hits: 396%/2481  
      🟩 12.5               Pass: 100%/2   | Total:  1h 16m | Avg: 38m 04s | Max: 39m 28s
      🟩 12.6               Pass: 100%/30  | Total:  6h 31m | Avg: 13m 02s | Max: 39m 52s | Hits: 398%/7621  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 04m | Avg: 16m 12s | Max: 22m 10s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 48m 43s | Avg:  9m 44s | Max: 34m 14s | Hits: 396%/2481  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 16m | Avg: 38m 04s | Max: 39m 28s
      🟩 nvcc12.6           Pass: 100%/26  | Total:  5h 26m | Avg: 12m 32s | Max: 39m 52s | Hits: 398%/7621  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 04m | Avg: 16m 12s | Max: 22m 10s
      🟩 nvcc               Pass: 100%/33  | Total:  7h 31m | Avg: 13m 40s | Max: 39m 52s | Hits: 397%/10102 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 33m 30s | Avg:  8m 22s | Max: 21m 22s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 16s | Avg:  4m 16s | Max:  4m 16s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 28s | Avg:  4m 28s | Max:  4m 28s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 36m | Avg: 12m 05s | Max: 22m 10s
      🟩 GCC7               Pass: 100%/2   | Total:  8m 19s | Avg:  4m 09s | Max:  5m 06s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 09s | Avg:  3m 34s | Max:  3m 55s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟩 GCC11              Pass: 100%/1   | Total: 23m 05s | Avg: 23m 05s | Max: 23m 05s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 34m | Avg: 11m 52s | Max: 29m 54s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 12m | Avg: 36m 07s | Max: 38m 01s | Hits: 397%/4972  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 19m | Avg: 39m 51s | Max: 39m 52s | Hits: 397%/5130  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 16m | Avg: 38m 04s | Max: 39m 28s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/15  | Total:  2h 23m | Avg:  9m 32s | Max: 22m 10s
      🟩 GCC                Pass: 100%/16  | Total:  2h 24m | Avg:  9m 02s | Max: 29m 54s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 31m | Avg: 37m 59s | Max: 39m 52s | Hits: 397%/10102 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 16m | Avg: 38m 04s | Max: 39m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  8h 35m | Avg: 13m 56s | Max: 39m 52s | Hits: 397%/10102 
    🟩 jobs
      🟩 Build              Pass: 100%/32  | Total:  6h 55m | Avg: 12m 59s | Max: 39m 52s | Hits: 397%/10102 
      🟩 NVRTC              Pass: 100%/2   | Total: 59m 29s | Avg: 29m 44s | Max: 29m 54s
      🟩 Test               Pass: 100%/2   | Total: 38m 28s | Avg: 19m 14s | Max: 19m 42s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 12m 25s | Avg: 12m 25s | Max: 12m 25s
      🟩 90a                Pass: 100%/2   | Total: 16m 07s | Avg:  8m 03s | Max: 12m 33s
    🟩 std
      🟩 17                 Pass: 100%/15  | Total:  4h 08m | Avg: 16m 33s | Max: 39m 50s | Hits: 398%/7463  
      🟩 20                 Pass: 100%/21  | Total:  4h 25m | Avg: 12m 38s | Max: 39m 52s | Hits: 395%/2639  
    
  • 🟩 thrust: Pass: 100%/37 | Total: 10h 41m | Avg: 17m 20s | Max: 1h 20m | Hits: 144%/9220

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 41s | Avg: 10m 50s | Max: 15m 44s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total: 10h 32m | Avg: 18m 04s | Max:  1h 20m | Hits: 144%/9220  
      🟩 arm64              Pass: 100%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  5m 01s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 22m | Avg: 16m 28s | Max:  1h 01m | Hits:  80%/1844  
      🟩 12.5               Pass: 100%/2   | Total:  2h 39m | Avg:  1h 19m | Max:  1h 20m
      🟩 12.6               Pass: 100%/30  | Total:  6h 39m | Avg: 13m 19s | Max:  1h 04m | Hits: 160%/7376  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 33s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 22m | Avg: 16m 28s | Max:  1h 01m | Hits:  80%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 39m | Avg:  1h 19m | Max:  1h 20m
      🟩 nvcc12.6           Pass: 100%/28  | Total:  6h 28m | Avg: 13m 53s | Max:  1h 04m | Hits: 160%/7376  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 33s
      🟩 nvcc               Pass: 100%/35  | Total: 10h 30m | Avg: 18m 01s | Max:  1h 20m | Hits: 144%/9220  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 30s | Avg:  5m 22s | Max:  5m 48s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 Clang18            Pass: 100%/7   | Total: 47m 57s | Avg:  6m 51s | Max: 12m 40s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 19s | Avg:  5m 09s | Max:  5m 11s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 24s | Avg:  5m 42s | Max:  5m 54s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 48s | Avg:  5m 48s | Max:  5m 48s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 48s | Avg:  5m 48s | Max:  5m 48s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 04m | Avg:  8m 00s | Max: 15m 44s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 01m | Hits:  98%/3688  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 44m | Avg: 54m 42s | Max:  1h 04m | Hits: 175%/5532  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 39m | Avg:  1h 19m | Max:  1h 20m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  1h 26m | Avg:  6m 09s | Max: 12m 40s
      🟩 GCC                Pass: 100%/16  | Total:  1h 48m | Avg:  6m 47s | Max: 15m 44s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 47m | Avg: 57m 29s | Max:  1h 04m | Hits: 144%/9220  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 39m | Avg:  1h 19m | Max:  1h 20m
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total: 10h 41m | Avg: 17m 20s | Max:  1h 20m | Hits: 144%/9220  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  9h 07m | Avg: 17m 39s | Max:  1h 20m | Hits:  89%/7376  
      🟩 TestCPU            Pass: 100%/3   | Total: 53m 48s | Avg: 17m 56s | Max: 37m 33s | Hits: 365%/1844  
      🟩 TestGPU            Pass: 100%/3   | Total: 40m 49s | Avg: 13m 36s | Max: 15m 44s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 25s | Avg:  4m 25s | Max:  4m 25s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  5h 23m | Avg: 23m 06s | Max:  1h 20m | Hits:  92%/5532  
      🟩 20                 Pass: 100%/21  | Total:  4h 56m | Avg: 14m 07s | Max:  1h 18m | Hits: 222%/3688  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 03m | Avg: 6m 11s | Max: 20m 01s | Hits: 81%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 53m | Avg:  7m 04s | Max: 20m 01s | Hits:  81%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 41s | Avg:  2m 40s | Max:  2m 46s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 13m 36s | Avg: 13m 36s | Max: 13m 36s | Hits:  81%/261   
      🟩 12.5               Pass: 100%/2   | Total: 17m 45s | Avg:  8m 52s | Max:  8m 55s
      🟩 12.6               Pass: 100%/17  | Total:  1h 32m | Avg:  5m 26s | Max: 20m 01s | Hits:  81%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 13m 36s | Avg: 13m 36s | Max: 13m 36s | Hits:  81%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 45s | Avg:  8m 52s | Max:  8m 55s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 32m | Avg:  5m 26s | Max: 20m 01s | Hits:  81%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 03m | Avg:  6m 11s | Max: 20m 01s | Hits:  81%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 05s | Avg:  3m 05s | Max:  3m 05s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s
      🟩 Clang18            Pass: 100%/4   | Total: 26m 45s | Avg:  6m 41s | Max: 17m 48s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 10s | Avg:  3m 10s | Max:  3m 10s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s
      🟩 GCC12              Pass: 100%/2   | Total: 23m 14s | Avg: 11m 37s | Max: 20m 01s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 41s | Avg:  2m 40s | Max:  2m 50s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 13m 36s | Avg: 13m 36s | Max: 13m 36s | Hits:  81%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 36s | Avg: 12m 36s | Max: 12m 36s | Hits:  81%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 45s | Avg:  8m 52s | Max:  8m 55s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 39m 36s | Avg:  4m 57s | Max: 17m 48s
      🟩 GCC                Pass: 100%/8   | Total: 40m 14s | Avg:  5m 01s | Max: 20m 01s
      🟩 MSVC               Pass: 100%/2   | Total: 26m 12s | Avg: 13m 06s | Max: 13m 36s | Hits:  81%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 45s | Avg:  8m 52s | Max:  8m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  2h 03m | Avg:  6m 11s | Max: 20m 01s | Hits:  81%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 25m | Avg:  4m 46s | Max: 13m 36s | Hits:  81%/522   
      🟩 Test               Pass: 100%/2   | Total: 37m 49s | Avg: 18m 54s | Max: 20m 01s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 39s | Avg:  2m 39s | Max:  2m 39s
      🟩 90a                Pass: 100%/1   | Total:  2m 50s | Avg:  2m 50s | Max:  2m 50s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 16m 44s | Avg:  4m 11s | Max:  8m 50s
      🟩 20                 Pass: 100%/16  | Total:  1h 47m | Avg:  6m 41s | Max: 20m 01s | Hits:  81%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 26s | Avg: 5m 13s | Max: 8m 25s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  8m 25s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  8m 25s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  8m 25s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  8m 25s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  8m 25s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  8m 25s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  8m 25s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 01s | Avg:  2m 01s | Max:  2m 01s
      🟩 Test               Pass: 100%/1   | Total:  8m 25s | Avg:  8m 25s | Max:  8m 25s
    
  • 🟩 python: Pass: 100%/1 | Total: 26m 47s | Avg: 26m 47s | Max: 26m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 26m 47s | Avg: 26m 47s | Max: 26m 47s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 26m 47s | Avg: 26m 47s | Max: 26m 47s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 26m 47s | Avg: 26m 47s | Max: 26m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 26m 47s | Avg: 26m 47s | Max: 26m 47s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 26m 47s | Avg: 26m 47s | Max: 26m 47s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 26m 47s | Avg: 26m 47s | Max: 26m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 26m 47s | Avg: 26m 47s | Max: 26m 47s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 26m 47s | Avg: 26m 47s | Max: 26m 47s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 139)

# Runner
92 linux-amd64-cpu16
21 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner January 16, 2025 15:28
@bernhardmgruber
Copy link
Contributor

I rebased and added a formatting commit.

@bernhardmgruber
Copy link
Contributor

Rebased so we can have the fix from #3423

Copy link
Contributor

🟩 CI finished in 1h 48m: Pass: 100%/139 | Total: 1d 07h | Avg: 13m 27s | Max: 1h 04m | Hits: 463%/23384
  • 🟩 cub: Pass: 100%/38 | Total: 10h 10m | Avg: 16m 04s | Max: 1h 04m | Hits: 439%/3540

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total: 10h 01m | Avg: 16m 41s | Max:  1h 04m | Hits: 439%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  4m 52s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 15m | Avg: 15m 01s | Max: 53m 56s | Hits: 439%/885   
      🟩 12.5               Pass: 100%/2   | Total:  1h 30m | Avg: 45m 17s | Max: 46m 31s
      🟩 12.6               Pass: 100%/31  | Total:  7h 25m | Avg: 14m 21s | Max:  1h 04m | Hits: 439%/2655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 01s | Avg:  4m 30s | Max:  4m 38s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 15m | Avg: 15m 01s | Max: 53m 56s | Hits: 439%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 30m | Avg: 45m 17s | Max: 46m 31s
      🟩 nvcc12.6           Pass: 100%/29  | Total:  7h 16m | Avg: 15m 02s | Max:  1h 04m | Hits: 439%/2655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 01s | Avg:  4m 30s | Max:  4m 38s
      🟩 nvcc               Pass: 100%/36  | Total: 10h 01m | Avg: 16m 42s | Max:  1h 04m | Hits: 439%/3540  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 47s | Avg:  5m 26s | Max:  5m 48s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 18m | Avg: 11m 15s | Max: 32m 22s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 59s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  5m 42s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s
      🟩 GCC11              Pass: 100%/1   | Total:  6m 02s | Avg:  6m 02s | Max:  6m 02s
      🟩 GCC12              Pass: 100%/3   | Total: 29m 56s | Avg:  9m 58s | Max: 19m 24s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 40m | Avg: 12m 31s | Max: 23m 46s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 54s | Max: 55m 52s | Hits: 443%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 04m | Hits: 435%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 30m | Avg: 45m 17s | Max: 46m 31s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  1h 57m | Avg:  8m 25s | Max: 32m 22s
      🟩 GCC                Pass: 100%/18  | Total:  2h 49m | Avg:  9m 25s | Max: 23m 46s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 52m | Avg: 58m 08s | Max:  1h 04m | Hits: 439%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 30m | Avg: 45m 17s | Max: 46m 31s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 23m 45s | Avg: 11m 52s | Max: 19m 24s
      🟩 v100               Pass: 100%/36  | Total:  9h 47m | Avg: 16m 18s | Max:  1h 04m | Hits: 439%/3540  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  7h 38m | Avg: 14m 46s | Max:  1h 04m | Hits: 439%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 19m 20s | Avg: 19m 20s | Max: 19m 20s
      🟩 GraphCapture       Pass: 100%/1   | Total: 14m 50s | Avg: 14m 50s | Max: 14m 50s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 02m | Avg: 20m 48s | Max: 21m 46s
      🟩 TestGPU            Pass: 100%/2   | Total: 56m 08s | Avg: 28m 04s | Max: 32m 22s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 45s | Avg: 11m 52s | Max: 19m 24s
      🟩 90a                Pass: 100%/1   | Total:  4m 20s | Avg:  4m 20s | Max:  4m 20s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  4h 28m | Avg: 19m 11s | Max: 58m 00s | Hits: 440%/2655  
      🟩 20                 Pass: 100%/24  | Total:  5h 42m | Avg: 14m 15s | Max:  1h 04m | Hits: 434%/885   
    
  • 🟩 libcudacxx: Pass: 100%/37 | Total: 8h 25m | Avg: 13m 40s | Max: 36m 57s | Hits: 620%/10102

    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  8h 17m | Avg: 14m 13s | Max: 36m 57s | Hits: 620%/10102 
      🟩 arm64              Pass: 100%/2   | Total:  7m 48s | Avg:  3m 54s | Max:  4m 10s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 09m | Avg: 13m 55s | Max: 25m 41s | Hits: 663%/2481  
      🟩 12.5               Pass: 100%/2   | Total: 44m 12s | Avg: 22m 06s | Max: 32m 01s
      🟩 12.6               Pass: 100%/30  | Total:  6h 31m | Avg: 13m 03s | Max: 36m 57s | Hits: 607%/7621  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 06m | Avg: 16m 41s | Max: 20m 22s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 09m | Avg: 13m 55s | Max: 25m 41s | Hits: 663%/2481  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 44m 12s | Avg: 22m 06s | Max: 32m 01s
      🟩 nvcc12.6           Pass: 100%/26  | Total:  5h 25m | Avg: 12m 30s | Max: 36m 57s | Hits: 607%/7621  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 06m | Avg: 16m 41s | Max: 20m 22s
      🟩 nvcc               Pass: 100%/33  | Total:  7h 19m | Avg: 13m 18s | Max: 36m 57s | Hits: 620%/10102 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 32m 37s | Avg:  8m 09s | Max: 17m 11s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s
      🟩 Clang16            Pass: 100%/1   | Total: 21m 50s | Avg: 21m 50s | Max: 21m 50s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 44s | Avg:  4m 44s | Max:  4m 44s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 36m | Avg: 12m 05s | Max: 20m 22s
      🟩 GCC7               Pass: 100%/2   | Total:  6m 53s | Avg:  3m 26s | Max:  3m 29s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s
      🟩 GCC9               Pass: 100%/2   | Total: 41m 40s | Avg: 20m 50s | Max: 22m 18s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s
      🟩 GCC12              Pass: 100%/1   | Total: 23m 31s | Avg: 23m 31s | Max: 23m 31s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 36m | Avg: 12m 06s | Max: 22m 11s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 55m 38s | Avg: 27m 49s | Max: 29m 57s | Hits: 644%/4972  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 42s | Max: 36m 57s | Hits: 598%/5130  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 44m 12s | Avg: 22m 06s | Max: 32m 01s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/15  | Total:  2h 40m | Avg: 10m 41s | Max: 21m 50s
      🟩 GCC                Pass: 100%/16  | Total:  3h 00m | Avg: 11m 15s | Max: 23m 31s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 01m | Avg: 30m 15s | Max: 36m 57s | Hits: 620%/10102 
      🟩 NVHPC              Pass: 100%/2   | Total: 44m 12s | Avg: 22m 06s | Max: 32m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  8h 25m | Avg: 13m 40s | Max: 36m 57s | Hits: 620%/10102 
    🟩 jobs
      🟩 Build              Pass: 100%/32  | Total:  7h 06m | Avg: 13m 19s | Max: 36m 57s | Hits: 620%/10102 
      🟩 NVRTC              Pass: 100%/2   | Total: 42m 17s | Avg: 21m 08s | Max: 21m 13s
      🟩 Test               Pass: 100%/2   | Total: 35m 04s | Avg: 17m 32s | Max: 17m 40s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 01s | Avg:  2m 01s | Max:  2m 01s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 01s | Avg: 13m 01s | Max: 13m 01s
      🟩 90a                Pass: 100%/2   | Total: 17m 11s | Avg:  8m 35s | Max: 13m 17s
    🟩 std
      🟩 17                 Pass: 100%/15  | Total:  3h 40m | Avg: 14m 41s | Max: 29m 57s | Hits: 636%/7463  
      🟩 20                 Pass: 100%/21  | Total:  4h 43m | Avg: 13m 29s | Max: 36m 57s | Hits: 576%/2639  
    
  • 🟩 thrust: Pass: 100%/37 | Total: 9h 46m | Avg: 15m 51s | Max: 1h 02m | Hits: 305%/9220

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 19m 12s | Avg:  9m 36s | Max: 13m 14s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  9h 36m | Avg: 16m 28s | Max:  1h 02m | Hits: 305%/9220  
      🟩 arm64              Pass: 100%/2   | Total:  9m 51s | Avg:  4m 55s | Max:  5m 17s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 07m | Avg: 13m 27s | Max: 46m 40s | Hits: 290%/1844  
      🟩 12.5               Pass: 100%/2   | Total:  1h 50m | Avg: 55m 26s | Max: 57m 06s
      🟩 12.6               Pass: 100%/30  | Total:  6h 48m | Avg: 13m 36s | Max:  1h 02m | Hits: 308%/7376  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 48s | Avg:  5m 24s | Max:  5m 28s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 07m | Avg: 13m 27s | Max: 46m 40s | Hits: 290%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 50m | Avg: 55m 26s | Max: 57m 06s
      🟩 nvcc12.6           Pass: 100%/28  | Total:  6h 37m | Avg: 14m 11s | Max:  1h 02m | Hits: 308%/7376  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 48s | Avg:  5m 24s | Max:  5m 28s
      🟩 nvcc               Pass: 100%/35  | Total:  9h 35m | Avg: 16m 26s | Max:  1h 02m | Hits: 305%/9220  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 14s | Avg:  5m 18s | Max:  5m 48s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s
      🟩 Clang18            Pass: 100%/7   | Total: 48m 14s | Avg:  6m 53s | Max: 13m 40s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  5m 55s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 36s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 31m | Avg: 11m 29s | Max: 38m 30s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 32m | Avg: 46m 08s | Max: 46m 40s | Hits: 294%/3688  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 39m | Avg: 53m 18s | Max:  1h 02m | Hits: 311%/5532  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 50m | Avg: 55m 26s | Max: 57m 06s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  1h 26m | Avg:  6m 11s | Max: 13m 40s
      🟩 GCC                Pass: 100%/16  | Total:  2h 16m | Avg:  8m 32s | Max: 38m 30s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 12m | Avg: 50m 26s | Max:  1h 02m | Hits: 305%/9220  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 50m | Avg: 55m 26s | Max: 57m 06s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  9h 46m | Avg: 15m 51s | Max:  1h 02m | Hits: 305%/9220  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  8h 14m | Avg: 15m 57s | Max:  1h 02m | Hits: 289%/7376  
      🟩 TestCPU            Pass: 100%/3   | Total: 54m 15s | Avg: 18m 05s | Max: 38m 17s | Hits: 365%/1844  
      🟩 TestGPU            Pass: 100%/3   | Total: 37m 38s | Avg: 12m 32s | Max: 13m 40s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 41s | Avg:  4m 41s | Max:  4m 41s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  4h 23m | Avg: 18m 50s | Max: 58m 49s | Hits: 291%/5532  
      🟩 20                 Pass: 100%/21  | Total:  5h 03m | Avg: 14m 27s | Max:  1h 02m | Hits: 325%/3688  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 50m | Avg: 5m 30s | Max: 18m 54s | Hits: 388%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 39m | Avg:  6m 14s | Max: 18m 54s | Hits: 388%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 28s | Avg:  2m 37s | Max:  2m 41s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 55s | Avg: 10m 55s | Max: 10m 55s | Hits: 388%/261   
      🟩 12.5               Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 22s
      🟩 12.6               Pass: 100%/17  | Total:  1h 29m | Avg:  5m 14s | Max: 18m 54s | Hits: 388%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 55s | Avg: 10m 55s | Max: 10m 55s | Hits: 388%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 22s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 29m | Avg:  5m 14s | Max: 18m 54s | Hits: 388%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 50m | Avg:  5m 30s | Max: 18m 54s | Hits: 388%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 08s | Avg:  3m 08s | Max:  3m 08s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 17s | Avg:  3m 17s | Max:  3m 17s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s
      🟩 Clang18            Pass: 100%/4   | Total: 25m 41s | Avg:  6m 25s | Max: 17m 13s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 00s | Avg:  3m 00s | Max:  3m 00s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 10s | Avg:  3m 10s | Max:  3m 10s
      🟩 GCC12              Pass: 100%/2   | Total: 21m 58s | Avg: 10m 59s | Max: 18m 54s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 52s | Avg:  2m 43s | Max:  2m 55s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 55s | Avg: 10m 55s | Max: 10m 55s | Hits: 388%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 09s | Avg: 11m 09s | Max: 11m 09s | Hits: 388%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 22s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 38m 54s | Avg:  4m 51s | Max: 17m 13s
      🟩 GCC                Pass: 100%/8   | Total: 39m 00s | Avg:  4m 52s | Max: 18m 54s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 04s | Avg: 11m 02s | Max: 11m 09s | Hits: 388%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 22s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  1h 50m | Avg:  5m 30s | Max: 18m 54s | Hits: 388%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 14m | Avg:  4m 07s | Max: 11m 09s | Hits: 388%/522   
      🟩 Test               Pass: 100%/2   | Total: 36m 07s | Avg: 18m 03s | Max: 18m 54s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 44s | Avg:  2m 44s | Max:  2m 44s
      🟩 90a                Pass: 100%/1   | Total:  2m 55s | Avg:  2m 55s | Max:  2m 55s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 13m 02s | Avg:  3m 15s | Max:  4m 58s
      🟩 20                 Pass: 100%/16  | Total:  1h 37m | Avg:  6m 04s | Max: 18m 54s | Hits: 388%/522   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 19m 31s | Avg: 4m 52s | Max: 5m 44s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 19m 31s | Avg:  4m 52s | Max:  5m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  5m 03s
      🟩 12.6               Pass: 100%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  5m 44s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  5m 03s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  5m 44s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 31s | Avg:  4m 52s | Max:  5m 44s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  5m 03s | Avg:  5m 03s | Max:  5m 03s
      🟩 Clang18            Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 43s | Avg:  4m 43s | Max:  4m 43s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 01s | Avg:  4m 01s | Max:  4m 01s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total: 10m 47s | Avg:  5m 23s | Max:  5m 44s
      🟩 GCC                Pass: 100%/2   | Total:  8m 44s | Avg:  4m 22s | Max:  4m 43s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 19m 31s | Avg:  4m 52s | Max:  5m 44s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 19m 31s | Avg:  4m 52s | Max:  5m 44s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 57s | Avg: 4m 58s | Max: 7m 53s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 53s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 53s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 53s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 53s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 53s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 53s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 53s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
      🟩 Test               Pass: 100%/1   | Total:  7m 53s | Avg:  7m 53s | Max:  7m 53s
    
  • 🟩 python: Pass: 100%/1 | Total: 28m 05s | Avg: 28m 05s | Max: 28m 05s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 28m 05s | Avg: 28m 05s | Max: 28m 05s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 28m 05s | Avg: 28m 05s | Max: 28m 05s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 28m 05s | Avg: 28m 05s | Max: 28m 05s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 28m 05s | Avg: 28m 05s | Max: 28m 05s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 28m 05s | Avg: 28m 05s | Max: 28m 05s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 28m 05s | Avg: 28m 05s | Max: 28m 05s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 28m 05s | Avg: 28m 05s | Max: 28m 05s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 28m 05s | Avg: 28m 05s | Max: 28m 05s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 139)

# Runner
92 linux-amd64-cpu16
21 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@miscco miscco enabled auto-merge (squash) January 17, 2025 17:31
Copy link
Contributor

🟩 CI finished in 1h 58m: Pass: 100%/139 | Total: 1d 10h | Avg: 14m 55s | Max: 1h 09m | Hits: 378%/23388
  • 🟩 cub: Pass: 100%/38 | Total: 13h 30m | Avg: 21m 20s | Max: 1h 08m | Hits: 43%/3540

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total: 13h 19m | Avg: 22m 12s | Max:  1h 08m | Hits:  43%/3540  
      🟩 arm64              Pass: 100%/2   | Total: 10m 59s | Avg:  5m 29s | Max:  6m 01s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 29m | Avg: 17m 52s | Max:  1h 06m | Hits:  43%/885   
      🟩 12.5               Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m
      🟩 12.6               Pass: 100%/31  | Total:  9h 47m | Avg: 18m 57s | Max:  1h 08m | Hits:  43%/2655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 11m 01s | Avg:  5m 30s | Max:  5m 32s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 29m | Avg: 17m 52s | Max:  1h 06m | Hits:  43%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m
      🟩 nvcc12.6           Pass: 100%/29  | Total:  9h 36m | Avg: 19m 53s | Max:  1h 08m | Hits:  43%/2655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 01s | Avg:  5m 30s | Max:  5m 32s
      🟩 nvcc               Pass: 100%/36  | Total: 13h 19m | Avg: 22m 12s | Max:  1h 08m | Hits:  43%/3540  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 25m 06s | Avg:  6m 16s | Max:  6m 31s
      🟩 Clang15            Pass: 100%/1   | Total:  6m 48s | Avg:  6m 48s | Max:  6m 48s
      🟩 Clang16            Pass: 100%/1   | Total:  6m 51s | Avg:  6m 51s | Max:  6m 51s
      🟩 Clang17            Pass: 100%/1   | Total:  6m 43s | Avg:  6m 43s | Max:  6m 43s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 28m | Avg: 12m 36s | Max: 30m 37s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 20s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s
      🟩 GCC9               Pass: 100%/2   | Total: 10m 50s | Avg:  5m 25s | Max:  5m 25s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 GCC12              Pass: 100%/3   | Total: 29m 09s | Avg:  9m 43s | Max: 19m 19s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 33m | Avg: 26m 37s | Max: 52m 59s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m | Hits:  43%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 08m | Hits:  43%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  2h 13m | Avg:  9m 32s | Max: 30m 37s
      🟩 GCC                Pass: 100%/18  | Total:  4h 40m | Avg: 15m 34s | Max: 52m 59s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 22m | Avg:  1h 05m | Max:  1h 08m | Hits:  43%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 23m 24s | Avg: 11m 42s | Max: 19m 19s
      🟩 v100               Pass: 100%/36  | Total: 13h 07m | Avg: 21m 52s | Max:  1h 08m | Hits:  43%/3540  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  9h 48m | Avg: 18m 59s | Max:  1h 08m | Hits:  43%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 35m 17s | Avg: 35m 17s | Max: 35m 17s
      🟩 GraphCapture       Pass: 100%/1   | Total: 27m 42s | Avg: 27m 42s | Max: 27m 42s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 20m | Avg: 26m 55s | Max: 33m 49s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 18m | Avg: 39m 15s | Max: 47m 54s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 24s | Avg: 11m 42s | Max: 19m 19s
      🟩 90a                Pass: 100%/1   | Total:  4m 26s | Avg:  4m 26s | Max:  4m 26s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  5h 19m | Avg: 22m 49s | Max:  1h 07m | Hits:  43%/2655  
      🟩 20                 Pass: 100%/24  | Total:  8h 11m | Avg: 20m 27s | Max:  1h 08m | Hits:  43%/885   
    
  • 🟩 libcudacxx: Pass: 100%/37 | Total: 7h 33m | Avg: 12m 15s | Max: 35m 53s | Hits: 681%/10146

    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  7h 23m | Avg: 12m 40s | Max: 35m 53s | Hits: 681%/10146 
      🟩 arm64              Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  6m 43s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 44m 13s | Avg:  8m 50s | Max: 24m 52s | Hits: 681%/2491  
      🟩 12.5               Pass: 100%/2   | Total: 49m 24s | Avg: 24m 42s | Max: 35m 53s
      🟩 12.6               Pass: 100%/30  | Total:  6h 00m | Avg: 12m 00s | Max: 34m 32s | Hits: 681%/7655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 07m | Avg: 16m 55s | Max: 20m 27s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 44m 13s | Avg:  8m 50s | Max: 24m 52s | Hits: 681%/2491  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 49m 24s | Avg: 24m 42s | Max: 35m 53s
      🟩 nvcc12.6           Pass: 100%/26  | Total:  4h 52m | Avg: 11m 15s | Max: 34m 32s | Hits: 681%/7655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 07m | Avg: 16m 55s | Max: 20m 27s
      🟩 nvcc               Pass: 100%/33  | Total:  6h 26m | Avg: 11m 42s | Max: 35m 53s | Hits: 681%/10146 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 28m 50s | Avg:  7m 12s | Max:  8m 03s
      🟩 Clang15            Pass: 100%/1   | Total:  8m 50s | Avg:  8m 50s | Max:  8m 50s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 26s | Avg:  4m 26s | Max:  4m 26s
      🟩 Clang17            Pass: 100%/1   | Total:  8m 41s | Avg:  8m 41s | Max:  8m 41s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 56m | Avg: 14m 30s | Max: 30m 20s
      🟩 GCC7               Pass: 100%/2   | Total:  6m 57s | Avg:  3m 28s | Max:  3m 45s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 25s | Avg:  3m 25s | Max:  3m 25s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 01s | Avg:  3m 30s | Max:  3m 45s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 50s | Avg:  3m 50s | Max:  3m 50s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 45m | Avg: 13m 10s | Max: 34m 32s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 48m 51s | Avg: 24m 25s | Max: 24m 52s | Hits: 681%/4992  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 54m 37s | Avg: 27m 18s | Max: 27m 42s | Hits: 681%/5154  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 49m 24s | Avg: 24m 42s | Max: 35m 53s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/15  | Total:  2h 46m | Avg: 11m 07s | Max: 30m 20s
      🟩 GCC                Pass: 100%/16  | Total:  2h 14m | Avg:  8m 22s | Max: 34m 32s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 43m | Avg: 25m 52s | Max: 27m 42s | Hits: 681%/10146 
      🟩 NVHPC              Pass: 100%/2   | Total: 49m 24s | Avg: 24m 42s | Max: 35m 53s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  7h 33m | Avg: 12m 15s | Max: 35m 53s | Hits: 681%/10146 
    🟩 jobs
      🟩 Build              Pass: 100%/32  | Total:  5h 32m | Avg: 10m 23s | Max: 35m 53s | Hits: 681%/10146 
      🟩 NVRTC              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 16s | Max: 34m 32s
      🟩 Test               Pass: 100%/2   | Total: 50m 39s | Avg: 25m 19s | Max: 30m 20s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 01s | Avg:  2m 01s | Max:  2m 01s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 45s | Avg: 13m 45s | Max: 13m 45s
      🟩 90a                Pass: 100%/2   | Total: 17m 45s | Avg:  8m 52s | Max: 14m 11s
    🟩 std
      🟩 17                 Pass: 100%/15  | Total:  3h 07m | Avg: 12m 29s | Max: 34m 32s | Hits: 681%/7493  
      🟩 20                 Pass: 100%/21  | Total:  4h 24m | Avg: 12m 35s | Max: 35m 53s | Hits: 680%/2653  
    
  • 🟩 thrust: Pass: 100%/37 | Total: 9h 53m | Avg: 16m 02s | Max: 1h 09m | Hits: 176%/9180

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 23m 30s | Avg: 11m 45s | Max: 17m 08s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  9h 44m | Avg: 16m 41s | Max:  1h 09m | Hits: 176%/9180  
      🟩 arm64              Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  5m 07s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 13m | Avg: 14m 37s | Max: 53m 08s | Hits: 121%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m
      🟩 12.6               Pass: 100%/30  | Total:  6h 22m | Avg: 12m 45s | Max:  1h 04m | Hits: 189%/7344  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 44s | Avg:  5m 22s | Max:  5m 24s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 13m | Avg: 14m 37s | Max: 53m 08s | Hits: 121%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m
      🟩 nvcc12.6           Pass: 100%/28  | Total:  6h 12m | Avg: 13m 17s | Max:  1h 04m | Hits: 189%/7344  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 44s | Avg:  5m 22s | Max:  5m 24s
      🟩 nvcc               Pass: 100%/35  | Total:  9h 43m | Avg: 16m 39s | Max:  1h 09m | Hits: 176%/9180  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 16s | Avg:  5m 19s | Max:  5m 48s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
      🟩 Clang18            Pass: 100%/7   | Total: 46m 20s | Avg:  6m 37s | Max: 12m 09s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 06s | Avg:  5m 03s | Max:  5m 14s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 45s | Avg:  5m 45s | Max:  5m 45s
      🟩 GCC9               Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  5m 22s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 GCC11              Pass: 100%/1   | Total:  6m 03s | Avg:  6m 03s | Max:  6m 03s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 03m | Avg:  7m 56s | Max: 17m 08s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 17s | Max: 55m 26s | Hits: 136%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 36m | Avg: 52m 04s | Max:  1h 04m | Hits: 202%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  1h 24m | Avg:  6m 00s | Max: 12m 09s
      🟩 GCC                Pass: 100%/16  | Total:  1h 46m | Avg:  6m 41s | Max: 17m 08s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 24m | Avg: 52m 57s | Max:  1h 04m | Hits: 176%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  9h 53m | Avg: 16m 02s | Max:  1h 09m | Hits: 176%/9180  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  8h 22m | Avg: 16m 12s | Max:  1h 09m | Hits: 129%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 32s | Avg: 17m 10s | Max: 35m 58s | Hits: 365%/1836  
      🟩 TestGPU            Pass: 100%/3   | Total: 40m 02s | Avg: 13m 20s | Max: 17m 08s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 41s | Avg:  4m 41s | Max:  4m 41s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  4h 47m | Avg: 20m 30s | Max:  1h 09m | Hits: 131%/5508  
      🟩 20                 Pass: 100%/21  | Total:  4h 43m | Avg: 13m 29s | Max:  1h 07m | Hits: 243%/3672  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 17m | Avg: 6m 53s | Max: 24m 25s | Hits: 300%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  2h 04m | Avg:  7m 45s | Max: 24m 25s | Hits: 300%/522   
      🟩 arm64              Pass: 100%/4   | Total: 13m 52s | Avg:  3m 28s | Max:  3m 33s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 12m 08s | Avg: 12m 08s | Max: 12m 08s | Hits: 300%/261   
      🟩 12.5               Pass: 100%/2   | Total: 13m 21s | Avg:  6m 40s | Max:  6m 45s
      🟩 12.6               Pass: 100%/17  | Total:  1h 52m | Avg:  6m 37s | Max: 24m 25s | Hits: 301%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 12m 08s | Avg: 12m 08s | Max: 12m 08s | Hits: 300%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 13m 21s | Avg:  6m 40s | Max:  6m 45s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 52m | Avg:  6m 37s | Max: 24m 25s | Hits: 301%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 17m | Avg:  6m 53s | Max: 24m 25s | Hits: 300%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 07s | Avg:  4m 07s | Max:  4m 07s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang18            Pass: 100%/4   | Total: 35m 19s | Avg:  8m 49s | Max: 24m 07s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 GCC12              Pass: 100%/2   | Total: 28m 14s | Avg: 14m 07s | Max: 24m 25s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 19s | Avg:  3m 19s | Max:  3m 33s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 12m 08s | Avg: 12m 08s | Max: 12m 08s | Hits: 300%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 27s | Avg: 11m 27s | Max: 11m 27s | Hits: 301%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 13m 21s | Avg:  6m 40s | Max:  6m 45s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 51m 41s | Avg:  6m 27s | Max: 24m 07s
      🟩 GCC                Pass: 100%/8   | Total: 49m 22s | Avg:  6m 10s | Max: 24m 25s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 35s | Avg: 11m 47s | Max: 12m 08s | Hits: 300%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 13m 21s | Avg:  6m 40s | Max:  6m 45s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  2h 17m | Avg:  6m 53s | Max: 24m 25s | Hits: 300%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 29m | Avg:  4m 58s | Max: 12m 08s | Hits: 300%/522   
      🟩 Test               Pass: 100%/2   | Total: 48m 32s | Avg: 24m 16s | Max: 24m 25s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 21s | Avg:  3m 21s | Max:  3m 21s
      🟩 90a                Pass: 100%/1   | Total:  3m 05s | Avg:  3m 05s | Max:  3m 05s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  6m 36s
      🟩 20                 Pass: 100%/16  | Total:  2h 01m | Avg:  7m 34s | Max: 24m 25s | Hits: 300%/522   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 24m 44s | Avg: 6m 11s | Max: 6m 39s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 24m 44s | Avg:  6m 11s | Max:  6m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total: 12m 10s | Avg:  6m 05s | Max:  6m 16s
      🟩 12.6               Pass: 100%/2   | Total: 12m 34s | Avg:  6m 17s | Max:  6m 39s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total: 12m 10s | Avg:  6m 05s | Max:  6m 16s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 12m 34s | Avg:  6m 17s | Max:  6m 39s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 24m 44s | Avg:  6m 11s | Max:  6m 39s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  6m 16s | Avg:  6m 16s | Max:  6m 16s
      🟩 Clang18            Pass: 100%/1   | Total:  6m 39s | Avg:  6m 39s | Max:  6m 39s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 54s | Avg:  5m 54s | Max:  5m 54s
      🟩 GCC13              Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total: 12m 55s | Avg:  6m 27s | Max:  6m 39s
      🟩 GCC                Pass: 100%/2   | Total: 11m 49s | Avg:  5m 54s | Max:  5m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 24m 44s | Avg:  6m 11s | Max:  6m 39s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 24m 44s | Avg:  6m 11s | Max:  6m 39s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 8m 56s | Avg: 4m 28s | Max: 6m 53s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  6m 53s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  6m 53s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  6m 53s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  6m 53s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  6m 53s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  6m 53s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  6m 53s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
      🟩 Test               Pass: 100%/1   | Total:  6m 53s | Avg:  6m 53s | Max:  6m 53s
    
  • 🟩 python: Pass: 100%/1 | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 139)

# Runner
92 linux-amd64-cpu16
21 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@jrhemstad jrhemstad requested a review from alliepiper January 21, 2025 16:01
@miscco miscco merged commit df1f722 into NVIDIA:main Jan 21, 2025
171 of 174 checks passed
@miscco miscco deleted the drop_old_dialects branch January 21, 2025 16:04
davebayer pushed a commit to davebayer/cccl that referenced this pull request Jan 22, 2025
* Drop C++11 and C++14 support for all of cccl

---------

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>
davebayer added a commit to davebayer/cccl that referenced this pull request Jan 22, 2025
update docs

update docs

add `memcmp`, `memmove` and `memchr` implementations

implement tests

Use cuda::std::min/max in Thrust (NVIDIA#3364)

Implement `cuda::std::numeric_limits` for `__half` and `__nv_bfloat16` (NVIDIA#3361)

* implement `cuda::std::numeric_limits` for `__half` and `__nv_bfloat16`

Cleanup util_arch (NVIDIA#2773)

Deprecate thrust::null_type (NVIDIA#3367)

Deprecate cub::DeviceSpmv (NVIDIA#3320)

Fixes: NVIDIA#896

Improves `DeviceSegmentedSort` test run time for large number of items and segments (NVIDIA#3246)

* fixes segment offset generation

* switches to analytical verification

* switches to analytical verification for pairs

* fixes spelling

* adds tests for large number of segments

* fixes narrowing conversion in tests

* addresses review comments

* fixes includes

Compile basic infra test with C++17 (NVIDIA#3377)

Adds support for large number of items and large number of segments to `DeviceSegmentedSort` (NVIDIA#3308)

* fixes segment offset generation

* switches to analytical verification

* switches to analytical verification for pairs

* addresses review comments

* introduces segment offset type

* adds tests for large number of segments

* adds support for large number of segments

* drops segment offset type

* fixes thrust namespace

* removes about-to-be-deprecated cub iterators

* no exec specifier on defaulted ctor

* fixes gcc7 linker error

* uses local_segment_index_t throughout

* determine offset type based on type returned by segment iterator begin/end iterators

* minor style improvements

Exit with error when RAPIDS CI fails. (NVIDIA#3385)

cuda.parallel: Support structured types as algorithm inputs (NVIDIA#3218)

* Introduce gpu_struct decorator and typing

* Enable `reduce` to accept arrays of structs as inputs

* Add test for reducing arrays-of-struct

* Update documentation

* Use a numpy array rather than ctypes object

* Change zeros -> empty for output array and temp storage

* Add a TODO for typing GpuStruct

* Documentation udpates

* Remove test_reduce_struct_type from test_reduce.py

* Revert to `to_cccl_value()` accepting ndarray + GpuStruct

* Bump copyrights

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Deprecate thrust::async (NVIDIA#3324)

Fixes: NVIDIA#100

Review/Deprecate CUB `util.ptx` for CCCL 2.x (NVIDIA#3342)

Fix broken `_CCCL_BUILTIN_ASSUME` macro (NVIDIA#3314)

* add compiler-specific path
* fix device code path
* add _CCC_ASSUME

Deprecate thrust::numeric_limits (NVIDIA#3366)

Replace `typedef` with `using` in libcu++ (NVIDIA#3368)

Deprecate thrust::optional (NVIDIA#3307)

Fixes: NVIDIA#3306

Upgrade to Catch2 3.8  (NVIDIA#3310)

Fixes: NVIDIA#1724

refactor `<cuda/std/cstdint>` (NVIDIA#3325)

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

Update CODEOWNERS (NVIDIA#3331)

* Update CODEOWNERS

* Update CODEOWNERS

* Update CODEOWNERS

* [pre-commit.ci] auto code formatting

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Fix sign-compare warning (NVIDIA#3408)

Implement more cmath functions to be usable on host and device (NVIDIA#3382)

* Implement more cmath functions to be usable on host and device

* Implement math roots functions

* Implement exponential functions

Redefine and deprecate thrust::remove_cvref (NVIDIA#3394)

* Redefine and deprecate thrust::remove_cvref

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Fix assert definition for NVHPC due to constexpr issues (NVIDIA#3418)

NVHPC cannot decide at compile time where the code would run so _CCCL_ASSERT within a constexpr function breaks it.

Fix this by always using the host definition which should also work on device.

Fixes NVIDIA#3411

Extend CUB reduce benchmarks (NVIDIA#3401)

* Rename max.cu to custom.cu, since it uses a custom operator
* Extend types covered my min.cu to all fundamental types
* Add some notes on how to collect tuning parameters

Fixes: NVIDIA#3283

Update upload-pages-artifact to v3 (NVIDIA#3423)

* Update upload-pages-artifact to v3

* Empty commit

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Replace and deprecate thrust::cuda_cub::terminate (NVIDIA#3421)

`std::linalg` accessors and `transposed_layout` (NVIDIA#2962)

Add round up/down to multiple (NVIDIA#3234)

[FEA]: Introduce Python module with CCCL headers (NVIDIA#3201)

* Add cccl/python/cuda_cccl directory and use from cuda_parallel, cuda_cooperative

* Run `copy_cccl_headers_to_aude_include()` before `setup()`

* Create python/cuda_cccl/cuda/_include/__init__.py, then simply import cuda._include to find the include path.

* Add cuda.cccl._version exactly as for cuda.cooperative and cuda.parallel

* Bug fix: cuda/_include only exists after shutil.copytree() ran.

* Use `f"cuda-cccl @ file://{cccl_path}/python/cuda_cccl"` in setup.py

* Remove CustomBuildCommand, CustomWheelBuild in cuda_parallel/setup.py (they are equivalent to the default functions)

* Replace := operator (needs Python 3.8+)

* Fix oversights: remove `pip3 install ./cuda_cccl` lines from README.md

* Restore original README.md: `pip3 install -e` now works on first pass.

* cuda_cccl/README.md: FOR INTERNAL USE ONLY

* Remove `$pymajor.$pyminor.` prefix in cuda_cccl _version.py (as suggested under NVIDIA#3201 (comment))

Command used: ci/update_version.sh 2 8 0

* Modernize pyproject.toml, setup.py

Trigger for this change:

* NVIDIA#3201 (comment)

* NVIDIA#3201 (comment)

* Install CCCL headers under cuda.cccl.include

Trigger for this change:

* NVIDIA#3201 (comment)

Unexpected accidental discovery: cuda.cooperative unit tests pass without CCCL headers entirely.

* Factor out cuda_cccl/cuda/cccl/include_paths.py

* Reuse cuda_cccl/cuda/cccl/include_paths.py from cuda_cooperative

* Add missing Copyright notice.

* Add missing __init__.py (cuda.cccl)

* Add `"cuda.cccl"` to `autodoc.mock_imports`

* Move cuda.cccl.include_paths into function where it is used. (Attempt to resolve Build and Verify Docs failure.)

* Add # TODO: move this to a module-level import

* Modernize cuda_cooperative/pyproject.toml, setup.py

* Convert cuda_cooperative to use hatchling as build backend.

* Revert "Convert cuda_cooperative to use hatchling as build backend."

This reverts commit 61637d6.

* Move numpy from [build-system] requires -> [project] dependencies

* Move pyproject.toml [project] dependencies -> setup.py install_requires, to be able to use CCCL_PATH

* Remove copy_license() and use license_files=["../../LICENSE"] instead.

* Further modernize cuda_cccl/setup.py to use pathlib

* Trivial simplifications in cuda_cccl/pyproject.toml

* Further simplify cuda_cccl/pyproject.toml, setup.py: remove inconsequential code

* Make cuda_cooperative/pyproject.toml more similar to cuda_cccl/pyproject.toml

* Add taplo-pre-commit to .pre-commit-config.yaml

* taplo-pre-commit auto-fixes

* Use pathlib in cuda_cooperative/setup.py

* CCCL_PYTHON_PATH in cuda_cooperative/setup.py

* Modernize cuda_parallel/pyproject.toml, setup.py

* Use pathlib in cuda_parallel/setup.py

* Add `# TOML lint & format` comment.

* Replace MANIFEST.in with `[tool.setuptools.package-data]` section in pyproject.toml

* Use pathlib in cuda/cccl/include_paths.py

* pre-commit autoupdate (EXCEPT clang-format, which was manually restored)

* Fixes after git merge main

* Resolve warning: AttributeError: '_Reduce' object has no attribute 'build_result'

```
=========================================================================== warnings summary ===========================================================================
tests/test_reduce.py::test_reduce_non_contiguous
  /home/coder/cccl/python/devenv/lib/python3.12/site-packages/_pytest/unraisableexception.py:85: PytestUnraisableExceptionWarning: Exception ignored in: <function _Reduce.__del__ at 0x7bf123139080>

  Traceback (most recent call last):
    File "/home/coder/cccl/python/cuda_parallel/cuda/parallel/experimental/algorithms/reduce.py", line 132, in __del__
      bindings.cccl_device_reduce_cleanup(ctypes.byref(self.build_result))
                                                       ^^^^^^^^^^^^^^^^^
  AttributeError: '_Reduce' object has no attribute 'build_result'

    warnings.warn(pytest.PytestUnraisableExceptionWarning(msg))

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
============================================================= 1 passed, 93 deselected, 1 warning in 0.44s ==============================================================
```

* Move `copy_cccl_headers_to_cuda_cccl_include()` functionality to `class CustomBuildPy`

* Introduce cuda_cooperative/constraints.txt

* Also add cuda_parallel/constraints.txt

* Add `--constraint constraints.txt` in ci/test_python.sh

* Update Copyright dates

* Switch to https://github.com/ComPWA/taplo-pre-commit (the other repo has been archived by the owner on Jul 1, 2024)

For completeness: The other repo took a long time to install into the pre-commit cache; so long it lead to timeouts in the CCCL CI.

* Remove unused cuda_parallel jinja2 dependency (noticed by chance).

* Remove constraints.txt files, advertise running `pip install cuda-cccl` first instead.

* Make cuda_cooperative, cuda_parallel testing completely independent.

* Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Try using another runner (because V100 runners seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Fix sign-compare warning (NVIDIA#3408) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Revert "Try using another runner (because V100 runners seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]"

This reverts commit ea33a21.

Error message: NVIDIA#3201 (comment)

* Try using A100 runner (because V100 runners still seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Also show cuda-cooperative site-packages, cuda-parallel site-packages (after pip install) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Try using l4 runner (because V100 runners still seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Restore original ci/matrix.yaml [skip-rapids]

* Use for loop in test_python.sh to avoid code duplication.

* Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc][skip pre-commit.ci]

* Comment out taplo-lint in pre-commit config [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Revert "Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc][skip pre-commit.ci]"

This reverts commit ec206fd.

* Implement suggestion by @shwina (NVIDIA#3201 (review))

* Address feedback by @leofang

---------

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

cuda.parallel: Add optional stream argument to reduce_into() (NVIDIA#3348)

* Add optional stream argument to reduce_into()

* Add tests to check for reduce_into() stream behavior

* Move protocol related utils to separate file and rework __cuda_stream__ error messages

* Fix synchronization issue in stream test and add one more invalid stream test case

* Rename cuda stream validation function after removing leading underscore

* Unpack values from __cuda_stream__ instead of indexing

* Fix linting errors

* Handle TypeError when unpacking invalid __cuda_stream__ return

* Use stream to allocate cupy memory in new stream test

Upgrade to actions/deploy-pages@v4 (from v2), as suggested by @leofang (NVIDIA#3434)

Deprecate `cub::{min, max}` and replace internal uses with those from libcu++ (NVIDIA#3419)

* Deprecate `cub::{min, max}` and replace internal uses with those from libcu++

Fixes NVIDIA#3404

Fix CI issues (NVIDIA#3443)

Remove deprecated `cub::min` (NVIDIA#3450)

* Remove deprecated `cuda::{min,max}`

* Drop unused `thrust::remove_cvref` file

Fix typo in builtin (NVIDIA#3451)

Moves agents to `detail::<algorithm_name>` namespace (NVIDIA#3435)

uses unsigned offset types in thrust's scan dispatch (NVIDIA#3436)

Default transform_iterator's copy ctor (NVIDIA#3395)

Fixes: NVIDIA#2393

Turn C++ dialect warning into error (NVIDIA#3453)

Uses unsigned offset types in thrust's sort algorithm calling into `DispatchMergeSort` (NVIDIA#3437)

* uses thrust's dynamic dispatch for merge_sort

* [pre-commit.ci] auto code formatting

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Refactor allocator handling of contiguous_storage (NVIDIA#3050)

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Drop thrust::detail::integer_traits (NVIDIA#3391)

Add cuda::is_floating_point supporting half and bfloat (NVIDIA#3379)

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Improve docs of std headers (NVIDIA#3416)

Drop C++11 and C++14 support for all of cccl (NVIDIA#3417)

* Drop C++11 and C++14 support for all of cccl

---------

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

Deprecate a few CUB macros (NVIDIA#3456)

Deprecate thrust universal iterator categories (NVIDIA#3461)

Fix launch args order (NVIDIA#3465)

Add `--extended-lambda` to the list of removed clangd flags (NVIDIA#3432)

add `_CCCL_HAS_NVFP8` macro (NVIDIA#3429)

Add `_CCCL_BUILTIN_PREFETCH` (NVIDIA#3433)

Drop universal iterator categories (NVIDIA#3474)

Ensure that headers in `<cuda/*>` can be build with a C++ only compiler (NVIDIA#3472)

Specialize __is_extended_floating_point for FP8 types (NVIDIA#3470)

Also ensure that we actually can enable FP8 due to FP16 and BF16 requirements

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Moves CUB kernel entry points to a detail namespace (NVIDIA#3468)

* moves emptykernel to detail ns

* second batch

* third batch

* fourth batch

* fixes cuda parallel

* concatenates nested namespaces

Deprecate block/warp algo specializations (NVIDIA#3455)

Fixes: NVIDIA#3409

Refactor CUB's util_debug (NVIDIA#3345)
davebayer pushed a commit to davebayer/cccl that referenced this pull request Jan 29, 2025
* Drop C++11 and C++14 support for all of cccl

---------

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.0 Targeted for 3.0 release breaking Breaking change
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

3 participants