Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

new type-erased memory resources #2824

Merged
merged 48 commits into from
Dec 17, 2024

Conversation

ericniebler
Copy link
Collaborator

@ericniebler ericniebler commented Nov 15, 2024

Description

this PR reimplements cudax::mr::any_resource and adds a new cudax::mr::resource_ref. they both pass their existing regressions tests without modification. the new types also provide a try_get_property function that can retrieve a property that was "sliced off" during an interface-narrowing conversion.

neither cudax::mr::any_resource nor cudax::mr::resource_ref need to store the vtable entries for the properties in-situ, which results in a space savings proportional to the number of properties.

there is extra code to make cudax::mr::any_resource efficiently convertible to the existing cuda::mr::resource_ref.

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@ericniebler ericniebler requested review from a team as code owners November 15, 2024 02:20
@ericniebler ericniebler marked this pull request as draft November 15, 2024 02:20
@ericniebler ericniebler force-pushed the cudax-any-resource-redux branch from cedd671 to a278c34 Compare November 15, 2024 03:33
Copy link
Contributor

🟨 CI finished in 51m 18s: Pass: 83%/54 | Total: 4h 04m | Avg: 4m 31s | Max: 17m 30s
  • 🟨 cudax: Pass: 83%/54 | Total: 4h 04m | Avg: 4m 31s | Max: 17m 30s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  82%/50  | Total:  3h 53m | Avg:  4m 40s | Max: 17m 30s
      🟩 arm64              Pass: 100%/4   | Total: 10m 27s | Avg:  2m 36s | Max:  2m 46s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  81%/49  | Total:  2h 42m | Avg:  3m 18s | Max:  7m 42s
      🟩 Test               Pass: 100%/5   | Total:  1h 21m | Avg: 16m 23s | Max: 17m 30s
    🟨 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 36s | Avg:  3m 18s | Max:  3m 23s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 41s | Avg:  3m 20s | Max:  3m 42s
      🟨 Clang11            Pass:  75%/4   | Total: 12m 24s | Avg:  3m 06s | Max:  3m 21s
      🟨 Clang12            Pass:  75%/4   | Total: 11m 55s | Avg:  2m 58s | Max:  3m 12s
      🟨 Clang13            Pass:  75%/4   | Total: 12m 19s | Avg:  3m 04s | Max:  3m 14s
      🟩 Clang14            Pass: 100%/4   | Total: 25m 34s | Avg:  6m 23s | Max: 15m 59s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 35s | Avg:  3m 17s | Max:  3m 26s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 57s | Avg:  2m 59s | Max:  3m 28s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 35s | Avg:  3m 17s | Max:  3m 19s
      🟩 Clang18            Pass: 100%/2   | Total: 20m 36s | Avg: 10m 18s | Max: 17m 30s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 39s | Avg:  3m 19s | Max:  3m 45s
      🟨 GCC10              Pass:  75%/4   | Total: 11m 37s | Avg:  2m 54s | Max:  2m 57s
      🟨 GCC11              Pass:  75%/4   | Total: 12m 16s | Avg:  3m 04s | Max:  3m 23s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 00m | Avg:  8m 37s | Max: 16m 56s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 04s | Avg:  2m 41s | Max:  2m 50s
      🟥 MSVC14.36          Pass:   0%/1   | Total:  6m 48s | Avg:  6m 48s | Max:  6m 48s
      🟥 MSVC14.39          Pass:   0%/1   | Total:  7m 42s | Avg:  7m 42s | Max:  7m 42s
      🟥 NVHPC24.7          Pass:   0%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 49s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  83%/54  | Total:  4h 04m | Avg:  4m 31s | Max: 17m 30s
    🟨 gpu
      🟨 v100               Pass:  83%/54  | Total:  4h 04m | Avg:  4m 31s | Max: 17m 30s
    🟨 ctk
      🟨 12.0               Pass:  68%/19  | Total:  1h 25m | Avg:  4m 30s | Max: 15m 59s
      🟥 12.5               Pass:   0%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 49s
      🟨 12.6               Pass:  96%/33  | Total:  2h 29m | Avg:  4m 31s | Max: 17m 30s
    🟨 cudacxx
      🟨 nvcc12.0           Pass:  68%/19  | Total:  1h 25m | Avg:  4m 30s | Max: 15m 59s
      🟥 nvcc12.5           Pass:   0%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 49s
      🟨 nvcc12.6           Pass:  96%/33  | Total:  2h 29m | Avg:  4m 31s | Max: 17m 30s
    🟨 cxx_family
      🟨 Clang              Pass:  90%/30  | Total:  2h 01m | Avg:  4m 02s | Max: 17m 30s
      🟨 GCC                Pass:  90%/20  | Total:  1h 39m | Avg:  4m 57s | Max: 16m 56s
      🟥 MSVC               Pass:   0%/2   | Total: 14m 30s | Avg:  7m 15s | Max:  7m 42s
      🟥 NVHPC              Pass:   0%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 49s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s
      🟩 90a                Pass: 100%/1   | Total:  2m 50s | Avg:  2m 50s | Max:  2m 50s
    🟨 std
      🟨 17                 Pass:  96%/29  | Total:  1h 57m | Avg:  4m 03s | Max: 16m 56s
      🟨 20                 Pass:  68%/25  | Total:  2h 06m | Avg:  5m 04s | Max: 17m 30s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 54)

# Runner
43 linux-amd64-cpu16
5 linux-amd64-gpu-v100-latest-1
4 linux-arm64-cpu16
2 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 21m 24s: Pass: 92%/54 | Total: 4h 09m | Avg: 4m 36s | Max: 17m 09s
  • 🟨 cudax: Pass: 92%/54 | Total: 4h 09m | Avg: 4m 36s | Max: 17m 09s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  92%/50  | Total:  3h 57m | Avg:  4m 44s | Max: 17m 09s
      🟩 arm64              Pass: 100%/4   | Total: 11m 59s | Avg:  2m 59s | Max:  3m 13s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  91%/49  | Total:  2h 48m | Avg:  3m 26s | Max: 11m 19s
      🟩 Test               Pass: 100%/5   | Total:  1h 20m | Avg: 16m 03s | Max: 17m 09s
    🟨 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 36s | Avg:  3m 18s | Max:  3m 27s
      🟩 Clang10            Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  3m 54s
      🟩 Clang11            Pass: 100%/4   | Total: 12m 22s | Avg:  3m 05s | Max:  3m 39s
      🟩 Clang12            Pass: 100%/4   | Total: 11m 50s | Avg:  2m 57s | Max:  3m 15s
      🟩 Clang13            Pass: 100%/4   | Total: 11m 56s | Avg:  2m 59s | Max:  3m 16s
      🟩 Clang14            Pass: 100%/4   | Total: 25m 35s | Avg:  6m 23s | Max: 16m 17s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  3m 13s
      🟩 Clang16            Pass: 100%/4   | Total: 12m 01s | Avg:  3m 00s | Max:  3m 16s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 25s | Avg:  3m 12s | Max:  3m 19s
      🟩 Clang18            Pass: 100%/2   | Total: 19m 27s | Avg:  9m 43s | Max: 16m 15s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 47s | Avg:  2m 53s | Max:  2m 56s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 34s | Avg:  2m 53s | Max:  3m 05s
      🟩 GCC11              Pass: 100%/4   | Total: 12m 06s | Avg:  3m 01s | Max:  3m 17s
      🟩 GCC12              Pass: 100%/7   | Total: 59m 55s | Avg:  8m 33s | Max: 17m 09s
      🟩 GCC13              Pass: 100%/3   | Total:  9m 11s | Avg:  3m 03s | Max:  3m 13s
      🟥 MSVC14.36          Pass:   0%/1   | Total:  8m 38s | Avg:  8m 38s | Max:  8m 38s
      🟥 MSVC14.39          Pass:   0%/1   | Total: 11m 19s | Avg: 11m 19s | Max: 11m 19s
      🟥 NVHPC24.7          Pass:   0%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
    🟨 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 59m | Avg:  3m 59s | Max: 16m 17s
      🟩 GCC                Pass: 100%/20  | Total:  1h 38m | Avg:  4m 55s | Max: 17m 09s
      🟥 MSVC               Pass:   0%/2   | Total: 19m 57s | Avg:  9m 58s | Max: 11m 19s
      🟥 NVHPC              Pass:   0%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  92%/54  | Total:  4h 09m | Avg:  4m 36s | Max: 17m 09s
    🟨 gpu
      🟨 v100               Pass:  92%/54  | Total:  4h 09m | Avg:  4m 36s | Max: 17m 09s
    🟨 ctk
      🟨 12.0               Pass:  94%/19  | Total:  1h 26m | Avg:  4m 32s | Max: 16m 17s
      🟥 12.5               Pass:   0%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
      🟨 12.6               Pass:  96%/33  | Total:  2h 31m | Avg:  4m 36s | Max: 17m 09s
    🟨 cudacxx
      🟨 nvcc12.0           Pass:  94%/19  | Total:  1h 26m | Avg:  4m 32s | Max: 16m 17s
      🟥 nvcc12.5           Pass:   0%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
      🟨 nvcc12.6           Pass:  96%/33  | Total:  2h 31m | Avg:  4m 36s | Max: 17m 09s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 26s | Avg:  2m 26s | Max:  2m 26s
      🟩 90a                Pass: 100%/1   | Total:  2m 52s | Avg:  2m 52s | Max:  2m 52s
    🟨 std
      🟨 17                 Pass:  96%/29  | Total:  1h 59m | Avg:  4m 06s | Max: 17m 09s
      🟨 20                 Pass:  88%/25  | Total:  2h 10m | Avg:  5m 12s | Max: 16m 17s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 54)

# Runner
43 linux-amd64-cpu16
5 linux-amd64-gpu-v100-latest-1
4 linux-arm64-cpu16
2 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 56m 46s: Pass: 92%/54 | Total: 3h 47m | Avg: 4m 12s | Max: 17m 50s | Hits: 87%/254
  • 🟨 cudax: Pass: 92%/54 | Total: 3h 47m | Avg: 4m 12s | Max: 17m 50s | Hits: 87%/254

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  92%/50  | Total:  3h 35m | Avg:  4m 18s | Max: 17m 50s | Hits:  87%/254   
      🟩 arm64              Pass: 100%/4   | Total: 12m 05s | Avg:  3m 01s | Max:  3m 47s
    🟨 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 20s | Avg:  3m 10s | Max:  3m 17s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 40s | Avg:  3m 20s | Max:  3m 48s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 13s | Avg:  2m 48s | Max:  2m 54s
      🟩 Clang12            Pass: 100%/4   | Total: 11m 49s | Avg:  2m 57s | Max:  3m 01s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 15s | Avg:  3m 03s | Max:  3m 21s
      🟩 Clang14            Pass: 100%/4   | Total: 25m 42s | Avg:  6m 25s | Max: 16m 29s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 10s | Avg:  3m 05s | Max:  3m 15s
      🟩 Clang16            Pass: 100%/4   | Total: 12m 12s | Avg:  3m 03s | Max:  3m 17s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 09s | Avg:  3m 04s | Max:  3m 08s
      🟩 Clang18            Pass: 100%/2   | Total: 19m 23s | Avg:  9m 41s | Max: 16m 24s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 30s | Avg:  2m 45s | Max:  2m 52s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 46s | Avg:  2m 56s | Max:  3m 09s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 51s | Avg:  2m 57s | Max:  3m 05s
      🟨 GCC12              Pass:  71%/7   | Total: 42m 49s | Avg:  6m 07s | Max: 17m 50s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 52s | Avg:  2m 57s | Max:  3m 47s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  6m 46s | Avg:  6m 46s | Max:  6m 46s | Hits:  87%/127   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 40s | Avg:  9m 40s | Max:  9m 40s | Hits:  87%/127   
      🟥 NVHPC24.7          Pass:   0%/2   | Total: 12m 23s | Avg:  6m 11s | Max:  6m 12s
    🟨 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 57m | Avg:  3m 55s | Max: 16m 29s
      🟨 GCC                Pass:  90%/20  | Total:  1h 20m | Avg:  4m 02s | Max: 17m 50s
      🟩 MSVC               Pass: 100%/2   | Total: 16m 26s | Avg:  8m 13s | Max:  9m 40s | Hits:  87%/254   
      🟥 NVHPC              Pass:   0%/2   | Total: 12m 23s | Avg:  6m 11s | Max:  6m 12s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  92%/54  | Total:  3h 47m | Avg:  4m 12s | Max: 17m 50s | Hits:  87%/254   
    🟨 gpu
      🟨 v100               Pass:  92%/54  | Total:  3h 47m | Avg:  4m 12s | Max: 17m 50s | Hits:  87%/254   
    🟨 ctk
      🟨 12.0               Pass:  94%/19  | Total:  1h 14m | Avg:  3m 56s | Max: 16m 29s | Hits:  87%/127   
      🟥 12.5               Pass:   0%/2   | Total: 12m 23s | Avg:  6m 11s | Max:  6m 12s
      🟨 12.6               Pass:  96%/33  | Total:  2h 20m | Avg:  4m 15s | Max: 17m 50s | Hits:  87%/127   
    🟨 cudacxx
      🟨 nvcc12.0           Pass:  94%/19  | Total:  1h 14m | Avg:  3m 56s | Max: 16m 29s | Hits:  87%/127   
      🟥 nvcc12.5           Pass:   0%/2   | Total: 12m 23s | Avg:  6m 11s | Max:  6m 12s
      🟨 nvcc12.6           Pass:  96%/33  | Total:  2h 20m | Avg:  4m 15s | Max: 17m 50s | Hits:  87%/127   
    🟨 jobs
      🟨 Build              Pass:  95%/49  | Total:  2h 43m | Avg:  3m 19s | Max:  9m 40s | Hits:  87%/254   
      🟨 Test               Pass:  60%/5   | Total:  1h 04m | Avg: 12m 53s | Max: 17m 50s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 27s | Avg:  2m 27s | Max:  2m 27s
      🟩 90a                Pass: 100%/1   | Total:  2m 29s | Avg:  2m 29s | Max:  2m 29s
    🟨 std
      🟨 17                 Pass:  93%/29  | Total:  1h 48m | Avg:  3m 44s | Max: 17m 50s
      🟨 20                 Pass:  92%/25  | Total:  1h 59m | Avg:  4m 46s | Max: 16m 29s | Hits:  87%/254   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 54)

# Runner
43 linux-amd64-cpu16
5 linux-amd64-gpu-v100-latest-1
4 linux-arm64-cpu16
2 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 48m 30s: Pass: 96%/54 | Total: 4h 15m | Avg: 4m 44s | Max: 17m 35s | Hits: 75%/296
  • 🟨 cudax: Pass: 96%/54 | Total: 4h 15m | Avg: 4m 44s | Max: 17m 35s | Hits: 75%/296

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  96%/50  | Total:  4h 03m | Avg:  4m 51s | Max: 17m 35s | Hits:  75%/296   
      🟩 arm64              Pass: 100%/4   | Total: 12m 30s | Avg:  3m 07s | Max:  3m 19s
    🚨 ctk: 12.5 🚨
      🟩 12.0               Pass: 100%/19  | Total:  1h 28m | Avg:  4m 40s | Max: 17m 19s | Hits:  75%/148   
      🔥 12.5               Pass:   0%/2   | Total: 13m 21s | Avg:  6m 40s | Max:  6m 56s
      🟩 12.6               Pass: 100%/33  | Total:  2h 33m | Avg:  4m 38s | Max: 17m 35s | Hits:  75%/148   
    🚨 cudacxx: nvcc12.5 🚨
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 28m | Avg:  4m 40s | Max: 17m 19s | Hits:  75%/148   
      🔥 nvcc12.5           Pass:   0%/2   | Total: 13m 21s | Avg:  6m 40s | Max:  6m 56s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 33m | Avg:  4m 38s | Max: 17m 35s | Hits:  75%/148   
    🚨 cxx: NVHPC24.7 🚨
      🟩 Clang9             Pass: 100%/2   | Total:  6m 34s | Avg:  3m 17s | Max:  3m 27s
      🟩 Clang10            Pass: 100%/2   | Total:  7m 33s | Avg:  3m 46s | Max:  4m 09s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 54s | Avg:  2m 58s | Max:  3m 11s
      🟩 Clang12            Pass: 100%/4   | Total: 12m 50s | Avg:  3m 12s | Max:  3m 20s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 13s | Avg:  3m 03s | Max:  3m 16s
      🟩 Clang14            Pass: 100%/4   | Total: 24m 58s | Avg:  6m 14s | Max: 15m 48s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 35s | Avg:  3m 17s | Max:  3m 18s
      🟩 Clang16            Pass: 100%/4   | Total: 13m 18s | Avg:  3m 19s | Max:  3m 27s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 45s | Avg:  3m 22s | Max:  3m 34s
      🟩 Clang18            Pass: 100%/2   | Total: 20m 38s | Avg: 10m 19s | Max: 17m 21s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 41s | Avg:  2m 50s | Max:  2m 56s
      🟩 GCC10              Pass: 100%/4   | Total: 13m 17s | Avg:  3m 19s | Max:  3m 31s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 40s | Avg:  2m 55s | Max:  3m 02s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 04m | Avg:  9m 12s | Max: 17m 35s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 30s | Avg:  2m 50s | Max:  3m 01s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  7m 18s | Avg:  7m 18s | Max:  7m 18s | Hits:  75%/148   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 04s | Avg:  8m 04s | Max:  8m 04s | Hits:  75%/148   
      🔥 NVHPC24.7          Pass:   0%/2   | Total: 13m 21s | Avg:  6m 40s | Max:  6m 56s
    🚨 cxx_family: NVHPC 🚨
      🟩 Clang              Pass: 100%/30  | Total:  2h 03m | Avg:  4m 06s | Max: 17m 21s
      🟩 GCC                Pass: 100%/20  | Total:  1h 43m | Avg:  5m 10s | Max: 17m 35s
      🟩 MSVC               Pass: 100%/2   | Total: 15m 22s | Avg:  7m 41s | Max:  8m 04s | Hits:  75%/296   
      🔥 NVHPC              Pass:   0%/2   | Total: 13m 21s | Avg:  6m 40s | Max:  6m 56s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  95%/49  | Total:  2h 50m | Avg:  3m 29s | Max:  8m 04s | Hits:  75%/296   
      🟩 Test               Pass: 100%/5   | Total:  1h 24m | Avg: 16m 57s | Max: 17m 35s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  96%/54  | Total:  4h 15m | Avg:  4m 44s | Max: 17m 35s | Hits:  75%/296   
    🟨 gpu
      🟨 v100               Pass:  96%/54  | Total:  4h 15m | Avg:  4m 44s | Max: 17m 35s | Hits:  75%/296   
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 53s | Avg:  2m 53s | Max:  2m 53s
      🟩 90a                Pass: 100%/1   | Total:  2m 38s | Avg:  2m 38s | Max:  2m 38s
    🟨 std
      🟨 17                 Pass:  96%/29  | Total:  2h 02m | Avg:  4m 14s | Max: 17m 19s
      🟨 20                 Pass:  96%/25  | Total:  2h 12m | Avg:  5m 18s | Max: 17m 35s | Hits:  75%/296   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 54)

# Runner
43 linux-amd64-cpu16
5 linux-amd64-gpu-v100-latest-1
4 linux-arm64-cpu16
2 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 20m 21s: Pass: 98%/54 | Total: 4h 01m | Avg: 4m 28s | Max: 15m 46s | Hits: 85%/296
  • 🟨 cudax: Pass: 98%/54 | Total: 4h 01m | Avg: 4m 28s | Max: 15m 46s | Hits: 85%/296

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  98%/50  | Total:  3h 51m | Avg:  4m 37s | Max: 15m 46s | Hits:  85%/296   
      🟩 arm64              Pass: 100%/4   | Total: 10m 20s | Avg:  2m 35s | Max:  2m 39s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/19  | Total:  1h 28m | Avg:  4m 38s | Max: 15m 08s | Hits:  85%/148   
      🟩 12.5               Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
      🔍 12.6               Pass:  96%/33  | Total:  2h 22m | Avg:  4m 19s | Max: 15m 46s | Hits:  85%/148   
    🔍 cudacxx: nvcc12.6 🔍
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 28m | Avg:  4m 38s | Max: 15m 08s | Hits:  85%/148   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
      🔍 nvcc12.6           Pass:  96%/33  | Total:  2h 22m | Avg:  4m 19s | Max: 15m 46s | Hits:  85%/148   
    🔍 cxx: GCC12 🔍
      🟩 Clang9             Pass: 100%/2   | Total:  7m 16s | Avg:  3m 38s | Max:  3m 45s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 52s | Avg:  3m 26s | Max:  3m 48s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 58s | Avg:  2m 59s | Max:  3m 09s
      🟩 Clang12            Pass: 100%/4   | Total: 12m 32s | Avg:  3m 08s | Max:  3m 34s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 16s | Avg:  3m 04s | Max:  3m 14s
      🟩 Clang14            Pass: 100%/4   | Total: 24m 48s | Avg:  6m 12s | Max: 15m 08s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  3m 12s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 47s | Avg:  2m 56s | Max:  3m 18s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 49s | Avg:  3m 24s | Max:  3m 25s
      🟩 Clang18            Pass: 100%/2   | Total: 18m 51s | Avg:  9m 25s | Max: 15m 36s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 02s | Avg:  3m 01s | Max:  3m 14s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 50s | Avg:  2m 57s | Max:  3m 08s
      🟩 GCC11              Pass: 100%/4   | Total: 12m 31s | Avg:  3m 07s | Max:  3m 29s
      🔍 GCC12              Pass:  85%/7   | Total: 49m 53s | Avg:  7m 07s | Max: 15m 46s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 55s | Avg:  2m 38s | Max:  2m 51s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 09s | Avg: 10m 09s | Max: 10m 09s | Hits:  85%/148   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 48s | Avg: 12m 48s | Max: 12m 48s | Hits:  85%/148   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/30  | Total:  1h 59m | Avg:  3m 59s | Max: 15m 36s
      🔍 GCC                Pass:  95%/20  | Total:  1h 28m | Avg:  4m 24s | Max: 15m 46s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 57s | Avg: 11m 28s | Max: 12m 48s | Hits:  85%/296   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
    🔍 jobs: Test 🔍
      🟩 Build              Pass: 100%/49  | Total:  2h 53m | Avg:  3m 32s | Max: 12m 48s | Hits:  85%/296   
      🔍 Test               Pass:  80%/5   | Total:  1h 07m | Avg: 13m 33s | Max: 15m 46s
    🔍 std: 17 🔍
      🔍 17                 Pass:  96%/29  | Total:  1h 47m | Avg:  3m 42s | Max: 14m 49s
      🟩 20                 Pass: 100%/25  | Total:  2h 13m | Avg:  5m 21s | Max: 15m 46s | Hits:  85%/296   
    🟨 cudacxx_family
      🟨 nvcc               Pass:  98%/54  | Total:  4h 01m | Avg:  4m 28s | Max: 15m 46s | Hits:  85%/296   
    🟨 gpu
      🟨 v100               Pass:  98%/54  | Total:  4h 01m | Avg:  4m 28s | Max: 15m 46s | Hits:  85%/296   
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 41s | Avg:  2m 41s | Max:  2m 41s
      🟩 90a                Pass: 100%/1   | Total:  2m 51s | Avg:  2m 51s | Max:  2m 51s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 54)

# Runner
43 linux-amd64-cpu16
5 linux-amd64-gpu-v100-latest-1
4 linux-arm64-cpu16
2 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 59m 09s: Pass: 100%/54 | Total: 4h 09m | Avg: 4m 37s | Max: 15m 46s | Hits: 85%/296
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 09m | Avg: 4m 37s | Max: 15m 46s | Hits: 85%/296

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  3h 59m | Avg:  4m 46s | Max: 15m 46s | Hits:  85%/296   
      🟩 arm64              Pass: 100%/4   | Total: 10m 20s | Avg:  2m 35s | Max:  2m 39s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 28m | Avg:  4m 38s | Max: 15m 08s | Hits:  85%/148   
      🟩 12.5               Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
      🟩 12.6               Pass: 100%/33  | Total:  2h 30m | Avg:  4m 33s | Max: 15m 46s | Hits:  85%/148   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 28m | Avg:  4m 38s | Max: 15m 08s | Hits:  85%/148   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 30m | Avg:  4m 33s | Max: 15m 46s | Hits:  85%/148   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 09m | Avg:  4m 37s | Max: 15m 46s | Hits:  85%/296   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  7m 16s | Avg:  3m 38s | Max:  3m 45s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 52s | Avg:  3m 26s | Max:  3m 48s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 58s | Avg:  2m 59s | Max:  3m 09s
      🟩 Clang12            Pass: 100%/4   | Total: 12m 32s | Avg:  3m 08s | Max:  3m 34s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 16s | Avg:  3m 04s | Max:  3m 14s
      🟩 Clang14            Pass: 100%/4   | Total: 24m 48s | Avg:  6m 12s | Max: 15m 08s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 22s | Avg:  3m 11s | Max:  3m 12s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 47s | Avg:  2m 56s | Max:  3m 18s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 49s | Avg:  3m 24s | Max:  3m 25s
      🟩 Clang18            Pass: 100%/2   | Total: 18m 51s | Avg:  9m 25s | Max: 15m 36s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 02s | Avg:  3m 01s | Max:  3m 14s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 50s | Avg:  2m 57s | Max:  3m 08s
      🟩 GCC11              Pass: 100%/4   | Total: 12m 31s | Avg:  3m 07s | Max:  3m 29s
      🟩 GCC12              Pass: 100%/7   | Total: 57m 49s | Avg:  8m 15s | Max: 15m 46s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 55s | Avg:  2m 38s | Max:  2m 51s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 09s | Avg: 10m 09s | Max: 10m 09s | Hits:  85%/148   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 48s | Avg: 12m 48s | Max: 12m 48s | Hits:  85%/148   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 59m | Avg:  3m 59s | Max: 15m 36s
      🟩 GCC                Pass: 100%/20  | Total:  1h 36m | Avg:  4m 48s | Max: 15m 46s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 57s | Avg: 11m 28s | Max: 12m 48s | Hits:  85%/296   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 09m | Avg:  4m 37s | Max: 15m 46s | Hits:  85%/296   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  2h 53m | Avg:  3m 32s | Max: 12m 48s | Hits:  85%/296   
      🟩 Test               Pass: 100%/5   | Total:  1h 15m | Avg: 15m 08s | Max: 15m 46s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 41s | Avg:  2m 41s | Max:  2m 41s
      🟩 90a                Pass: 100%/1   | Total:  2m 51s | Avg:  2m 51s | Max:  2m 51s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  1h 55m | Avg:  3m 59s | Max: 14m 49s
      🟩 20                 Pass: 100%/25  | Total:  2h 13m | Avg:  5m 21s | Max: 15m 46s | Hits:  85%/296   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 54)

# Runner
43 linux-amd64-cpu16
5 linux-amd64-gpu-v100-latest-1
4 linux-arm64-cpu16
2 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 30m 43s: Pass: 83%/54 | Total: 4h 33m | Avg: 5m 04s | Max: 16m 33s
  • 🟨 cudax: Pass: 83%/54 | Total: 4h 33m | Avg: 5m 04s | Max: 16m 33s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  82%/50  | Total:  4h 19m | Avg:  5m 11s | Max: 16m 33s
      🟩 arm64              Pass: 100%/4   | Total: 13m 58s | Avg:  3m 29s | Max:  3m 35s
    🟨 ctk
      🟨 12.0               Pass:  78%/19  | Total:  1h 35m | Avg:  5m 02s | Max: 16m 30s
      🟩 12.5               Pass: 100%/2   | Total: 12m 12s | Avg:  6m 06s | Max:  6m 26s
      🟨 12.6               Pass:  84%/33  | Total:  2h 46m | Avg:  5m 01s | Max: 16m 33s
    🟨 cudacxx
      🟨 nvcc12.0           Pass:  78%/19  | Total:  1h 35m | Avg:  5m 02s | Max: 16m 30s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 12s | Avg:  6m 06s | Max:  6m 26s
      🟨 nvcc12.6           Pass:  84%/33  | Total:  2h 46m | Avg:  5m 01s | Max: 16m 33s
    🟨 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  8m 13s | Avg:  4m 06s | Max:  4m 42s
      🟩 Clang10            Pass: 100%/2   | Total:  8m 18s | Avg:  4m 09s | Max:  4m 23s
      🟩 Clang11            Pass: 100%/4   | Total: 14m 26s | Avg:  3m 36s | Max:  3m 43s
      🟩 Clang12            Pass: 100%/4   | Total: 14m 11s | Avg:  3m 32s | Max:  3m 40s
      🟩 Clang13            Pass: 100%/4   | Total: 14m 06s | Avg:  3m 31s | Max:  3m 39s
      🟨 Clang14            Pass:  75%/4   | Total: 26m 38s | Avg:  6m 39s | Max: 16m 04s
      🟩 Clang15            Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 07s
      🟩 Clang16            Pass: 100%/4   | Total: 14m 32s | Avg:  3m 38s | Max:  4m 01s
      🟩 Clang17            Pass: 100%/2   | Total:  7m 41s | Avg:  3m 50s | Max:  4m 03s
      🟨 Clang18            Pass:  50%/2   | Total: 19m 15s | Avg:  9m 37s | Max: 15m 35s
      🟥 GCC9               Pass:   0%/2   | Total:  7m 07s | Avg:  3m 33s | Max:  3m 51s
      🟩 GCC10              Pass: 100%/4   | Total: 14m 48s | Avg:  3m 42s | Max:  4m 01s
      🟩 GCC11              Pass: 100%/4   | Total: 14m 39s | Avg:  3m 39s | Max:  4m 04s
      🟨 GCC12              Pass:  57%/7   | Total:  1h 03m | Avg:  9m 04s | Max: 16m 33s
      🟩 GCC13              Pass: 100%/3   | Total: 10m 44s | Avg:  3m 34s | Max:  3m 41s
      🟥 MSVC14.36          Pass:   0%/1   | Total:  7m 06s | Avg:  7m 06s | Max:  7m 06s
      🟥 MSVC14.39          Pass:   0%/1   | Total:  8m 25s | Avg:  8m 25s | Max:  8m 25s
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 12s | Avg:  6m 06s | Max:  6m 26s
    🟨 cxx_family
      🟨 Clang              Pass:  93%/30  | Total:  2h 15m | Avg:  4m 30s | Max: 16m 04s
      🟨 GCC                Pass:  75%/20  | Total:  1h 50m | Avg:  5m 32s | Max: 16m 33s
      🟥 MSVC               Pass:   0%/2   | Total: 15m 31s | Avg:  7m 45s | Max:  8m 25s
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 12s | Avg:  6m 06s | Max:  6m 26s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  83%/54  | Total:  4h 33m | Avg:  5m 04s | Max: 16m 33s
    🟨 gpu
      🟨 v100               Pass:  83%/54  | Total:  4h 33m | Avg:  5m 04s | Max: 16m 33s
    🟨 jobs
      🟨 Build              Pass:  91%/49  | Total:  3h 12m | Avg:  3m 56s | Max:  8m 25s
      🟥 Test               Pass:   0%/5   | Total:  1h 20m | Avg: 16m 11s | Max: 16m 33s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 52s | Avg:  2m 52s | Max:  2m 52s
      🟩 90a                Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s
    🟨 std
      🟨 17                 Pass:  86%/29  | Total:  2h 16m | Avg:  4m 41s | Max: 16m 33s
      🟨 20                 Pass:  80%/25  | Total:  2h 17m | Avg:  5m 30s | Max: 16m 17s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 54)

# Runner
43 linux-amd64-cpu16
5 linux-amd64-gpu-v100-latest-1
4 linux-arm64-cpu16
2 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 1h 45m: Pass: 99%/394 | Total: 1d 20h | Avg: 6m 42s | Max: 39m 10s | Hits: 98%/25644
  • 🟨 cudax: Pass: 96%/54 | Total: 4h 22m | Avg: 4m 51s | Max: 19m 30s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  96%/50  | Total:  4h 10m | Avg:  5m 00s | Max: 19m 30s
      🟩 arm64              Pass: 100%/4   | Total: 11m 57s | Avg:  2m 59s | Max:  3m 25s
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/30  | Total:  2h 01m | Avg:  4m 02s | Max: 16m 43s
      🟩 GCC                Pass: 100%/20  | Total:  1h 47m | Avg:  5m 21s | Max: 19m 30s
      🔥 MSVC               Pass:   0%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 31s
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 00s | Avg:  6m 00s | Max:  6m 02s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  95%/49  | Total:  2h 53m | Avg:  3m 32s | Max: 11m 31s
      🟩 Test               Pass: 100%/5   | Total:  1h 28m | Avg: 17m 41s | Max: 19m 30s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/29  | Total:  2h 03m | Avg:  4m 15s | Max: 19m 30s
      🔍 20                 Pass:  92%/25  | Total:  2h 18m | Avg:  5m 32s | Max: 18m 57s
    🟨 ctk
      🟨 12.0               Pass:  94%/19  | Total:  1h 31m | Avg:  4m 49s | Max: 16m 39s
      🟩 12.5               Pass: 100%/2   | Total: 12m 00s | Avg:  6m 00s | Max:  6m 02s
      🟨 12.6               Pass:  96%/33  | Total:  2h 38m | Avg:  4m 47s | Max: 19m 30s
    🟨 cudacxx
      🟨 nvcc12.0           Pass:  94%/19  | Total:  1h 31m | Avg:  4m 49s | Max: 16m 39s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 00s | Avg:  6m 00s | Max:  6m 02s
      🟨 nvcc12.6           Pass:  96%/33  | Total:  2h 38m | Avg:  4m 47s | Max: 19m 30s
    🟨 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 53s | Avg:  3m 26s | Max:  3m 46s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 34s | Avg:  3m 17s | Max:  3m 30s
      🟩 Clang11            Pass: 100%/4   | Total: 12m 06s | Avg:  3m 01s | Max:  3m 13s
      🟩 Clang12            Pass: 100%/4   | Total: 11m 40s | Avg:  2m 55s | Max:  3m 01s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 19s | Avg:  3m 04s | Max:  3m 17s
      🟩 Clang14            Pass: 100%/4   | Total: 25m 52s | Avg:  6m 28s | Max: 16m 36s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 30s | Avg:  3m 15s | Max:  3m 19s
      🟩 Clang16            Pass: 100%/4   | Total: 12m 46s | Avg:  3m 11s | Max:  3m 22s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 27s | Avg:  3m 13s | Max:  3m 21s
      🟩 Clang18            Pass: 100%/2   | Total: 19m 53s | Avg:  9m 56s | Max: 16m 43s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 21s | Avg:  3m 10s | Max:  3m 42s
      🟩 GCC10              Pass: 100%/4   | Total: 12m 02s | Avg:  3m 00s | Max:  3m 14s
      🟩 GCC11              Pass: 100%/4   | Total: 12m 37s | Avg:  3m 09s | Max:  3m 25s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 07m | Avg:  9m 39s | Max: 19m 30s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 33s | Avg:  2m 51s | Max:  3m 25s
      🟥 MSVC14.36          Pass:   0%/1   | Total: 10m 28s | Avg: 10m 28s | Max: 10m 28s
      🟥 MSVC14.39          Pass:   0%/1   | Total: 11m 31s | Avg: 11m 31s | Max: 11m 31s
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 00s | Avg:  6m 00s | Max:  6m 02s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  96%/54  | Total:  4h 22m | Avg:  4m 51s | Max: 19m 30s
    🟨 gpu
      🟨 v100               Pass:  96%/54  | Total:  4h 22m | Avg:  4m 51s | Max: 19m 30s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s
      🟩 90a                Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s
    
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 14h 02m | Avg: 7m 08s | Max: 39m 10s | Hits: 98%/9500

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 13h 34m | Avg:  7m 24s | Max: 39m 10s | Hits:  98%/9500  
      🟩 arm64              Pass: 100%/8   | Total: 28m 21s | Avg:  3m 32s | Max:  3m 54s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 04m | Avg:  4m 17s | Max: 18m 01s | Hits:  98%/2181  
      🟩 11.8               Pass: 100%/3   | Total:  1h 00m | Avg: 20m 02s | Max: 30m 55s
      🟩 12.5               Pass: 100%/4   | Total:  1h 04m | Avg: 16m 07s | Max: 39m 10s
      🟩 12.6               Pass: 100%/96  | Total: 10h 53m | Avg:  6m 48s | Max: 29m 19s | Hits:  97%/7319  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 33m | Avg: 12m 46s | Max: 19m 44s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 04m | Avg:  4m 17s | Max: 18m 01s | Hits:  98%/2181  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 00m | Avg: 20m 02s | Max: 30m 55s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 04m | Avg: 16m 07s | Max: 39m 10s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  8h 20m | Avg:  5m 57s | Max: 29m 19s | Hits:  97%/7319  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 33m | Avg: 12m 46s | Max: 19m 44s
      🟩 nvcc               Pass: 100%/106 | Total: 11h 29m | Avg:  6m 30s | Max: 39m 10s | Hits:  98%/9500  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 26m 23s | Avg:  4m 23s | Max:  6m 06s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 51s | Avg:  5m 37s | Max:  6m 16s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 31s | Avg:  4m 37s | Max:  5m 00s
      🟩 Clang12            Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  5m 00s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 14s | Avg:  4m 33s | Max:  5m 01s
      🟩 Clang14            Pass: 100%/4   | Total: 17m 41s | Avg:  4m 25s | Max:  4m 52s
      🟩 Clang15            Pass: 100%/4   | Total: 17m 29s | Avg:  4m 22s | Max:  4m 40s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 05s | Avg:  4m 31s | Max:  4m 48s
      🟩 Clang17            Pass: 100%/4   | Total: 18m 02s | Avg:  4m 30s | Max:  4m 45s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 10m | Avg: 10m 35s | Max: 19m 44s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 33s | Avg:  2m 46s | Max:  2m 53s
      🟩 GCC7               Pass: 100%/6   | Total: 20m 05s | Avg:  3m 20s | Max:  4m 08s
      🟩 GCC8               Pass: 100%/6   | Total: 21m 05s | Avg:  3m 30s | Max:  4m 31s
      🟩 GCC9               Pass: 100%/6   | Total: 24m 19s | Avg:  4m 03s | Max:  6m 03s
      🟩 GCC10              Pass: 100%/4   | Total: 15m 54s | Avg:  3m 58s | Max:  4m 34s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 14m | Avg: 10m 42s | Max: 30m 55s
      🟩 GCC12              Pass: 100%/4   | Total: 15m 56s | Avg:  3m 59s | Max:  4m 30s
      🟩 GCC13              Pass: 100%/17  | Total:  2h 43m | Avg:  9m 38s | Max: 29m 19s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 17m 25s | Avg:  5m 48s | Max:  6m 14s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 01s | Avg: 18m 01s | Max: 18m 01s | Hits:  98%/2181  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 27m 18s | Avg: 13m 39s | Max: 14m 10s | Hits:  97%/4725  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 03s | Avg: 14m 03s | Max: 14m 03s | Hits:  97%/2594  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 04m | Avg: 16m 07s | Max: 39m 10s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  5h 59m | Avg:  6m 32s | Max: 19m 44s
      🟩 GCC                Pass: 100%/52  | Total:  5h 41m | Avg:  6m 34s | Max: 30m 55s
      🟩 Intel              Pass: 100%/3   | Total: 17m 25s | Avg:  5m 48s | Max:  6m 14s
      🟩 MSVC               Pass: 100%/4   | Total: 59m 22s | Avg: 14m 50s | Max: 18m 01s | Hits:  98%/9500  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 04m | Avg: 16m 07s | Max: 39m 10s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 14h 02m | Avg:  7m 08s | Max: 39m 10s | Hits:  98%/9500  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total: 11h 38m | Avg:  6m 21s | Max: 39m 10s | Hits:  98%/9500  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 34m | Avg: 23m 32s | Max: 29m 19s
      🟩 Test               Pass: 100%/3   | Total: 47m 40s | Avg: 15m 53s | Max: 18m 12s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 00m | Avg: 20m 02s | Max: 30m 55s
      🟩 90                 Pass: 100%/4   | Total: 42m 56s | Avg: 10m 44s | Max: 12m 10s
      🟩 90a                Pass: 100%/8   | Total:  1h 00m | Avg:  7m 35s | Max: 13m 39s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  3h 08m | Avg:  5m 54s | Max: 29m 19s
      🟩 14                 Pass: 100%/32  | Total:  3h 26m | Avg:  6m 27s | Max: 19m 33s | Hits:  98%/4465  
      🟩 17                 Pass: 100%/30  | Total:  4h 14m | Avg:  8m 29s | Max: 39m 10s | Hits:  97%/2441  
      🟩 20                 Pass: 100%/23  | Total:  3h 10m | Avg:  8m 17s | Max: 21m 13s | Hits:  97%/2594  
    
  • 🟩 cub: Pass: 100%/110 | Total: 12h 05m | Avg: 6m 35s | Max: 23m 55s | Hits: 99%/2964

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 11h 28m | Avg:  6m 44s | Max: 23m 55s | Hits:  99%/2964  
      🟩 arm64              Pass: 100%/8   | Total: 37m 31s | Avg:  4m 41s | Max:  5m 10s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 14m | Avg:  4m 59s | Max: 14m 50s | Hits:  99%/741   
      🟩 11.8               Pass: 100%/3   | Total: 15m 45s | Avg:  5m 15s | Max:  5m 31s
      🟩 12.5               Pass: 100%/4   | Total: 35m 57s | Avg:  8m 59s | Max:  9m 33s
      🟩 12.6               Pass: 100%/88  | Total:  9h 59m | Avg:  6m 48s | Max: 23m 55s | Hits:  99%/2223  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 16m 36s | Avg:  4m 09s | Max:  4m 15s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 14m | Avg:  4m 59s | Max: 14m 50s | Hits:  99%/741   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 45s | Avg:  5m 15s | Max:  5m 31s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 35m 57s | Avg:  8m 59s | Max:  9m 33s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 42m | Avg:  6m 56s | Max: 23m 55s | Hits:  99%/2223  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 16m 36s | Avg:  4m 09s | Max:  4m 15s
      🟩 nvcc               Pass: 100%/106 | Total: 11h 49m | Avg:  6m 41s | Max: 23m 55s | Hits:  99%/2964  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 30m 59s | Avg:  5m 09s | Max:  6m 10s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 01s | Avg:  6m 20s | Max:  6m 47s
      🟩 Clang11            Pass: 100%/4   | Total: 21m 27s | Avg:  5m 21s | Max:  5m 42s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 21s | Avg:  5m 05s | Max:  5m 19s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 56s | Avg:  4m 59s | Max:  5m 13s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 24s | Avg:  5m 06s | Max:  5m 27s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 56s | Avg:  4m 59s | Max:  5m 09s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 22s | Avg:  5m 05s | Max:  5m 22s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 58s | Avg:  5m 14s | Max:  5m 36s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 23m | Avg:  7m 35s | Max: 23m 55s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 01s | Avg:  4m 00s | Max:  4m 15s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 56s | Avg:  4m 39s | Max:  5m 26s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 34s | Avg:  4m 45s | Max:  5m 13s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 00s | Avg:  4m 50s | Max:  5m 25s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 47s | Avg:  5m 26s | Max:  5m 44s
      🟩 GCC11              Pass: 100%/7   | Total: 37m 31s | Avg:  5m 21s | Max:  5m 40s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 09s | Avg:  5m 32s | Max:  5m 57s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 48m | Avg: 10m 31s | Max: 23m 51s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 17m 57s | Avg:  5m 59s | Max:  6m 02s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 50s | Avg: 14m 50s | Max: 14m 50s | Hits:  99%/741   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 23m 59s | Avg: 11m 59s | Max: 12m 10s | Hits:  99%/1482  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 43s | Avg: 12m 43s | Max: 12m 43s | Hits:  99%/741   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 35m 57s | Avg:  8m 59s | Max:  9m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 36m | Avg:  5m 46s | Max: 23m 55s
      🟩 GCC                Pass: 100%/51  | Total:  5h 43m | Avg:  6m 43s | Max: 23m 51s
      🟩 Intel              Pass: 100%/3   | Total: 17m 57s | Avg:  5m 59s | Max:  6m 02s
      🟩 MSVC               Pass: 100%/4   | Total: 51m 32s | Avg: 12m 53s | Max: 14m 50s | Hits:  99%/2964  
      🟩 NVHPC              Pass: 100%/4   | Total: 35m 57s | Avg:  8m 59s | Max:  9m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 12h 05m | Avg:  6m 35s | Max: 23m 55s | Hits:  99%/2964  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  9h 19m | Avg:  5m 29s | Max: 14m 50s | Hits:  99%/2964  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 17m 39s | Avg: 17m 39s | Max: 17m 39s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 52s | Avg: 19m 52s | Max: 19m 52s
      🟩 HostLaunch         Pass: 100%/3   | Total: 58m 22s | Avg: 19m 27s | Max: 21m 29s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 09m | Avg: 23m 18s | Max: 23m 55s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 45s | Avg:  5m 15s | Max:  5m 31s
      🟩 90a                Pass: 100%/4   | Total: 15m 49s | Avg:  3m 57s | Max:  4m 07s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 58m | Avg:  5m 57s | Max: 22m 08s
      🟩 14                 Pass: 100%/29  | Total:  2h 45m | Avg:  5m 43s | Max: 14m 50s | Hits:  99%/1482  
      🟩 17                 Pass: 100%/27  | Total:  2h 30m | Avg:  5m 34s | Max: 12m 10s | Hits:  99%/741   
      🟩 20                 Pass: 100%/24  | Total:  3h 50m | Avg:  9m 36s | Max: 23m 55s | Hits:  99%/741   
    
  • 🟩 thrust: Pass: 100%/109 | Total: 13h 08m | Avg: 7m 13s | Max: 24m 06s | Hits: 99%/13180

    🟩 cpu
      🟩 amd64              Pass: 100%/101 | Total: 12h 26m | Avg:  7m 23s | Max: 24m 06s | Hits:  99%/13180 
      🟩 arm64              Pass: 100%/8   | Total: 42m 08s | Avg:  5m 16s | Max:  6m 07s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 26m | Avg:  5m 47s | Max: 18m 53s | Hits:  99%/2636  
      🟩 11.8               Pass: 100%/3   | Total: 18m 58s | Avg:  6m 19s | Max:  7m 13s
      🟩 12.5               Pass: 100%/4   | Total:  1h 12m | Avg: 18m 11s | Max: 19m 44s
      🟩 12.6               Pass: 100%/87  | Total: 10h 09m | Avg:  7m 00s | Max: 24m 06s | Hits:  99%/10544 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 21m 34s | Avg:  5m 23s | Max:  5m 59s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 26m | Avg:  5m 47s | Max: 18m 53s | Hits:  99%/2636  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 18m 58s | Avg:  6m 19s | Max:  7m 13s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 12m | Avg: 18m 11s | Max: 19m 44s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  9h 48m | Avg:  7m 05s | Max: 24m 06s | Hits:  99%/10544 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 21m 34s | Avg:  5m 23s | Max:  5m 59s
      🟩 nvcc               Pass: 100%/105 | Total: 12h 46m | Avg:  7m 18s | Max: 24m 06s | Hits:  99%/13180 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 35m 13s | Avg:  5m 52s | Max:  7m 08s
      🟩 Clang10            Pass: 100%/3   | Total: 21m 57s | Avg:  7m 19s | Max:  7m 25s
      🟩 Clang11            Pass: 100%/4   | Total: 23m 22s | Avg:  5m 50s | Max:  6m 08s
      🟩 Clang12            Pass: 100%/4   | Total: 24m 16s | Avg:  6m 04s | Max:  6m 24s
      🟩 Clang13            Pass: 100%/4   | Total: 23m 33s | Avg:  5m 53s | Max:  6m 14s
      🟩 Clang14            Pass: 100%/4   | Total: 22m 42s | Avg:  5m 40s | Max:  6m 05s
      🟩 Clang15            Pass: 100%/4   | Total: 23m 49s | Avg:  5m 57s | Max:  6m 17s
      🟩 Clang16            Pass: 100%/4   | Total: 24m 53s | Avg:  6m 13s | Max:  6m 42s
      🟩 Clang17            Pass: 100%/4   | Total: 24m 04s | Avg:  6m 01s | Max:  6m 29s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 11m | Avg:  6m 30s | Max: 15m 18s
      🟩 GCC6               Pass: 100%/2   | Total: 10m 05s | Avg:  5m 02s | Max:  5m 32s
      🟩 GCC7               Pass: 100%/6   | Total: 30m 44s | Avg:  5m 07s | Max:  6m 14s
      🟩 GCC8               Pass: 100%/6   | Total: 32m 30s | Avg:  5m 25s | Max:  6m 18s
      🟩 GCC9               Pass: 100%/6   | Total: 32m 38s | Avg:  5m 26s | Max:  6m 41s
      🟩 GCC10              Pass: 100%/4   | Total: 23m 18s | Avg:  5m 49s | Max:  6m 05s
      🟩 GCC11              Pass: 100%/7   | Total: 43m 49s | Avg:  6m 15s | Max:  7m 13s
      🟩 GCC12              Pass: 100%/4   | Total: 25m 52s | Avg:  6m 28s | Max:  7m 04s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 40m | Avg:  7m 12s | Max: 15m 32s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 21m 31s | Avg:  7m 10s | Max:  7m 33s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 53s | Avg: 18m 53s | Max: 18m 53s | Hits:  99%/2636  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 36m 53s | Avg: 18m 26s | Max: 19m 23s | Hits:  99%/5272  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 43m 12s | Avg: 21m 36s | Max: 24m 06s | Hits:  99%/5272  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 12m | Avg: 18m 11s | Max: 19m 44s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 55m | Avg:  6m 09s | Max: 15m 18s
      🟩 GCC                Pass: 100%/49  | Total:  4h 59m | Avg:  6m 07s | Max: 15m 32s
      🟩 Intel              Pass: 100%/3   | Total: 21m 31s | Avg:  7m 10s | Max:  7m 33s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 38m | Avg: 19m 47s | Max: 24m 06s | Hits:  99%/13180 
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 12m | Avg: 18m 11s | Max: 19m 44s
    🟩 gpu
      🟩 v100               Pass: 100%/109 | Total: 13h 08m | Avg:  7m 13s | Max: 24m 06s | Hits:  99%/13180 
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total: 11h 34m | Avg:  6m 48s | Max: 19m 44s | Hits:  99%/10544 
      🟩 TestCPU            Pass: 100%/4   | Total: 50m 05s | Avg: 12m 31s | Max: 24m 06s | Hits:  99%/2636  
      🟩 TestGPU            Pass: 100%/3   | Total: 43m 40s | Avg: 14m 33s | Max: 15m 32s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 18m 58s | Avg:  6m 19s | Max:  7m 13s
      🟩 90a                Pass: 100%/4   | Total: 20m 41s | Avg:  5m 10s | Max:  5m 37s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 04m | Avg:  6m 08s | Max: 16m 55s
      🟩 14                 Pass: 100%/29  | Total:  3h 28m | Avg:  7m 10s | Max: 19m 23s | Hits:  99%/5272  
      🟩 17                 Pass: 100%/27  | Total:  3h 08m | Avg:  6m 59s | Max: 17m 53s | Hits:  99%/2636  
      🟩 20                 Pass: 100%/23  | Total:  3h 26m | Avg:  8m 59s | Max: 24m 06s | Hits:  99%/5272  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 21s | Avg: 5m 10s | Max: 8m 18s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  8m 18s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  8m 18s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  8m 18s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  8m 18s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  8m 18s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  8m 18s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  8m 18s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
      🟩 Test               Pass: 100%/1   | Total:  8m 18s | Avg:  8m 18s | Max:  8m 18s
    
  • 🟩 python: Pass: 100%/1 | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 52s | Avg: 15m 52s | Max: 15m 52s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 394)

# Runner
326 linux-amd64-cpu16
28 linux-arm64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

🟨 CI finished in 2h 14m: Pass: 98%/396 | Total: 2d 05h | Avg: 8m 02s | Max: 57m 49s | Hits: 84%/22084
  • 🟨 thrust: Pass: 98%/111 | Total: 12h 16m | Avg: 6m 38s | Max: 27m 42s | Hits: 99%/9260

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  98%/103 | Total: 11h 38m | Avg:  6m 47s | Max: 27m 42s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/8   | Total: 37m 36s | Avg:  4m 42s | Max:  5m 08s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  1h 17m | Avg:  5m 09s | Max: 17m 20s | Hits:  99%/1852  
      🟩 11.8               Pass: 100%/3   | Total: 16m 27s | Avg:  5m 29s | Max:  5m 45s
      🟩 12.5               Pass: 100%/4   | Total:  1h 00m | Avg: 15m 14s | Max: 16m 20s
      🔍 12.6               Pass:  97%/89  | Total:  9h 41m | Avg:  6m 32s | Max: 27m 42s | Hits:  99%/7408  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 19m 40s | Avg:  4m 55s | Max:  5m 13s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 17m | Avg:  5m 09s | Max: 17m 20s | Hits:  99%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 27s | Avg:  5m 29s | Max:  5m 45s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 00m | Avg: 15m 14s | Max: 16m 20s
      🔍 nvcc12.6           Pass:  97%/85  | Total:  9h 22m | Avg:  6m 36s | Max: 27m 42s | Hits:  99%/7408  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total: 19m 40s | Avg:  4m 55s | Max:  5m 13s
      🔍 nvcc               Pass:  98%/107 | Total: 11h 56m | Avg:  6m 41s | Max: 27m 42s | Hits:  99%/9260  
    🔍 cxx: GCC13 🔍
      🟩 Clang9             Pass: 100%/6   | Total: 30m 47s | Avg:  5m 07s | Max:  6m 47s
      🟩 Clang10            Pass: 100%/3   | Total: 20m 20s | Avg:  6m 46s | Max:  7m 56s
      🟩 Clang11            Pass: 100%/4   | Total: 21m 35s | Avg:  5m 23s | Max:  5m 44s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 23s | Avg:  5m 20s | Max:  5m 35s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 21s | Avg:  5m 05s | Max:  5m 30s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 02s | Avg:  5m 15s | Max:  5m 34s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 36s | Avg:  5m 24s | Max:  5m 44s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 31s | Avg:  5m 07s | Max:  5m 16s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 47s | Avg:  5m 11s | Max:  5m 27s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 19m | Avg:  7m 10s | Max: 27m 42s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 42s | Avg:  4m 21s | Max:  4m 23s
      🟩 GCC7               Pass: 100%/6   | Total: 29m 18s | Avg:  4m 53s | Max:  5m 38s
      🟩 GCC8               Pass: 100%/6   | Total: 27m 53s | Avg:  4m 38s | Max:  5m 22s
      🟩 GCC9               Pass: 100%/6   | Total: 28m 46s | Avg:  4m 47s | Max:  5m 32s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 13s | Avg:  5m 18s | Max:  5m 41s
      🟩 GCC11              Pass: 100%/7   | Total: 37m 59s | Avg:  5m 25s | Max:  5m 45s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 00s | Avg:  5m 30s | Max:  5m 44s
      🔍 GCC13              Pass:  87%/16  | Total:  1h 45m | Avg:  6m 36s | Max: 13m 21s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 20m 23s | Avg:  6m 47s | Max:  7m 12s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 38m 17s | Avg: 19m 08s | Max: 21m 27s | Hits:  99%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 40m 32s | Avg: 20m 16s | Max: 22m 48s | Hits:  99%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 00m | Avg: 15m 14s | Max: 16m 20s
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/48  | Total:  4h 37m | Avg:  5m 46s | Max: 27m 42s
      🔍 GCC                Pass:  96%/51  | Total:  4h 41m | Avg:  5m 31s | Max: 13m 21s
      🟩 Intel              Pass: 100%/3   | Total: 20m 23s | Avg:  6m 47s | Max:  7m 12s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 36m | Avg: 19m 13s | Max: 22m 48s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 00m | Avg: 15m 14s | Max: 16m 20s
    🔍 jobs: TestGPU 🔍
      🟩 Build              Pass: 100%/103 | Total: 10h 26m | Avg:  6m 05s | Max: 21m 27s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 19s | Avg: 11m 34s | Max: 22m 48s | Hits:  99%/1852  
      🔍 TestGPU            Pass:  50%/4   | Total:  1h 03m | Avg: 15m 53s | Max: 27m 42s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/30  | Total:  2h 44m | Avg:  5m 29s | Max: 13m 58s
      🟩 14                 Pass: 100%/29  | Total:  3h 02m | Avg:  6m 18s | Max: 17m 20s | Hits:  99%/3704  
      🟩 17                 Pass: 100%/27  | Total:  2h 55m | Avg:  6m 30s | Max: 21m 27s | Hits:  99%/1852  
      🔍 20                 Pass:  95%/23  | Total:  3h 14m | Avg:  8m 27s | Max: 27m 42s | Hits:  99%/3704  
    🟨 cmake_options
      🟨 -DTHRUST_DISPATCH_TYPE=Force32bit Pass:  50%/2   | Total: 19m 04s | Avg:  9m 32s | Max: 13m 21s
    🟨 gpu
      🟨 v100               Pass:  98%/111 | Total: 12h 16m | Avg:  6m 38s | Max: 27m 42s | Hits:  99%/9260  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 27s | Avg:  5m 29s | Max:  5m 45s
      🟩 90a                Pass: 100%/4   | Total: 18m 11s | Avg:  4m 32s | Max:  4m 52s
    
  • 🟨 libcudacxx: Pass: 99%/118 | Total: 21h 28m | Avg: 10m 55s | Max: 44m 57s | Hits: 65%/9504

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  99%/110 | Total: 20h 44m | Avg: 11m 18s | Max: 44m 57s | Hits:  65%/9504  
      🟩 arm64              Pass: 100%/8   | Total: 43m 58s | Avg:  5m 29s | Max: 18m 19s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  2h 57m | Avg: 11m 51s | Max: 37m 00s | Hits:  34%/2182  
      🟩 11.8               Pass: 100%/3   | Total:  1h 16m | Avg: 25m 25s | Max: 31m 16s
      🟩 12.5               Pass: 100%/4   | Total:  1h 37m | Avg: 24m 16s | Max: 44m 57s
      🔍 12.6               Pass:  98%/96  | Total: 15h 37m | Avg:  9m 45s | Max: 43m 13s | Hits:  74%/7322  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 32m | Avg: 12m 43s | Max: 25m 38s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 57m | Avg: 11m 51s | Max: 37m 00s | Hits:  34%/2182  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 16m | Avg: 25m 25s | Max: 31m 16s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 37m | Avg: 24m 16s | Max: 44m 57s
      🔍 nvcc12.6           Pass:  98%/84  | Total: 13h 04m | Avg:  9m 20s | Max: 43m 13s | Hits:  74%/7322  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 32m | Avg: 12m 43s | Max: 25m 38s
      🔍 nvcc               Pass:  99%/106 | Total: 18h 56m | Avg: 10m 43s | Max: 44m 57s | Hits:  65%/9504  
    🔍 cxx: GCC13 🔍
      🟩 Clang9             Pass: 100%/6   | Total: 26m 58s | Avg:  4m 29s | Max:  6m 26s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 25s | Avg:  5m 48s | Max:  6m 09s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 05s | Avg:  4m 31s | Max:  4m 44s
      🟩 Clang12            Pass: 100%/4   | Total: 17m 47s | Avg:  4m 26s | Max:  4m 47s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  4m 40s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 20s | Avg:  4m 35s | Max:  5m 10s
      🟩 Clang15            Pass: 100%/4   | Total: 32m 43s | Avg:  8m 10s | Max: 18m 59s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 20s | Avg:  5m 05s | Max:  6m 14s
      🟩 Clang17            Pass: 100%/4   | Total: 26m 49s | Avg:  6m 42s | Max: 13m 23s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 13m | Avg: 10m 46s | Max: 25m 38s
      🟩 GCC6               Pass: 100%/2   | Total: 31m 12s | Avg: 15m 36s | Max: 27m 13s
      🟩 GCC7               Pass: 100%/6   | Total: 56m 16s | Avg:  9m 22s | Max: 23m 51s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 15m | Avg: 12m 35s | Max: 19m 01s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 06m | Avg: 11m 03s | Max: 26m 43s
      🟩 GCC10              Pass: 100%/4   | Total: 26m 03s | Avg:  6m 30s | Max: 13m 20s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 26m | Avg: 20m 52s | Max: 31m 16s
      🟩 GCC12              Pass: 100%/4   | Total: 37m 05s | Avg:  9m 16s | Max: 24m 55s
      🔍 GCC13              Pass:  94%/17  | Total:  3h 13m | Avg: 11m 23s | Max: 30m 06s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 04m | Avg: 21m 28s | Max: 32m 04s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 37m 00s | Avg: 37m 00s | Max: 37m 00s | Hits:  34%/2182  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 24m 32s | Avg: 12m 16s | Max: 12m 47s | Hits:  99%/4727  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s | Hits:  29%/2595  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 37m | Avg: 24m 16s | Max: 44m 57s
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/55  | Total:  6h 29m | Avg:  7m 05s | Max: 25m 38s
      🔍 GCC                Pass:  98%/52  | Total: 10h 32m | Avg: 12m 09s | Max: 31m 16s
      🟩 Intel              Pass: 100%/3   | Total:  1h 04m | Avg: 21m 28s | Max: 32m 04s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 44m | Avg: 26m 11s | Max: 43m 13s | Hits:  65%/9504  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 37m | Avg: 24m 16s | Max: 44m 57s
    🔍 jobs: NVRTC 🔍
      🟩 Build              Pass: 100%/110 | Total: 19h 00m | Avg: 10m 21s | Max: 44m 57s | Hits:  65%/9504  
      🔍 NVRTC              Pass:  75%/4   | Total:  1h 35m | Avg: 23m 55s | Max: 30m 06s
      🟩 Test               Pass: 100%/3   | Total: 50m 56s | Avg: 16m 58s | Max: 21m 02s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 55s | Avg:  1m 55s | Max:  1m 55s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/32  | Total:  4h 46m | Avg:  8m 57s | Max: 27m 13s
      🟩 14                 Pass: 100%/32  | Total:  6h 02m | Avg: 11m 19s | Max: 37m 00s | Hits:  67%/4467  
      🟩 17                 Pass: 100%/30  | Total:  5h 36m | Avg: 11m 13s | Max: 32m 04s | Hits:  99%/2442  
      🔍 20                 Pass:  95%/23  | Total:  5h 00m | Avg: 13m 03s | Max: 44m 57s | Hits:  29%/2595  
    🟨 gpu
      🟨 v100               Pass:  99%/118 | Total: 21h 28m | Avg: 10m 55s | Max: 44m 57s | Hits:  65%/9504  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 16m | Avg: 25m 25s | Max: 31m 16s
      🟩 90                 Pass: 100%/4   | Total: 42m 49s | Avg: 10m 42s | Max: 12m 51s
      🟩 90a                Pass: 100%/8   | Total: 56m 35s | Avg:  7m 04s | Max: 12m 50s
    
  • 🟨 cub: Pass: 99%/110 | Total: 14h 33m | Avg: 7m 56s | Max: 57m 49s | Hits: 99%/3028

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  99%/102 | Total: 13h 55m | Avg:  8m 11s | Max: 57m 49s | Hits:  99%/3028  
      🟩 arm64              Pass: 100%/8   | Total: 38m 09s | Avg:  4m 46s | Max:  5m 37s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  1h 59m | Avg:  7m 58s | Max: 48m 57s | Hits:  99%/757   
      🟩 11.8               Pass: 100%/3   | Total: 16m 38s | Avg:  5m 32s | Max:  5m 39s
      🟩 12.5               Pass: 100%/4   | Total: 37m 11s | Avg:  9m 17s | Max:  9m 41s
      🔍 12.6               Pass:  98%/88  | Total: 11h 40m | Avg:  7m 57s | Max: 57m 49s | Hits:  99%/2271  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 16m 42s | Avg:  4m 10s | Max:  4m 22s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 59m | Avg:  7m 58s | Max: 48m 57s | Hits:  99%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 38s | Avg:  5m 32s | Max:  5m 39s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 37m 11s | Avg:  9m 17s | Max:  9m 41s
      🔍 nvcc12.6           Pass:  98%/84  | Total: 11h 23m | Avg:  8m 08s | Max: 57m 49s | Hits:  99%/2271  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total: 16m 42s | Avg:  4m 10s | Max:  4m 22s
      🔍 nvcc               Pass:  99%/106 | Total: 14h 17m | Avg:  8m 05s | Max: 57m 49s | Hits:  99%/3028  
    🔍 cxx: GCC13 🔍
      🟩 Clang9             Pass: 100%/6   | Total: 31m 18s | Avg:  5m 13s | Max:  6m 40s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 23s | Avg:  6m 27s | Max:  7m 18s
      🟩 Clang11            Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  5m 42s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  5m 32s
      🟩 Clang13            Pass: 100%/4   | Total: 21m 18s | Avg:  5m 19s | Max:  5m 27s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 13s | Avg:  5m 03s | Max:  5m 12s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 24s | Avg:  5m 21s | Max:  5m 44s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 45s | Avg:  5m 26s | Max:  6m 00s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 48s | Avg:  5m 12s | Max:  5m 29s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 48m | Avg:  9m 49s | Max: 46m 42s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 46s | Avg:  4m 23s | Max:  4m 33s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 13m | Avg: 12m 13s | Max: 48m 57s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 50s | Avg:  4m 48s | Max:  5m 18s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 33s | Avg:  4m 55s | Max:  5m 37s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 18s | Avg:  5m 19s | Max:  5m 23s
      🟩 GCC11              Pass: 100%/7   | Total: 38m 06s | Avg:  5m 26s | Max:  5m 39s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 14m | Avg: 18m 33s | Max: 57m 49s
      🔍 GCC13              Pass:  93%/16  | Total:  3h 05m | Avg: 11m 34s | Max: 40m 59s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 13s | Avg:  6m 04s | Max:  6m 12s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 13m 46s | Avg: 13m 46s | Max: 13m 46s | Hits:  99%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 24m 58s | Avg: 12m 29s | Max: 12m 29s | Hits:  99%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 42s | Avg: 13m 42s | Max: 13m 42s | Hits:  99%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 37m 11s | Avg:  9m 17s | Max:  9m 41s
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/48  | Total:  5h 06m | Avg:  6m 23s | Max: 46m 42s
      🔍 GCC                Pass:  98%/51  | Total:  7h 39m | Avg:  9m 00s | Max: 57m 49s
      🟩 Intel              Pass: 100%/3   | Total: 18m 13s | Avg:  6m 04s | Max:  6m 12s
      🟩 MSVC               Pass: 100%/4   | Total: 52m 26s | Avg: 13m 06s | Max: 13m 46s | Hits:  99%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total: 37m 11s | Avg:  9m 17s | Max:  9m 41s
    🚨 jobs: GraphCapture 🚨
      🟩 Build              Pass: 100%/102 | Total: 11h 08m | Avg:  6m 33s | Max: 57m 49s | Hits:  99%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 27m 36s | Avg: 27m 36s | Max: 27m 36s
      🔥 GraphCapture       Pass:   0%/1   | Total: 12m 07s | Avg: 12m 07s | Max: 12m 07s
      🟩 HostLaunch         Pass: 100%/3   | Total: 57m 16s | Avg: 19m 05s | Max: 20m 08s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 48m | Avg: 36m 02s | Max: 46m 42s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/30  | Total:  3h 01m | Avg:  6m 03s | Max: 20m 26s
      🟩 14                 Pass: 100%/29  | Total:  2h 48m | Avg:  5m 48s | Max: 13m 46s | Hits:  99%/1514  
      🟩 17                 Pass: 100%/27  | Total:  4h 11m | Avg:  9m 19s | Max: 57m 49s | Hits:  99%/757   
      🔍 20                 Pass:  95%/24  | Total:  4h 32m | Avg: 11m 20s | Max: 46m 42s | Hits:  99%/757   
    🟨 gpu
      🟨 v100               Pass:  99%/110 | Total: 14h 33m | Avg:  7m 56s | Max: 57m 49s | Hits:  99%/3028  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 38s | Avg:  5m 32s | Max:  5m 39s
      🟩 90a                Pass: 100%/4   | Total: 16m 36s | Avg:  4m 09s | Max:  4m 14s
    
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 19m | Avg: 4m 48s | Max: 21m 04s | Hits: 92%/292

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  4h 09m | Avg:  4m 59s | Max: 21m 04s | Hits:  92%/292   
      🟩 arm64              Pass: 100%/4   | Total: 10m 28s | Avg:  2m 37s | Max:  2m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 36m | Avg:  5m 04s | Max: 20m 49s | Hits:  91%/146   
      🟩 12.5               Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 38s
      🟩 12.6               Pass: 100%/33  | Total:  2h 32m | Avg:  4m 37s | Max: 21m 04s | Hits:  92%/146   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 36m | Avg:  5m 04s | Max: 20m 49s | Hits:  91%/146   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 38s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 32m | Avg:  4m 37s | Max: 21m 04s | Hits:  92%/146   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 19m | Avg:  4m 48s | Max: 21m 04s | Hits:  92%/292   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 39s | Avg:  3m 19s | Max:  3m 23s
      🟩 Clang10            Pass: 100%/2   | Total:  7m 03s | Avg:  3m 31s | Max:  3m 47s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 52s | Avg:  2m 58s | Max:  3m 16s
      🟩 Clang12            Pass: 100%/4   | Total: 12m 00s | Avg:  3m 00s | Max:  3m 17s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 29s | Avg:  3m 07s | Max:  3m 23s
      🟩 Clang14            Pass: 100%/4   | Total: 29m 51s | Avg:  7m 27s | Max: 20m 49s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 42s | Avg:  3m 21s | Max:  3m 24s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 45s | Avg:  2m 56s | Max:  3m 14s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 24s | Avg:  3m 12s | Max:  3m 19s
      🟩 Clang18            Pass: 100%/2   | Total: 19m 37s | Avg:  9m 48s | Max: 16m 32s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 56s | Avg:  2m 58s | Max:  3m 01s
      🟩 GCC10              Pass: 100%/4   | Total: 12m 29s | Avg:  3m 07s | Max:  3m 30s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 19s | Avg:  2m 49s | Max:  2m 52s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 11m | Avg: 10m 12s | Max: 21m 04s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 50s | Avg:  2m 36s | Max:  2m 41s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  7m 48s | Avg:  7m 48s | Max:  7m 48s | Hits:  91%/146   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  7m 59s | Avg:  7m 59s | Max:  7m 59s | Hits:  92%/146   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 38s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 04m | Avg:  4m 08s | Max: 20m 49s
      🟩 GCC                Pass: 100%/20  | Total:  1h 49m | Avg:  5m 27s | Max: 21m 04s
      🟩 MSVC               Pass: 100%/2   | Total: 15m 47s | Avg:  7m 53s | Max:  7m 59s | Hits:  92%/292   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 38s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 19m | Avg:  4m 48s | Max: 21m 04s | Hits:  92%/292   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  2h 42m | Avg:  3m 19s | Max:  7m 59s | Hits:  92%/292   
      🟩 Test               Pass: 100%/5   | Total:  1h 37m | Avg: 19m 28s | Max: 21m 04s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s
      🟩 90a                Pass: 100%/1   | Total:  2m 41s | Avg:  2m 41s | Max:  2m 41s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  2h 06m | Avg:  4m 22s | Max: 21m 04s
      🟩 20                 Pass: 100%/25  | Total:  2h 13m | Avg:  5m 19s | Max: 20m 49s | Hits:  92%/292   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 54s | Avg: 4m 57s | Max: 7m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
      🟩 Test               Pass: 100%/1   | Total:  7m 51s | Avg:  7m 51s | Max:  7m 51s
    
  • 🟩 python: Pass: 100%/1 | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 396)

# Runner
327 linux-amd64-cpu16
28 linux-arm64-cpu16
26 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 3h 38m: Pass: 100%/396 | Total: 2d 05h | Avg: 8m 02s | Max: 57m 49s | Hits: 84%/22084
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 21h 18m | Avg: 10m 50s | Max: 44m 57s | Hits: 65%/9504

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 20h 34m | Avg: 11m 13s | Max: 44m 57s | Hits:  65%/9504  
      🟩 arm64              Pass: 100%/8   | Total: 43m 58s | Avg:  5m 29s | Max: 18m 19s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  2h 57m | Avg: 11m 51s | Max: 37m 00s | Hits:  34%/2182  
      🟩 11.8               Pass: 100%/3   | Total:  1h 16m | Avg: 25m 25s | Max: 31m 16s
      🟩 12.5               Pass: 100%/4   | Total:  1h 37m | Avg: 24m 16s | Max: 44m 57s
      🟩 12.6               Pass: 100%/96  | Total: 15h 27m | Avg:  9m 39s | Max: 43m 13s | Hits:  74%/7322  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 32m | Avg: 12m 43s | Max: 25m 38s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 57m | Avg: 11m 51s | Max: 37m 00s | Hits:  34%/2182  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 16m | Avg: 25m 25s | Max: 31m 16s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 37m | Avg: 24m 16s | Max: 44m 57s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 12h 54m | Avg:  9m 13s | Max: 43m 13s | Hits:  74%/7322  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 32m | Avg: 12m 43s | Max: 25m 38s
      🟩 nvcc               Pass: 100%/106 | Total: 18h 45m | Avg: 10m 37s | Max: 44m 57s | Hits:  65%/9504  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 26m 58s | Avg:  4m 29s | Max:  6m 26s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 25s | Avg:  5m 48s | Max:  6m 09s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 05s | Avg:  4m 31s | Max:  4m 44s
      🟩 Clang12            Pass: 100%/4   | Total: 17m 47s | Avg:  4m 26s | Max:  4m 47s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  4m 40s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 20s | Avg:  4m 35s | Max:  5m 10s
      🟩 Clang15            Pass: 100%/4   | Total: 32m 43s | Avg:  8m 10s | Max: 18m 59s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 20s | Avg:  5m 05s | Max:  6m 14s
      🟩 Clang17            Pass: 100%/4   | Total: 26m 49s | Avg:  6m 42s | Max: 13m 23s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 13m | Avg: 10m 46s | Max: 25m 38s
      🟩 GCC6               Pass: 100%/2   | Total: 31m 12s | Avg: 15m 36s | Max: 27m 13s
      🟩 GCC7               Pass: 100%/6   | Total: 56m 16s | Avg:  9m 22s | Max: 23m 51s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 15m | Avg: 12m 35s | Max: 19m 01s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 06m | Avg: 11m 03s | Max: 26m 43s
      🟩 GCC10              Pass: 100%/4   | Total: 26m 03s | Avg:  6m 30s | Max: 13m 20s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 26m | Avg: 20m 52s | Max: 31m 16s
      🟩 GCC12              Pass: 100%/4   | Total: 37m 05s | Avg:  9m 16s | Max: 24m 55s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 03m | Avg: 10m 47s | Max: 27m 04s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 04m | Avg: 21m 28s | Max: 32m 04s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 37m 00s | Avg: 37m 00s | Max: 37m 00s | Hits:  34%/2182  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 24m 32s | Avg: 12m 16s | Max: 12m 47s | Hits:  99%/4727  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 43m 13s | Avg: 43m 13s | Max: 43m 13s | Hits:  29%/2595  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 37m | Avg: 24m 16s | Max: 44m 57s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  6h 29m | Avg:  7m 05s | Max: 25m 38s
      🟩 GCC                Pass: 100%/52  | Total: 10h 22m | Avg: 11m 57s | Max: 31m 16s
      🟩 Intel              Pass: 100%/3   | Total:  1h 04m | Avg: 21m 28s | Max: 32m 04s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 44m | Avg: 26m 11s | Max: 43m 13s | Hits:  65%/9504  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 37m | Avg: 24m 16s | Max: 44m 57s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 21h 18m | Avg: 10m 50s | Max: 44m 57s | Hits:  65%/9504  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total: 19h 00m | Avg: 10m 21s | Max: 44m 57s | Hits:  65%/9504  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 25m | Avg: 21m 21s | Max: 27m 04s
      🟩 Test               Pass: 100%/3   | Total: 50m 56s | Avg: 16m 58s | Max: 21m 02s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 55s | Avg:  1m 55s | Max:  1m 55s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 16m | Avg: 25m 25s | Max: 31m 16s
      🟩 90                 Pass: 100%/4   | Total: 42m 49s | Avg: 10m 42s | Max: 12m 51s
      🟩 90a                Pass: 100%/8   | Total: 56m 35s | Avg:  7m 04s | Max: 12m 50s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  4h 46m | Avg:  8m 57s | Max: 27m 13s
      🟩 14                 Pass: 100%/32  | Total:  6h 02m | Avg: 11m 19s | Max: 37m 00s | Hits:  67%/4467  
      🟩 17                 Pass: 100%/30  | Total:  5h 36m | Avg: 11m 13s | Max: 32m 04s | Hits:  99%/2442  
      🟩 20                 Pass: 100%/23  | Total:  4h 50m | Avg: 12m 36s | Max: 44m 57s | Hits:  29%/2595  
    
  • 🟩 thrust: Pass: 100%/111 | Total: 12h 22m | Avg: 6m 41s | Max: 27m 42s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 14m 32s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total: 11h 45m | Avg:  6m 50s | Max: 27m 42s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/8   | Total: 37m 36s | Avg:  4m 42s | Max:  5m 08s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 17m | Avg:  5m 09s | Max: 17m 20s | Hits:  99%/1852  
      🟩 11.8               Pass: 100%/3   | Total: 16m 27s | Avg:  5m 29s | Max:  5m 45s
      🟩 12.5               Pass: 100%/4   | Total:  1h 00m | Avg: 15m 14s | Max: 16m 20s
      🟩 12.6               Pass: 100%/89  | Total:  9h 48m | Avg:  6m 36s | Max: 27m 42s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 19m 40s | Avg:  4m 55s | Max:  5m 13s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 17m | Avg:  5m 09s | Max: 17m 20s | Hits:  99%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 27s | Avg:  5m 29s | Max:  5m 45s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 00m | Avg: 15m 14s | Max: 16m 20s
      🟩 nvcc12.6           Pass: 100%/85  | Total:  9h 28m | Avg:  6m 41s | Max: 27m 42s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 19m 40s | Avg:  4m 55s | Max:  5m 13s
      🟩 nvcc               Pass: 100%/107 | Total: 12h 03m | Avg:  6m 45s | Max: 27m 42s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 30m 47s | Avg:  5m 07s | Max:  6m 47s
      🟩 Clang10            Pass: 100%/3   | Total: 20m 20s | Avg:  6m 46s | Max:  7m 56s
      🟩 Clang11            Pass: 100%/4   | Total: 21m 35s | Avg:  5m 23s | Max:  5m 44s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 23s | Avg:  5m 20s | Max:  5m 35s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 21s | Avg:  5m 05s | Max:  5m 30s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 02s | Avg:  5m 15s | Max:  5m 34s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 36s | Avg:  5m 24s | Max:  5m 44s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 31s | Avg:  5m 07s | Max:  5m 16s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 47s | Avg:  5m 11s | Max:  5m 27s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 19m | Avg:  7m 10s | Max: 27m 42s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 42s | Avg:  4m 21s | Max:  4m 23s
      🟩 GCC7               Pass: 100%/6   | Total: 29m 18s | Avg:  4m 53s | Max:  5m 38s
      🟩 GCC8               Pass: 100%/6   | Total: 27m 53s | Avg:  4m 38s | Max:  5m 22s
      🟩 GCC9               Pass: 100%/6   | Total: 28m 46s | Avg:  4m 47s | Max:  5m 32s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 13s | Avg:  5m 18s | Max:  5m 41s
      🟩 GCC11              Pass: 100%/7   | Total: 37m 59s | Avg:  5m 25s | Max:  5m 45s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 00s | Avg:  5m 30s | Max:  5m 44s
      🟩 GCC13              Pass: 100%/16  | Total:  1h 52m | Avg:  7m 00s | Max: 14m 58s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 20m 23s | Avg:  6m 47s | Max:  7m 12s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 20s | Avg: 17m 20s | Max: 17m 20s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 38m 17s | Avg: 19m 08s | Max: 21m 27s | Hits:  99%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 40m 32s | Avg: 20m 16s | Max: 22m 48s | Hits:  99%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 00m | Avg: 15m 14s | Max: 16m 20s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 37m | Avg:  5m 46s | Max: 27m 42s
      🟩 GCC                Pass: 100%/51  | Total:  4h 48m | Avg:  5m 38s | Max: 14m 58s
      🟩 Intel              Pass: 100%/3   | Total: 20m 23s | Avg:  6m 47s | Max:  7m 12s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 36m | Avg: 19m 13s | Max: 22m 48s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 00m | Avg: 15m 14s | Max: 16m 20s
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total: 12h 22m | Avg:  6m 41s | Max: 27m 42s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total: 10h 26m | Avg:  6m 05s | Max: 21m 27s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 19s | Avg: 11m 34s | Max: 22m 48s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 10m | Avg: 17m 31s | Max: 27m 42s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 27s | Avg:  5m 29s | Max:  5m 45s
      🟩 90a                Pass: 100%/4   | Total: 18m 11s | Avg:  4m 32s | Max:  4m 52s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 44m | Avg:  5m 29s | Max: 13m 58s
      🟩 14                 Pass: 100%/29  | Total:  3h 02m | Avg:  6m 18s | Max: 17m 20s | Hits:  99%/3704  
      🟩 17                 Pass: 100%/27  | Total:  2h 55m | Avg:  6m 30s | Max: 21m 27s | Hits:  99%/1852  
      🟩 20                 Pass: 100%/23  | Total:  3h 19m | Avg:  8m 40s | Max: 27m 42s | Hits:  99%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 14h 38m | Avg: 7m 58s | Max: 57m 49s | Hits: 99%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 13h 59m | Avg:  8m 14s | Max: 57m 49s | Hits:  99%/3028  
      🟩 arm64              Pass: 100%/8   | Total: 38m 09s | Avg:  4m 46s | Max:  5m 37s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 59m | Avg:  7m 58s | Max: 48m 57s | Hits:  99%/757   
      🟩 11.8               Pass: 100%/3   | Total: 16m 38s | Avg:  5m 32s | Max:  5m 39s
      🟩 12.5               Pass: 100%/4   | Total: 37m 11s | Avg:  9m 17s | Max:  9m 41s
      🟩 12.6               Pass: 100%/88  | Total: 11h 44m | Avg:  8m 00s | Max: 57m 49s | Hits:  99%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 16m 42s | Avg:  4m 10s | Max:  4m 22s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 59m | Avg:  7m 58s | Max: 48m 57s | Hits:  99%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 38s | Avg:  5m 32s | Max:  5m 39s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 37m 11s | Avg:  9m 17s | Max:  9m 41s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 11h 27m | Avg:  8m 11s | Max: 57m 49s | Hits:  99%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 16m 42s | Avg:  4m 10s | Max:  4m 22s
      🟩 nvcc               Pass: 100%/106 | Total: 14h 21m | Avg:  8m 07s | Max: 57m 49s | Hits:  99%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 31m 18s | Avg:  5m 13s | Max:  6m 40s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 23s | Avg:  6m 27s | Max:  7m 18s
      🟩 Clang11            Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  5m 42s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  5m 32s
      🟩 Clang13            Pass: 100%/4   | Total: 21m 18s | Avg:  5m 19s | Max:  5m 27s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 13s | Avg:  5m 03s | Max:  5m 12s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 24s | Avg:  5m 21s | Max:  5m 44s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 45s | Avg:  5m 26s | Max:  6m 00s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 48s | Avg:  5m 12s | Max:  5m 29s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 48m | Avg:  9m 49s | Max: 46m 42s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 46s | Avg:  4m 23s | Max:  4m 33s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 13m | Avg: 12m 13s | Max: 48m 57s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 50s | Avg:  4m 48s | Max:  5m 18s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 33s | Avg:  4m 55s | Max:  5m 37s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 18s | Avg:  5m 19s | Max:  5m 23s
      🟩 GCC11              Pass: 100%/7   | Total: 38m 06s | Avg:  5m 26s | Max:  5m 39s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 14m | Avg: 18m 33s | Max: 57m 49s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 09m | Avg: 11m 50s | Max: 40m 59s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 13s | Avg:  6m 04s | Max:  6m 12s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 13m 46s | Avg: 13m 46s | Max: 13m 46s | Hits:  99%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 24m 58s | Avg: 12m 29s | Max: 12m 29s | Hits:  99%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 42s | Avg: 13m 42s | Max: 13m 42s | Hits:  99%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 37m 11s | Avg:  9m 17s | Max:  9m 41s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  5h 06m | Avg:  6m 23s | Max: 46m 42s
      🟩 GCC                Pass: 100%/51  | Total:  7h 43m | Avg:  9m 05s | Max: 57m 49s
      🟩 Intel              Pass: 100%/3   | Total: 18m 13s | Avg:  6m 04s | Max:  6m 12s
      🟩 MSVC               Pass: 100%/4   | Total: 52m 26s | Avg: 13m 06s | Max: 13m 46s | Hits:  99%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total: 37m 11s | Avg:  9m 17s | Max:  9m 41s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 14h 38m | Avg:  7m 58s | Max: 57m 49s | Hits:  99%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total: 11h 08m | Avg:  6m 33s | Max: 57m 49s | Hits:  99%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 27m 36s | Avg: 27m 36s | Max: 27m 36s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 18s | Avg: 16m 18s | Max: 16m 18s
      🟩 HostLaunch         Pass: 100%/3   | Total: 57m 16s | Avg: 19m 05s | Max: 20m 08s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 48m | Avg: 36m 02s | Max: 46m 42s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 38s | Avg:  5m 32s | Max:  5m 39s
      🟩 90a                Pass: 100%/4   | Total: 16m 36s | Avg:  4m 09s | Max:  4m 14s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 01m | Avg:  6m 03s | Max: 20m 26s
      🟩 14                 Pass: 100%/29  | Total:  2h 48m | Avg:  5m 48s | Max: 13m 46s | Hits:  99%/1514  
      🟩 17                 Pass: 100%/27  | Total:  4h 11m | Avg:  9m 19s | Max: 57m 49s | Hits:  99%/757   
      🟩 20                 Pass: 100%/24  | Total:  4h 36m | Avg: 11m 30s | Max: 46m 42s | Hits:  99%/757   
    
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 19m | Avg: 4m 48s | Max: 21m 04s | Hits: 92%/292

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  4h 09m | Avg:  4m 59s | Max: 21m 04s | Hits:  92%/292   
      🟩 arm64              Pass: 100%/4   | Total: 10m 28s | Avg:  2m 37s | Max:  2m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 36m | Avg:  5m 04s | Max: 20m 49s | Hits:  91%/146   
      🟩 12.5               Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 38s
      🟩 12.6               Pass: 100%/33  | Total:  2h 32m | Avg:  4m 37s | Max: 21m 04s | Hits:  92%/146   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 36m | Avg:  5m 04s | Max: 20m 49s | Hits:  91%/146   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 38s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 32m | Avg:  4m 37s | Max: 21m 04s | Hits:  92%/146   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 19m | Avg:  4m 48s | Max: 21m 04s | Hits:  92%/292   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 39s | Avg:  3m 19s | Max:  3m 23s
      🟩 Clang10            Pass: 100%/2   | Total:  7m 03s | Avg:  3m 31s | Max:  3m 47s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 52s | Avg:  2m 58s | Max:  3m 16s
      🟩 Clang12            Pass: 100%/4   | Total: 12m 00s | Avg:  3m 00s | Max:  3m 17s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 29s | Avg:  3m 07s | Max:  3m 23s
      🟩 Clang14            Pass: 100%/4   | Total: 29m 51s | Avg:  7m 27s | Max: 20m 49s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 42s | Avg:  3m 21s | Max:  3m 24s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 45s | Avg:  2m 56s | Max:  3m 14s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 24s | Avg:  3m 12s | Max:  3m 19s
      🟩 Clang18            Pass: 100%/2   | Total: 19m 37s | Avg:  9m 48s | Max: 16m 32s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 56s | Avg:  2m 58s | Max:  3m 01s
      🟩 GCC10              Pass: 100%/4   | Total: 12m 29s | Avg:  3m 07s | Max:  3m 30s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 19s | Avg:  2m 49s | Max:  2m 52s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 11m | Avg: 10m 12s | Max: 21m 04s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 50s | Avg:  2m 36s | Max:  2m 41s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  7m 48s | Avg:  7m 48s | Max:  7m 48s | Hits:  91%/146   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  7m 59s | Avg:  7m 59s | Max:  7m 59s | Hits:  92%/146   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 38s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 04m | Avg:  4m 08s | Max: 20m 49s
      🟩 GCC                Pass: 100%/20  | Total:  1h 49m | Avg:  5m 27s | Max: 21m 04s
      🟩 MSVC               Pass: 100%/2   | Total: 15m 47s | Avg:  7m 53s | Max:  7m 59s | Hits:  92%/292   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 38s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 19m | Avg:  4m 48s | Max: 21m 04s | Hits:  92%/292   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  2h 42m | Avg:  3m 19s | Max:  7m 59s | Hits:  92%/292   
      🟩 Test               Pass: 100%/5   | Total:  1h 37m | Avg: 19m 28s | Max: 21m 04s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s
      🟩 90a                Pass: 100%/1   | Total:  2m 41s | Avg:  2m 41s | Max:  2m 41s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  2h 06m | Avg:  4m 22s | Max: 21m 04s
      🟩 20                 Pass: 100%/25  | Total:  2h 13m | Avg:  5m 19s | Max: 20m 49s | Hits:  92%/292   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 54s | Avg: 4m 57s | Max: 7m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 51s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
      🟩 Test               Pass: 100%/1   | Total:  7m 51s | Avg:  7m 51s | Max:  7m 51s
    
  • 🟩 python: Pass: 100%/1 | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 06s | Avg: 15m 06s | Max: 15m 06s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 396)

# Runner
327 linux-amd64-cpu16
28 linux-arm64-cpu16
26 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

@ericniebler ericniebler marked this pull request as ready for review November 22, 2024 19:36
@ericniebler ericniebler requested a review from a team as a code owner November 22, 2024 19:36
@ericniebler ericniebler changed the title [WIP] new type-erased memory resources new type-erased memory resources Nov 22, 2024
Copy link
Contributor

🟩 CI finished in 2h 08m: Pass: 100%/396 | Total: 7d 14h | Avg: 27m 37s | Max: 1h 11m | Hits: 61%/22104
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 1d 04h | Avg: 14m 33s | Max: 58m 56s | Hits: 52%/9524

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  1d 03h | Avg: 14m 49s | Max: 58m 56s | Hits:  52%/9524  
      🟩 arm64              Pass: 100%/8   | Total:  1h 27m | Avg: 10m 55s | Max: 17m 05s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  2h 33m | Avg: 10m 14s | Max: 36m 39s | Hits:  34%/2187  
      🟩 11.8               Pass: 100%/3   | Total:  1h 06m | Avg: 22m 04s | Max: 28m 38s
      🟩 12.5               Pass: 100%/4   | Total:  2h 12m | Avg: 33m 14s | Max: 37m 30s
      🟩 12.6               Pass: 100%/96  | Total: 22h 45m | Avg: 14m 13s | Max: 58m 56s | Hits:  57%/7337  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 29m | Avg: 12m 25s | Max: 19m 30s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 33m | Avg: 10m 14s | Max: 36m 39s | Hits:  34%/2187  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 06m | Avg: 22m 04s | Max: 28m 38s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  2h 12m | Avg: 33m 14s | Max: 37m 30s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 20h 16m | Avg: 14m 28s | Max: 58m 56s | Hits:  57%/7337  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 29m | Avg: 12m 25s | Max: 19m 30s
      🟩 nvcc               Pass: 100%/106 | Total:  1d 02h | Avg: 14m 48s | Max: 58m 56s | Hits:  52%/9524  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 11m | Avg: 11m 56s | Max: 21m 07s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 35s | Avg:  5m 51s | Max:  6m 24s
      🟩 Clang11            Pass: 100%/4   | Total: 42m 20s | Avg: 10m 35s | Max: 17m 28s
      🟩 Clang12            Pass: 100%/4   | Total: 46m 10s | Avg: 11m 32s | Max: 19m 07s
      🟩 Clang13            Pass: 100%/4   | Total: 38m 30s | Avg:  9m 37s | Max: 14m 26s
      🟩 Clang14            Pass: 100%/4   | Total: 46m 35s | Avg: 11m 38s | Max: 19m 39s
      🟩 Clang15            Pass: 100%/4   | Total: 44m 12s | Avg: 11m 03s | Max: 18m 32s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 09m | Avg: 17m 19s | Max: 19m 58s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 06m | Avg: 16m 44s | Max: 19m 29s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 46m | Avg: 12m 35s | Max: 21m 39s
      🟩 GCC6               Pass: 100%/2   | Total: 33m 39s | Avg: 16m 49s | Max: 22m 24s
      🟩 GCC7               Pass: 100%/6   | Total: 32m 38s | Avg:  5m 26s | Max: 14m 40s
      🟩 GCC8               Pass: 100%/6   | Total: 58m 23s | Avg:  9m 43s | Max: 17m 40s
      🟩 GCC9               Pass: 100%/6   | Total: 40m 34s | Avg:  6m 45s | Max: 18m 43s
      🟩 GCC10              Pass: 100%/4   | Total: 31m 14s | Avg:  7m 48s | Max: 18m 38s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 03m | Avg: 17m 35s | Max: 28m 38s
      🟩 GCC12              Pass: 100%/4   | Total: 58m 31s | Avg: 14m 37s | Max: 21m 11s
      🟩 GCC13              Pass: 100%/17  | Total:  6h 00m | Avg: 21m 12s | Max: 58m 56s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 55m 25s | Avg: 18m 28s | Max: 26m 08s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 36m 39s | Avg: 36m 39s | Max: 36m 39s | Hits:  34%/2187  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 10m | Avg: 35m 26s | Max: 36m 41s | Hits:  35%/4737  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 07s | Avg: 14m 07s | Max: 14m 07s | Hits:  97%/2600  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  2h 12m | Avg: 33m 14s | Max: 37m 30s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total: 11h 09m | Avg: 12m 10s | Max: 21m 39s
      🟩 GCC                Pass: 100%/52  | Total: 12h 18m | Avg: 14m 12s | Max: 58m 56s
      🟩 Intel              Pass: 100%/3   | Total: 55m 25s | Avg: 18m 28s | Max: 26m 08s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 01m | Avg: 30m 24s | Max: 36m 41s | Hits:  52%/9524  
      🟩 NVHPC              Pass: 100%/4   | Total:  2h 12m | Avg: 33m 14s | Max: 37m 30s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  1d 04h | Avg: 14m 33s | Max: 58m 56s | Hits:  52%/9524  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total: 23h 33m | Avg: 12m 50s | Max: 37m 30s | Hits:  52%/9524  
      🟩 NVRTC              Pass: 100%/4   | Total:  3h 34m | Avg: 53m 33s | Max: 58m 02s
      🟩 Test               Pass: 100%/3   | Total:  1h 29m | Avg: 29m 42s | Max: 58m 56s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 49s | Avg:  1m 49s | Max:  1m 49s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 06m | Avg: 22m 04s | Max: 28m 38s
      🟩 90                 Pass: 100%/4   | Total: 40m 03s | Avg: 10m 00s | Max: 11m 45s
      🟩 90a                Pass: 100%/8   | Total: 59m 13s | Avg:  7m 24s | Max: 13m 12s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  6h 06m | Avg: 11m 26s | Max: 44m 16s
      🟩 14                 Pass: 100%/32  | Total:  7h 32m | Avg: 14m 08s | Max: 53m 56s | Hits:  37%/4477  
      🟩 17                 Pass: 100%/30  | Total:  8h 09m | Avg: 16m 18s | Max: 57m 59s | Hits:  30%/2447  
      🟩 20                 Pass: 100%/23  | Total:  6h 48m | Avg: 17m 45s | Max: 58m 56s | Hits:  97%/2600  
    
  • 🟩 thrust: Pass: 100%/111 | Total: 2d 11h | Avg: 31m 53s | Max: 1h 01m | Hits: 70%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 52m 25s | Avg: 26m 12s | Max: 31m 17s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total:  2d 06h | Avg: 31m 58s | Max:  1h 01m | Hits:  70%/9260  
      🟩 arm64              Pass: 100%/8   | Total:  4h 07m | Avg: 30m 52s | Max: 37m 24s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  6h 27m | Avg: 25m 49s | Max: 54m 42s | Hits:  63%/1852  
      🟩 11.8               Pass: 100%/3   | Total:  1h 35m | Avg: 31m 46s | Max: 45m 37s
      🟩 12.5               Pass: 100%/4   | Total:  3h 37m | Avg: 54m 20s | Max: 56m 39s
      🟩 12.6               Pass: 100%/89  | Total:  1d 23h | Avg: 31m 54s | Max:  1h 01m | Hits:  72%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 46m | Avg: 26m 42s | Max: 32m 31s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  6h 27m | Avg: 25m 49s | Max: 54m 42s | Hits:  63%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 35m | Avg: 31m 46s | Max: 45m 37s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  3h 37m | Avg: 54m 20s | Max: 56m 39s
      🟩 nvcc12.6           Pass: 100%/85  | Total:  1d 21h | Avg: 32m 09s | Max:  1h 01m | Hits:  72%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 46m | Avg: 26m 42s | Max: 32m 31s
      🟩 nvcc               Pass: 100%/107 | Total:  2d 09h | Avg: 32m 05s | Max:  1h 01m | Hits:  70%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 06m | Avg: 31m 04s | Max: 35m 02s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 42m | Avg: 34m 01s | Max: 37m 28s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 16s | Max: 36m 20s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 14s | Max: 37m 32s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 08s | Max: 34m 58s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 12m | Avg: 33m 02s | Max: 36m 23s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 56s | Max: 35m 02s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 25m | Avg: 36m 27s | Max: 46m 19s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 44s | Max: 36m 36s
      🟩 Clang18            Pass: 100%/11  | Total:  4h 42m | Avg: 25m 41s | Max: 35m 03s
      🟩 GCC6               Pass: 100%/2   | Total: 34m 45s | Avg: 17m 22s | Max: 30m 59s
      🟩 GCC7               Pass: 100%/6   | Total:  2h 45m | Avg: 27m 36s | Max: 35m 03s
      🟩 GCC8               Pass: 100%/6   | Total:  2h 47m | Avg: 27m 59s | Max: 37m 05s
      🟩 GCC9               Pass: 100%/6   | Total:  2h 46m | Avg: 27m 48s | Max: 36m 48s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 11m | Avg: 32m 47s | Max: 34m 57s
      🟩 GCC11              Pass: 100%/7   | Total:  3h 54m | Avg: 33m 34s | Max: 45m 37s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 21m | Avg: 35m 21s | Max: 38m 29s
      🟩 GCC13              Pass: 100%/16  | Total:  6h 33m | Avg: 24m 36s | Max: 40m 24s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 58m | Avg: 39m 34s | Max: 42m 33s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 54m 42s | Avg: 54m 42s | Max: 54m 42s | Hits:  63%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 03s | Max: 57m 48s | Hits:  63%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 24m | Avg: 42m 19s | Max:  1h 01m | Hits:  81%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  3h 37m | Avg: 54m 20s | Max: 56m 39s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  1d 01h | Avg: 31m 33s | Max: 46m 19s
      🟩 GCC                Pass: 100%/51  | Total: 23h 56m | Avg: 28m 10s | Max: 45m 37s
      🟩 Intel              Pass: 100%/3   | Total:  1h 58m | Avg: 39m 34s | Max: 42m 33s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 13m | Avg: 50m 41s | Max:  1h 01m | Hits:  70%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  3h 37m | Avg: 54m 20s | Max: 56m 39s
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total:  2d 11h | Avg: 31m 53s | Max:  1h 01m | Hits:  70%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  2d 08h | Avg: 33m 08s | Max:  1h 01m | Hits:  63%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 45m 12s | Avg: 11m 18s | Max: 23m 00s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 21m | Avg: 20m 21s | Max: 27m 03s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 35m | Avg: 31m 46s | Max: 45m 37s
      🟩 90a                Pass: 100%/4   | Total:  1h 22m | Avg: 20m 39s | Max: 22m 37s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 11h 52m | Avg: 23m 44s | Max: 48m 03s
      🟩 14                 Pass: 100%/29  | Total: 17h 22m | Avg: 35m 56s | Max: 56m 39s | Hits:  63%/3704  
      🟩 17                 Pass: 100%/27  | Total: 16h 32m | Avg: 36m 45s | Max: 57m 48s | Hits:  63%/1852  
      🟩 20                 Pass: 100%/23  | Total: 12h 20m | Avg: 32m 12s | Max:  1h 01m | Hits:  81%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 3d 17h | Avg: 48m 47s | Max: 1h 11m | Hits: 65%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total:  3d 10h | Avg: 48m 20s | Max:  1h 11m | Hits:  65%/3028  
      🟩 arm64              Pass: 100%/8   | Total:  7h 16m | Avg: 54m 31s | Max: 57m 49s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  8h 42m | Avg: 34m 49s | Max: 52m 07s | Hits:  65%/757   
      🟩 11.8               Pass: 100%/3   | Total:  2h 32m | Avg: 50m 59s | Max:  1h 11m
      🟩 12.5               Pass: 100%/4   | Total:  4h 20m | Avg:  1h 05m | Max:  1h 09m
      🟩 12.6               Pass: 100%/88  | Total:  3d 01h | Avg: 50m 22s | Max:  1h 09m | Hits:  65%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  3h 52m | Avg: 58m 03s | Max:  1h 03m
      🟩 nvcc11.1           Pass: 100%/15  | Total:  8h 42m | Avg: 34m 49s | Max: 52m 07s | Hits:  65%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 32m | Avg: 50m 59s | Max:  1h 11m
      🟩 nvcc12.5           Pass: 100%/4   | Total:  4h 20m | Avg:  1h 05m | Max:  1h 09m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 22h | Avg: 50m 00s | Max:  1h 09m | Hits:  65%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  3h 52m | Avg: 58m 03s | Max:  1h 03m
      🟩 nvcc               Pass: 100%/106 | Total:  3d 13h | Avg: 48m 26s | Max:  1h 11m | Hits:  65%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 53m | Avg: 48m 58s | Max: 55m 50s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 45m | Avg: 55m 06s | Max: 58m 13s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 35m | Avg: 53m 54s | Max: 58m 54s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 05s | Max: 51m 36s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 37m | Avg: 54m 17s | Max: 57m 14s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 35m | Avg: 53m 59s | Max: 55m 54s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 39m | Avg: 54m 48s | Max: 56m 26s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 33m | Avg: 53m 28s | Max: 58m 43s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 53m | Avg: 58m 22s | Max:  1h 09m
      🟩 Clang18            Pass: 100%/11  | Total:  9h 03m | Avg: 49m 22s | Max:  1h 03m
      🟩 GCC6               Pass: 100%/2   | Total: 50m 20s | Avg: 25m 10s | Max: 45m 56s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 11m | Avg: 41m 56s | Max: 55m 33s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 14m | Avg: 42m 22s | Max: 54m 31s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 22m | Avg: 43m 49s | Max: 57m 35s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 46m | Avg: 56m 36s | Max: 58m 41s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 03m | Avg: 51m 58s | Max:  1h 11m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 45m | Avg: 56m 25s | Max: 58m 34s
      🟩 GCC13              Pass: 100%/16  | Total:  9h 08m | Avg: 34m 15s | Max: 57m 49s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 55m | Avg: 58m 34s | Max:  1h 00m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 52m 07s | Avg: 52m 07s | Max: 52m 07s | Hits:  65%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 02s | Max: 57m 20s | Hits:  65%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  65%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  4h 20m | Avg:  1h 05m | Max:  1h 09m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  1d 18h | Avg: 52m 32s | Max:  1h 09m
      🟩 GCC                Pass: 100%/51  | Total:  1d 12h | Avg: 42m 48s | Max:  1h 11m
      🟩 Intel              Pass: 100%/3   | Total:  2h 55m | Avg: 58m 34s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  3h 46m | Avg: 56m 43s | Max:  1h 00m | Hits:  65%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total:  4h 20m | Avg:  1h 05m | Max:  1h 09m
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total:  3d 17h | Avg: 48m 47s | Max:  1h 11m | Hits:  65%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  3d 14h | Avg: 50m 59s | Max:  1h 11m | Hits:  65%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 19m 48s | Avg: 19m 48s | Max: 19m 48s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 13s | Avg: 16m 13s | Max: 16m 13s
      🟩 HostLaunch         Pass: 100%/3   | Total: 56m 34s | Avg: 18m 51s | Max: 19m 48s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 13m | Avg: 24m 29s | Max: 26m 47s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 32m | Avg: 50m 59s | Max:  1h 11m
      🟩 90a                Pass: 100%/4   | Total:  1h 32m | Avg: 23m 07s | Max: 24m 42s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 21h 02m | Avg: 42m 05s | Max:  1h 02m
      🟩 14                 Pass: 100%/29  | Total:  1d 01h | Avg: 53m 03s | Max:  1h 10m | Hits:  65%/1514  
      🟩 17                 Pass: 100%/27  | Total:  1d 00h | Avg: 53m 32s | Max:  1h 11m | Hits:  65%/757   
      🟩 20                 Pass: 100%/24  | Total: 18h 40m | Avg: 46m 41s | Max:  1h 09m | Hits:  65%/757   
    
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 45m | Avg: 5m 17s | Max: 18m 04s | Hits: 13%/292

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  4h 30m | Avg:  5m 24s | Max: 18m 04s | Hits:  13%/292   
      🟩 arm64              Pass: 100%/4   | Total: 15m 11s | Avg:  3m 47s | Max:  4m 03s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 41m | Avg:  5m 21s | Max: 18m 04s | Hits:  13%/146   
      🟩 12.5               Pass: 100%/2   | Total: 12m 46s | Avg:  6m 23s | Max:  6m 33s
      🟩 12.6               Pass: 100%/33  | Total:  2h 51m | Avg:  5m 11s | Max: 16m 19s | Hits:  13%/146   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 41m | Avg:  5m 21s | Max: 18m 04s | Hits:  13%/146   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 46s | Avg:  6m 23s | Max:  6m 33s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 51m | Avg:  5m 11s | Max: 16m 19s | Hits:  13%/146   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 45m | Avg:  5m 17s | Max: 18m 04s | Hits:  13%/292   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  7m 44s | Avg:  3m 52s | Max:  3m 57s
      🟩 Clang10            Pass: 100%/2   | Total:  7m 52s | Avg:  3m 56s | Max:  4m 22s
      🟩 Clang11            Pass: 100%/4   | Total: 14m 37s | Avg:  3m 39s | Max:  3m 56s
      🟩 Clang12            Pass: 100%/4   | Total: 14m 28s | Avg:  3m 37s | Max:  3m 58s
      🟩 Clang13            Pass: 100%/4   | Total: 14m 49s | Avg:  3m 42s | Max:  4m 00s
      🟩 Clang14            Pass: 100%/4   | Total: 29m 01s | Avg:  7m 15s | Max: 18m 04s
      🟩 Clang15            Pass: 100%/2   | Total:  7m 38s | Avg:  3m 49s | Max:  3m 50s
      🟩 Clang16            Pass: 100%/4   | Total: 15m 30s | Avg:  3m 52s | Max:  4m 20s
      🟩 Clang17            Pass: 100%/2   | Total:  8m 29s | Avg:  4m 14s | Max:  4m 16s
      🟩 Clang18            Pass: 100%/2   | Total: 20m 08s | Avg: 10m 04s | Max: 16m 19s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 24s | Avg:  3m 42s | Max:  3m 46s
      🟩 GCC10              Pass: 100%/4   | Total: 14m 29s | Avg:  3m 37s | Max:  4m 03s
      🟩 GCC11              Pass: 100%/4   | Total: 14m 42s | Avg:  3m 40s | Max:  3m 50s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 03m | Avg:  9m 02s | Max: 16m 30s
      🟩 GCC13              Pass: 100%/3   | Total: 10m 30s | Avg:  3m 30s | Max:  3m 50s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 47s | Avg: 10m 47s | Max: 10m 47s | Hits:  13%/146   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 43s | Avg: 11m 43s | Max: 11m 43s | Hits:  13%/146   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 46s | Avg:  6m 23s | Max:  6m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 20m | Avg:  4m 40s | Max: 18m 04s
      🟩 GCC                Pass: 100%/20  | Total:  1h 50m | Avg:  5m 31s | Max: 16m 30s
      🟩 MSVC               Pass: 100%/2   | Total: 22m 30s | Avg: 11m 15s | Max: 11m 43s | Hits:  13%/292   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 46s | Avg:  6m 23s | Max:  6m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 45m | Avg:  5m 17s | Max: 18m 04s | Hits:  13%/292   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  3h 23m | Avg:  4m 09s | Max: 11m 43s | Hits:  13%/292   
      🟩 Test               Pass: 100%/5   | Total:  1h 22m | Avg: 16m 30s | Max: 18m 04s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 01s | Avg:  3m 01s | Max:  3m 01s
      🟩 90a                Pass: 100%/1   | Total:  2m 58s | Avg:  2m 58s | Max:  2m 58s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  2h 15m | Avg:  4m 39s | Max: 16m 30s
      🟩 20                 Pass: 100%/25  | Total:  2h 30m | Avg:  6m 01s | Max: 18m 04s | Hits:  13%/292   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 57s | Avg: 4m 58s | Max: 7m 59s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 59s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 58s | Avg:  1m 58s | Max:  1m 58s
      🟩 Test               Pass: 100%/1   | Total:  7m 59s | Avg:  7m 59s | Max:  7m 59s
    
  • 🟩 python: Pass: 100%/1 | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 396)

# Runner
327 linux-amd64-cpu16
28 linux-arm64-cpu16
26 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

@pciolkosz pciolkosz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The memory resource part looks good.
I will try to review the basic_any part in its PR too

cudax/test/memory_resource/any_resource.cu Outdated Show resolved Hide resolved
Copy link
Contributor

🟨 CI finished in 4h 15m: Pass: 99%/396 | Total: 3d 01h | Avg: 11m 11s | Max: 1h 41m | Hits: 69%/21812
  • 🟨 cudax: Pass: 96%/54 | Total: 5h 29m | Avg: 6m 05s | Max: 19m 13s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  96%/50  | Total:  5h 12m | Avg:  6m 15s | Max: 19m 13s
      🟩 arm64              Pass: 100%/4   | Total: 16m 37s | Avg:  4m 09s | Max:  4m 22s
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/30  | Total:  2h 41m | Avg:  5m 23s | Max: 17m 48s
      🟩 GCC                Pass: 100%/20  | Total:  2h 08m | Avg:  6m 25s | Max: 19m 13s
      🔥 MSVC               Pass:   0%/2   | Total: 24m 03s | Avg: 12m 01s | Max: 12m 22s
      🟩 NVHPC              Pass: 100%/2   | Total: 14m 57s | Avg:  7m 28s | Max:  7m 36s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  95%/49  | Total:  4h 00m | Avg:  4m 54s | Max: 12m 22s
      🟩 Test               Pass: 100%/5   | Total:  1h 28m | Avg: 17m 42s | Max: 19m 13s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/29  | Total:  2h 39m | Avg:  5m 29s | Max: 19m 13s
      🔍 20                 Pass:  92%/25  | Total:  2h 50m | Avg:  6m 48s | Max: 18m 02s
    🟨 ctk
      🟨 12.0               Pass:  94%/19  | Total:  1h 57m | Avg:  6m 11s | Max: 19m 13s
      🟩 12.5               Pass: 100%/2   | Total: 14m 57s | Avg:  7m 28s | Max:  7m 36s
      🟨 12.6               Pass:  96%/33  | Total:  3h 16m | Avg:  5m 57s | Max: 18m 02s
    🟨 cudacxx
      🟨 nvcc12.0           Pass:  94%/19  | Total:  1h 57m | Avg:  6m 11s | Max: 19m 13s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 14m 57s | Avg:  7m 28s | Max:  7m 36s
      🟨 nvcc12.6           Pass:  96%/33  | Total:  3h 16m | Avg:  5m 57s | Max: 18m 02s
    🟨 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  9m 13s | Avg:  4m 36s | Max:  4m 56s
      🟩 Clang10            Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 53s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 28s | Avg:  4m 22s | Max:  4m 50s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 43s | Avg:  4m 40s | Max:  5m 00s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 16s | Avg:  4m 34s | Max:  5m 03s
      🟩 Clang14            Pass: 100%/4   | Total: 29m 38s | Avg:  7m 24s | Max: 16m 32s
      🟩 Clang15            Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  4m 54s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  5m 01s
      🟩 Clang17            Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  4m 44s
      🟩 Clang18            Pass: 100%/2   | Total: 22m 27s | Avg: 11m 13s | Max: 17m 48s
      🟩 GCC9               Pass: 100%/2   | Total:  9m 11s | Avg:  4m 35s | Max:  4m 46s
      🟩 GCC10              Pass: 100%/4   | Total: 17m 27s | Avg:  4m 21s | Max:  4m 28s
      🟩 GCC11              Pass: 100%/4   | Total: 18m 16s | Avg:  4m 34s | Max:  4m 50s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 11m | Avg: 10m 14s | Max: 19m 13s
      🟩 GCC13              Pass: 100%/3   | Total: 11m 59s | Avg:  3m 59s | Max:  4m 22s
      🟥 MSVC14.36          Pass:   0%/1   | Total: 12m 22s | Avg: 12m 22s | Max: 12m 22s
      🟥 MSVC14.39          Pass:   0%/1   | Total: 11m 41s | Avg: 11m 41s | Max: 11m 41s
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 14m 57s | Avg:  7m 28s | Max:  7m 36s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  96%/54  | Total:  5h 29m | Avg:  6m 05s | Max: 19m 13s
    🟨 gpu
      🟨 v100               Pass:  96%/54  | Total:  5h 29m | Avg:  6m 05s | Max: 19m 13s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
      🟩 90a                Pass: 100%/1   | Total:  3m 32s | Avg:  3m 32s | Max:  3m 32s
    
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 1d 17h | Avg: 21m 06s | Max: 1h 41m | Hits: 31%/9524

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  1d 15h | Avg: 21m 21s | Max:  1h 41m | Hits:  31%/9524  
      🟩 arm64              Pass: 100%/8   | Total:  2h 21m | Avg: 17m 39s | Max: 25m 16s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  5h 15m | Avg: 21m 01s | Max: 41m 16s | Hits:  34%/2187  
      🟩 11.8               Pass: 100%/3   | Total:  1h 16m | Avg: 25m 21s | Max: 31m 03s
      🟩 12.5               Pass: 100%/4   | Total:  2h 42m | Avg: 40m 36s | Max: 50m 36s
      🟩 12.6               Pass: 100%/96  | Total:  1d 08h | Avg: 20m 10s | Max:  1h 41m | Hits:  31%/7337  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 30m | Avg: 12m 34s | Max: 19m 03s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  5h 15m | Avg: 21m 01s | Max: 41m 16s | Hits:  34%/2187  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 16m | Avg: 25m 21s | Max: 31m 03s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  2h 42m | Avg: 40m 36s | Max: 50m 36s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  1d 05h | Avg: 21m 15s | Max:  1h 41m | Hits:  31%/7337  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 30m | Avg: 12m 34s | Max: 19m 03s
      🟩 nvcc               Pass: 100%/106 | Total:  1d 14h | Avg: 22m 04s | Max:  1h 41m | Hits:  31%/9524  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  2h 04m | Avg: 20m 46s | Max: 30m 14s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 12m | Avg: 24m 10s | Max: 28m 25s
      🟩 Clang11            Pass: 100%/4   | Total: 55m 05s | Avg: 13m 46s | Max: 25m 25s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 34m | Avg: 23m 33s | Max: 27m 12s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 17m | Avg: 19m 25s | Max: 26m 04s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 36m | Avg: 24m 05s | Max: 29m 25s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 36m | Avg: 24m 01s | Max: 29m 32s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 17m | Avg: 19m 21s | Max: 29m 50s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 33m | Avg: 23m 26s | Max: 28m 22s
      🟩 Clang18            Pass: 100%/18  | Total:  5h 02m | Avg: 16m 48s | Max:  1h 15m
      🟩 GCC6               Pass: 100%/2   | Total: 41m 28s | Avg: 20m 44s | Max: 22m 50s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 37m | Avg: 16m 11s | Max: 27m 53s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 31m | Avg: 15m 18s | Max: 28m 02s
      🟩 GCC9               Pass: 100%/6   | Total: 58m 51s | Avg:  9m 48s | Max: 25m 11s
      🟩 GCC10              Pass: 100%/4   | Total:  1h 33m | Avg: 23m 21s | Max: 29m 09s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 28m | Avg: 21m 08s | Max: 31m 03s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 21m | Avg: 20m 25s | Max: 27m 59s
      🟩 GCC13              Pass: 100%/17  | Total:  6h 16m | Avg: 22m 07s | Max:  1h 41m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 16m | Avg: 25m 37s | Max: 31m 19s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 41m 16s | Avg: 41m 16s | Max: 41m 16s | Hits:  34%/2187  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 22m | Avg: 41m 21s | Max: 42m 18s | Hits:  31%/4737  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 48m 21s | Avg: 48m 21s | Max: 48m 21s | Hits:  29%/2600  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  2h 42m | Avg: 40m 36s | Max: 50m 36s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total: 18h 10m | Avg: 19m 49s | Max:  1h 15m
      🟩 GCC                Pass: 100%/52  | Total: 16h 28m | Avg: 19m 00s | Max:  1h 41m
      🟩 Intel              Pass: 100%/3   | Total:  1h 16m | Avg: 25m 37s | Max: 31m 19s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 52m | Avg: 43m 04s | Max: 48m 21s | Hits:  31%/9524  
      🟩 NVHPC              Pass: 100%/4   | Total:  2h 42m | Avg: 40m 36s | Max: 50m 36s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  1d 17h | Avg: 21m 06s | Max:  1h 41m | Hits:  31%/9524  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total:  1d 12h | Avg: 19m 42s | Max: 50m 36s | Hits:  31%/9524  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 35m | Avg: 23m 54s | Max: 26m 31s
      🟩 Test               Pass: 100%/3   | Total:  3h 44m | Avg:  1h 14m | Max:  1h 41m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 16m | Avg: 25m 21s | Max: 31m 03s
      🟩 90                 Pass: 100%/4   | Total: 42m 32s | Avg: 10m 38s | Max: 12m 36s
      🟩 90a                Pass: 100%/8   | Total:  1h 09m | Avg:  8m 41s | Max: 12m 06s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  8h 26m | Avg: 15m 48s | Max: 47m 07s
      🟩 14                 Pass: 100%/32  | Total: 10h 24m | Avg: 19m 30s | Max: 42m 18s | Hits:  33%/4477  
      🟩 17                 Pass: 100%/30  | Total: 11h 07m | Avg: 22m 14s | Max: 49m 00s | Hits:  30%/2447  
      🟩 20                 Pass: 100%/23  | Total: 11h 30m | Avg: 30m 02s | Max:  1h 41m | Hits:  29%/2600  
    
  • 🟩 thrust: Pass: 100%/111 | Total: 12h 25m | Avg: 6m 42s | Max: 26m 00s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 18m 16s | Avg:  9m 08s | Max: 12m 12s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total: 11h 47m | Avg:  6m 52s | Max: 26m 00s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/8   | Total: 37m 33s | Avg:  4m 41s | Max:  5m 04s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 23m | Avg:  5m 34s | Max: 22m 22s | Hits:  99%/1852  
      🟩 11.8               Pass: 100%/3   | Total: 15m 30s | Avg:  5m 10s | Max:  5m 27s
      🟩 12.5               Pass: 100%/4   | Total:  1h 03m | Avg: 15m 45s | Max: 16m 58s
      🟩 12.6               Pass: 100%/89  | Total:  9h 42m | Avg:  6m 33s | Max: 26m 00s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 20m 06s | Avg:  5m 01s | Max:  5m 21s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 23m | Avg:  5m 34s | Max: 22m 22s | Hits:  99%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 30s | Avg:  5m 10s | Max:  5m 27s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 03m | Avg: 15m 45s | Max: 16m 58s
      🟩 nvcc12.6           Pass: 100%/85  | Total:  9h 22m | Avg:  6m 37s | Max: 26m 00s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 20m 06s | Avg:  5m 01s | Max:  5m 21s
      🟩 nvcc               Pass: 100%/107 | Total: 12h 05m | Avg:  6m 46s | Max: 26m 00s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 30m 56s | Avg:  5m 09s | Max:  6m 35s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 54s | Avg:  6m 38s | Max:  7m 12s
      🟩 Clang11            Pass: 100%/4   | Total: 24m 31s | Avg:  6m 07s | Max:  9m 01s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 09s | Avg:  5m 17s | Max:  5m 43s
      🟩 Clang13            Pass: 100%/4   | Total: 21m 26s | Avg:  5m 21s | Max:  6m 03s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 57s | Avg:  5m 14s | Max:  5m 44s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 13s | Avg:  5m 18s | Max:  5m 45s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 06s | Avg:  5m 16s | Max:  5m 34s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 36s | Avg:  5m 09s | Max:  5m 34s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 03m | Avg:  5m 46s | Max: 11m 57s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 20s | Avg:  4m 10s | Max:  4m 27s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 36s | Avg:  4m 36s | Max:  5m 22s
      🟩 GCC8               Pass: 100%/6   | Total: 29m 03s | Avg:  4m 50s | Max:  5m 31s
      🟩 GCC9               Pass: 100%/6   | Total: 31m 09s | Avg:  5m 11s | Max:  6m 05s
      🟩 GCC10              Pass: 100%/4   | Total: 22m 06s | Avg:  5m 31s | Max:  5m 56s
      🟩 GCC11              Pass: 100%/7   | Total: 38m 08s | Avg:  5m 26s | Max:  6m 16s
      🟩 GCC12              Pass: 100%/4   | Total: 23m 34s | Avg:  5m 53s | Max:  6m 07s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 01m | Avg:  7m 36s | Max: 26m 00s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 21m 13s | Avg:  7m 04s | Max:  7m 36s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 22m 22s | Avg: 22m 22s | Max: 22m 22s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 32m 08s | Avg: 16m 04s | Max: 16m 14s | Hits:  99%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 39m 18s | Avg: 19m 39s | Max: 23m 25s | Hits:  99%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 03m | Avg: 15m 45s | Max: 16m 58s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 25m | Avg:  5m 31s | Max: 11m 57s
      🟩 GCC                Pass: 100%/51  | Total:  5h 01m | Avg:  5m 54s | Max: 26m 00s
      🟩 Intel              Pass: 100%/3   | Total: 21m 13s | Avg:  7m 04s | Max:  7m 36s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 33m | Avg: 18m 45s | Max: 23m 25s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 03m | Avg: 15m 45s | Max: 16m 58s
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total: 12h 25m | Avg:  6m 42s | Max: 26m 00s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total: 10h 37m | Avg:  6m 11s | Max: 22m 22s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 45m 57s | Avg: 11m 29s | Max: 23m 25s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 02m | Avg: 15m 32s | Max: 26m 00s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 30s | Avg:  5m 10s | Max:  5m 27s
      🟩 90a                Pass: 100%/4   | Total: 18m 49s | Avg:  4m 42s | Max:  4m 56s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 57m | Avg:  5m 54s | Max: 26m 00s
      🟩 14                 Pass: 100%/29  | Total:  3h 10m | Avg:  6m 33s | Max: 22m 22s | Hits:  99%/3704  
      🟩 17                 Pass: 100%/27  | Total:  2h 52m | Avg:  6m 23s | Max: 16m 58s | Hits:  99%/1852  
      🟩 20                 Pass: 100%/23  | Total:  3h 06m | Avg:  8m 07s | Max: 23m 25s | Hits:  99%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 14h 02m | Avg: 7m 39s | Max: 53m 50s | Hits: 99%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 13h 25m | Avg:  7m 53s | Max: 53m 50s | Hits:  99%/3028  
      🟩 arm64              Pass: 100%/8   | Total: 37m 13s | Avg:  4m 39s | Max:  4m 53s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 14m | Avg:  4m 58s | Max: 14m 14s | Hits:  99%/757   
      🟩 11.8               Pass: 100%/3   | Total: 17m 09s | Avg:  5m 43s | Max:  6m 10s
      🟩 12.5               Pass: 100%/4   | Total: 35m 18s | Avg:  8m 49s | Max:  9m 11s
      🟩 12.6               Pass: 100%/88  | Total: 11h 55m | Avg:  8m 07s | Max: 53m 50s | Hits:  99%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 17m 11s | Avg:  4m 17s | Max:  4m 23s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 14m | Avg:  4m 58s | Max: 14m 14s | Hits:  99%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 17m 09s | Avg:  5m 43s | Max:  6m 10s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 35m 18s | Avg:  8m 49s | Max:  9m 11s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 11h 38m | Avg:  8m 18s | Max: 53m 50s | Hits:  99%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 17m 11s | Avg:  4m 17s | Max:  4m 23s
      🟩 nvcc               Pass: 100%/106 | Total: 13h 45m | Avg:  7m 47s | Max: 53m 50s | Hits:  99%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 31m 53s | Avg:  5m 18s | Max:  6m 51s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 55s | Avg:  6m 18s | Max:  6m 28s
      🟩 Clang11            Pass: 100%/4   | Total: 21m 24s | Avg:  5m 21s | Max:  5m 44s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 08s | Avg:  5m 17s | Max:  5m 27s
      🟩 Clang13            Pass: 100%/4   | Total: 22m 00s | Avg:  5m 30s | Max:  5m 41s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 48s | Avg:  5m 27s | Max:  5m 57s
      🟩 Clang15            Pass: 100%/4   | Total: 22m 02s | Avg:  5m 30s | Max:  5m 44s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 45s | Avg:  5m 26s | Max:  5m 46s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 53s | Avg:  5m 28s | Max:  5m 45s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 43m | Avg:  9m 25s | Max: 43m 01s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 55s | Avg:  4m 27s | Max:  4m 36s
      🟩 GCC7               Pass: 100%/6   | Total: 28m 02s | Avg:  4m 40s | Max:  5m 19s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 12s | Avg:  4m 42s | Max:  5m 27s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 20s | Avg:  4m 53s | Max:  5m 46s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 42s | Avg:  5m 25s | Max:  5m 32s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 27m | Avg: 12m 33s | Max: 53m 50s
      🟩 GCC12              Pass: 100%/4   | Total: 23m 05s | Avg:  5m 46s | Max:  6m 05s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 20m | Avg: 12m 33s | Max: 43m 51s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 08s | Avg:  6m 22s | Max:  6m 31s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 14s | Avg: 14m 14s | Max: 14m 14s | Hits:  99%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 26m 52s | Avg: 13m 26s | Max: 14m 00s | Hits:  99%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 40s | Avg: 12m 40s | Max: 12m 40s | Hits:  99%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 35m 18s | Avg:  8m 49s | Max:  9m 11s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  5h 06m | Avg:  6m 23s | Max: 43m 01s
      🟩 GCC                Pass: 100%/51  | Total:  7h 08m | Avg:  8m 23s | Max: 53m 50s
      🟩 Intel              Pass: 100%/3   | Total: 19m 08s | Avg:  6m 22s | Max:  6m 31s
      🟩 MSVC               Pass: 100%/4   | Total: 53m 46s | Avg: 13m 26s | Max: 14m 14s | Hits:  99%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total: 35m 18s | Avg:  8m 49s | Max:  9m 11s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 14h 02m | Avg:  7m 39s | Max: 53m 50s | Hits:  99%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total: 10h 26m | Avg:  6m 08s | Max: 53m 50s | Hits:  99%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 11s | Avg: 20m 11s | Max: 20m 11s
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 08s | Avg: 24m 08s | Max: 24m 08s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 00m | Avg: 20m 13s | Max: 23m 30s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 51m | Avg: 37m 07s | Max: 43m 51s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 17m 09s | Avg:  5m 43s | Max:  6m 10s
      🟩 90a                Pass: 100%/4   | Total: 17m 09s | Avg:  4m 17s | Max:  4m 52s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 30m | Avg:  7m 01s | Max: 43m 51s
      🟩 14                 Pass: 100%/29  | Total:  2h 52m | Avg:  5m 57s | Max: 14m 14s | Hits:  99%/1514  
      🟩 17                 Pass: 100%/27  | Total:  2h 36m | Avg:  5m 47s | Max: 12m 52s | Hits:  99%/757   
      🟩 20                 Pass: 100%/24  | Total:  5h 03m | Avg: 12m 37s | Max: 53m 50s | Hits:  99%/757   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 49s | Avg: 5m 24s | Max: 8m 49s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  8m 49s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  8m 49s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  8m 49s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  8m 49s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  8m 49s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  8m 49s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  8m 49s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
      🟩 Test               Pass: 100%/1   | Total:  8m 49s | Avg:  8m 49s | Max:  8m 49s
    
  • 🟩 python: Pass: 100%/1 | Total: 14m 17s | Avg: 14m 17s | Max: 14m 17s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 17s | Avg: 14m 17s | Max: 14m 17s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 14m 17s | Avg: 14m 17s | Max: 14m 17s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 14m 17s | Avg: 14m 17s | Max: 14m 17s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 17s | Avg: 14m 17s | Max: 14m 17s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 17s | Avg: 14m 17s | Max: 14m 17s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 17s | Avg: 14m 17s | Max: 14m 17s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 17s | Avg: 14m 17s | Max: 14m 17s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 17s | Avg: 14m 17s | Max: 14m 17s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 396)

# Runner
327 linux-amd64-cpu16
28 linux-arm64-cpu16
26 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 3h 02m: Pass: 100%/396 | Total: 2d 01h | Avg: 7m 31s | Max: 41m 58s | Hits: 91%/22136
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 18h 58m | Avg: 9m 38s | Max: 41m 58s | Hits: 81%/9546

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 18h 11m | Avg:  9m 55s | Max: 41m 58s | Hits:  81%/9546  
      🟩 arm64              Pass: 100%/8   | Total: 46m 30s | Avg:  5m 48s | Max: 18m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 42m | Avg:  6m 48s | Max: 22m 27s | Hits:  98%/2199  
      🟩 11.8               Pass: 100%/3   | Total:  1h 03m | Avg: 21m 04s | Max: 29m 03s
      🟩 12.5               Pass: 100%/4   | Total: 57m 52s | Avg: 14m 28s | Max: 27m 55s
      🟩 12.6               Pass: 100%/96  | Total: 15h 15m | Avg:  9m 31s | Max: 41m 58s | Hits:  75%/7347  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 23m | Avg: 11m 56s | Max: 18m 52s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 42m | Avg:  6m 48s | Max: 22m 27s | Hits:  98%/2199  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 03m | Avg: 21m 04s | Max: 29m 03s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 57m 52s | Avg: 14m 28s | Max: 27m 55s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 12h 51m | Avg:  9m 11s | Max: 41m 58s | Hits:  75%/7347  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 23m | Avg: 11m 56s | Max: 18m 52s
      🟩 nvcc               Pass: 100%/106 | Total: 16h 35m | Avg:  9m 23s | Max: 41m 58s | Hits:  81%/9546  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 06m | Avg: 11m 08s | Max: 27m 54s
      🟩 Clang10            Pass: 100%/3   | Total: 29m 08s | Avg:  9m 42s | Max: 18m 55s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 09s | Avg:  4m 17s | Max:  5m 00s
      🟩 Clang12            Pass: 100%/4   | Total: 40m 37s | Avg: 10m 09s | Max: 27m 31s
      🟩 Clang13            Pass: 100%/4   | Total: 54m 29s | Avg: 13m 37s | Max: 29m 34s
      🟩 Clang14            Pass: 100%/4   | Total: 17m 29s | Avg:  4m 22s | Max:  4m 50s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 35s | Avg:  4m 53s | Max:  5m 58s
      🟩 Clang16            Pass: 100%/4   | Total: 47m 35s | Avg: 11m 53s | Max: 20m 17s
      🟩 Clang17            Pass: 100%/4   | Total: 33m 40s | Avg:  8m 25s | Max: 20m 32s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 03m | Avg: 10m 12s | Max: 19m 16s
      🟩 GCC6               Pass: 100%/2   | Total: 26m 15s | Avg: 13m 07s | Max: 22m 25s
      🟩 GCC7               Pass: 100%/6   | Total: 20m 00s | Avg:  3m 20s | Max:  4m 04s
      🟩 GCC8               Pass: 100%/6   | Total: 41m 34s | Avg:  6m 55s | Max: 24m 10s
      🟩 GCC9               Pass: 100%/6   | Total: 37m 27s | Avg:  6m 14s | Max: 18m 30s
      🟩 GCC10              Pass: 100%/4   | Total: 37m 02s | Avg:  9m 15s | Max: 24m 42s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 19m | Avg: 11m 17s | Max: 29m 03s
      🟩 GCC12              Pass: 100%/4   | Total: 17m 05s | Avg:  4m 16s | Max:  4m 22s
      🟩 GCC13              Pass: 100%/17  | Total:  2h 58m | Avg: 10m 30s | Max: 25m 24s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 52m 21s | Avg: 17m 27s | Max: 41m 58s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 38s | Avg: 17m 38s | Max: 17m 38s | Hits:  98%/2199  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 48m 31s | Avg: 24m 15s | Max: 36m 25s | Hits:  63%/4743  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 37s | Avg: 14m 37s | Max: 14m 37s | Hits:  98%/2604  
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 57m 52s | Avg: 14m 28s | Max: 27m 55s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  8h 30m | Avg:  9m 16s | Max: 29m 34s
      🟩 GCC                Pass: 100%/52  | Total:  7h 17m | Avg:  8m 24s | Max: 29m 03s
      🟩 Intel              Pass: 100%/3   | Total: 52m 21s | Avg: 17m 27s | Max: 41m 58s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 20m | Avg: 20m 11s | Max: 36m 25s | Hits:  81%/9546  
      🟩 NVHPC              Pass: 100%/4   | Total: 57m 52s | Avg: 14m 28s | Max: 27m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 18h 58m | Avg:  9m 38s | Max: 41m 58s | Hits:  81%/9546  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total: 16h 33m | Avg:  9m 02s | Max: 41m 58s | Hits:  81%/9546  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 32m | Avg: 23m 12s | Max: 25m 24s
      🟩 Test               Pass: 100%/3   | Total: 49m 55s | Avg: 16m 38s | Max: 19m 43s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 57s | Avg:  1m 57s | Max:  1m 57s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 03m | Avg: 21m 04s | Max: 29m 03s
      🟩 90                 Pass: 100%/4   | Total: 39m 58s | Avg:  9m 59s | Max: 11m 44s
      🟩 90a                Pass: 100%/8   | Total: 58m 01s | Avg:  7m 15s | Max: 13m 23s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  4h 51m | Avg:  9m 07s | Max: 27m 55s
      🟩 14                 Pass: 100%/32  | Total:  4h 18m | Avg:  8m 05s | Max: 25m 24s | Hits:  98%/4492  
      🟩 17                 Pass: 100%/30  | Total:  6h 04m | Avg: 12m 08s | Max: 41m 58s | Hits:  31%/2450  
      🟩 20                 Pass: 100%/23  | Total:  3h 41m | Avg:  9m 38s | Max: 27m 31s | Hits:  98%/2604  
    
  • 🟩 thrust: Pass: 100%/111 | Total: 12h 55m | Avg: 6m 59s | Max: 39m 05s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 23m 59s | Avg: 11m 59s | Max: 18m 29s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total: 12h 17m | Avg:  7m 09s | Max: 39m 05s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/8   | Total: 37m 42s | Avg:  4m 42s | Max:  5m 09s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 19m | Avg:  5m 18s | Max: 17m 35s | Hits:  99%/1852  
      🟩 11.8               Pass: 100%/3   | Total: 17m 27s | Avg:  5m 49s | Max:  6m 33s
      🟩 12.5               Pass: 100%/4   | Total:  1h 01m | Avg: 15m 23s | Max: 16m 44s
      🟩 12.6               Pass: 100%/89  | Total: 10h 17m | Avg:  6m 55s | Max: 39m 05s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 19m 22s | Avg:  4m 50s | Max:  5m 10s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 19m | Avg:  5m 18s | Max: 17m 35s | Hits:  99%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 17m 27s | Avg:  5m 49s | Max:  6m 33s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 01m | Avg: 15m 23s | Max: 16m 44s
      🟩 nvcc12.6           Pass: 100%/85  | Total:  9h 57m | Avg:  7m 01s | Max: 39m 05s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 19m 22s | Avg:  4m 50s | Max:  5m 10s
      🟩 nvcc               Pass: 100%/107 | Total: 12h 36m | Avg:  7m 04s | Max: 39m 05s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 31m 08s | Avg:  5m 11s | Max:  6m 11s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 10s | Avg:  6m 03s | Max:  6m 19s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 47s | Avg:  4m 56s | Max:  5m 14s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 08s | Avg:  5m 17s | Max:  5m 44s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 51s | Avg:  5m 12s | Max:  5m 52s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 58s | Avg:  5m 14s | Max:  5m 29s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 33s | Avg:  5m 23s | Max:  5m 42s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 06s | Avg:  5m 16s | Max:  5m 41s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 36s | Avg:  5m 24s | Max:  5m 48s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 09m | Avg:  6m 19s | Max: 19m 08s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 25s | Avg:  4m 12s | Max:  4m 30s
      🟩 GCC7               Pass: 100%/6   | Total: 28m 27s | Avg:  4m 44s | Max:  6m 02s
      🟩 GCC8               Pass: 100%/6   | Total: 29m 11s | Avg:  4m 51s | Max:  5m 18s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 16s | Avg:  4m 52s | Max:  5m 27s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 29s | Avg:  5m 22s | Max:  5m 43s
      🟩 GCC11              Pass: 100%/7   | Total: 39m 07s | Avg:  5m 35s | Max:  6m 33s
      🟩 GCC12              Pass: 100%/4   | Total: 23m 06s | Avg:  5m 46s | Max:  6m 25s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 35m | Avg:  9m 42s | Max: 39m 05s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 21m 14s | Avg:  7m 04s | Max:  7m 38s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 35s | Avg: 17m 35s | Max: 17m 35s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 31m 43s | Avg: 15m 51s | Max: 16m 27s | Hits:  99%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 43m 13s | Avg: 21m 36s | Max: 25m 10s | Hits:  99%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 01m | Avg: 15m 23s | Max: 16m 44s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 25m | Avg:  5m 32s | Max: 19m 08s
      🟩 GCC                Pass: 100%/51  | Total:  5h 34m | Avg:  6m 33s | Max: 39m 05s
      🟩 Intel              Pass: 100%/3   | Total: 21m 14s | Avg:  7m 04s | Max:  7m 38s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 32m | Avg: 18m 30s | Max: 25m 10s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 01m | Avg: 15m 23s | Max: 16m 44s
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total: 12h 55m | Avg:  6m 59s | Max: 39m 05s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total: 10h 24m | Avg:  6m 03s | Max: 18m 03s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total:  1h 19m | Avg: 19m 47s | Max: 39m 05s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 12m | Avg: 18m 04s | Max: 20m 24s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 17m 27s | Avg:  5m 49s | Max:  6m 33s
      🟩 90a                Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max:  5m 36s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 23m | Avg:  6m 46s | Max: 39m 05s
      🟩 14                 Pass: 100%/29  | Total:  3h 02m | Avg:  6m 18s | Max: 17m 35s | Hits:  99%/3704  
      🟩 17                 Pass: 100%/27  | Total:  2h 50m | Avg:  6m 18s | Max: 16m 27s | Hits:  99%/1852  
      🟩 20                 Pass: 100%/23  | Total:  3h 14m | Avg:  8m 28s | Max: 25m 10s | Hits:  99%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 12h 25m | Avg: 6m 46s | Max: 27m 13s | Hits: 99%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 11h 46m | Avg:  6m 55s | Max: 27m 13s | Hits:  99%/3028  
      🟩 arm64              Pass: 100%/8   | Total: 39m 04s | Avg:  4m 53s | Max:  5m 26s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 14m | Avg:  4m 58s | Max: 14m 20s | Hits:  99%/757   
      🟩 11.8               Pass: 100%/3   | Total: 16m 49s | Avg:  5m 36s | Max:  5m 41s
      🟩 12.5               Pass: 100%/4   | Total: 35m 44s | Avg:  8m 56s | Max:  9m 22s
      🟩 12.6               Pass: 100%/88  | Total: 10h 18m | Avg:  7m 01s | Max: 27m 13s | Hits:  99%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 17m 06s | Avg:  4m 16s | Max:  4m 28s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 14m | Avg:  4m 58s | Max: 14m 20s | Hits:  99%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 49s | Avg:  5m 36s | Max:  5m 41s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 35m 44s | Avg:  8m 56s | Max:  9m 22s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 01m | Avg:  7m 09s | Max: 27m 13s | Hits:  99%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 17m 06s | Avg:  4m 16s | Max:  4m 28s
      🟩 nvcc               Pass: 100%/106 | Total: 12h 08m | Avg:  6m 52s | Max: 27m 13s | Hits:  99%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 31m 09s | Avg:  5m 11s | Max:  6m 09s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 48s | Avg:  6m 16s | Max:  6m 56s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 04s | Avg:  5m 01s | Max:  5m 07s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 33s | Avg:  5m 08s | Max:  5m 21s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 36s | Avg:  5m 09s | Max:  5m 35s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 46s | Avg:  5m 26s | Max:  5m 35s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 43s | Avg:  5m 10s | Max:  5m 20s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 37s | Avg:  5m 09s | Max:  5m 20s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 37s | Avg:  5m 24s | Max:  5m 44s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 31m | Avg:  8m 16s | Max: 27m 13s
      🟩 GCC6               Pass: 100%/2   | Total:  9m 05s | Avg:  4m 32s | Max:  4m 53s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 53s | Avg:  4m 38s | Max:  5m 23s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 34s | Avg:  4m 45s | Max:  5m 37s
      🟩 GCC9               Pass: 100%/6   | Total: 28m 27s | Avg:  4m 44s | Max:  5m 22s
      🟩 GCC10              Pass: 100%/4   | Total: 20m 33s | Avg:  5m 08s | Max:  5m 31s
      🟩 GCC11              Pass: 100%/7   | Total: 38m 22s | Avg:  5m 28s | Max:  5m 49s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 23s | Avg:  5m 35s | Max:  5m 51s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 53m | Avg: 10m 48s | Max: 24m 50s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 17s | Avg:  6m 25s | Max:  6m 34s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 20s | Avg: 14m 20s | Max: 14m 20s | Hits:  99%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 24m 25s | Avg: 12m 12s | Max: 12m 42s | Hits:  99%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 16m 19s | Avg: 16m 19s | Max: 16m 19s | Hits:  99%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 35m 44s | Avg:  8m 56s | Max:  9m 22s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 46m | Avg:  5m 58s | Max: 27m 13s
      🟩 GCC                Pass: 100%/51  | Total:  5h 48m | Avg:  6m 49s | Max: 24m 50s
      🟩 Intel              Pass: 100%/3   | Total: 19m 17s | Avg:  6m 25s | Max:  6m 34s
      🟩 MSVC               Pass: 100%/4   | Total: 55m 04s | Avg: 13m 46s | Max: 16m 19s | Hits:  99%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total: 35m 44s | Avg:  8m 56s | Max:  9m 22s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 12h 25m | Avg:  6m 46s | Max: 27m 13s | Hits:  99%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  9h 30m | Avg:  5m 35s | Max: 16m 19s | Hits:  99%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 05s | Avg: 21m 05s | Max: 21m 05s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 28s | Avg: 17m 28s | Max: 17m 28s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 05m | Avg: 21m 54s | Max: 27m 13s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 10m | Avg: 23m 35s | Max: 24m 50s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 49s | Avg:  5m 36s | Max:  5m 41s
      🟩 90a                Pass: 100%/4   | Total: 16m 46s | Avg:  4m 11s | Max:  4m 23s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 04m | Avg:  6m 09s | Max: 23m 52s
      🟩 14                 Pass: 100%/29  | Total:  2h 47m | Avg:  5m 47s | Max: 14m 20s | Hits:  99%/1514  
      🟩 17                 Pass: 100%/27  | Total:  2h 32m | Avg:  5m 38s | Max: 11m 43s | Hits:  99%/757   
      🟩 20                 Pass: 100%/24  | Total:  4h 00m | Avg: 10m 01s | Max: 27m 13s | Hits:  99%/757   
    
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 53m | Avg: 5m 26s | Max: 21m 01s | Hits: 80%/302

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  4h 39m | Avg:  5m 35s | Max: 21m 01s | Hits:  80%/302   
      🟩 arm64              Pass: 100%/4   | Total: 14m 29s | Avg:  3m 37s | Max:  3m 41s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 40m | Avg:  5m 17s | Max: 17m 44s | Hits:  80%/151   
      🟩 12.5               Pass: 100%/2   | Total: 12m 37s | Avg:  6m 18s | Max:  6m 22s
      🟩 12.6               Pass: 100%/33  | Total:  3h 00m | Avg:  5m 28s | Max: 21m 01s | Hits:  80%/151   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 40m | Avg:  5m 17s | Max: 17m 44s | Hits:  80%/151   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 37s | Avg:  6m 18s | Max:  6m 22s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  3h 00m | Avg:  5m 28s | Max: 21m 01s | Hits:  80%/151   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 53m | Avg:  5m 26s | Max: 21m 01s | Hits:  80%/302   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  4m 16s
      🟩 Clang10            Pass: 100%/2   | Total:  8m 09s | Avg:  4m 04s | Max:  4m 34s
      🟩 Clang11            Pass: 100%/4   | Total: 14m 15s | Avg:  3m 33s | Max:  3m 55s
      🟩 Clang12            Pass: 100%/4   | Total: 14m 15s | Avg:  3m 33s | Max:  3m 53s
      🟩 Clang13            Pass: 100%/4   | Total: 15m 26s | Avg:  3m 51s | Max:  4m 02s
      🟩 Clang14            Pass: 100%/4   | Total: 28m 29s | Avg:  7m 07s | Max: 17m 44s
      🟩 Clang15            Pass: 100%/2   | Total:  7m 40s | Avg:  3m 50s | Max:  3m 57s
      🟩 Clang16            Pass: 100%/4   | Total: 15m 20s | Avg:  3m 50s | Max:  4m 09s
      🟩 Clang17            Pass: 100%/2   | Total:  7m 53s | Avg:  3m 56s | Max:  4m 16s
      🟩 Clang18            Pass: 100%/2   | Total: 24m 40s | Avg: 12m 20s | Max: 21m 01s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 50s | Avg:  3m 55s | Max:  4m 13s
      🟩 GCC10              Pass: 100%/4   | Total: 15m 13s | Avg:  3m 48s | Max:  4m 32s
      🟩 GCC11              Pass: 100%/4   | Total: 14m 36s | Avg:  3m 39s | Max:  4m 04s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 08m | Avg:  9m 46s | Max: 18m 28s
      🟩 GCC13              Pass: 100%/3   | Total: 10m 20s | Avg:  3m 26s | Max:  3m 41s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 02s | Avg: 10m 02s | Max: 10m 02s | Hits:  80%/151   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 53s | Avg: 10m 53s | Max: 10m 53s | Hits:  80%/151   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 37s | Avg:  6m 18s | Max:  6m 22s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 23m | Avg:  4m 47s | Max: 21m 01s
      🟩 GCC                Pass: 100%/20  | Total:  1h 56m | Avg:  5m 49s | Max: 18m 28s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 55s | Avg: 10m 27s | Max: 10m 53s | Hits:  80%/302   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 37s | Avg:  6m 18s | Max:  6m 22s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 53m | Avg:  5m 26s | Max: 21m 01s | Hits:  80%/302   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  3h 20m | Avg:  4m 05s | Max: 10m 53s | Hits:  80%/302   
      🟩 Test               Pass: 100%/5   | Total:  1h 33m | Avg: 18m 36s | Max: 21m 01s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 06s | Avg:  3m 06s | Max:  3m 06s
      🟩 90a                Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  2h 17m | Avg:  4m 44s | Max: 18m 28s
      🟩 20                 Pass: 100%/25  | Total:  2h 36m | Avg:  6m 14s | Max: 21m 01s | Hits:  80%/302   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 07s | Avg: 5m 03s | Max: 8m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 01s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 01s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 01s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 01s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 01s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 06s | Avg:  2m 06s | Max:  2m 06s
      🟩 Test               Pass: 100%/1   | Total:  8m 01s | Avg:  8m 01s | Max:  8m 01s
    
  • 🟩 python: Pass: 100%/1 | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 16s | Avg: 15m 16s | Max: 15m 16s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 396)

# Runner
327 linux-amd64-cpu16
28 linux-arm64-cpu16
26 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 2h 41m: Pass: 100%/396 | Total: 1d 22h | Avg: 7m 02s | Max: 1h 26m | Hits: 98%/22136
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 16h 10m | Avg: 8m 13s | Max: 1h 26m | Hits: 98%/9546

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 15h 30m | Avg:  8m 27s | Max:  1h 26m | Hits:  98%/9546  
      🟩 arm64              Pass: 100%/8   | Total: 39m 56s | Avg:  4m 59s | Max: 13m 28s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 21m | Avg:  5m 25s | Max: 21m 59s | Hits:  98%/2199  
      🟩 11.8               Pass: 100%/3   | Total:  9m 24s | Avg:  3m 08s | Max:  3m 26s
      🟩 12.5               Pass: 100%/4   | Total: 35m 51s | Avg:  8m 57s | Max:  9m 56s
      🟩 12.6               Pass: 100%/96  | Total: 14h 03m | Avg:  8m 47s | Max:  1h 26m | Hits:  98%/7347  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 31m | Avg: 12m 39s | Max: 26m 09s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 21m | Avg:  5m 25s | Max: 21m 59s | Hits:  98%/2199  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  9m 24s | Avg:  3m 08s | Max:  3m 26s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 35m 51s | Avg:  8m 57s | Max:  9m 56s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 11h 31m | Avg:  8m 14s | Max:  1h 26m | Hits:  98%/7347  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 31m | Avg: 12m 39s | Max: 26m 09s
      🟩 nvcc               Pass: 100%/106 | Total: 13h 38m | Avg:  7m 43s | Max:  1h 26m | Hits:  98%/9546  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 24m 10s | Avg:  4m 01s | Max:  5m 04s
      🟩 Clang10            Pass: 100%/3   | Total: 15m 04s | Avg:  5m 01s | Max:  6m 03s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 19s | Avg:  4m 19s | Max:  5m 18s
      🟩 Clang12            Pass: 100%/4   | Total: 16m 22s | Avg:  4m 05s | Max:  4m 23s
      🟩 Clang13            Pass: 100%/4   | Total: 37m 40s | Avg:  9m 25s | Max: 25m 35s
      🟩 Clang14            Pass: 100%/4   | Total: 54m 09s | Avg: 13m 32s | Max: 25m 53s
      🟩 Clang15            Pass: 100%/4   | Total: 17m 44s | Avg:  4m 26s | Max:  4m 40s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  5m 30s
      🟩 Clang17            Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 34s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 20m | Avg: 11m 07s | Max: 26m 09s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 25s | Avg:  2m 42s | Max:  2m 57s
      🟩 GCC7               Pass: 100%/6   | Total: 35m 58s | Avg:  5m 59s | Max: 20m 40s
      🟩 GCC8               Pass: 100%/6   | Total: 39m 31s | Avg:  6m 35s | Max: 21m 59s
      🟩 GCC9               Pass: 100%/6   | Total: 21m 19s | Avg:  3m 33s | Max:  4m 13s
      🟩 GCC10              Pass: 100%/4   | Total: 16m 54s | Avg:  4m 13s | Max:  4m 31s
      🟩 GCC11              Pass: 100%/7   | Total: 49m 42s | Avg:  7m 06s | Max: 28m 24s
      🟩 GCC12              Pass: 100%/4   | Total: 15m 39s | Avg:  3m 54s | Max:  4m 21s
      🟩 GCC13              Pass: 100%/17  | Total:  4h 10m | Avg: 14m 43s | Max:  1h 26m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 08s | Avg:  6m 22s | Max:  7m 40s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 40s | Avg: 19m 40s | Max: 19m 40s | Hits:  98%/2199  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 27m 57s | Avg: 13m 58s | Max: 14m 39s | Hits:  98%/4743  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 26s | Avg: 15m 26s | Max: 15m 26s | Hits:  98%/2604  
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 35m 51s | Avg:  8m 57s | Max:  9m 56s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  6h 57m | Avg:  7m 35s | Max: 26m 09s
      🟩 GCC                Pass: 100%/52  | Total:  7h 14m | Avg:  8m 21s | Max:  1h 26m
      🟩 Intel              Pass: 100%/3   | Total: 19m 08s | Avg:  6m 22s | Max:  7m 40s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 03m | Avg: 15m 45s | Max: 19m 40s | Hits:  98%/9546  
      🟩 NVHPC              Pass: 100%/4   | Total: 35m 51s | Avg:  8m 57s | Max:  9m 56s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 16h 10m | Avg:  8m 13s | Max:  1h 26m | Hits:  98%/9546  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total: 12h 19m | Avg:  6m 43s | Max: 28m 24s | Hits:  98%/9546  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 52m | Avg: 28m 05s | Max: 33m 20s
      🟩 Test               Pass: 100%/3   | Total:  1h 56m | Avg: 38m 51s | Max:  1h 26m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 55s | Avg:  1m 55s | Max:  1m 55s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  9m 24s | Avg:  3m 08s | Max:  3m 26s
      🟩 90                 Pass: 100%/4   | Total: 41m 14s | Avg: 10m 18s | Max: 11m 35s
      🟩 90a                Pass: 100%/8   | Total: 56m 26s | Avg:  7m 03s | Max: 11m 32s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  3h 22m | Avg:  6m 18s | Max: 27m 24s
      🟩 14                 Pass: 100%/32  | Total:  3h 31m | Avg:  6m 36s | Max: 20m 40s | Hits:  98%/4492  
      🟩 17                 Pass: 100%/30  | Total:  3h 56m | Avg:  7m 52s | Max: 33m 20s | Hits:  98%/2450  
      🟩 20                 Pass: 100%/23  | Total:  5h 19m | Avg: 13m 52s | Max:  1h 26m | Hits:  98%/2604  
    
  • 🟩 thrust: Pass: 100%/111 | Total: 12h 20m | Avg: 6m 40s | Max: 21m 42s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 23m 17s | Avg: 11m 38s | Max: 17m 36s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total: 11h 42m | Avg:  6m 48s | Max: 21m 42s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/8   | Total: 38m 39s | Avg:  4m 49s | Max:  5m 29s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 17m | Avg:  5m 10s | Max: 18m 22s | Hits:  99%/1852  
      🟩 11.8               Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 39s
      🟩 12.5               Pass: 100%/4   | Total:  1h 03m | Avg: 15m 53s | Max: 17m 44s
      🟩 12.6               Pass: 100%/89  | Total:  9h 44m | Avg:  6m 33s | Max: 21m 42s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 19m 42s | Avg:  4m 55s | Max:  5m 12s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 17m | Avg:  5m 10s | Max: 18m 22s | Hits:  99%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 39s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 03m | Avg: 15m 53s | Max: 17m 44s
      🟩 nvcc12.6           Pass: 100%/85  | Total:  9h 24m | Avg:  6m 38s | Max: 21m 42s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 19m 42s | Avg:  4m 55s | Max:  5m 12s
      🟩 nvcc               Pass: 100%/107 | Total: 12h 01m | Avg:  6m 44s | Max: 21m 42s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 31m 14s | Avg:  5m 12s | Max:  6m 43s
      🟩 Clang10            Pass: 100%/3   | Total: 20m 12s | Avg:  6m 44s | Max:  7m 15s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 29s | Avg:  5m 07s | Max:  5m 28s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 49s | Avg:  5m 12s | Max:  5m 36s
      🟩 Clang13            Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  5m 27s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 25s | Avg:  5m 06s | Max:  5m 33s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 37s | Avg:  5m 24s | Max:  5m 45s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 57s | Avg:  5m 29s | Max:  6m 02s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 17s | Avg:  5m 19s | Max:  5m 49s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 04m | Avg:  5m 51s | Max: 13m 47s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 28s
      🟩 GCC7               Pass: 100%/6   | Total: 28m 18s | Avg:  4m 43s | Max:  5m 42s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 27s | Avg:  4m 44s | Max:  5m 55s
      🟩 GCC9               Pass: 100%/6   | Total: 30m 35s | Avg:  5m 05s | Max:  6m 51s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 56s | Avg:  5m 29s | Max:  5m 50s
      🟩 GCC11              Pass: 100%/7   | Total: 38m 13s | Avg:  5m 27s | Max:  6m 23s
      🟩 GCC12              Pass: 100%/4   | Total: 23m 43s | Avg:  5m 55s | Max:  6m 35s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 02m | Avg:  7m 40s | Max: 18m 17s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 20m 39s | Avg:  6m 53s | Max:  7m 23s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 22s | Avg: 18m 22s | Max: 18m 22s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 32m 21s | Avg: 16m 10s | Max: 16m 21s | Hits:  99%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 39m 43s | Avg: 19m 51s | Max: 21m 42s | Hits:  99%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 03m | Avg: 15m 53s | Max: 17m 44s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 23m | Avg:  5m 29s | Max: 13m 47s
      🟩 GCC                Pass: 100%/51  | Total:  5h 02m | Avg:  5m 56s | Max: 18m 17s
      🟩 Intel              Pass: 100%/3   | Total: 20m 39s | Avg:  6m 53s | Max:  7m 23s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 30m | Avg: 18m 05s | Max: 21m 42s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 03m | Avg: 15m 53s | Max: 17m 44s
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total: 12h 20m | Avg:  6m 40s | Max: 21m 42s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total: 10h 30m | Avg:  6m 07s | Max: 18m 22s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 44m 21s | Avg: 11m 05s | Max: 21m 42s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 06m | Avg: 16m 34s | Max: 18m 17s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 39s
      🟩 90a                Pass: 100%/4   | Total: 18m 16s | Avg:  4m 34s | Max:  4m 52s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 48m | Avg:  5m 37s | Max: 16m 39s
      🟩 14                 Pass: 100%/29  | Total:  3h 06m | Avg:  6m 26s | Max: 18m 22s | Hits:  99%/3704  
      🟩 17                 Pass: 100%/27  | Total:  2h 50m | Avg:  6m 19s | Max: 16m 32s | Hits:  99%/1852  
      🟩 20                 Pass: 100%/23  | Total:  3h 11m | Avg:  8m 19s | Max: 21m 42s | Hits:  99%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 13h 03m | Avg: 7m 07s | Max: 39m 18s | Hits: 99%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 12h 25m | Avg:  7m 18s | Max: 39m 18s | Hits:  99%/3028  
      🟩 arm64              Pass: 100%/8   | Total: 38m 26s | Avg:  4m 48s | Max:  5m 32s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 13m | Avg:  4m 52s | Max: 14m 13s | Hits:  99%/757   
      🟩 11.8               Pass: 100%/3   | Total: 17m 04s | Avg:  5m 41s | Max:  5m 49s
      🟩 12.5               Pass: 100%/4   | Total: 36m 29s | Avg:  9m 07s | Max:  9m 48s
      🟩 12.6               Pass: 100%/88  | Total: 10h 56m | Avg:  7m 27s | Max: 39m 18s | Hits:  99%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 16m 35s | Avg:  4m 08s | Max:  4m 23s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 13m | Avg:  4m 52s | Max: 14m 13s | Hits:  99%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 17m 04s | Avg:  5m 41s | Max:  5m 49s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 36m 29s | Avg:  9m 07s | Max:  9m 48s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 40m | Avg:  7m 37s | Max: 39m 18s | Hits:  99%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 16m 35s | Avg:  4m 08s | Max:  4m 23s
      🟩 nvcc               Pass: 100%/106 | Total: 12h 47m | Avg:  7m 14s | Max: 39m 18s | Hits:  99%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 30m 11s | Avg:  5m 01s | Max:  5m 56s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 24s | Avg:  6m 28s | Max:  6m 48s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 25s | Avg:  5m 06s | Max:  5m 12s
      🟩 Clang12            Pass: 100%/4   | Total: 22m 33s | Avg:  5m 38s | Max:  6m 44s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 46s | Avg:  5m 11s | Max:  5m 22s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 37s | Avg:  5m 24s | Max:  6m 00s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 13s | Avg:  5m 18s | Max:  5m 43s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 52s | Avg:  5m 28s | Max:  5m 43s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 44s | Avg:  5m 11s | Max:  5m 33s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 35m | Avg:  8m 38s | Max: 29m 31s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 28s | Avg:  4m 14s | Max:  4m 24s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 41s | Avg:  4m 36s | Max:  5m 22s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 28s | Avg:  4m 44s | Max:  5m 29s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 03s | Avg:  4m 50s | Max:  5m 43s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  5m 27s
      🟩 GCC11              Pass: 100%/7   | Total: 38m 32s | Avg:  5m 30s | Max:  5m 49s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 34s | Avg:  5m 38s | Max:  6m 01s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 26m | Avg: 12m 53s | Max: 39m 18s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 26s | Avg:  6m 28s | Max:  6m 49s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 13s | Avg: 14m 13s | Max: 14m 13s | Hits:  99%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 24m 22s | Avg: 12m 11s | Max: 12m 11s | Hits:  99%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 49s | Avg: 12m 49s | Max: 12m 49s | Hits:  99%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 36m 29s | Avg:  9m 07s | Max:  9m 48s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 53m | Avg:  6m 07s | Max: 29m 31s
      🟩 GCC                Pass: 100%/51  | Total:  6h 22m | Avg:  7m 29s | Max: 39m 18s
      🟩 Intel              Pass: 100%/3   | Total: 19m 26s | Avg:  6m 28s | Max:  6m 49s
      🟩 MSVC               Pass: 100%/4   | Total: 51m 24s | Avg: 12m 51s | Max: 14m 13s | Hits:  99%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total: 36m 29s | Avg:  9m 07s | Max:  9m 48s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 13h 03m | Avg:  7m 07s | Max: 39m 18s | Hits:  99%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  9h 31m | Avg:  5m 36s | Max: 14m 13s | Hits:  99%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 21s | Avg: 21m 21s | Max: 21m 21s
      🟩 GraphCapture       Pass: 100%/1   | Total: 21m 33s | Avg: 21m 33s | Max: 21m 33s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 41s | Max: 24m 24s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 40m | Avg: 33m 38s | Max: 39m 18s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 17m 04s | Avg:  5m 41s | Max:  5m 49s
      🟩 90a                Pass: 100%/4   | Total: 16m 49s | Avg:  4m 12s | Max:  4m 16s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 24m | Avg:  6m 49s | Max: 39m 18s
      🟩 14                 Pass: 100%/29  | Total:  2h 48m | Avg:  5m 49s | Max: 14m 13s | Hits:  99%/1514  
      🟩 17                 Pass: 100%/27  | Total:  2h 32m | Avg:  5m 38s | Max: 12m 11s | Hits:  99%/757   
      🟩 20                 Pass: 100%/24  | Total:  4h 17m | Avg: 10m 44s | Max: 32m 06s | Hits:  99%/757   
    
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 26m | Avg: 4m 56s | Max: 24m 49s | Hits: 80%/302

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  4h 14m | Avg:  5m 05s | Max: 24m 49s | Hits:  80%/302   
      🟩 arm64              Pass: 100%/4   | Total: 11m 34s | Avg:  2m 53s | Max:  3m 21s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 32m | Avg:  4m 52s | Max: 18m 53s | Hits:  80%/151   
      🟩 12.5               Pass: 100%/2   | Total: 13m 49s | Avg:  6m 54s | Max:  7m 17s
      🟩 12.6               Pass: 100%/33  | Total:  2h 40m | Avg:  4m 50s | Max: 24m 49s | Hits:  80%/151   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 32m | Avg:  4m 52s | Max: 18m 53s | Hits:  80%/151   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 13m 49s | Avg:  6m 54s | Max:  7m 17s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 40m | Avg:  4m 50s | Max: 24m 49s | Hits:  80%/151   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 26m | Avg:  4m 56s | Max: 24m 49s | Hits:  80%/302   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 08s | Avg:  3m 04s | Max:  3m 21s
      🟩 Clang10            Pass: 100%/2   | Total:  7m 13s | Avg:  3m 36s | Max:  3m 50s
      🟩 Clang11            Pass: 100%/4   | Total: 12m 01s | Avg:  3m 00s | Max:  3m 15s
      🟩 Clang12            Pass: 100%/4   | Total: 12m 11s | Avg:  3m 02s | Max:  3m 24s
      🟩 Clang13            Pass: 100%/4   | Total: 11m 41s | Avg:  2m 55s | Max:  2m 57s
      🟩 Clang14            Pass: 100%/4   | Total: 27m 11s | Avg:  6m 47s | Max: 18m 19s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 24s | Avg:  3m 12s | Max:  3m 20s
      🟩 Clang16            Pass: 100%/4   | Total: 12m 31s | Avg:  3m 07s | Max:  3m 21s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 03s | Avg:  3m 01s | Max:  3m 04s
      🟩 Clang18            Pass: 100%/2   | Total: 21m 24s | Avg: 10m 42s | Max: 17m 58s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 03s | Avg:  3m 01s | Max:  3m 11s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 36s | Avg:  2m 54s | Max:  3m 07s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 53s | Avg:  2m 58s | Max:  3m 12s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 14m | Avg: 10m 34s | Max: 24m 49s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 39s | Avg:  2m 33s | Max:  2m 34s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 11s | Avg:  9m 11s | Max:  9m 11s | Hits:  80%/151   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 31s | Avg:  9m 31s | Max:  9m 31s | Hits:  80%/151   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 13m 49s | Avg:  6m 54s | Max:  7m 17s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 02m | Avg:  4m 05s | Max: 18m 19s
      🟩 GCC                Pass: 100%/20  | Total:  1h 51m | Avg:  5m 33s | Max: 24m 49s
      🟩 MSVC               Pass: 100%/2   | Total: 18m 42s | Avg:  9m 21s | Max:  9m 31s | Hits:  80%/302   
      🟩 NVHPC              Pass: 100%/2   | Total: 13m 49s | Avg:  6m 54s | Max:  7m 17s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 26m | Avg:  4m 56s | Max: 24m 49s | Hits:  80%/302   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  2h 49m | Avg:  3m 27s | Max:  9m 31s | Hits:  80%/302   
      🟩 Test               Pass: 100%/5   | Total:  1h 37m | Avg: 19m 29s | Max: 24m 49s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 35s | Avg:  2m 35s | Max:  2m 35s
      🟩 90a                Pass: 100%/1   | Total:  2m 33s | Avg:  2m 33s | Max:  2m 33s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  2h 03m | Avg:  4m 14s | Max: 18m 53s
      🟩 20                 Pass: 100%/25  | Total:  2h 23m | Avg:  5m 44s | Max: 24m 49s | Hits:  80%/302   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 48s | Avg: 4m 54s | Max: 7m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  7m 47s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  7m 47s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  7m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  7m 47s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  7m 47s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  7m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  7m 47s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 01s | Avg:  2m 01s | Max:  2m 01s
      🟩 Test               Pass: 100%/1   | Total:  7m 47s | Avg:  7m 47s | Max:  7m 47s
    
  • 🟩 python: Pass: 100%/1 | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 396)

# Runner
327 linux-amd64-cpu16
28 linux-arm64-cpu16
26 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

Copy link

copy-pr-bot bot commented Dec 16, 2024

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@ericniebler
Copy link
Collaborator Author

/ok to test

@ericniebler
Copy link
Collaborator Author

/ok to test

@ericniebler
Copy link
Collaborator Author

/ok to test

@ericniebler
Copy link
Collaborator Author

/ok to test

@ericniebler
Copy link
Collaborator Author

/ok to test

Copy link
Contributor

🟨 CI finished in 1h 17m: Pass: 98%/170 | Total: 1d 02h | Avg: 9m 26s | Max: 52m 44s | Hits: 99%/22190
  • 🟨 cudax: Pass: 92%/26 | Total: 2h 21m | Avg: 5m 27s | Max: 19m 44s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  90%/22  | Total:  2h 08m | Avg:  5m 49s | Max: 19m 44s
      🟩 arm64              Pass: 100%/4   | Total: 13m 43s | Avg:  3m 25s | Max:  3m 34s
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/13  | Total:  1h 01m | Avg:  4m 43s | Max: 16m 02s
      🟩 GCC                Pass: 100%/9   | Total: 47m 54s | Avg:  5m 19s | Max: 19m 44s
      🔥 MSVC               Pass:   0%/2   | Total: 19m 12s | Avg:  9m 36s | Max: 10m 31s
      🟩 NVHPC              Pass: 100%/2   | Total: 13m 11s | Avg:  6m 35s | Max:  6m 39s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  91%/24  | Total:  1h 46m | Avg:  4m 25s | Max: 10m 31s
      🟩 Test               Pass: 100%/2   | Total: 35m 46s | Avg: 17m 53s | Max: 19m 44s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/6   | Total: 23m 30s | Avg:  3m 55s | Max:  6m 39s
      🔍 20                 Pass:  90%/20  | Total:  1h 58m | Avg:  5m 54s | Max: 19m 44s
    🟨 ctk
      🟨 12.0               Pass:  66%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  8m 41s
      🟩 12.5               Pass: 100%/2   | Total: 13m 11s | Avg:  6m 35s | Max:  6m 39s
      🟨 12.6               Pass:  95%/21  | Total:  1h 53m | Avg:  5m 23s | Max: 19m 44s
    🟨 cudacxx
      🟨 nvcc12.0           Pass:  66%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  8m 41s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 13m 11s | Avg:  6m 35s | Max:  6m 39s
      🟨 nvcc12.6           Pass:  95%/21  | Total:  1h 53m | Avg:  5m 23s | Max: 19m 44s
    🟨 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 46s | Avg:  3m 46s | Max:  3m 46s
      🟩 Clang13            Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 48s | Avg:  3m 48s | Max:  3m 48s
      🟩 Clang18            Pass: 100%/4   | Total: 26m 42s | Avg:  6m 40s | Max: 16m 02s
      🟩 GCC9               Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 37s | Avg:  3m 37s | Max:  3m 37s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s
      🟩 GCC12              Pass: 100%/2   | Total: 23m 34s | Avg: 11m 47s | Max: 19m 44s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 42s | Avg:  3m 25s | Max:  3m 34s
      🟥 MSVC14.36          Pass:   0%/1   | Total:  8m 41s | Avg:  8m 41s | Max:  8m 41s
      🟥 MSVC14.39          Pass:   0%/1   | Total: 10m 31s | Avg: 10m 31s | Max: 10m 31s
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 13m 11s | Avg:  6m 35s | Max:  6m 39s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  92%/26  | Total:  2h 21m | Avg:  5m 27s | Max: 19m 44s
    🟨 gpu
      🟨 v100               Pass:  92%/26  | Total:  2h 21m | Avg:  5m 27s | Max: 19m 44s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s
      🟩 90a                Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s
    
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 9h 18m | Avg: 11m 38s | Max: 29m 05s | Hits: 99%/9806

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total:  9h 11m | Avg: 11m 59s | Max: 29m 05s | Hits:  99%/9806  
      🟩 arm64              Pass: 100%/2   | Total:  6m 53s | Avg:  3m 26s | Max:  3m 36s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 08m | Avg:  9m 45s | Max: 22m 44s | Hits:  98%/2237  
      🟩 12.5               Pass: 100%/2   | Total: 38m 10s | Avg: 19m 05s | Max: 29m 05s
      🟩 12.6               Pass: 100%/39  | Total:  7h 32m | Avg: 11m 35s | Max: 27m 49s | Hits:  99%/7569  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 04m | Avg: 16m 01s | Max: 20m 53s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 08m | Avg:  9m 45s | Max: 22m 44s | Hits:  98%/2237  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 38m 10s | Avg: 19m 05s | Max: 29m 05s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  6h 28m | Avg: 11m 05s | Max: 27m 49s | Hits:  99%/7569  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 04m | Avg: 16m 01s | Max: 20m 53s
      🟩 nvcc               Pass: 100%/44  | Total:  8h 14m | Avg: 11m 14s | Max: 29m 05s | Hits:  99%/9806  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 41m 54s | Avg: 10m 28s | Max: 18m 51s
      🟩 Clang10            Pass: 100%/1   | Total: 21m 39s | Avg: 21m 39s | Max: 21m 39s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 04s | Avg:  4m 04s | Max:  4m 04s
      🟩 Clang13            Pass: 100%/1   | Total: 14m 18s | Avg: 14m 18s | Max: 14m 18s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 17s | Avg:  4m 17s | Max:  4m 17s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s
      🟩 Clang17            Pass: 100%/1   | Total: 20m 32s | Avg: 20m 32s | Max: 20m 32s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 38m | Avg: 12m 16s | Max: 20m 53s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 13s | Avg:  2m 36s | Max:  2m 39s
      🟩 GCC7               Pass: 100%/2   | Total: 13m 14s | Avg:  6m 37s | Max: 10m 02s
      🟩 GCC8               Pass: 100%/1   | Total: 18m 07s | Avg: 18m 07s | Max: 18m 07s
      🟩 GCC9               Pass: 100%/3   | Total: 29m 27s | Avg:  9m 49s | Max: 22m 44s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 58s | Avg:  3m 58s | Max:  3m 58s
      🟩 GCC11              Pass: 100%/1   | Total: 22m 28s | Avg: 22m 28s | Max: 22m 28s
      🟩 GCC12              Pass: 100%/1   | Total:  8m 22s | Avg:  8m 22s | Max:  8m 22s
      🟩 GCC13              Pass: 100%/10  | Total:  2h 16m | Avg: 13m 41s | Max: 27m 49s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 25s | Avg: 18m 25s | Max: 18m 25s | Hits:  98%/2237  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 12m 51s | Avg: 12m 51s | Max: 12m 51s | Hits:  99%/2474  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 28m 58s | Avg: 14m 29s | Max: 15m 32s | Hits:  98%/5095  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 38m 10s | Avg: 19m 05s | Max: 29m 05s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  3h 37m | Avg: 10m 51s | Max: 21m 39s
      🟩 GCC                Pass: 100%/21  | Total:  3h 57m | Avg: 11m 19s | Max: 27m 49s
      🟩 Intel              Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 00m | Avg: 15m 03s | Max: 18m 25s | Hits:  99%/9806  
      🟩 NVHPC              Pass: 100%/2   | Total: 38m 10s | Avg: 19m 05s | Max: 29m 05s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total:  9h 18m | Avg: 11m 38s | Max: 29m 05s | Hits:  99%/9806  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  7h 03m | Avg: 10m 19s | Max: 29m 05s | Hits:  99%/9806  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 38m | Avg: 24m 40s | Max: 27m 49s
      🟩 Test               Pass: 100%/2   | Total: 34m 54s | Avg: 17m 27s | Max: 17m 30s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 58s | Avg:  1m 58s | Max:  1m 58s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 12m 33s | Avg: 12m 33s | Max: 12m 33s
      🟩 90a                Pass: 100%/2   | Total: 16m 31s | Avg:  8m 15s | Max: 12m 25s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total:  1h 01m | Avg: 10m 18s | Max: 26m 13s
      🟩 14                 Pass: 100%/5   | Total:  1h 17m | Avg: 15m 33s | Max: 27m 49s | Hits:  98%/2237  
      🟩 17                 Pass: 100%/13  | Total:  2h 55m | Avg: 13m 28s | Max: 29m 05s | Hits:  99%/4948  
      🟩 20                 Pass: 100%/23  | Total:  4h 02m | Avg: 10m 31s | Max: 22m 28s | Hits:  98%/2621  
    
  • 🟩 cub: Pass: 100%/47 | Total: 8h 02m | Avg: 10m 15s | Max: 52m 44s | Hits: 99%/3124

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  7h 52m | Avg: 10m 30s | Max: 52m 44s | Hits:  99%/3124  
      🟩 arm64              Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  4m 57s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 41m 28s | Avg:  5m 55s | Max: 14m 52s | Hits:  99%/781   
      🟩 12.5               Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max:  8m 53s
      🟩 12.6               Pass: 100%/38  | Total:  7h 03m | Avg: 11m 08s | Max: 52m 44s | Hits:  99%/2343  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 21s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 41m 28s | Avg:  5m 55s | Max: 14m 52s | Hits:  99%/781   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max:  8m 53s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  6h 54m | Avg: 11m 30s | Max: 52m 44s | Hits:  99%/2343  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 21s
      🟩 nvcc               Pass: 100%/45  | Total:  7h 53m | Avg: 10m 31s | Max: 52m 44s | Hits:  99%/3124  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 01s | Avg:  5m 15s | Max:  5m 49s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 52s | Avg:  6m 52s | Max:  6m 52s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 06s | Avg:  5m 06s | Max:  5m 06s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 08s | Avg:  5m 08s | Max:  5m 08s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 12m | Avg: 10m 17s | Max: 30m 30s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 45s | Avg:  4m 22s | Max:  4m 35s
      🟩 GCC7               Pass: 100%/2   | Total: 57m 55s | Avg: 28m 57s | Max: 52m 44s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 10s | Avg:  4m 43s | Max:  5m 46s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 GCC12              Pass: 100%/3   | Total: 26m 08s | Avg:  8m 42s | Max: 16m 02s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 21m | Avg: 17m 40s | Max: 38m 36s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 49s | Avg:  6m 49s | Max:  6m 49s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s | Hits:  99%/781   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 12m 24s | Avg: 12m 24s | Max: 12m 24s | Hits:  99%/781   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 28m 11s | Avg: 14m 05s | Max: 15m 21s | Hits:  99%/1562  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max:  8m 53s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 17m | Avg:  7m 14s | Max: 30m 30s
      🟩 GCC                Pass: 100%/21  | Total:  4h 24m | Avg: 12m 36s | Max: 52m 44s
      🟩 Intel              Pass: 100%/1   | Total:  6m 49s | Avg:  6m 49s | Max:  6m 49s
      🟩 MSVC               Pass: 100%/4   | Total: 55m 27s | Avg: 13m 51s | Max: 15m 21s | Hits:  99%/3124  
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max:  8m 53s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 20m 27s | Avg: 10m 13s | Max: 16m 02s
      🟩 v100               Pass: 100%/45  | Total:  7h 42m | Avg: 10m 16s | Max: 52m 44s | Hits:  99%/3124  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 58m | Avg:  7m 27s | Max: 52m 44s | Hits:  99%/3124  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 31m 37s | Avg: 31m 37s | Max: 31m 37s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 22s | Avg: 19m 22s | Max: 19m 22s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 04m | Avg: 21m 24s | Max: 30m 36s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 09m | Avg: 34m 33s | Max: 38m 36s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 20m 27s | Avg: 10m 13s | Max: 16m 02s
      🟩 90a                Pass: 100%/1   | Total:  4m 24s | Avg:  4m 24s | Max:  4m 24s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  1h 11m | Avg: 14m 12s | Max: 52m 44s
      🟩 14                 Pass: 100%/4   | Total: 30m 23s | Avg:  7m 35s | Max: 14m 52s | Hits:  99%/781   
      🟩 17                 Pass: 100%/12  | Total:  1h 23m | Avg:  6m 59s | Max: 12m 50s | Hits:  99%/1562  
      🟩 20                 Pass: 100%/26  | Total:  4h 57m | Avg: 11m 25s | Max: 38m 36s | Hits:  99%/781   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 25m | Avg: 8m 22s | Max: 24m 14s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 24m 47s | Avg: 12m 23s | Max: 19m 09s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  6h 15m | Avg:  8m 32s | Max: 24m 14s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  4m 58s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 45m 06s | Avg:  6m 26s | Max: 19m 31s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 29m 55s | Avg: 14m 57s | Max: 15m 36s
      🟩 12.6               Pass: 100%/37  | Total:  5h 10m | Avg:  8m 23s | Max: 24m 14s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 56s | Avg:  4m 58s | Max:  5m 02s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 45m 06s | Avg:  6m 26s | Max: 19m 31s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 29m 55s | Avg: 14m 57s | Max: 15m 36s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  5h 00m | Avg:  8m 35s | Max: 24m 14s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 56s | Avg:  4m 58s | Max:  5m 02s
      🟩 nvcc               Pass: 100%/44  | Total:  6h 15m | Avg:  8m 31s | Max: 24m 14s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 20m 33s | Avg:  5m 08s | Max:  6m 06s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 36s | Avg:  6m 36s | Max:  6m 36s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 06s | Avg:  5m 06s | Max:  5m 06s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 07s | Avg:  5m 07s | Max:  5m 07s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 15s | Avg:  5m 15s | Max:  5m 15s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 45s | Avg:  5m 45s | Max:  5m 45s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 54s | Avg:  5m 54s | Max:  5m 54s
      🟩 Clang18            Pass: 100%/7   | Total: 56m 42s | Avg:  8m 06s | Max: 23m 42s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 23s | Avg:  4m 11s | Max:  4m 21s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 40s | Avg:  4m 50s | Max:  5m 10s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 29s | Avg:  5m 29s | Max:  5m 29s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 04s | Avg:  4m 41s | Max:  5m 55s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 15m | Avg:  9m 26s | Max: 21m 38s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 06s | Avg:  7m 06s | Max:  7m 06s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 31s | Avg: 19m 31s | Max: 19m 31s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 16m 53s | Avg: 16m 53s | Max: 16m 53s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 00m | Avg: 20m 05s | Max: 24m 14s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 29m 55s | Avg: 14m 57s | Max: 15m 36s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 01m | Avg:  6m 25s | Max: 23m 42s
      🟩 GCC                Pass: 100%/19  | Total:  2h 09m | Avg:  6m 49s | Max: 21m 38s
      🟩 Intel              Pass: 100%/1   | Total:  7m 06s | Avg:  7m 06s | Max:  7m 06s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 36m | Avg: 19m 19s | Max: 24m 14s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 55s | Avg: 14m 57s | Max: 15m 36s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 25m | Avg:  8m 22s | Max: 24m 14s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 40m | Avg:  7m 01s | Max: 19m 42s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 40m 03s | Avg: 13m 21s | Max: 24m 14s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 04m | Avg: 21m 29s | Max: 23m 42s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 22s | Avg:  4m 22s | Max:  4m 22s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 21m 39s | Avg:  4m 19s | Max:  5m 24s
      🟩 14                 Pass: 100%/4   | Total: 35m 08s | Avg:  8m 47s | Max: 19m 31s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 38m | Avg:  8m 11s | Max: 16m 53s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 25m | Avg:  8m 56s | Max: 24m 14s | Hits:  99%/3704  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 11s | Avg: 5m 05s | Max: 8m 06s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  8m 06s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  8m 06s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  8m 06s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  8m 06s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  8m 06s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  8m 06s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  8m 06s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 05s | Avg:  2m 05s | Max:  2m 05s
      🟩 Test               Pass: 100%/1   | Total:  8m 06s | Avg:  8m 06s | Max:  8m 06s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 19s | Avg: 27m 19s | Max: 27m 19s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 19s | Avg: 27m 19s | Max: 27m 19s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 27m 19s | Avg: 27m 19s | Max: 27m 19s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 27m 19s | Avg: 27m 19s | Max: 27m 19s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 19s | Avg: 27m 19s | Max: 27m 19s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 19s | Avg: 27m 19s | Max: 27m 19s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 19s | Avg: 27m 19s | Max: 27m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 27m 19s | Avg: 27m 19s | Max: 27m 19s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 19s | Avg: 27m 19s | Max: 27m 19s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
125 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@ericniebler
Copy link
Collaborator Author

/ok to test

Copy link
Contributor

🟨 CI finished in 55m 15s: Pass: 99%/170 | Total: 1d 02h | Avg: 9m 29s | Max: 49m 04s | Hits: 77%/22346
  • 🟨 cudax: Pass: 96%/26 | Total: 2h 25m | Avg: 5m 35s | Max: 24m 24s | Hits: 74%/156

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/22  | Total:  2h 11m | Avg:  5m 59s | Max: 24m 24s | Hits:  74%/156   
      🟩 arm64              Pass: 100%/4   | Total: 13m 34s | Avg:  3m 23s | Max:  3m 37s
    🔍 ctk: 12.0 🔍
      🔍 12.0               Pass:  66%/3   | Total: 16m 00s | Avg:  5m 20s | Max:  8m 46s
      🟩 12.5               Pass: 100%/2   | Total: 12m 21s | Avg:  6m 10s | Max:  6m 23s
      🟩 12.6               Pass: 100%/21  | Total:  1h 57m | Avg:  5m 34s | Max: 24m 24s | Hits:  74%/156   
    🔍 cudacxx: nvcc12.0 🔍
      🔍 nvcc12.0           Pass:  66%/3   | Total: 16m 00s | Avg:  5m 20s | Max:  8m 46s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 21s | Avg:  6m 10s | Max:  6m 23s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 57m | Avg:  5m 34s | Max: 24m 24s | Hits:  74%/156   
    🚨 cxx: MSVC14.36 🚨
      🟩 Clang9             Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 25s | Avg:  4m 25s | Max:  4m 25s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 47s | Avg:  3m 47s | Max:  3m 47s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 05s | Avg:  4m 05s | Max:  4m 05s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 Clang18            Pass: 100%/4   | Total: 26m 30s | Avg:  6m 37s | Max: 16m 06s
      🟩 GCC9               Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 48s | Avg:  3m 48s | Max:  3m 48s
      🟩 GCC12              Pass: 100%/2   | Total: 28m 22s | Avg: 14m 11s | Max: 24m 24s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 10s | Avg:  3m 17s | Max:  3m 37s
      🔥 MSVC14.36          Pass:   0%/1   | Total:  8m 46s | Avg:  8m 46s | Max:  8m 46s
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 24s | Avg: 10m 24s | Max: 10m 24s | Hits:  74%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 21s | Avg:  6m 10s | Max:  6m 23s
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/13  | Total:  1h 01m | Avg:  4m 41s | Max: 16m 06s
      🟩 GCC                Pass: 100%/9   | Total: 52m 59s | Avg:  5m 53s | Max: 24m 24s
      🔍 MSVC               Pass:  50%/2   | Total: 19m 10s | Avg:  9m 35s | Max: 10m 24s | Hits:  74%/156   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 21s | Avg:  6m 10s | Max:  6m 23s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  95%/24  | Total:  1h 45m | Avg:  4m 22s | Max: 10m 24s | Hits:  74%/156   
      🟩 Test               Pass: 100%/2   | Total: 40m 30s | Avg: 20m 15s | Max: 24m 24s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/6   | Total: 22m 58s | Avg:  3m 49s | Max:  5m 58s
      🔍 20                 Pass:  95%/20  | Total:  2h 02m | Avg:  6m 07s | Max: 24m 24s | Hits:  74%/156   
    🟨 cudacxx_family
      🟨 nvcc               Pass:  96%/26  | Total:  2h 25m | Avg:  5m 35s | Max: 24m 24s | Hits:  74%/156   
    🟨 gpu
      🟨 v100               Pass:  96%/26  | Total:  2h 25m | Avg:  5m 35s | Max: 24m 24s | Hits:  74%/156   
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 08s | Avg:  3m 08s | Max:  3m 08s
      🟩 90a                Pass: 100%/1   | Total:  3m 07s | Avg:  3m 07s | Max:  3m 07s
    
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 10h 50m | Avg: 13m 32s | Max: 49m 04s | Hits: 49%/9806

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 10h 32m | Avg: 13m 45s | Max: 49m 04s | Hits:  49%/9806  
      🟩 arm64              Pass: 100%/2   | Total: 17m 37s | Avg:  8m 48s | Max: 14m 19s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 09m | Avg:  9m 51s | Max: 30m 16s | Hits:  34%/2237  
      🟩 12.5               Pass: 100%/2   | Total:  1h 04m | Avg: 32m 07s | Max: 32m 14s
      🟩 12.6               Pass: 100%/39  | Total:  8h 37m | Avg: 13m 15s | Max: 49m 04s | Hits:  54%/7569  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 03m | Avg: 15m 52s | Max: 19m 45s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 09m | Avg:  9m 51s | Max: 30m 16s | Hits:  34%/2237  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 04m | Avg: 32m 07s | Max: 32m 14s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  7h 33m | Avg: 12m 57s | Max: 49m 04s | Hits:  54%/7569  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 03m | Avg: 15m 52s | Max: 19m 45s
      🟩 nvcc               Pass: 100%/44  | Total:  9h 46m | Avg: 13m 20s | Max: 49m 04s | Hits:  49%/9806  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 28m 34s | Avg:  7m 08s | Max: 16m 47s
      🟩 Clang10            Pass: 100%/1   | Total:  5m 20s | Avg:  5m 20s | Max:  5m 20s
      🟩 Clang11            Pass: 100%/1   | Total:  7m 56s | Avg:  7m 56s | Max:  7m 56s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 08s | Avg:  4m 08s | Max:  4m 08s
      🟩 Clang13            Pass: 100%/1   | Total: 19m 43s | Avg: 19m 43s | Max: 19m 43s
      🟩 Clang14            Pass: 100%/1   | Total:  8m 36s | Avg:  8m 36s | Max:  8m 36s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
      🟩 Clang16            Pass: 100%/1   | Total: 22m 19s | Avg: 22m 19s | Max: 22m 19s
      🟩 Clang17            Pass: 100%/1   | Total:  8m 11s | Avg:  8m 11s | Max:  8m 11s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 15m | Avg: 16m 56s | Max: 49m 04s
      🟩 GCC6               Pass: 100%/2   | Total: 13m 24s | Avg:  6m 42s | Max: 11m 05s
      🟩 GCC7               Pass: 100%/2   | Total:  6m 38s | Avg:  3m 19s | Max:  3m 34s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 GCC9               Pass: 100%/3   | Total: 25m 49s | Avg:  8m 36s | Max: 20m 18s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 34s | Avg:  3m 34s | Max:  3m 34s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
      🟩 GCC12              Pass: 100%/1   | Total: 22m 52s | Avg: 22m 52s | Max: 22m 52s
      🟩 GCC13              Pass: 100%/10  | Total:  2h 23m | Avg: 14m 21s | Max: 32m 36s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 30m 16s | Avg: 30m 16s | Max: 30m 16s | Hits:  34%/2237  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 32m 45s | Avg: 32m 45s | Max: 32m 45s | Hits:  30%/2474  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 49m 44s | Avg: 24m 52s | Max: 36m 11s | Hits:  65%/5095  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 04m | Avg: 32m 07s | Max: 32m 14s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  4h 04m | Avg: 12m 13s | Max: 49m 04s
      🟩 GCC                Pass: 100%/21  | Total:  3h 42m | Avg: 10m 37s | Max: 32m 36s
      🟩 Intel              Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 52m | Avg: 28m 11s | Max: 36m 11s | Hits:  49%/9806  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 07s | Max: 32m 14s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total: 10h 50m | Avg: 13m 32s | Max: 49m 04s | Hits:  49%/9806  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  7h 52m | Avg: 11m 30s | Max: 36m 11s | Hits:  49%/9806  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 47m | Avg: 26m 55s | Max: 32m 36s
      🟩 Test               Pass: 100%/2   | Total:  1h 08m | Avg: 34m 22s | Max: 49m 04s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 52s | Avg:  1m 52s | Max:  1m 52s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 22s | Avg: 13m 22s | Max: 13m 22s
      🟩 90a                Pass: 100%/2   | Total: 15m 51s | Avg:  7m 55s | Max: 12m 14s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total: 44m 44s | Avg:  7m 27s | Max: 29m 36s
      🟩 14                 Pass: 100%/5   | Total:  1h 22m | Avg: 16m 26s | Max: 32m 36s | Hits:  34%/2237  
      🟩 17                 Pass: 100%/13  | Total:  3h 26m | Avg: 15m 52s | Max: 36m 11s | Hits:  30%/4948  
      🟩 20                 Pass: 100%/23  | Total:  5h 15m | Avg: 13m 42s | Max: 49m 04s | Hits:  98%/2621  
    
  • 🟩 cub: Pass: 100%/47 | Total: 6h 53m | Avg: 8m 47s | Max: 37m 51s | Hits: 99%/3124

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  6h 43m | Avg:  8m 58s | Max: 37m 51s | Hits:  99%/3124  
      🟩 arm64              Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  4m 59s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 42m 46s | Avg:  6m 06s | Max: 15m 45s | Hits:  99%/781   
      🟩 12.5               Pass: 100%/2   | Total: 18m 38s | Avg:  9m 19s | Max:  9m 47s
      🟩 12.6               Pass: 100%/38  | Total:  5h 52m | Avg:  9m 15s | Max: 37m 51s | Hits:  99%/2343  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  4m 35s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 42m 46s | Avg:  6m 06s | Max: 15m 45s | Hits:  99%/781   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 38s | Avg:  9m 19s | Max:  9m 47s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  5h 43m | Avg:  9m 32s | Max: 37m 51s | Hits:  99%/2343  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  4m 35s
      🟩 nvcc               Pass: 100%/45  | Total:  6h 44m | Avg:  8m 59s | Max: 37m 51s | Hits:  99%/3124  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 20m 36s | Avg:  5m 09s | Max:  5m 48s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 43s | Avg:  6m 43s | Max:  6m 43s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 36s | Avg:  5m 36s | Max:  5m 36s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 36s | Avg:  5m 36s | Max:  5m 36s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 48s | Avg:  5m 48s | Max:  5m 48s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 28m | Avg: 12m 40s | Max: 37m 51s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 54s | Avg:  4m 27s | Max:  4m 45s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  5m 06s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 35s | Avg:  5m 35s | Max:  5m 35s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 42s | Avg:  4m 54s | Max:  5m 43s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 51s | Avg:  5m 51s | Max:  5m 51s
      🟩 GCC12              Pass: 100%/3   | Total: 25m 48s | Avg:  8m 36s | Max: 16m 01s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 42m | Avg: 12m 47s | Max: 30m 19s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 57s | Avg:  6m 57s | Max:  6m 57s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 45s | Avg: 15m 45s | Max: 15m 45s | Hits:  99%/781   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 12m 53s | Avg: 12m 53s | Max: 12m 53s | Hits:  99%/781   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 25m 44s | Avg: 12m 52s | Max: 12m 58s | Hits:  99%/1562  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 38s | Avg:  9m 19s | Max:  9m 47s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 34m | Avg:  8m 09s | Max: 37m 51s
      🟩 GCC                Pass: 100%/21  | Total:  2h 58m | Avg:  8m 30s | Max: 30m 19s
      🟩 Intel              Pass: 100%/1   | Total:  6m 57s | Avg:  6m 57s | Max:  6m 57s
      🟩 MSVC               Pass: 100%/4   | Total: 54m 22s | Avg: 13m 35s | Max: 15m 45s | Hits:  99%/3124  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 38s | Avg:  9m 19s | Max:  9m 47s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 20m 16s | Avg: 10m 08s | Max: 16m 01s
      🟩 v100               Pass: 100%/45  | Total:  6h 33m | Avg:  8m 44s | Max: 37m 51s | Hits:  99%/3124  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 12m | Avg:  6m 18s | Max: 15m 45s | Hits:  99%/3124  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 16s | Avg: 18m 16s | Max: 18m 16s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
      🟩 HostLaunch         Pass: 100%/3   | Total: 59m 23s | Avg: 19m 47s | Max: 26m 04s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 05s | Max: 37m 51s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 20m 16s | Avg: 10m 08s | Max: 16m 01s
      🟩 90a                Pass: 100%/1   | Total:  4m 13s | Avg:  4m 13s | Max:  4m 13s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 24m 00s | Avg:  4m 48s | Max:  5m 40s
      🟩 14                 Pass: 100%/4   | Total: 30m 48s | Avg:  7m 42s | Max: 15m 45s | Hits:  99%/781   
      🟩 17                 Pass: 100%/12  | Total:  1h 25m | Avg:  7m 06s | Max: 12m 53s | Hits:  99%/1562  
      🟩 20                 Pass: 100%/26  | Total:  4h 33m | Avg: 10m 30s | Max: 37m 51s | Hits:  99%/781   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 02m | Avg: 7m 52s | Max: 22m 18s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 21s | Avg: 10m 40s | Max: 15m 07s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  5h 52m | Avg:  8m 00s | Max: 22m 18s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 30s | Avg:  4m 45s | Max:  5m 02s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 42m 36s | Avg:  6m 05s | Max: 17m 21s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 29m 45s | Avg: 14m 52s | Max: 15m 28s
      🟩 12.6               Pass: 100%/37  | Total:  4h 49m | Avg:  7m 49s | Max: 22m 18s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 51s | Avg:  4m 55s | Max:  4m 58s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 42m 36s | Avg:  6m 05s | Max: 17m 21s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 29m 45s | Avg: 14m 52s | Max: 15m 28s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  4h 39m | Avg:  7m 59s | Max: 22m 18s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 51s | Avg:  4m 55s | Max:  4m 58s
      🟩 nvcc               Pass: 100%/44  | Total:  5h 52m | Avg:  8m 00s | Max: 22m 18s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 20m 02s | Avg:  5m 00s | Max:  5m 57s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 55s | Avg:  6m 55s | Max:  6m 55s
      🟩 Clang11            Pass: 100%/1   | Total:  4m 57s | Avg:  4m 57s | Max:  4m 57s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 07s | Avg:  5m 07s | Max:  5m 07s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 10s | Avg:  5m 10s | Max:  5m 10s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 58s | Avg:  4m 58s | Max:  4m 58s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 45s | Avg:  5m 45s | Max:  5m 45s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s
      🟩 Clang18            Pass: 100%/7   | Total: 51m 24s | Avg:  7m 20s | Max: 18m 17s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 03s | Avg:  4m 01s | Max:  4m 15s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  5m 05s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
      🟩 GCC9               Pass: 100%/3   | Total: 13m 53s | Avg:  4m 37s | Max:  5m 27s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 54s | Avg:  5m 54s | Max:  5m 54s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 05m | Avg:  8m 14s | Max: 15m 07s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 05s | Avg:  7m 05s | Max:  7m 05s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 21s | Avg: 17m 21s | Max: 17m 21s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 15m 19s | Avg: 15m 19s | Max: 15m 19s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 57m 18s | Avg: 19m 06s | Max: 22m 18s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 29m 45s | Avg: 14m 52s | Max: 15m 28s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 55m | Avg:  6m 03s | Max: 18m 17s
      🟩 GCC                Pass: 100%/19  | Total:  1h 59m | Avg:  6m 18s | Max: 15m 07s
      🟩 Intel              Pass: 100%/1   | Total:  7m 05s | Avg:  7m 05s | Max:  7m 05s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 29m | Avg: 17m 59s | Max: 22m 18s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 45s | Avg: 14m 52s | Max: 15m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 02m | Avg:  7m 52s | Max: 22m 18s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 36m | Avg:  6m 54s | Max: 18m 13s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 37m 11s | Avg: 12m 23s | Max: 22m 18s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 48m 31s | Avg: 16m 10s | Max: 18m 17s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 43s | Avg:  4m 43s | Max:  4m 43s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 21m 48s | Avg:  4m 21s | Max:  5m 19s
      🟩 14                 Pass: 100%/4   | Total: 32m 38s | Avg:  8m 09s | Max: 17m 21s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 38m | Avg:  8m 11s | Max: 16m 47s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 07m | Avg:  8m 10s | Max: 22m 18s | Hits:  99%/3704  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 07s | Avg: 5m 03s | Max: 8m 06s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 06s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 06s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 06s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 06s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 06s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 06s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  8m 06s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 01s | Avg:  2m 01s | Max:  2m 01s
      🟩 Test               Pass: 100%/1   | Total:  8m 06s | Avg:  8m 06s | Max:  8m 06s
    
  • 🟩 python: Pass: 100%/1 | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
125 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@ericniebler
Copy link
Collaborator Author

/ok to test

@ericniebler
Copy link
Collaborator Author

/ok to test

Copy link
Contributor

🟩 CI finished in 1h 26m: Pass: 100%/170 | Total: 1d 00h | Avg: 8m 44s | Max: 41m 44s | Hits: 96%/22502
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 8h 55m | Avg: 11m 09s | Max: 24m 31s | Hits: 94%/9806

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total:  8h 31m | Avg: 11m 07s | Max: 24m 31s | Hits:  94%/9806  
      🟩 arm64              Pass: 100%/2   | Total: 23m 59s | Avg: 11m 59s | Max: 20m 39s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 08m | Avg:  9m 49s | Max: 22m 53s | Hits:  98%/2237  
      🟩 12.5               Pass: 100%/2   | Total: 16m 24s | Avg:  8m 12s | Max:  8m 22s
      🟩 12.6               Pass: 100%/39  | Total:  7h 30m | Avg: 11m 33s | Max: 24m 31s | Hits:  92%/7569  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 01m | Avg: 15m 19s | Max: 19m 06s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 08m | Avg:  9m 49s | Max: 22m 53s | Hits:  98%/2237  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 16m 24s | Avg:  8m 12s | Max:  8m 22s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  6h 29m | Avg: 11m 07s | Max: 24m 31s | Hits:  92%/7569  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 01m | Avg: 15m 19s | Max: 19m 06s
      🟩 nvcc               Pass: 100%/44  | Total:  7h 54m | Avg: 10m 46s | Max: 24m 31s | Hits:  94%/9806  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 47m 01s | Avg: 11m 45s | Max: 22m 53s
      🟩 Clang10            Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 Clang11            Pass: 100%/1   | Total:  4m 13s | Avg:  4m 13s | Max:  4m 13s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 09s | Avg:  4m 09s | Max:  4m 09s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 56s | Avg:  3m 56s | Max:  3m 56s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 17s | Avg:  4m 17s | Max:  4m 17s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 13s | Avg:  4m 13s | Max:  4m 13s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 05s | Avg:  4m 05s | Max:  4m 05s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 15s | Avg:  4m 15s | Max:  4m 15s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 07m | Avg: 15m 52s | Max: 20m 39s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 09s | Avg:  2m 34s | Max:  2m 43s
      🟩 GCC7               Pass: 100%/2   | Total: 30m 44s | Avg: 15m 22s | Max: 16m 43s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s
      🟩 GCC9               Pass: 100%/3   | Total: 21m 35s | Avg:  7m 11s | Max: 15m 24s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 GCC12              Pass: 100%/1   | Total: 15m 01s | Avg: 15m 01s | Max: 15m 01s
      🟩 GCC13              Pass: 100%/10  | Total:  2h 15m | Avg: 13m 31s | Max: 24m 31s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 23m 23s | Avg: 23m 23s | Max: 23m 23s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 17s | Avg: 19m 17s | Max: 19m 17s | Hits:  98%/2237  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 18m 08s | Avg: 18m 08s | Max: 18m 08s | Hits:  81%/2474  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 30m 59s | Avg: 15m 29s | Max: 16m 03s | Hits:  98%/5095  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 16m 24s | Avg:  8m 12s | Max:  8m 22s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  3h 28m | Avg: 10m 25s | Max: 22m 53s
      🟩 GCC                Pass: 100%/21  | Total:  3h 38m | Avg: 10m 25s | Max: 24m 31s
      🟩 Intel              Pass: 100%/1   | Total: 23m 23s | Avg: 23m 23s | Max: 23m 23s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 08m | Avg: 17m 06s | Max: 19m 17s | Hits:  94%/9806  
      🟩 NVHPC              Pass: 100%/2   | Total: 16m 24s | Avg:  8m 12s | Max:  8m 22s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total:  8h 55m | Avg: 11m 09s | Max: 24m 31s | Hits:  94%/9806  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  6h 49m | Avg:  9m 59s | Max: 23m 23s | Hits:  94%/9806  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 29m | Avg: 22m 21s | Max: 24m 31s
      🟩 Test               Pass: 100%/2   | Total: 34m 46s | Avg: 17m 23s | Max: 18m 18s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 11m 40s | Avg: 11m 40s | Max: 11m 40s
      🟩 90a                Pass: 100%/2   | Total: 16m 15s | Avg:  8m 07s | Max: 12m 19s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total:  1h 20m | Avg: 13m 28s | Max: 23m 12s
      🟩 14                 Pass: 100%/5   | Total:  1h 07m | Avg: 13m 34s | Max: 24m 31s | Hits:  98%/2237  
      🟩 17                 Pass: 100%/13  | Total:  2h 26m | Avg: 11m 16s | Max: 23m 23s | Hits:  90%/4948  
      🟩 20                 Pass: 100%/23  | Total:  3h 58m | Avg: 10m 21s | Max: 21m 42s | Hits:  97%/2621  
    
  • 🟩 cub: Pass: 100%/47 | Total: 6h 52m | Avg: 8m 46s | Max: 41m 44s | Hits: 99%/3124

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  6h 42m | Avg:  8m 56s | Max: 41m 44s | Hits:  99%/3124  
      🟩 arm64              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  4m 56s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 41m 09s | Avg:  5m 52s | Max: 14m 57s | Hits:  99%/781   
      🟩 12.5               Pass: 100%/2   | Total: 19m 02s | Avg:  9m 31s | Max: 10m 05s
      🟩 12.6               Pass: 100%/38  | Total:  5h 52m | Avg:  9m 15s | Max: 41m 44s | Hits:  99%/2343  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 40s | Avg:  4m 20s | Max:  4m 21s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 41m 09s | Avg:  5m 52s | Max: 14m 57s | Hits:  99%/781   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 19m 02s | Avg:  9m 31s | Max: 10m 05s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  5h 43m | Avg:  9m 32s | Max: 41m 44s | Hits:  99%/2343  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 40s | Avg:  4m 20s | Max:  4m 21s
      🟩 nvcc               Pass: 100%/45  | Total:  6h 43m | Avg:  8m 58s | Max: 41m 44s | Hits:  99%/3124  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 10s | Avg:  5m 17s | Max:  6m 23s
      🟩 Clang10            Pass: 100%/1   | Total:  7m 08s | Avg:  7m 08s | Max:  7m 08s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 35s | Avg:  5m 35s | Max:  5m 35s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 10s | Avg:  5m 10s | Max:  5m 10s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 39s | Avg:  5m 39s | Max:  5m 39s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 02m | Avg:  8m 59s | Max: 20m 49s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 28s | Avg:  4m 14s | Max:  4m 24s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 23s | Avg:  5m 11s | Max:  5m 12s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 51s | Avg:  5m 51s | Max:  5m 51s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 24s | Avg:  4m 48s | Max:  5m 44s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 39s | Avg:  5m 39s | Max:  5m 39s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 45s | Avg:  5m 45s | Max:  5m 45s
      🟩 GCC12              Pass: 100%/3   | Total: 26m 30s | Avg:  8m 50s | Max: 16m 17s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 05m | Avg: 15m 44s | Max: 41m 44s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 24s | Avg:  6m 24s | Max:  6m 24s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 57s | Avg: 14m 57s | Max: 14m 57s | Hits:  99%/781   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 14m 04s | Avg: 14m 04s | Max: 14m 04s | Hits:  99%/781   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 25m 45s | Avg: 12m 52s | Max: 13m 00s | Hits:  99%/1562  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 19m 02s | Avg:  9m 31s | Max: 10m 05s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 09m | Avg:  6m 47s | Max: 20m 49s
      🟩 GCC                Pass: 100%/21  | Total:  3h 22m | Avg:  9m 39s | Max: 41m 44s
      🟩 Intel              Pass: 100%/1   | Total:  6m 24s | Avg:  6m 24s | Max:  6m 24s
      🟩 MSVC               Pass: 100%/4   | Total: 54m 46s | Avg: 13m 41s | Max: 14m 57s | Hits:  99%/3124  
      🟩 NVHPC              Pass: 100%/2   | Total: 19m 02s | Avg:  9m 31s | Max: 10m 05s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 20m 25s | Avg: 10m 12s | Max: 16m 17s
      🟩 v100               Pass: 100%/45  | Total:  6h 31m | Avg:  8m 42s | Max: 41m 44s | Hits:  99%/3124  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 12m | Avg:  6m 18s | Max: 14m 57s | Hits:  99%/3124  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 25m 11s | Avg: 25m 11s | Max: 25m 11s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 50s | Avg: 19m 50s | Max: 19m 50s
      🟩 HostLaunch         Pass: 100%/3   | Total: 52m 28s | Avg: 17m 29s | Max: 18m 10s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 16s | Max: 41m 44s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 20m 25s | Avg: 10m 12s | Max: 16m 17s
      🟩 90a                Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 23m 31s | Avg:  4m 42s | Max:  5m 43s
      🟩 14                 Pass: 100%/4   | Total: 30m 55s | Avg:  7m 43s | Max: 14m 57s | Hits:  99%/781   
      🟩 17                 Pass: 100%/12  | Total:  1h 26m | Avg:  7m 14s | Max: 14m 04s | Hits:  99%/1562  
      🟩 20                 Pass: 100%/26  | Total:  4h 31m | Avg: 10m 25s | Max: 41m 44s | Hits:  99%/781   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 01m | Avg: 7m 51s | Max: 21m 43s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 22m 48s | Avg: 11m 24s | Max: 16m 34s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  5h 51m | Avg:  7m 59s | Max: 21m 43s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 39s | Avg:  4m 49s | Max:  5m 07s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 44m 49s | Avg:  6m 24s | Max: 19m 24s | Hits:  99%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 28m 36s | Avg: 14m 18s | Max: 15m 01s
      🟩 12.6               Pass: 100%/37  | Total:  4h 48m | Avg:  7m 47s | Max: 21m 43s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  5m 00s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 44m 49s | Avg:  6m 24s | Max: 19m 24s | Hits:  99%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 28m 36s | Avg: 14m 18s | Max: 15m 01s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  4h 38m | Avg:  7m 56s | Max: 21m 43s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  5m 00s
      🟩 nvcc               Pass: 100%/44  | Total:  5h 51m | Avg:  7m 59s | Max: 21m 43s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 20m 52s | Avg:  5m 13s | Max:  6m 14s
      🟩 Clang10            Pass: 100%/1   | Total:  7m 21s | Avg:  7m 21s | Max:  7m 21s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 33s | Avg:  5m 33s | Max:  5m 33s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 43s | Avg:  5m 43s | Max:  5m 43s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 20s | Avg:  5m 20s | Max:  5m 20s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 Clang18            Pass: 100%/7   | Total: 44m 20s | Avg:  6m 20s | Max: 11m 48s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 00s | Avg:  4m 00s | Max:  4m 13s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  5m 03s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 07s | Avg:  4m 42s | Max:  5m 37s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 GCC11              Pass: 100%/1   | Total:  6m 03s | Avg:  6m 03s | Max:  6m 03s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 08m | Avg:  8m 30s | Max: 16m 34s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 01s | Avg:  7m 01s | Max:  7m 01s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 24s | Avg: 19m 24s | Max: 19m 24s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 15m 47s | Avg: 15m 47s | Max: 15m 47s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 57m 28s | Avg: 19m 09s | Max: 21m 43s | Hits:  99%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 28m 36s | Avg: 14m 18s | Max: 15m 01s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 50m | Avg:  5m 49s | Max: 11m 48s
      🟩 GCC                Pass: 100%/19  | Total:  2h 02m | Avg:  6m 26s | Max: 16m 34s
      🟩 Intel              Pass: 100%/1   | Total:  7m 01s | Avg:  7m 01s | Max:  7m 01s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 32m | Avg: 18m 31s | Max: 21m 43s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 28m 36s | Avg: 14m 18s | Max: 15m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 01m | Avg:  7m 51s | Max: 21m 43s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 40m | Avg:  7m 00s | Max: 19m 24s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 36m 52s | Avg: 12m 17s | Max: 21m 43s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 44m 20s | Avg: 14m 46s | Max: 16m 34s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 22m 35s | Avg:  4m 31s | Max:  5m 43s
      🟩 14                 Pass: 100%/4   | Total: 34m 54s | Avg:  8m 43s | Max: 19m 24s | Hits:  99%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 39m | Avg:  8m 15s | Max: 17m 53s | Hits:  99%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 02m | Avg:  7m 55s | Max: 21m 43s | Hits:  99%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 19m | Avg: 5m 20s | Max: 16m 27s | Hits: 67%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 04m | Avg:  5m 40s | Max: 16m 27s | Hits:  67%/312   
      🟩 arm64              Pass: 100%/4   | Total: 14m 22s | Avg:  3m 35s | Max:  4m 00s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 16m 57s | Avg:  5m 39s | Max:  9m 59s | Hits:  67%/156   
      🟩 12.5               Pass: 100%/2   | Total: 12m 29s | Avg:  6m 14s | Max:  6m 16s
      🟩 12.6               Pass: 100%/21  | Total:  1h 49m | Avg:  5m 13s | Max: 16m 27s | Hits:  67%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 16m 57s | Avg:  5m 39s | Max:  9m 59s | Hits:  67%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 29s | Avg:  6m 14s | Max:  6m 16s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 49m | Avg:  5m 13s | Max: 16m 27s | Hits:  67%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 19m | Avg:  5m 20s | Max: 16m 27s | Hits:  67%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 34s | Avg:  3m 34s | Max:  3m 34s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 45s | Avg:  3m 45s | Max:  3m 45s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 05s | Avg:  4m 05s | Max:  4m 05s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 50s | Avg:  3m 50s | Max:  3m 50s
      🟩 Clang18            Pass: 100%/4   | Total: 27m 37s | Avg:  6m 54s | Max: 16m 16s
      🟩 GCC9               Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 40s | Avg:  3m 40s | Max:  3m 40s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s
      🟩 GCC12              Pass: 100%/2   | Total: 20m 12s | Avg: 10m 06s | Max: 16m 27s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 40s | Avg:  3m 25s | Max:  4m 00s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 59s | Avg:  9m 59s | Max:  9m 59s | Hits:  67%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 54s | Avg:  9m 54s | Max:  9m 54s | Hits:  67%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 29s | Avg:  6m 14s | Max:  6m 16s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  1h 02m | Avg:  4m 46s | Max: 16m 16s
      🟩 GCC                Pass: 100%/9   | Total: 44m 35s | Avg:  4m 57s | Max: 16m 27s
      🟩 MSVC               Pass: 100%/2   | Total: 19m 53s | Avg:  9m 56s | Max:  9m 59s | Hits:  67%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 29s | Avg:  6m 14s | Max:  6m 16s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 19m | Avg:  5m 20s | Max: 16m 27s | Hits:  67%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 46m | Avg:  4m 25s | Max:  9m 59s | Hits:  67%/312   
      🟩 Test               Pass: 100%/2   | Total: 32m 43s | Avg: 16m 21s | Max: 16m 27s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 58s | Avg:  2m 58s | Max:  2m 58s
      🟩 90a                Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 23m 08s | Avg:  3m 51s | Max:  6m 13s
      🟩 20                 Pass: 100%/20  | Total:  1h 55m | Avg:  5m 47s | Max: 16m 27s | Hits:  67%/312   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 38s | Avg: 5m 19s | Max: 8m 43s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  8m 43s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  8m 43s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  8m 43s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  8m 43s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  8m 43s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  8m 43s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  8m 43s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 55s | Avg:  1m 55s | Max:  1m 55s
      🟩 Test               Pass: 100%/1   | Total:  8m 43s | Avg:  8m 43s | Max:  8m 43s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 57s | Avg: 27m 57s | Max: 27m 57s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 57s | Avg: 27m 57s | Max: 27m 57s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 27m 57s | Avg: 27m 57s | Max: 27m 57s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 27m 57s | Avg: 27m 57s | Max: 27m 57s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 57s | Avg: 27m 57s | Max: 27m 57s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 57s | Avg: 27m 57s | Max: 27m 57s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 57s | Avg: 27m 57s | Max: 27m 57s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 27m 57s | Avg: 27m 57s | Max: 27m 57s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 57s | Avg: 27m 57s | Max: 27m 57s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
125 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@miscco
Copy link
Collaborator

miscco commented Dec 17, 2024

🥳

@ericniebler ericniebler merged commit 82cff38 into NVIDIA:main Dec 17, 2024
184 checks passed
shwina pushed a commit to shwina/cccl that referenced this pull request Dec 18, 2024
davebayer pushed a commit to davebayer/cccl that referenced this pull request Jan 18, 2025
davebayer added a commit to davebayer/cccl that referenced this pull request Jan 20, 2025
implement `add_sat`

split `signed`/`unsigned` implementation, improve implementation for MSVC

improve device `add_sat` implementation

add `add_sat` test

improve generic `add_sat` implementation for signed types

implement `sub_sat`

allow more msvc intrinsics on x86

add op tests

partially implement `mul_sat`

implement `div_sat` and `saturate_cast`

add `saturate_cast` test

simplify `div_sat` test

Deprectate C++11 and C++14 for libcu++ (#3173)

* Deprectate C++11 and C++14 for libcu++

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

Implement `abs` and `div` from `cstdlib` (#3153)

* implement integer abs functions
* improve tests, fix constexpr support
* just use the our implementation
* implement `cuda::std::div`
* prefer host's `div_t` like types
* provide `cuda::std::abs` overloads for floats
* allow fp abs for NVRTC
* silence msvc's warning about conversion from floating point to integral

Fix missing radix sort policies (#3174)

Fixes NVBug 5009941

Introduces new `DeviceReduce::Arg{Min,Max}` interface with two output iterators (#3148)

* introduces new arg{min,max} interface with two output iterators

* adds fp inf tests

* fixes docs

* improves code example

* fixes exec space specifier

* trying to fix deprecation warning for more compilers

* inlines unzip operator

* trying to fix deprecation warning for nvhpc

* integrates supression fixes in diagnostics

* pre-ctk 11.5 deprecation suppression

* fixes icc

* fix for pre-ctk11.5

* cleans up deprecation suppression

* cleanup

Extend tuning documentation (#3179)

Add codespell pre-commit hook, fix typos in CCCL (#3168)

* Add codespell pre-commit hook
* Automatic changes from codespell.
* Manual changes.

Fix parameter space for TUNE_LOAD in scan benchmark (#3176)

fix various old compiler checks (#3178)

implement C++26 `std::projected` (#3175)

Fix pre-commit config for codespell and remaining typos (#3182)

Massive cleanup of our config (#3155)

Fix UB in atomics with automatic storage (#2586)

* Adds specialized local cuda atomics and injects them into most atomics paths.

Co-authored-by: Georgy Evtushenko <evtushenko.georgy@gmail.com>
Co-authored-by: gonzalobg <65027571+gonzalobg@users.noreply.github.com>

* Allow CUDA 12.2 to keep perf, this addresses earlier comments in #478

* Remove extraneous double brackets in unformatted code.

* Merge unsafe atomic logic into `__cuda_is_local`.

* Use `const_cast` for type conversions in cuda_local.h

* Fix build issues from interface changes

* Fix missing __nanosleep on sm70-

* Guard __isLocal from NVHPC

* Use PTX instead of running nothing from NVHPC

* fixup /s/nvrtc/nvhpc

* Fixup missing CUDA ifdef surrounding device code

* Fix codegen

* Bypass some sort of compiler bug on GCC7

* Apply suggestions from code review

* Use unsafe automatic storage atomics in codegen tests

---------

Co-authored-by: Georgy Evtushenko <evtushenko.georgy@gmail.com>
Co-authored-by: gonzalobg <65027571+gonzalobg@users.noreply.github.com>
Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Refactor the source code layout for `cuda.parallel` (#3177)

* Refactor the source layout for cuda.parallel

* Add copyright

* Address review feedback

* Don't import anything into `experimental` namespace

* fix import

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

new type-erased memory resources (#2824)

s/_LIBCUDACXX_DECLSPEC_EMPTY_BASES/_CCCL_DECLSPEC_EMPTY_BASES/g (#3186)

Document address stability of `thrust::transform` (#3181)

* Do not document _LIBCUDACXX_MARK_CAN_COPY_ARGUMENTS
* Reformat and fix UnaryFunction/BinaryFunction in transform docs
* Mention transform can use proclaim_copyable_arguments
* Document cuda::proclaims_copyable_arguments better
* Deprecate depending on transform functor argument addresses

Fixes: #3053

turn off cuda version check for clangd (#3194)

[STF] jacobi example based on parallel_for (#3187)

* Simple jacobi example with parallel for and reductions

* clang-format

* remove useless capture list

fixes pre-nv_diag suppression issues (#3189)

Prefer c2h::type_name over c2h::demangle (#3195)

Fix memcpy_async* tests (#3197)

* memcpy_async_tx: Fix bug in test

Two bugs, one of which occurs in practice:

1. There is a missing fence.proxy.space::global between the writes to
   global memory and the memcpy_async_tx. (Occurs in practice)

2. The end of the kernel should be fenced with `__syncthreads()`,
   because the barrier is invalidated in the destructor. If other
   threads are still waiting on it, there will be UB. (Has not yet
   manifested itself)

* cp_async_bulk_tensor: Pre-emptively fence more in test

Add type annotations and mypy checks for `cuda.parallel`  (#3180)

* Refactor the source layout for cuda.parallel

* Add initial type annotations

* Update pre-commit config

* More typing

* Fix bad merge

* Fix TYPE_CHECKING and numpy annotations

* typing bindings.py correctly

* Address review feedback

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Fix rendering of cuda.parallel docs (#3192)

* Fix pre-commit config for codespell and remaining typos

* Fix rendering of docs for cuda.parallel

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Enable PDL for DeviceMergeSortBlockSortKernel (#3199)

The kernel already contains a call to _CCCL_PDL_GRID_DEPENDENCY_SYNC.
This commit enables PDL when launching the kernel.

Adds support for large `num_items` to `DeviceReduce::{ArgMin,ArgMax}` (#2647)

* adds benchmarks for reduce::arg{min,max}

* preliminary streaming arg-extremum reduction

* fixes implicit conversion

* uses streaming dispatch class

* changes arg benches to use new streaming reduce

* streaming arg-extrema reduction

* fixes style

* fixes compilation failures

* cleanups

* adds rst style comments

* declare vars const and use clamp

* consolidates argmin argmax benchmarks

* fixes thrust usage

* drops offset type in arg-extrema benchmarks

* fixes clang cuda

* exec space macros

* switch to signed global offset type for slightly better perf

* clarifies documentation

* applies minor benchmark style changes from review comments

* fixes interface documentation and comments

* list-init accumulating output op

* improves style, comments, and tests

* cleans up aggregate init

* renames dispatch class usage in benchmarks

* fixes merge conflicts

* addresses review comments

* addresses review comments

* fixes assertion

* removes superseded implementation

* changes large problem tests to use new interface

* removes obsolete tests for deprecated interface

Fixes for Python 3.7 docs environment (#3206)

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Adds support for large number of items to `DeviceTransform` (#3172)

* moves large problem test helper to common file

* adds support for large num items to device transform

* adds tests for large number of items to device interface

* fixes format

* addresses review comments

cp_async_bulk: Fix test (#3198)

* memcpy_async_tx: Fix bug in test

Two bugs, one of which occurs in practice:

1. There is a missing fence.proxy.space::global between the writes to
   global memory and the memcpy_async_tx. (Occurs in practice)

2. The end of the kernel should be fenced with `__syncthreads()`,
   because the barrier is invalidated in the destructor. If other
   threads are still waiting on it, there will be UB. (Has not yet
   manifested itself)

* cp_async_bulk_tensor: Pre-emptively fence more in test

* cp_async_bulk: Fix test

The global memory pointer could be misaligned.

cudax fixes for msvc 14.41 (#3200)

avoid instantiating class templates in `is_same` implementation when possible (#3203)

Fix: make launchers a CUB detail; make kernel source functions hidden. (#3209)

* Fix: make launchers a CUB detail; make kernel source functions hidden.

* [pre-commit.ci] auto code formatting

* Address review comments, fix which macro gets fixed.

help the ranges concepts recognize standard contiguous iterators in c++14/17 (#3202)

unify macros and cmake options that control the suppression of deprecation warnings (#3220)

* unify macros and cmake options that control the suppression of deprecation warnings

* suppress nvcc warning #186 in thrust header tests

* suppress c++ dialect deprecation warnings in libcudacxx header tests

Fx thread-reduce performance regression (#3225)

cuda.parallel: In-memory caching of build objects (#3216)

* Define __eq__ and __hash__ for Iterators

* Define cache_with_key utility and use it to cache Reduce objects

* Add tests for caching Reduce objects

* Tighten up types

* Updates to support 3.7

* Address review feedback

* Introduce IteratorKind to hold iterator type information

* Use the .kind to generate an abi_name

* Remove __eq__ and __hash__ methods from IteratorBase

* Move helper function

* Formatting

* Don't unpack tuple in cache key

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Just enough ranges for c++14 `span` (#3211)

use generalized concepts portability macros to simplify the `range` concept (#3217)

fixes some issues in the concepts portability macros and then re-implements the `range` concept with `_CCCL_REQUIRES_EXPR`

Use Ruff to sort imports (#3230)

* Update pyproject.tomls for import sorting

* Update files after running pre-commit

* Move ruff config to pyproject.toml

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

fix tuning_scan sm90 config issue (#3236)

Co-authored-by: Shijie Chen <shijiec@nvidia.com>

[STF] Logical token (#3196)

* Split the implementation of the void interface into the definition of the interface, and its implementations on streams and graphs.

* Add missing files

* Check if a task implementation can match a prototype where the void_interface arguments are ignored

* Implement ctx.abstract_logical_data() which relies on a void data interface

* Illustrate how to use abstract handles in local contexts

* Introduce an is_void_interface() virtual method in the data interface to potentially optimize some stages

* Small improvements in the examples

* Do not try to allocate or move void data

* Do not use I as a variable

* fix linkage error

* rename abtract_logical_data into logical_token

* Document logical token

* fix spelling error

* fix sphinx error

* reflect name changes

* use meaningful variable names

* simplify logical_token implementation because writeback is already disabled

* add a unit test for token elision

* implement token elision in host_launch

* Remove unused type

* Implement helpers to check if a function can be invoked from a tuple, or from a tuple where we removed tokens

* Much simpler is_tuple_invocable_with_filtered implementation

* Fix buggy test

* Factorize code

* Document that we can ignore tokens for task and host_launch

* Documentation for logical data freeze

Fix ReduceByKey tuning (#3240)

Fix RLE tuning (#3239)

cuda.parallel: Forbid non-contiguous arrays as inputs (or outputs) (#3233)

* Forbid non-contiguous arrays as inputs (or outputs)

* Implement a more robust way to check for contiguity

* Don't bother if cublas unavailable

* Fix how we check for zero-element arrays

* sort imports

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

expands support for more offset types in segmented benchmark (#3231)

Add escape hatches to the cmake configuration of the header tests so that we can tests deprecated compilers / dialects (#3253)

* Add escape hatches to the cmake configuration of the header tests so that we can tests deprecated compilers / dialects

* Do not add option twice

ptx: Add add_instruction.py (#3190)

This file helps create the necessary structure for new PTX instructions.

Co-authored-by: Allard Hendriksen <ahendriksen@nvidia.com>

Bump main to 2.9.0. (#3247)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Drop cub::Mutex (#3251)

Fixes: #3250

Remove legacy macros from CUB util_arch.cuh (#3257)

Fixes: #3256

Remove thrust::[unary|binary]_traits (#3260)

Fixes: #3259

Architecture and OS identification macros (#3237)

Bump main to 3.0.0. (#3265)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Drop thrust not1 and not2 (#3264)

Fixes: #3263

CCCL Internal macro documentation (#3238)

Deprecate GridBarrier and GridBarrierLifetime (#3258)

Fixes: #1389

Require at least gcc7 (#3268)

Fixes: #3267

Drop thrust::[unary|binary]_function (#3274)

Fixes: #3273

Drop ICC from CI (#3277)

[STF] Corruption of the capture list of an extended lambda with a parallel_for construct on a host execution place (#3270)

* Add a test to reproduce a bug observed with parallel_for on a host place

* clang-format

* use _CCCL_ASSERT

* Attempt to debug

* do not create a tuple with a universal reference that is out of scope when we use it, use an lvalue instead

* fix lambda expression

* clang-format

Enable thrust::identity test for non-MSVC (#3281)

This seems to be an oversight when the test was added

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Enable PDL in triple chevron launch (#3282)

It seems PDL was disabled by accident when _THRUST_HAS_PDL was renamed
to _CCCL_HAS_PDL during the review introducing the feature.

Disambiguate line continuations and macro continuations in <nv/target> (#3244)

Drop VS 2017 from CI (#3287)

Fixes: #3286

Drop ICC support in code (#3279)

* Drop ICC from code

Fixes: #3278

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Make CUB NVRTC commandline arguments come from a cmake template (#3292)

Propose the same components (thrust, cub, libc++, cudax, cuda.parallel,...) in the bug report template than in the feature request template (#3295)

Use process isolation instead of default hyper-v for Windows. (#3294)

Try improving build times by using process isolation instead of hyper-v

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

[pre-commit.ci] pre-commit autoupdate (#3248)

* [pre-commit.ci] pre-commit autoupdate

updates:
- [github.com/pre-commit/mirrors-clang-format: v18.1.8 → v19.1.6](https://github.com/pre-commit/mirrors-clang-format/compare/v18.1.8...v19.1.6)
- [github.com/astral-sh/ruff-pre-commit: v0.8.3 → v0.8.6](https://github.com/astral-sh/ruff-pre-commit/compare/v0.8.3...v0.8.6)
- [github.com/pre-commit/mirrors-mypy: v1.13.0 → v1.14.1](https://github.com/pre-commit/mirrors-mypy/compare/v1.13.0...v1.14.1)

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Drop Thrust legacy arch macros (#3298)

Which were disabled and could be re-enabled using THRUST_PROVIDE_LEGACY_ARCH_MACROS

Drop Thrust's compiler_fence.h (#3300)

Drop CTK 11.x from CI (#3275)

* Add cuda12.0-gcc7 devcontainer
* Move MSVC2017 jobs to CTK 12.6
Those is the only combination where rapidsai has devcontainers
* Add /Zc:__cplusplus for the libcudacxx tests
* Only add excape hatch for affected CTKs
* Workaround missing cudaLaunchKernelEx on MSVC
cudaLaunchKernelEx requires C++11, but unfortunately <cuda_runtime.h> checks this using the __cplusplus macro, which is reported wrongly for MSVC. CTK 12.3 fixed this by additionally detecting _MSV_VER. As a workaround, we provide our own copy of cudaLaunchKernelEx when it is not available from the CTK.
* Workaround nvcc+MSVC issue
* Regenerate devcontainers

Fixes: #3249

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Drop CUB's util_compiler.cuh (#3302)

All contained macros were deprecated

Update packman and repo_docs versions (#3293)

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Drop Thrust's deprecated compiler macros (#3301)

Drop CUB_RUNTIME_ENABLED and __THRUST_HAS_CUDART__ (#3305)

Adds support for large number of items to `DevicePartition::If` with the `ThreeWayPartition` overload (#2506)

* adds support for large number of items to three-way partition

* adapts interface to use choose_signed_offset_t

* integrates applicable feedback from device-select pr

* changes behavior for empty problems

* unifies grid constant macro

* fixes kernel template specialization mismatch

* integrates _CCCL_GRID_CONSTANT changes

* resolve merge conflicts

* fixes checks in test

* fixes test verification

* improves tests

* makes few improvements to streaming dispatch

* improves code comment on test

* fixes unrelated compiler error

* minor style improvements

Refactor scan tunings (#3262)

Require C++17 for compiling Thrust and CUB (#3255)

* Issue an unsuppressable warning when compiling with < C++17
* Remove C++11/14 presets
* Remove CCCL_IGNORE_DEPRECATED_CPP_DIALECT from headers
* Remove [CUB|THRUST|TCT]_IGNORE_DEPRECATED_CPP_[11|14]
* Remove CUB_ENABLE_DIALECT_CPP[11|14]
* Update CI runs
* Remove C++11/14 CI runs for CUB and Thrust
* Raise compiler minimum versions for C++17
* Update ReadMe
* Drop Thrust's cpp14_required.h
* Add escape hatch for C++17 removal

Fixes: #3252

Implement `views::empty` (#3254)

* Disable pair conversion of subrange with clang in C++17

* Fix namespace views

* Implement `views::empty`

This implements `std::ranges::views::empty`, see https://en.cppreference.com/w/cpp/ranges/empty_view

Refactor `limits` and `climits` (#3221)

* implement builtins for huge val, nan and nans

* change `INFINITY` and `NAN` implementation for NVRTC

cuda.parallel: Add documentation for the current iterators along with examples and tests (#3311)

* Add tests demonstrating usage of different iterators

* Update documentation of reduce_into by merging import code snippet with the rest of the example

* Add documentation for current iterators

* Run pre-commit checks and update accordingly

* Fix comments to refer to the proper lines in the code snippets in the docs

Drop clang<14 from CI, update devcontainers. (#3309)

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

[STF] Cleanup task dependencies object constructors (#3291)

* Define tag types for access modes

* - Rework how we build task_dep objects based on access mode tags
- pack_state is now responsible for using a const_cast for read only data

* Greatly simplify the previous attempt : do not define new types, but use integral constants based on the enums

* It seems the const_cast was not necessarily so we can simplify it and not even do some dispatch based on access modes

Disable test with a gcc-14 regression (#3297)

Deprecate Thrust's cpp_compatibility.h macros (#3299)

Remove dropped function objects from docs (#3319)

Document `NV_TARGET` macros (#3313)

[STF] Define ctx.pick_stream() which was missing for the unified context (#3326)

* Define ctx.pick_stream() which was missing for the unified context

* clang-format

Deprecate cub::IterateThreadStore (#3337)

Drop CUB's BinaryFlip operator (#3332)

Deprecate cub::Swap (#3333)

Clarify transform output can overlap input (#3323)

Drop CUB APIs with a debug_synchronous parameter (#3330)

Fixes: #3329

Drop CUB's util_compiler.cuh for real (#3340)

PR #3302 planned to drop the file, but only dropped its content. This
was an oversight. So let's drop the entire file.

Drop cub::ValueCache (#3346)

limits offset types for merge sort (#3328)

Drop CDPv1 (#3344)

Fixes: #3341

Drop thrust::void_t (#3362)

Use cuda::std::addressof in Thrust (#3363)

Fix all_of documentation for empty ranges (#3358)

all_of always returns true on an empty range.

[STF] Do not keep track of dangling events in a CUDA graph backend (#3327)

* Unlike the CUDA stream backend, nodes in a CUDA graph are necessarily done when
the CUDA graph completes. Therefore keeping track of "dangling events" is a
waste of time and resources.

* replace can_ignore_dangling_events by track_dangling_events which leads to more readable code

* When not storing the dangling events, we must still perform the deinit operations that were producing these events !

Extract scan kernels into NVRTC-compilable header (#3334)

* Extract scan kernels into NVRTC-compilable header

* Update cub/cub/device/dispatch/dispatch_scan.cuh

Co-authored-by: Georgii Evtushenko <evtushenko.georgy@gmail.com>

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>
Co-authored-by: Georgii Evtushenko <evtushenko.georgy@gmail.com>

Drop deprecated aliases in Thrust functional (#3272)

Fixes: #3271

Drop cub::DivideAndRoundUp (#3347)

Use cuda::std::min/max in Thrust (#3364)

Implement `cuda::std::numeric_limits` for `__half` and `__nv_bfloat16` (#3361)

* implement `cuda::std::numeric_limits` for `__half` and `__nv_bfloat16`

Cleanup util_arch (#2773)

Deprecate thrust::null_type (#3367)

Deprecate cub::DeviceSpmv (#3320)

Fixes: #896

Improves `DeviceSegmentedSort` test run time for large number of items and segments (#3246)

* fixes segment offset generation

* switches to analytical verification

* switches to analytical verification for pairs

* fixes spelling

* adds tests for large number of segments

* fixes narrowing conversion in tests

* addresses review comments

* fixes includes

Compile basic infra test with C++17 (#3377)

Adds support for large number of items and large number of segments to `DeviceSegmentedSort` (#3308)

* fixes segment offset generation

* switches to analytical verification

* switches to analytical verification for pairs

* addresses review comments

* introduces segment offset type

* adds tests for large number of segments

* adds support for large number of segments

* drops segment offset type

* fixes thrust namespace

* removes about-to-be-deprecated cub iterators

* no exec specifier on defaulted ctor

* fixes gcc7 linker error

* uses local_segment_index_t throughout

* determine offset type based on type returned by segment iterator begin/end iterators

* minor style improvements

Exit with error when RAPIDS CI fails. (#3385)

cuda.parallel: Support structured types as algorithm inputs (#3218)

* Introduce gpu_struct decorator and typing

* Enable `reduce` to accept arrays of structs as inputs

* Add test for reducing arrays-of-struct

* Update documentation

* Use a numpy array rather than ctypes object

* Change zeros -> empty for output array and temp storage

* Add a TODO for typing GpuStruct

* Documentation udpates

* Remove test_reduce_struct_type from test_reduce.py

* Revert to `to_cccl_value()` accepting ndarray + GpuStruct

* Bump copyrights

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Deprecate thrust::async (#3324)

Fixes: #100

Review/Deprecate CUB `util.ptx` for CCCL 2.x (#3342)

Fix broken `_CCCL_BUILTIN_ASSUME` macro (#3314)

* add compiler-specific path
* fix device code path
* add _CCC_ASSUME

Deprecate thrust::numeric_limits (#3366)

Replace `typedef` with `using` in libcu++ (#3368)

Deprecate thrust::optional (#3307)

Fixes: #3306

Upgrade to Catch2 3.8  (#3310)

Fixes: #1724

refactor `<cuda/std/cstdint>` (#3325)

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

Update CODEOWNERS (#3331)

* Update CODEOWNERS

* Update CODEOWNERS

* Update CODEOWNERS

* [pre-commit.ci] auto code formatting

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Fix sign-compare warning (#3408)

Implement more cmath functions to be usable on host and device (#3382)

* Implement more cmath functions to be usable on host and device

* Implement math roots functions

* Implement exponential functions

Redefine and deprecate thrust::remove_cvref (#3394)

* Redefine and deprecate thrust::remove_cvref

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Fix assert definition for NVHPC due to constexpr issues (#3418)

NVHPC cannot decide at compile time where the code would run so _CCCL_ASSERT within a constexpr function breaks it.

Fix this by always using the host definition which should also work on device.

Fixes #3411

Extend CUB reduce benchmarks (#3401)

* Rename max.cu to custom.cu, since it uses a custom operator
* Extend types covered my min.cu to all fundamental types
* Add some notes on how to collect tuning parameters

Fixes: #3283

Update upload-pages-artifact to v3 (#3423)

* Update upload-pages-artifact to v3

* Empty commit

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Replace and deprecate thrust::cuda_cub::terminate (#3421)

`std::linalg` accessors and `transposed_layout` (#2962)

Add round up/down to multiple (#3234)

[FEA]: Introduce Python module with CCCL headers (#3201)

* Add cccl/python/cuda_cccl directory and use from cuda_parallel, cuda_cooperative

* Run `copy_cccl_headers_to_aude_include()` before `setup()`

* Create python/cuda_cccl/cuda/_include/__init__.py, then simply import cuda._include to find the include path.

* Add cuda.cccl._version exactly as for cuda.cooperative and cuda.parallel

* Bug fix: cuda/_include only exists after shutil.copytree() ran.

* Use `f"cuda-cccl @ file://{cccl_path}/python/cuda_cccl"` in setup.py

* Remove CustomBuildCommand, CustomWheelBuild in cuda_parallel/setup.py (they are equivalent to the default functions)

* Replace := operator (needs Python 3.8+)

* Fix oversights: remove `pip3 install ./cuda_cccl` lines from README.md

* Restore original README.md: `pip3 install -e` now works on first pass.

* cuda_cccl/README.md: FOR INTERNAL USE ONLY

* Remove `$pymajor.$pyminor.` prefix in cuda_cccl _version.py (as suggested under https://github.com/NVIDIA/cccl/pull/3201#discussion_r1894035917)

Command used: ci/update_version.sh 2 8 0

* Modernize pyproject.toml, setup.py

Trigger for this change:

* https://github.com/NVIDIA/cccl/pull/3201#discussion_r1894043178

* https://github.com/NVIDIA/cccl/pull/3201#discussion_r1894044996

* Install CCCL headers under cuda.cccl.include

Trigger for this change:

* https://github.com/NVIDIA/cccl/pull/3201#discussion_r1894048562

Unexpected accidental discovery: cuda.cooperative unit tests pass without CCCL headers entirely.

* Factor out cuda_cccl/cuda/cccl/include_paths.py

* Reuse cuda_cccl/cuda/cccl/include_paths.py from cuda_cooperative

* Add missing Copyright notice.

* Add missing __init__.py (cuda.cccl)

* Add `"cuda.cccl"` to `autodoc.mock_imports`

* Move cuda.cccl.include_paths into function where it is used. (Attempt to resolve Build and Verify Docs failure.)

* Add # TODO: move this to a module-level import

* Modernize cuda_cooperative/pyproject.toml, setup.py

* Convert cuda_cooperative to use hatchling as build backend.

* Revert "Convert cuda_cooperative to use hatchling as build backend."

This reverts commit 61637d608da06fcf6851ef6197f88b5e7dbc3bbe.

* Move numpy from [build-system] requires -> [project] dependencies

* Move pyproject.toml [project] dependencies -> setup.py install_requires, to be able to use CCCL_PATH

* Remove copy_license() and use license_files=["../../LICENSE"] instead.

* Further modernize cuda_cccl/setup.py to use pathlib

* Trivial simplifications in cuda_cccl/pyproject.toml

* Further simplify cuda_cccl/pyproject.toml, setup.py: remove inconsequential code

* Make cuda_cooperative/pyproject.toml more similar to cuda_cccl/pyproject.toml

* Add taplo-pre-commit to .pre-commit-config.yaml

* taplo-pre-commit auto-fixes

* Use pathlib in cuda_cooperative/setup.py

* CCCL_PYTHON_PATH in cuda_cooperative/setup.py

* Modernize cuda_parallel/pyproject.toml, setup.py

* Use pathlib in cuda_parallel/setup.py

* Add `# TOML lint & format` comment.

* Replace MANIFEST.in with `[tool.setuptools.package-data]` section in pyproject.toml

* Use pathlib in cuda/cccl/include_paths.py

* pre-commit autoupdate (EXCEPT clang-format, which was manually restored)

* Fixes after git merge main

* Resolve warning: AttributeError: '_Reduce' object has no attribute 'build_result'

```
=========================================================================== warnings summary ===========================================================================
tests/test_reduce.py::test_reduce_non_contiguous
  /home/coder/cccl/python/devenv/lib/python3.12/site-packages/_pytest/unraisableexception.py:85: PytestUnraisableExceptionWarning: Exception ignored in: <function _Reduce.__del__ at 0x7bf123139080>

  Traceback (most recent call last):
    File "/home/coder/cccl/python/cuda_parallel/cuda/parallel/experimental/algorithms/reduce.py", line 132, in __del__
      bindings.cccl_device_reduce_cleanup(ctypes.byref(self.build_result))
                                                       ^^^^^^^^^^^^^^^^^
  AttributeError: '_Reduce' object has no attribute 'build_result'

    warnings.warn(pytest.PytestUnraisableExceptionWarning(msg))

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
============================================================= 1 passed, 93 deselected, 1 warning in 0.44s ==============================================================
```

* Move `copy_cccl_headers_to_cuda_cccl_include()` functionality to `class CustomBuildPy`

* Introduce cuda_cooperative/constraints.txt

* Also add cuda_parallel/constraints.txt

* Add `--constraint constraints.txt` in ci/test_python.sh

* Update Copyright dates

* Switch to https://github.com/ComPWA/taplo-pre-commit (the other repo has been archived by the owner on Jul 1, 2024)

For completeness: The other repo took a long time to install into the pre-commit cache; so long it lead to timeouts in the CCCL CI.

* Remove unused cuda_parallel jinja2 dependency (noticed by chance).

* Remove constraints.txt files, advertise running `pip install cuda-cccl` first instead.

* Make cuda_cooperative, cuda_parallel testing completely independent.

* Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Try using another runner (because V100 runners seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Fix sign-compare warning (#3408) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Revert "Try using another runner (because V100 runners seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]"

This reverts commit ea33a218ed77a075156cd1b332047202adb25aa2.

Error message: https://github.com/NVIDIA/cccl/pull/3201#issuecomment-2594012971

* Try using A100 runner (because V100 runners still seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Also show cuda-cooperative site-packages, cuda-parallel site-packages (after pip install) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Try using l4 runner (because V100 runners still seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Restore original ci/matrix.yaml [skip-rapids]

* Use for loop in test_python.sh to avoid code duplication.

* Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc][skip pre-commit.ci]

* Comment out taplo-lint in pre-commit config [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Revert "Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc][skip pre-commit.ci]"

This reverts commit ec206fd8b50a6a293e00a5825b579e125010b13d.

* Implement suggestion by @shwina (https://github.com/NVIDIA/cccl/pull/3201#pullrequestreview-2556918460)

* Address feedback by @leofang

---------

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

cuda.parallel: Add optional stream argument to reduce_into() (#3348)

* Add optional stream argument to reduce_into()

* Add tests to check for reduce_into() stream behavior

* Move protocol related utils to separate file and rework __cuda_stream__ error messages

* Fix synchronization issue in stream test and add one more invalid stream test case

* Rename cuda stream validation function after removing leading underscore

* Unpack values from __cuda_stream__ instead of indexing

* Fix linting errors

* Handle TypeError when unpacking invalid __cuda_stream__ return

* Use stream to allocate cupy memory in new stream test

Upgrade to actions/deploy-pages@v4 (from v2), as suggested by @leofang (#3434)

Deprecate `cub::{min, max}` and replace internal uses with those from libcu++ (#3419)

* Deprecate `cub::{min, max}` and replace internal uses with those from libcu++

Fixes #3404

move to c++17, finalize device optimization

fix msvc compilation, update tests

Deprectate C++11 and C++14 for libcu++ (#3173)

* Deprectate C++11 and C++14 for libcu++

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

Implement `abs` and `div` from `cstdlib` (#3153)

* implement integer abs functions
* improve tests, fix constexpr support
* just use the our implementation
* implement `cuda::std::div`
* prefer host's `div_t` like types
* provide `cuda::std::abs` overloads for floats
* allow fp abs for NVRTC
* silence msvc's warning about conversion from floating point to integral

Fix missing radix sort policies (#3174)

Fixes NVBug 5009941

Introduces new `DeviceReduce::Arg{Min,Max}` interface with two output iterators (#3148)

* introduces new arg{min,max} interface with two output iterators

* adds fp inf tests

* fixes docs

* improves code example

* fixes exec space specifier

* trying to fix deprecation warning for more compilers

* inlines unzip operator

* trying to fix deprecation warning for nvhpc

* integrates supression fixes in diagnostics

* pre-ctk 11.5 deprecation suppression

* fixes icc

* fix for pre-ctk11.5

* cleans up deprecation suppression

* cleanup

Extend tuning documentation (#3179)

Add codespell pre-commit hook, fix typos in CCCL (#3168)

* Add codespell pre-commit hook
* Automatic changes from codespell.
* Manual changes.

Fix parameter space for TUNE_LOAD in scan benchmark (#3176)

fix various old compiler checks (#3178)

implement C++26 `std::projected` (#3175)

Fix pre-commit config for codespell and remaining typos (#3182)

Massive cleanup of our config (#3155)

Fix UB in atomics with automatic storage (#2586)

* Adds specialized local cuda atomics and injects them into most atomics paths.

Co-authored-by: Georgy Evtushenko <evtushenko.georgy@gmail.com>
Co-authored-by: gonzalobg <65027571+gonzalobg@users.noreply.github.com>

* Allow CUDA 12.2 to keep perf, this addresses earlier comments in #478

* Remove extraneous double brackets in unformatted code.

* Merge unsafe atomic logic into `__cuda_is_local`.

* Use `const_cast` for type conversions in cuda_local.h

* Fix build issues from interface changes

* Fix missing __nanosleep on sm70-

* Guard __isLocal from NVHPC

* Use PTX instead of running nothing from NVHPC

* fixup /s/nvrtc/nvhpc

* Fixup missing CUDA ifdef surrounding device code

* Fix codegen

* Bypass some sort of compiler bug on GCC7

* Apply suggestions from code review

* Use unsafe automatic storage atomics in codegen tests

---------

Co-authored-by: Georgy Evtushenko <evtushenko.georgy@gmail.com>
Co-authored-by: gonzalobg <65027571+gonzalobg@users.noreply.github.com>
Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Refactor the source code layout for `cuda.parallel` (#3177)

* Refactor the source layout for cuda.parallel

* Add copyright

* Address review feedback

* Don't import anything into `experimental` namespace

* fix import

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

new type-erased memory resources (#2824)

s/_LIBCUDACXX_DECLSPEC_EMPTY_BASES/_CCCL_DECLSPEC_EMPTY_BASES/g (#3186)

Document address stability of `thrust::transform` (#3181)

* Do not document _LIBCUDACXX_MARK_CAN_COPY_ARGUMENTS
* Reformat and fix UnaryFunction/BinaryFunction in transform docs
* Mention transform can use proclaim_copyable_arguments
* Document cuda::proclaims_copyable_arguments better
* Deprecate depending on transform functor argument addresses

Fixes: #3053

turn off cuda version check for clangd (#3194)

[STF] jacobi example based on parallel_for (#3187)

* Simple jacobi example with parallel for and reductions

* clang-format

* remove useless capture list

fixes pre-nv_diag suppression issues (#3189)

Prefer c2h::type_name over c2h::demangle (#3195)

Fix memcpy_async* tests (#3197)

* memcpy_async_tx: Fix bug in test

Two bugs, one of which occurs in practice:

1. There is a missing fence.proxy.space::global between the writes to
   global memory and the memcpy_async_tx. (Occurs in practice)

2. The end of the kernel should be fenced with `__syncthreads()`,
   because the barrier is invalidated in the destructor. If other
   threads are still waiting on it, there will be UB. (Has not yet
   manifested itself)

* cp_async_bulk_tensor: Pre-emptively fence more in test

Add type annotations and mypy checks for `cuda.parallel`  (#3180)

* Refactor the source layout for cuda.parallel

* Add initial type annotations

* Update pre-commit config

* More typing

* Fix bad merge

* Fix TYPE_CHECKING and numpy annotations

* typing bindings.py correctly

* Address review feedback

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Fix rendering of cuda.parallel docs (#3192)

* Fix pre-commit config for codespell and remaining typos

* Fix rendering of docs for cuda.parallel

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Enable PDL for DeviceMergeSortBlockSortKernel (#3199)

The kernel already contains a call to _CCCL_PDL_GRID_DEPENDENCY_SYNC.
This commit enables PDL when launching the kernel.

Adds support for large `num_items` to `DeviceReduce::{ArgMin,ArgMax}` (#2647)

* adds benchmarks for reduce::arg{min,max}

* preliminary streaming arg-extremum reduction

* fixes implicit conversion

* uses streaming dispatch class

* changes arg benches to use new streaming reduce

* streaming arg-extrema reduction

* fixes style

* fixes compilation failures

* cleanups

* adds rst style comments

* declare vars const and use clamp

* consolidates argmin argmax benchmarks

* fixes thrust usage

* drops offset type in arg-extrema benchmarks

* fixes clang cuda

* exec space macros

* switch to signed global offset type for slightly better perf

* clarifies documentation

* applies minor benchmark style changes from review comments

* fixes interface documentation and comments

* list-init accumulating output op

* improves style, comments, and tests

* cleans up aggregate init

* renames dispatch class usage in benchmarks

* fixes merge conflicts

* addresses review comments

* addresses review comments

* fixes assertion

* removes superseded implementation

* changes large problem tests to use new interface

* removes obsolete tests for deprecated interface

Fixes for Python 3.7 docs environment (#3206)

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Adds support for large number of items to `DeviceTransform` (#3172)

* moves large problem test helper to common file

* adds support for large num items to device transform

* adds tests for large number of items to device interface

* fixes format

* addresses review comments

cp_async_bulk: Fix test (#3198)

* memcpy_async_tx: Fix bug in test

Two bugs, one of which occurs in practice:

1. There is a missing fence.proxy.space::global between the writes to
   global memory and the memcpy_async_tx. (Occurs in practice)

2. The end of the kernel should be fenced with `__syncthreads()`,
   because the barrier is invalidated in the destructor. If other
   threads are still waiting on it, there will be UB. (Has not yet
   manifested itself)

* cp_async_bulk_tensor: Pre-emptively fence more in test

* cp_async_bulk: Fix test

The global memory pointer could be misaligned.

cudax fixes for msvc 14.41 (#3200)

avoid instantiating class templates in `is_same` implementation when possible (#3203)

Fix: make launchers a CUB detail; make kernel source functions hidden. (#3209)

* Fix: make launchers a CUB detail; make kernel source functions hidden.

* [pre-commit.ci] auto code formatting

* Address review comments, fix which macro gets fixed.

help the ranges concepts recognize standard contiguous iterators in c++14/17 (#3202)

unify macros and cmake options that control the suppression of deprecation warnings (#3220)

* unify macros and cmake options that control the suppression of deprecation warnings

* suppress nvcc warning #186 in thrust header tests

* suppress c++ dialect deprecation warnings in libcudacxx header tests

Fx thread-reduce performance regression (#3225)

cuda.parallel: In-memory caching of build objects (#3216)

* Define __eq__ and __hash__ for Iterators

* Define cache_with_key utility and use it to cache Reduce objects

* Add tests for caching Reduce objects

* Tighten up types

* Updates to support 3.7

* Address review feedback

* Introduce IteratorKind to hold iterator type information

* Use the .kind to generate an abi_name

* Remove __eq__ and __hash__ methods from IteratorBase

* Move helper function

* Formatting

* Don't unpack tuple in cache key

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Just enough ranges for c++14 `span` (#3211)

use generalized concepts portability macros to simplify the `range` concept (#3217)

fixes some issues in the concepts portability macros and then re-implements the `range` concept with `_CCCL_REQUIRES_EXPR`

Use Ruff to sort imports (#3230)

* Update pyproject.tomls for import sorting

* Update files after running pre-commit

* Move ruff config to pyproject.toml

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

fix tuning_scan sm90 config issue (#3236)

Co-authored-by: Shijie Chen <shijiec@nvidia.com>

[STF] Logical token (#3196)

* Split the implementation of the void interface into the definition of the interface, and its implementations on streams and graphs.

* Add missing files

* Check if a task implementation can match a prototype where the void_interface arguments are ignored

* Implement ctx.abstract_logical_data() which relies on a void data interface

* Illustrate how to use abstract handles in local contexts

* Introduce an is_void_interface() virtual method in the data interface to potentially optimize some stages

* Small improvements in the examples

* Do not try to allocate or move void data

* Do not use I as a variable

* fix linkage error

* rename abtract_logical_data into logical_token

* Document logical token

* fix spelling error

* fix sphinx error

* reflect name changes

* use meaningful variable names

* simplify logical_token implementation because writeback is already disabled

* add a unit test for token elision

* implement token elision in host_launch

* Remove unused type

* Implement helpers to check if a function can be invoked from a tuple, or from a tuple where we removed tokens

* Much simpler is_tuple_invocable_with_filtered implementation

* Fix buggy test

* Factorize code

* Document that we can ignore tokens for task and host_launch

* Documentation for logical data freeze

Fix ReduceByKey tuning (#3240)

Fix RLE tuning (#3239)

cuda.parallel: Forbid non-contiguous arrays as inputs (or outputs) (#3233)

* Forbid non-contiguous arrays as inputs (or outputs)

* Implement a more robust way to check for contiguity

* Don't bother if cublas unavailable

* Fix how we check for zero-element arrays

* sort imports

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

expands support for more offset types in segmented benchmark (#3231)

Add escape hatches to the cmake configuration of the header tests so that we can tests deprecated compilers / dialects (#3253)

* Add escape hatches to the cmake configuration of the header tests so that we can tests deprecated compilers / dialects

* Do not add option twice

ptx: Add add_instruction.py (#3190)

This file helps create the necessary structure for new PTX instructions.

Co-authored-by: Allard Hendriksen <ahendriksen@nvidia.com>

Bump main to 2.9.0. (#3247)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Drop cub::Mutex (#3251)

Fixes: #3250

Remove legacy macros from CUB util_arch.cuh (#3257)

Fixes: #3256

Remove thrust::[unary|binary]_traits (#3260)

Fixes: #3259

Architecture and OS identification macros (#3237)

Bump main to 3.0.0. (#3265)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Drop thrust not1 and not2 (#3264)

Fixes: #3263

CCCL Internal macro documentation (#3238)

Deprecate GridBarrier and GridBarrierLifetime (#3258)

Fixes: #1389

Require at least gcc7 (#3268)

Fixes: #3267

Drop thrust::[unary|binary]_function (#3274)

Fixes: #3273

Drop ICC from CI (#3277)

[STF] Corruption of the capture list of an extended lambda with a parallel_for construct on a host execution place (#3270)

* Add a test to reproduce a bug observed with parallel_for on a host place

* clang-format

* use _CCCL_ASSERT

* Attempt to debug

* do not create a tuple with a universal reference that is out of scope when we use it, use an lvalue instead

* fix lambda expression

* clang-format

Enable thrust::identity test for non-MSVC (#3281)

This seems to be an oversight when the test was added

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Enable PDL in triple chevron launch (#3282)

It seems PDL was disabled by accident when _THRUST_HAS_PDL was renamed
to _CCCL_HAS_PDL during the review introducing the feature.

Disambiguate line continuations and macro continuations in <nv/target> (#3244)

Drop VS 2017 from CI (#3287)

Fixes: #3286

Drop ICC support in code (#3279)

* Drop ICC from code

Fixes: #3278

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Make CUB NVRTC commandline arguments come from a cmake template (#3292)

Propose the same components (thrust, cub, libc++, cudax, cuda.parallel,...) in the bug report template than in the feature request template (#3295)

Use process isolation instead of default hyper-v for Windows. (#3294)

Try improving build times by using process isolation instead of hyper-v

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

[pre-commit.ci] pre-commit autoupdate (#3248)

* [pre-commit.ci] pre-commit autoupdate

updates:
- [github.com/pre-commit/mirrors-clang-format: v18.1.8 → v19.1.6](https://github.com/pre-commit/mirrors-clang-format/compare/v18.1.8...v19.1.6)
- [github.com/astral-sh/ruff-pre-commit: v0.8.3 → v0.8.6](https://github.com/astral-sh/ruff-pre-commit/compare/v0.8.3...v0.8.6)
- [github.com/pre-commit/mirrors-mypy: v1.13.0 → v1.14.1](https://github.com/pre-commit/mirrors-mypy/compare/v1.13.0...v1.14.1)

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Drop Thrust legacy arch macros (#3298)

Which were disabled and could be re-enabled using THRUST_PROVIDE_LEGACY_ARCH_MACROS

Drop Thrust's compiler_fence.h (#3300)

Drop CTK 11.x from CI (#3275)

* Add cuda12.0-gcc7 devcontainer
* Move MSVC2017 jobs to CTK 12.6
Those is the only combination where rapidsai has devcontainers
* Add /Zc:__cplusplus for the libcudacxx tests
* Only add excape hatch for affected CTKs
* Workaround missing cudaLaunchKernelEx on MSVC
cudaLaunchKernelEx requires C++11, but unfortunately <cuda_runtime.h> checks this using the __cplusplus macro, which is reported wrongly for MSVC. CTK 12.3 fixed this by additionally detecting _MSV_VER. As a workaround, we provide our own copy of cudaLaunchKernelEx when it is not available from the CTK.
* Workaround nvcc+MSVC issue
* Regenerate devcontainers

Fixes: #3249

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Update packman and repo_docs versions (#3293)

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Drop Thrust's deprecated compiler macros (#3301)

Drop CUB_RUNTIME_ENABLED and __THRUST_HAS_CUDART__ (#3305)

Adds support for large number of items to `DevicePartition::If` with the `ThreeWayPartition` overload (#2506)

* adds support for large number of items to three-way partition

* adapts interface to use choose_signed_offset_t

* integrates applicable feedback from device-select pr

* changes behavior for empty problems

* unifies grid constant macro

* fixes kernel template specialization mismatch

* integrates _CCCL_GRID_CONSTANT changes

* resolve merge conflicts

* fixes checks in test

* fixes test verification

* improves tests

* makes few improvements to streaming dispatch

* improves code comment on test

* fixes unrelated compiler error

* minor style improvements

Refactor scan tunings (#3262)

Require C++17 for compiling Thrust and CUB (#3255)

* Issue an unsuppressable warning when compiling with < C++17
* Remove C++11/14 presets
* Remove CCCL_IGNORE_DEPRECATED_CPP_DIALECT from headers
* Remove [CUB|THRUST|TCT]_IGNORE_DEPRECATED_CPP_[11|14]
* Remove CUB_ENABLE_DIALECT_CPP[11|14]
* Update CI runs
* Remove C++11/14 CI runs for CUB and Thrust
* Raise compiler minimum versions for C++17
* Update ReadMe
* Drop Thrust's cpp14_required.h
* Add escape hatch for C++17 removal

Fixes: #3252

Implement `views::empty` (#3254)

* Disable pair conversion of subrange with clang in C++17

* Fix namespace views

* Implement `views::empty`

This implements `std::ranges::views::empty`, see https://en.cppreference.com/w/cpp/ranges/empty_view

Refactor `limits` and `climits` (#3221)

* implement builtins for huge val, nan and nans

* change `INFINITY` and `NAN` implementation for NVRTC

cuda.parallel: Add documentation for the current iterators along with examples and tests (#3311)

* Add tests demonstrating usage of different iterators

* Update documentation of reduce_into by merging import code snippet with the rest of the example

* Add documentation for current iterators

* Run pre-commit checks and update accordingly

* Fix comments to refer to the proper lines in the code snippets in the docs

Drop clang<14 from CI, update devcontainers. (#3309)

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

[STF] Cleanup task dependencies object constructors (#3291)

* Define tag types for access modes

* - Rework how we build task_dep objects based on access mode tags
- pack_state is now responsible for using a const_cast for read only data

* Greatly simplify the previous attempt : do not define new types, but use integral constants based on the enums

* It seems the const_cast was not necessarily so we can simplify it and not even do some dispatch based on access modes

Disable test with a gcc-14 regression (#3297)

Deprecate Thrust's cpp_compatibility.h macros (#3299)

Remove dropped function objects from docs (#3319)

Document `NV_TARGET` macros (#3313)

[STF] Define ctx.pick_stream() which was missing for the unified context (#3326)

* Define ctx.pick_stream() which was missing for the unified context

* clang-format

Deprecate cub::IterateThreadStore (#3337)

Drop CUB's BinaryFlip operator (#3332)

Deprecate cub::Swap (#3333)

Clarify transform output can overlap input (#3323)

Drop CUB APIs with a debug_synchronous parameter (#3330)

Fixes: #3329

Drop CUB's util_compiler.cuh for real (#3340)

PR #3302 planned to drop the file, but only dropped its content. This
was an oversight. So let's drop the entire file.

Drop cub::ValueCache (#3346)

limits offset types for merge sort (#3328)

Drop CDPv1 (#3344)

Fixes: #3341

Drop thrust::void_t (#3362)

Use cuda::std::addressof in Thrust (#3363)

Fix all_of documentation for empty ranges (#3358)

all_of always returns true on an empty range.

[STF] Do not keep track of dangling events in a CUDA graph backend (#3327)

* Unlike the CUDA stream backend, nodes in a CUDA graph are necessarily done when
the CUDA graph completes. Therefore keeping track of "dangling events" is a
waste of time and resources.

* replace can_ignore_dangling_events by track_dangling_events which leads to more readable code

* When not storing the dangling events, we must still perform the deinit operations that were producing these events !

Extract scan kernels into NVRTC-compilable header (#3334)

* Extract scan kernels into NVRTC-compilable header

* Update cub/cub/device/dispatch/dispatch_scan.cuh

Co-authored-by: Georgii Evtushenko <evtushenko.georgy@gmail.com>

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>
Co-authored-by: Georgii Evtushenko <evtushenko.georgy@gmail.com>

Drop deprecated aliases in Thrust functional (#3272)

Fixes: #3271

Drop cub::DivideAndRoundUp (#3347)

Use cuda::std::min/max in Thrust (#3364)

Implement `cuda::std::numeric_limits` for `__half` and `__nv_bfloat16` (#3361)

* implement `cuda::std::numeric_limits` for `__half` and `__nv_bfloat16`

Cleanup util_arch (#2773)

Deprecate thrust::null_type (#3367)

Deprecate cub::DeviceSpmv (#3320)

Fixes: #896

Improves `DeviceSegmentedSort` test run time for large number of items and segments (#3246)

* fixes segment offset generation

* switches to analytical verification

* switches to analytical verification for pairs

* fixes spelling

* adds tests for large number of segments

* fixes narrowing conversion in tests

* addresses review comments

* fixes includes

Compile basic infra test with C++17 (#3377)

Adds support for large number of items and large number of segments to `DeviceSegmentedSort` (#3308)

* fixes segment offset generation

* switches to analytical verification

* switches to analytical verification for pairs

* addresses review comments

* introduces segment offset type

* adds tests for large number of segments

* adds support for large number of segments

* drops segment offset type

* fixes thrust namespace

* removes about-to-be-deprecated cub iterators

* no exec specifier on defaulted ctor

* fixes gcc7 linker error

* uses local_segment_index_t throughout

* determine offset type based on type returned by segment iterator begin/end iterators

* minor style improvements

Exit with error when RAPIDS CI fails. (#3385)

cuda.parallel: Support structured types as algorithm inputs (#3218)

* Introduce gpu_struct decorator and typing

* Enable `reduce` to accept arrays of structs as inputs

* Add test for reducing arrays-of-struct

* Update documentation

* Use a numpy array rather than ctypes object

* Change zeros -> empty for output array and temp storage

* Add a TODO for typing GpuStruct

* Documentation udpates

* Remove test_reduce_struct_type from test_reduce.py

* Revert to `to_cccl_value()` accepting ndarray + GpuStruct

* Bump copyrights

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Deprecate thrust::async (#3324)

Fixes: #100

Review/Deprecate CUB `util.ptx` for CCCL 2.x (#3342)

Fix broken `_CCCL_BUILTIN_ASSUME` macro (#3314)

* add compiler-specific path
* fix device code path
* add _CCC_ASSUME

Deprecate thrust::numeric_limits (#3366)

Replace `typedef` with `using` in libcu++ (#3368)

Deprecate thrust::optional (#3307)

Fixes: #3306

Upgrade to Catch2 3.8  (#3310)

Fixes: #1724

refactor `<cuda/std/cstdint>` (#3325)

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

Update CODEOWNERS (#3331)

* Update CODEOWNERS

* Update CODEOWNERS

* Update CODEOWNERS

* [pre-commit.ci] auto code formatting

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Fix sign-compare warning (#3408)

Implement more cmath functions to be usable on host and device (#3382)

* Implement more cmath functions to be usable on host and device

* Implement math roots functions

* Implement exponential functions

Redefine and deprecate thrust::remove_cvref (#3394)

* Redefine and deprecate thrust::remove_cvref

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>

Fix assert definition for NVHPC due to constexpr issues (#3418)

NVHPC cannot decide at compile time where the code would run so _CCCL_ASSERT within a constexpr function breaks it.

Fix this by always using the host definition which should also work on device.

Fixes #3411

Extend CUB reduce benchmarks (#3401)

* Rename max.cu to custom.cu, since it uses a custom operator
* Extend types covered my min.cu to all fundamental types
* Add some notes on how to collect tuning parameters

Fixes: #3283

Update upload-pages-artifact to v3 (#3423)

* Update upload-pages-artifact to v3

* Empty commit

---------

Co-authored-by: Ashwin Srinath <shwina@users.noreply.github.com>

Replace and deprecate thrust::cuda_cub::terminate (#3421)

`std::linalg` accessors and `transposed_layout` (#2962)

Add round up/down to multiple (#3234)

[FEA]: Introduce Python module with CCCL headers (#3201)

* Add cccl/python/cuda_cccl directory and use from cuda_parallel, cuda_cooperative

* Run `copy_cccl_headers_to_aude_include()` before `setup()`

* Create python/cuda_cccl/cuda/_include/__init__.py, then simply import cuda._include to find the include path.

* Add cuda.cccl._version exactly as for cuda.cooperative and cuda.parallel

* Bug fix: cuda/_include only exists after shutil.copytree() ran.

* Use `f"cuda-cccl @ file://{cccl_path}/python/cuda_cccl"` in setup.py

* Remove CustomBuildCommand, CustomWheelBuild in cuda_parallel/setup.py (they are equivalent to the default functions)

* Replace := operator (needs Python 3.8+)

* Fix oversights: remove `pip3 install ./cuda_cccl` lines from README.md

* Restore original README.md: `pip3 install -e` now works on first pass.

* cuda_cccl/README.md: FOR INTERNAL USE ONLY

* Remove `$pymajor.$pyminor.` prefix in cuda_cccl _version.py (as suggested under https://github.com/NVIDIA/cccl/pull/3201#discussion_r1894035917)

Command used: ci/update_version.sh 2 8 0

* Modernize pyproject.toml, setup.py

Trigger for this change:

* https://github.com/NVIDIA/cccl/pull/3201#discussion_r1894043178

* https://github.com/NVIDIA/cccl/pull/3201#discussion_r1894044996

* Install CCCL headers under cuda.cccl.include

Trigger for this change:

* https://github.com/NVIDIA/cccl/pull/3201#discussion_r1894048562

Unexpected accidental discovery: cuda.cooperative unit tests pass without CCCL headers entirely.

* Factor out cuda_cccl/cuda/cccl/include_paths.py

* Reuse cuda_cccl/cuda/cccl/include_paths.py from cuda_cooperative

* Add missing Copyright notice.

* Add missing __init__.py (cuda.cccl)

* Add `"cuda.cccl"` to `autodoc.mock_imports`

* Move cuda.cccl.include_paths into function where it is used. (Attempt to resolve Build and Verify Docs failure.)

* Add # TODO: move this to a module-level import

* Modernize cuda_cooperative/pyproject.toml, setup.py

* Convert cuda_cooperative to use hatchling as build backend.

* Revert "Convert cuda_cooperative to use hatchling as build backend."

This reverts commit 61637d608da06fcf6851ef6197f88b5e7dbc3bbe.

* Move numpy from [build-system] requires -> [project] dependencies

* Move pyproject.toml [project] dependencies -> setup.py install_requires, to be able to use CCCL_PATH

* Remove copy_license() and use license_files=["../../LICENSE"] instead.

* Further modernize cuda_cccl/setup.py to use pathlib

* Trivial simplifications in cuda_cccl/pyproject.toml

* Further simplify cuda_cccl/pyproject.toml, setup.py: remove inconsequential code

* Make cuda_cooperative/pyproject.toml more similar to cuda_cccl/pyproject.toml

* Add taplo-pre-commit to .pre-commit-config.yaml

* taplo-pre-commit auto-fixes

* Use pathlib in cuda_cooperative/setup.py

* CCCL_PYTHON_PATH in cuda_cooperative/setup.py

* Modernize cuda_parallel/pyproject.toml, setup.py

* Use pathlib in cuda_parallel/setup.py

* Add `# TOML lint & format` comment.

* Replace MANIFEST.in with `[tool.setuptools.package-data]` section in pyproject.toml

* Use pathlib in cuda/cccl/include_paths.py

* pre-commit autoupdate (EXCEPT clang-format, which was manually restored)

* Fixes after git merge main

* Resolve warning: AttributeError: '_Reduce' object has no attribute 'build_result'

```
=========================================================================== warnings summary ===========================================================================
tests/test_reduce.py::test_reduce_non_contiguous
  /home/coder/cccl/python/devenv/lib/python3.12/site-packages/_pytest/unraisableexception.py:85: PytestUnraisableExceptionWarning: Exception ignored in: <function _Reduce.__del__ at 0x7bf123139080>

  Traceback (most recent call last):
    File "/home/coder/cccl/python/cuda_parallel/cuda/parallel/experimental/algorithms/reduce.py", line 132, in __del__
      bindings.cccl_device_reduce_cleanup(ctypes.byref(self.build_result))
                                                       ^^^^^^^^^^^^^^^^^
  AttributeError: '_Reduce' object has no attribute 'build_result'

    warnings.warn(pytest.PytestUnraisableExceptionWarning(msg))

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
============================================================= 1 passed, 93 deselected, 1 warning in 0.44s ==============================================================
```

* Move `copy_cccl_headers_to_cuda_cccl_include()` functionality to `class CustomBuildPy`

* Introduce cuda_cooperative/constraints.txt

* Also add cuda_parallel/constraints.txt

* Add `--constraint constraints.txt` in ci/test_python.sh

* Update Copyright dates

* Switch to https://github.com/ComPWA/taplo-pre-commit (the other repo has been archived by the owner on Jul 1, 2024)

For completeness: The other repo took a long time to install into the pre-commit cache; so long it lead to timeouts in the CCCL CI.

* Remove unused cuda_parallel jinja2 dependency (noticed by chance).

* Remove constraints.txt files, advertise running `pip install cuda-cccl` first instead.

* Make cuda_cooperative, cuda_parallel testing completely independent.

* Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Try using another runner (because V100 runners seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Fix sign-compare warning (#3408) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Revert "Try using another runner (because V100 runners seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]"

This reverts commit ea33a218ed77a075156cd1b332047202adb25aa2.

Error message: https://github.com/NVIDIA/cccl/pull/3201#issuecomment-2594012971

* Try using A100 runner (because V100 runners still seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Also show cuda-cooperative site-packages, cuda-parallel site-packages (after pip install) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Try using l4 runner (because V100 runners still seem to be stuck) [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Restore original ci/matrix.yaml [skip-rapids]

* Use for loop in test_python.sh to avoid code duplication.

* Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc][skip pre-commit.ci]

* Comment out taplo-lint in pre-commit config [skip-rapids][skip-matx][skip-docs][skip-vdc]

* Revert "Run only test_python.sh [skip-rapids][skip-matx][skip-docs][skip-vdc][skip pre-commit.ci]"

This reverts commit ec206fd8b50a6a293e00a5825b579e125010b13d.

* Implement suggestion by @shwina (https://github.com/NVIDIA/cccl/pull/3201#pullrequestreview-2556918460)

* Address feedback by @leofang

---------

Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>

cuda.parallel: Add optional stream argument to reduce_into() (#3348)

* Add optional stream argument to reduce_into()

* Add tests to check for reduce_into() stream behavior

* Move protocol related utils to separate file and rework __cuda_stream__ error messages

* Fix synchronization issue in stream test and add one more invalid stream test case

* Rename cuda stream validation function after removing leading underscore

* Unpack values from __cuda_stream__ instead of indexing

* Fix linting errors

* Handle TypeError when unpacking invalid __cuda_stream__ return

* Use stream to allocate cupy memory in new stream test

Upgrade to actions/deploy-pages@v4 (from v2), as suggested by @leofang (#3434)

Deprecate `cub::{min, max}` and replace internal uses with those from libcu++ (#3419)

* Deprecate `cub::{min, max}` and replace internal uses with those from libcu++

Fixes #3404

Fix CI issues (#3443)

update docs

fix review

restrict allowed types

replace constexpr implementations with generic

optimize `__is_arithmetic_integral`
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

3 participants