Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add b200 tunings for scan.exclusive.by_key #3560

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

bernhardmgruber
Copy link
Contributor

No description provided.

@bernhardmgruber bernhardmgruber changed the title scan.exclusive.by_key sm100 tuning Add b200 tunings for scan.exclusive.by_key Jan 28, 2025
typename Tuning::delay_constructor>;

template <typename Tuning>
// FIXME(bgruber): should we rather use `AccumT` instead of `ValueT` like the other default policies?
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: This comment seems unrelated here.

Suggested change
// FIXME(bgruber): should we rather use `AccumT` instead of `ValueT` like the other default policies?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I left this in for @gevtushenko to answer. @gevtushenko any action required here? Otherwise, I will just drop the comment.

Copy link
Contributor

🟨 CI finished in 4h 15m: Pass: 97%/90 | Total: 2d 15h | Avg: 42m 10s | Max: 1h 15m | Hits: 273%/10928
  • 🟨 cub: Pass: 97%/44 | Total: 1d 14h | Avg: 52m 31s | Max: 1h 15m | Hits: 369%/3552

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/42  | Total:  1d 12h | Avg: 52m 10s | Max:  1h 15m | Hits: 369%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 49s | Max:  1h 02m
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  4h 59m | Avg: 59m 57s | Max:  1h 06m | Hits: 369%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 09m
      🔍 12.6               Pass:  97%/37  | Total:  1d 07h | Avg: 50m 49s | Max:  1h 15m | Hits: 369%/2664  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 04m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 59m | Avg: 59m 57s | Max:  1h 06m | Hits: 369%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 09m
      🔍 nvcc12.6           Pass:  97%/35  | Total:  1d 05h | Avg: 50m 08s | Max:  1h 15m | Hits: 369%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 04m
      🔍 nvcc               Pass:  97%/42  | Total:  1d 12h | Avg: 52m 01s | Max:  1h 15m | Hits: 369%/3552  
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  3h 46m | Avg: 56m 44s | Max: 59m 38s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 48m | Avg: 54m 10s | Max: 54m 42s
      🟩 Clang16            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🟩 Clang17            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 56s | Max: 59m 28s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 48m | Avg: 49m 43s | Max:  1h 04m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 58m | Avg: 59m 01s | Max: 59m 56s
      🟩 GCC8               Pass: 100%/1   | Total: 58m 59s | Avg: 58m 59s | Max: 58m 59s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 57m | Avg: 58m 36s | Max: 59m 27s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 43s | Max: 57m 09s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 57m | Avg: 58m 32s | Max:  1h 00m
      🟩 GCC12              Pass: 100%/4   | Total:  2h 39m | Avg: 39m 55s | Max: 59m 30s
      🔍 GCC13              Pass:  87%/8   | Total:  4h 50m | Avg: 36m 15s | Max:  1h 02m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m | Hits: 369%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 15m | Hits: 369%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 09m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/17  | Total: 15h 17m | Avg: 53m 58s | Max:  1h 04m
      🔍 GCC                Pass:  95%/21  | Total: 16h 14m | Avg: 46m 24s | Max:  1h 02m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 48m | Avg:  1h 12m | Max:  1h 15m | Hits: 369%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 09m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 42m 09s | Avg: 21m 04s | Max: 22m 43s
      🔍 v100               Pass:  97%/42  | Total:  1d 13h | Avg: 54m 01s | Max:  1h 15m | Hits: 369%/3552  
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/37  | Total:  1d 11h | Avg: 58m 01s | Max:  1h 15m | Hits: 369%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 11s | Avg: 22m 11s | Max: 22m 11s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 01s | Avg: 19m 01s | Max: 19m 01s
      🔍 HostLaunch         Pass:  66%/3   | Total: 53m 44s | Avg: 17m 54s | Max: 27m 12s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 28s | Max: 39m 58s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 20h 02m | Avg:  1h 00m | Max:  1h 13m | Hits: 369%/2664  
      🔍 20                 Pass:  95%/24  | Total: 18h 28m | Avg: 46m 12s | Max:  1h 15m | Hits: 368%/888   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 42m 09s | Avg: 21m 04s | Max: 22m 43s
      🟩 90a                Pass: 100%/1   | Total: 23m 55s | Avg: 23m 55s | Max: 23m 55s
    
  • 🟨 thrust: Pass: 97%/43 | Total: 23h 46m | Avg: 33m 10s | Max: 1h 03m | Hits: 227%/7376

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/41  | Total: 22h 46m | Avg: 33m 19s | Max:  1h 03m | Hits: 227%/7376  
      🟩 arm64              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 12s | Max: 33m 02s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  3h 08m | Avg: 37m 37s | Max: 54m 43s | Hits: 227%/1844  
      🟩 12.5               Pass: 100%/2   | Total:  1h 48m | Avg: 54m 17s | Max: 57m 25s
      🔍 12.6               Pass:  97%/36  | Total: 18h 50m | Avg: 31m 23s | Max:  1h 03m | Hits: 227%/5532  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 57m 56s | Avg: 28m 58s | Max: 29m 20s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 08m | Avg: 37m 37s | Max: 54m 43s | Hits: 227%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 48m | Avg: 54m 17s | Max: 57m 25s
      🔍 nvcc12.6           Pass:  97%/34  | Total: 17h 52m | Avg: 31m 31s | Max:  1h 03m | Hits: 227%/5532  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 57m 56s | Avg: 28m 58s | Max: 29m 20s
      🔍 nvcc               Pass:  97%/41  | Total: 22h 48m | Avg: 33m 23s | Max:  1h 03m | Hits: 227%/7376  
    🔍 cxx: MSVC14.39 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  2h 05m | Avg: 31m 24s | Max: 32m 30s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 01m | Avg: 30m 52s | Max: 31m 59s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 00m | Avg: 30m 23s | Max: 31m 59s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 21s | Max: 34m 28s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 49m | Avg: 24m 09s | Max: 32m 13s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 07m | Avg: 33m 32s | Max: 34m 14s
      🟩 GCC8               Pass: 100%/1   | Total: 30m 49s | Avg: 30m 49s | Max: 30m 49s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 11m | Avg: 35m 50s | Max: 35m 50s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 01s | Max: 34m 48s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 28s | Max: 32m 20s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 09m | Avg: 34m 44s | Max: 35m 40s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 21m | Avg: 25m 10s | Max: 34m 58s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 46s | Max: 54m 50s | Hits: 227%/3688  
      🔍 MSVC14.39          Pass:  66%/3   | Total:  2h 35m | Avg: 51m 42s | Max:  1h 03m | Hits: 227%/3688  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 17s | Max: 57m 25s
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/17  | Total:  8h 03m | Avg: 28m 28s | Max: 34m 28s
      🟩 GCC                Pass: 100%/19  | Total:  9h 29m | Avg: 29m 58s | Max: 35m 50s
      🔍 MSVC               Pass:  80%/5   | Total:  4h 24m | Avg: 52m 56s | Max:  1h 03m | Hits: 227%/7376  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 48m | Avg: 54m 17s | Max: 57m 25s
    🔍 jobs: TestCPU 🔍
      🟩 Build              Pass: 100%/37  | Total: 22h 00m | Avg: 35m 40s | Max:  1h 03m | Hits: 227%/7376  
      🔍 TestCPU            Pass:  66%/3   | Total: 47m 37s | Avg: 15m 52s | Max: 32m 00s
      🟩 TestGPU            Pass: 100%/3   | Total: 58m 50s | Avg: 19m 36s | Max: 33m 10s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 12h 15m | Avg: 36m 46s | Max: 59m 52s | Hits: 227%/5532  
      🔍 20                 Pass:  95%/21  | Total: 10h 30m | Avg: 30m 02s | Max:  1h 03m | Hits: 227%/1844  
    🟨 gpu
      🟨 v100               Pass:  97%/43  | Total: 23h 46m | Avg: 33m 10s | Max:  1h 03m | Hits: 227%/7376  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total:  1h 00m | Avg: 30m 05s | Max: 33m 10s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 18m 25s | Avg: 18m 25s | Max: 18m 25s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 37s | Avg: 5m 18s | Max: 8m 35s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  8m 35s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  8m 35s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  8m 35s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  8m 35s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  8m 35s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  8m 35s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  8m 35s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
      🟩 Test               Pass: 100%/1   | Total:  8m 35s | Avg:  8m 35s | Max:  8m 35s
    
  • 🟩 python: Pass: 100%/1 | Total: 48m 11s | Avg: 48m 11s | Max: 48m 11s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 48m 11s | Avg: 48m 11s | Max: 48m 11s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 48m 11s | Avg: 48m 11s | Max: 48m 11s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 48m 11s | Avg: 48m 11s | Max: 48m 11s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 48m 11s | Avg: 48m 11s | Max: 48m 11s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 48m 11s | Avg: 48m 11s | Max: 48m 11s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 48m 11s | Avg: 48m 11s | Max: 48m 11s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 48m 11s | Avg: 48m 11s | Max: 48m 11s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 48m 11s | Avg: 48m 11s | Max: 48m 11s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟩 CI finished in 2h 53m: Pass: 100%/89 | Total: 15h 51m | Avg: 10m 41s | Max: 1h 02m | Hits: 422%/10928
  • 🟩 cub: Pass: 100%/44 | Total: 8h 12m | Avg: 11m 12s | Max: 46m 17s | Hits: 540%/3552

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  8h 03m | Avg: 11m 30s | Max: 46m 17s | Hits: 540%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  9m 47s | Avg:  4m 53s | Max:  4m 58s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 45m 48s | Avg:  9m 09s | Max: 24m 10s | Hits: 540%/888   
      🟩 12.5               Pass: 100%/2   | Total: 19m 35s | Avg:  9m 47s | Max: 10m 11s
      🟩 12.6               Pass: 100%/37  | Total:  7h 07m | Avg: 11m 33s | Max: 46m 17s | Hits: 540%/2664  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  4m 32s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 45m 48s | Avg:  9m 09s | Max: 24m 10s | Hits: 540%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 19m 35s | Avg:  9m 47s | Max: 10m 11s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  6h 58m | Avg: 11m 57s | Max: 46m 17s | Hits: 540%/2664  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  4m 32s
      🟩 nvcc               Pass: 100%/42  | Total:  8h 04m | Avg: 11m 31s | Max: 46m 17s | Hits: 540%/3552  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 37s | Avg:  5m 24s | Max:  5m 39s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  5m 59s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 22s | Avg:  5m 41s | Max:  5m 47s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 10s | Avg:  5m 35s | Max:  5m 51s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 33m | Avg: 13m 17s | Max: 46m 17s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 47s | Avg:  5m 23s | Max:  5m 35s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 34s | Avg:  5m 47s | Max:  6m 04s
      🟩 GCC10              Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 23s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 26s | Avg:  5m 43s | Max:  6m 00s
      🟩 GCC12              Pass: 100%/4   | Total: 36m 24s | Avg:  9m 06s | Max: 19m 34s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 08m | Avg: 16m 06s | Max: 34m 19s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 51m 41s | Avg: 25m 50s | Max: 27m 31s | Hits: 540%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 57m 46s | Avg: 28m 53s | Max: 29m 41s | Hits: 540%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 19m 35s | Avg:  9m 47s | Max: 10m 11s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 28m | Avg:  8m 45s | Max: 46m 17s
      🟩 GCC                Pass: 100%/21  | Total:  3h 35m | Avg: 10m 14s | Max: 34m 19s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 49m | Avg: 27m 21s | Max: 29m 41s | Hits: 540%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total: 19m 35s | Avg:  9m 47s | Max: 10m 11s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 24m 09s | Avg: 12m 04s | Max: 19m 34s
      🟩 v100               Pass: 100%/42  | Total:  7h 48m | Avg: 11m 09s | Max: 46m 17s | Hits: 540%/3552  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 57m | Avg:  8m 02s | Max: 29m 41s | Hits: 540%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 23m 00s | Avg: 23m 00s | Max: 23m 00s
      🟩 GraphCapture       Pass: 100%/1   | Total: 22m 19s | Avg: 22m 19s | Max: 22m 19s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 09m | Avg: 23m 10s | Max: 27m 50s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 20m | Avg: 40m 18s | Max: 46m 17s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 24m 09s | Avg: 12m 04s | Max: 19m 34s
      🟩 90a                Pass: 100%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 59m | Avg:  8m 57s | Max: 28m 05s | Hits: 540%/2664  
      🟩 20                 Pass: 100%/24  | Total:  5h 13m | Avg: 13m 04s | Max: 46m 17s | Hits: 540%/888   
    
  • 🟩 thrust: Pass: 100%/42 | Total: 6h 23m | Avg: 9m 08s | Max: 28m 59s | Hits: 365%/7376

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 27s | Avg: 10m 43s | Max: 14m 53s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total:  6h 13m | Avg:  9m 20s | Max: 28m 59s | Hits: 365%/7376  
      🟩 arm64              Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 28s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 45m 48s | Avg:  9m 09s | Max: 24m 51s | Hits: 365%/1844  
      🟩 12.5               Pass: 100%/2   | Total: 29m 46s | Avg: 14m 53s | Max: 15m 05s
      🟩 12.6               Pass: 100%/35  | Total:  5h 08m | Avg:  8m 48s | Max: 28m 59s | Hits: 365%/5532  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 51s | Avg:  5m 25s | Max:  5m 31s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 45m 48s | Avg:  9m 09s | Max: 24m 51s | Hits: 365%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 29m 46s | Avg: 14m 53s | Max: 15m 05s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  4h 57m | Avg:  9m 00s | Max: 28m 59s | Hits: 365%/5532  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 51s | Avg:  5m 25s | Max:  5m 31s
      🟩 nvcc               Pass: 100%/40  | Total:  6h 12m | Avg:  9m 19s | Max: 28m 59s | Hits: 365%/7376  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 35s | Avg:  5m 23s | Max:  5m 40s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 44s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 28s | Avg:  5m 44s | Max:  6m 00s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 32s | Avg:  5m 46s | Max:  6m 02s
      🟩 Clang18            Pass: 100%/7   | Total: 48m 34s | Avg:  6m 56s | Max: 14m 54s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 59s | Avg:  5m 29s | Max:  5m 48s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s
      🟩 GCC9               Pass: 100%/2   | Total: 12m 01s | Avg:  6m 00s | Max:  6m 32s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 16s | Avg:  5m 38s | Max:  5m 41s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 47s
      🟩 GCC12              Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  6m 07s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 18m | Avg:  9m 47s | Max: 26m 57s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 51m 24s | Avg: 25m 42s | Max: 26m 33s | Hits: 365%/3688  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 56m 07s | Avg: 28m 03s | Max: 28m 59s | Hits: 365%/3688  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 29m 46s | Avg: 14m 53s | Max: 15m 05s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 44m | Avg:  6m 09s | Max: 14m 54s
      🟩 GCC                Pass: 100%/19  | Total:  2h 21m | Avg:  7m 27s | Max: 26m 57s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 47m | Avg: 26m 52s | Max: 28m 59s | Hits: 365%/7376  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 46s | Avg: 14m 53s | Max: 15m 05s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total:  6h 23m | Avg:  9m 08s | Max: 28m 59s | Hits: 365%/7376  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 12m | Avg:  8m 26s | Max: 28m 59s | Hits: 365%/7376  
      🟩 TestCPU            Pass: 100%/2   | Total: 14m 52s | Avg:  7m 26s | Max:  7m 45s
      🟩 TestGPU            Pass: 100%/3   | Total: 56m 44s | Avg: 18m 54s | Max: 26m 57s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 46s | Avg:  4m 46s | Max:  4m 46s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 03m | Avg:  9m 11s | Max: 27m 08s | Hits: 365%/5532  
      🟩 20                 Pass: 100%/20  | Total:  2h 58m | Avg:  8m 55s | Max: 28m 59s | Hits: 365%/1844  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 12m 30s | Avg: 6m 15s | Max: 10m 25s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max: 10m 25s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max: 10m 25s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max: 10m 25s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max: 10m 25s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max: 10m 25s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max: 10m 25s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max: 10m 25s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 05s | Avg:  2m 05s | Max:  2m 05s
      🟩 Test               Pass: 100%/1   | Total: 10m 25s | Avg: 10m 25s | Max: 10m 25s
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 02m | Avg: 1h 02m | Max: 1h 02m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 89)

# Runner
65 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
8 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

4 participants