Skip to content

Env-based API for CUB part 3/3 #4877

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 16 commits into
base: main
Choose a base branch
from

Conversation

gevtushenko
Copy link
Collaborator

Description

closes #2126 and works towards #3855

This PR adds one instance of env-based single-phase overload to device reduction that can be used to:

  • specify determinism requirements
  • tune without reaching for dispatch layer (user facing API will come later)
  • specify memory resource for temporary storage allocation

This PR is based on top of #4876, so disregard "Add require, determinism, and tuning" commit for now.

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@gevtushenko gevtushenko requested review from a team as code owners June 3, 2025 05:23
@gevtushenko gevtushenko requested a review from gonidelis June 3, 2025 05:23
@github-project-automation github-project-automation bot moved this to Todo in CCCL Jun 3, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Jun 3, 2025
Copy link
Contributor

github-actions bot commented Jun 3, 2025

🟨 CI finished in 2h 01m: Pass: 80%/187 | Total: 3d 01h | Avg: 23m 31s | Max: 1h 27m | Hits: 85%/251419
  • 🟨 cub: Pass: 29%/47 | Total: 1d 05h | Avg: 37m 16s | Max: 1h 27m | Hits: 69%/17324

    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 58s | Max: 34m 26s | Hits:  74%/2177  
      🔍 nvcc               Pass:  26%/45  | Total:  1d 04h | Avg: 37m 30s | Max:  1h 27m | Hits:  68%/15147 
    🟨 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 03m | Avg: 31m 58s | Max: 34m 26s | Hits:  74%/2177  
      🟨 nvcc12.0           Pass:  80%/5   | Total:  4h 56m | Avg: 59m 16s | Max:  1h 14m | Hits:  68%/5053  
      🟨 nvcc12.9           Pass:  20%/40  | Total: 23h 11m | Avg: 34m 47s | Max:  1h 27m | Hits:  68%/10094 
    🟨 cxx
      🟨 Clang14            Pass:  75%/4   | Total:  3h 16m | Avg: 49m 09s | Max: 57m 23s | Hits:  68%/3790  
      🟨 Clang15            Pass:  50%/2   | Total:  1h 08m | Avg: 34m 17s | Max: 56m 37s | Hits:  68%/1261  
      🟨 Clang16            Pass:  50%/2   | Total:  1h 10m | Avg: 35m 12s | Max: 58m 49s | Hits:  68%/1261  
      🟨 Clang17            Pass:  50%/2   | Total:  1h 13m | Avg: 36m 48s | Max:  1h 01m | Hits:  68%/1261  
      🟨 Clang18            Pass:  50%/2   | Total:  1h 08m | Avg: 34m 28s | Max: 56m 46s | Hits:  68%/1261  
      🟨 Clang19            Pass:  42%/7   | Total:  3h 13m | Avg: 27m 36s | Max: 55m 39s | Hits:  72%/3438  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 57m | Avg: 58m 36s | Max: 59m 25s | Hits:  68%/2526  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m | Hits:  68%/1263  
      🟨 GCC9               Pass:  50%/2   | Total:  1h 39m | Avg: 49m 45s | Max: 58m 13s | Hits:  68%/1263  
      🟥 GCC10              Pass:   0%/2   | Total:  1h 23m | Avg: 41m 35s | Max: 41m 38s
      🟥 GCC11              Pass:   0%/2   | Total:  1h 22m | Avg: 41m 05s | Max: 41m 45s
      🟥 GCC12              Pass:   0%/2   | Total:  1h 22m | Avg: 41m 07s | Max: 41m 49s
      🟥 GCC13              Pass:   0%/11  | Total:  3h 14m | Avg: 17m 40s | Max: 54m 50s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 35m | Avg:  1h 17m | Max:  1h 20m
      🟥 MSVC14.43          Pass:   0%/2   | Total:  2h 50m | Avg:  1h 25m | Max:  1h 27m
      🟥 NVHPC25.5          Pass:   0%/2   | Total: 32m 15s | Avg: 16m 07s | Max: 16m 18s
    🟨 cpu
      🟨 amd64              Pass:  31%/45  | Total:  1d 03h | Avg: 37m 17s | Max:  1h 27m | Hits:  69%/17324 
      🟥 arm64              Pass:   0%/2   | Total:  1h 13m | Avg: 36m 54s | Max: 38m 35s
    🟨 ctk
      🟨 12.0               Pass:  80%/5   | Total:  4h 56m | Avg: 59m 16s | Max:  1h 14m | Hits:  68%/5053  
      🟨 12.9               Pass:  23%/42  | Total:  1d 00h | Avg: 34m 38s | Max:  1h 27m | Hits:  69%/12271 
    🟨 cxx_family
      🟨 Clang              Pass:  52%/19  | Total: 11h 11m | Avg: 35m 20s | Max:  1h 01m | Hits:  69%/12272 
      🟨 GCC                Pass:  18%/22  | Total: 12h 02m | Avg: 32m 49s | Max:  1h 03m | Hits:  68%/5052  
      🟥 MSVC               Pass:   0%/4   | Total:  5h 25m | Avg:  1h 21m | Max:  1h 27m
      🟥 NVHPC              Pass:   0%/2   | Total: 32m 15s | Avg: 16m 07s | Max: 16m 18s
    🟨 gpu
      🟥 h100               Pass:   0%/3   | Total: 20m 18s | Avg:  6m 46s | Max: 20m 18s
      🟨 rtx2080            Pass:  38%/36  | Total:  1d 03h | Avg: 45m 30s | Max:  1h 27m | Hits:  69%/17324 
      🟥 rtxa6000           Pass:   0%/8   | Total:  1h 33m | Avg: 11m 39s | Max: 54m 50s
    🟨 jobs
      🟨 Build              Pass:  35%/39  | Total:  1d 05h | Avg: 44m 54s | Max:  1h 27m | Hits:  69%/17324 
      🟥 DeviceLaunch       Pass:   0%/1  
      🟥 GraphCapture       Pass:   0%/1  
      🟥 HostLaunch         Pass:   0%/3  
      🟥 TestGPU            Pass:   0%/3  
    🟥 sm
      🟥 90                 Pass:   0%/3   | Total: 20m 18s | Avg:  6m 46s | Max: 20m 18s
      🟥 90;90a;100         Pass:   0%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
    🟨 std
      🟨 17                 Pass:  57%/21  | Total: 18h 59m | Avg: 54m 15s | Max:  1h 27m | Hits:  68%/14971 
      🟨 20                 Pass:   7%/26  | Total: 10h 12m | Avg: 23m 33s | Max:  1h 23m | Hits:  71%/2353  
    
  • 🟨 cudax: Pass: 88%/26 | Total: 3h 16m | Avg: 7m 34s | Max: 14m 58s | Hits: 91%/13340

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  86%/22  | Total:  2h 53m | Avg:  7m 53s | Max: 14m 58s | Hits:  91%/10968 
      🟩 arm64              Pass: 100%/4   | Total: 23m 11s | Avg:  5m 47s | Max:  6m 16s | Hits:  90%/2372  
    🚨 ctk: 12.0 🚨
      🔥 12.0               Pass:   0%/3   | Total: 23m 31s | Avg:  7m 50s | Max: 13m 01s
      🟩 12.9               Pass: 100%/23  | Total:  2h 53m | Avg:  7m 32s | Max: 14m 58s | Hits:  91%/13340 
    🚨 cudacxx: nvcc12.0 🚨
      🔥 nvcc12.0           Pass:   0%/3   | Total: 23m 31s | Avg:  7m 50s | Max: 13m 01s
      🟩 nvcc12.9           Pass: 100%/23  | Total:  2h 53m | Avg:  7m 32s | Max: 14m 58s | Hits:  91%/13340 
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 18m 05s | Avg:  9m 02s | Max: 12m 41s | Hits:  94%/1186  
      🔍 rtx2080            Pass:  87%/24  | Total:  2h 58m | Avg:  7m 27s | Max: 14m 58s | Hits:  90%/12154 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  86%/23  | Total:  2h 42m | Avg:  7m 03s | Max: 14m 58s | Hits:  89%/11561 
      🟩 Test               Pass: 100%/3   | Total: 34m 46s | Avg: 11m 35s | Max: 12m 51s | Hits:  99%/1779  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/4   | Total: 27m 55s | Avg:  6m 58s | Max: 11m 40s | Hits:  89%/2370  
      🔍 20                 Pass:  86%/22  | Total:  2h 49m | Avg:  7m 41s | Max: 14m 58s | Hits:  91%/10970 
    🟨 cxx
      🟨 Clang14            Pass:  50%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  5m 54s | Hits:  90%/595   
      🟩 Clang15            Pass: 100%/1   | Total:  6m 21s | Avg:  6m 21s | Max:  6m 21s | Hits:  90%/593   
      🟩 Clang16            Pass: 100%/1   | Total:  6m 22s | Avg:  6m 22s | Max:  6m 22s | Hits:  90%/593   
      🟩 Clang17            Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s | Hits:  90%/593   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 54s | Avg:  5m 54s | Max:  5m 54s | Hits:  90%/593   
      🟩 Clang19            Pass: 100%/4   | Total: 25m 28s | Avg:  6m 22s | Max:  9m 14s | Hits:  92%/2372  
      🟨 GCC10              Pass:  50%/2   | Total: 11m 53s | Avg:  5m 56s | Max:  6m 11s | Hits:  90%/595   
      🟩 GCC11              Pass: 100%/1   | Total:  6m 29s | Avg:  6m 29s | Max:  6m 29s | Hits:  90%/593   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 30s | Avg:  6m 30s | Max:  6m 30s | Hits:  90%/593   
      🟩 GCC13              Pass: 100%/8   | Total:  1h 00m | Avg:  7m 31s | Max: 12m 51s | Hits:  92%/4744  
      🟥 MSVC14.39          Pass:   0%/1   | Total: 13m 01s | Avg: 13m 01s | Max: 13m 01s
      🟩 MSVC14.43          Pass: 100%/1   | Total: 14m 58s | Avg: 14m 58s | Max: 14m 58s | Hits:  78%/294   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 23m 23s | Avg: 11m 41s | Max: 11m 43s | Hits:  88%/1182  
    🟨 cxx_family
      🟨 Clang              Pass:  90%/10  | Total:  1h 00m | Avg:  6m 02s | Max:  9m 14s | Hits:  91%/5339  
      🟨 GCC                Pass:  91%/12  | Total:  1h 25m | Avg:  7m 05s | Max: 12m 51s | Hits:  91%/6525  
      🟨 MSVC               Pass:  50%/2   | Total: 27m 59s | Avg: 13m 59s | Max: 14m 58s | Hits:  78%/294   
      🟩 NVHPC              Pass: 100%/2   | Total: 23m 23s | Avg: 11m 41s | Max: 11m 43s | Hits:  88%/1182  
    🟨 cudacxx_family
      🟨 nvcc               Pass:  88%/26  | Total:  3h 16m | Avg:  7m 34s | Max: 14m 58s | Hits:  91%/13340 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 22m 53s | Avg:  7m 37s | Max: 12m 41s | Hits:  93%/1779  
      🟩 90a                Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s | Hits:  90%/593   
    
  • 🟩 thrust: Pass: 100%/47 | Total: 1d 04h | Avg: 36m 46s | Max: 1h 23m | Hits: 81%/89895

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 51m 06s | Avg: 25m 33s | Max: 35m 01s | Hits:  90%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 03h | Avg: 36m 53s | Max:  1h 23m | Hits:  81%/86068 
      🟩 arm64              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 07s | Max: 36m 42s | Hits:  80%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 16m | Avg: 39m 21s | Max:  1h 08m | Hits:  76%/9560  
      🟩 12.9               Pass: 100%/42  | Total:  1d 01h | Avg: 36m 27s | Max:  1h 23m | Hits:  81%/80335 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 54m 32s | Avg: 27m 16s | Max: 28m 08s | Hits:  80%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 16m | Avg: 39m 21s | Max:  1h 08m | Hits:  76%/9560  
      🟩 nvcc12.9           Pass: 100%/40  | Total:  1d 00h | Avg: 36m 55s | Max:  1h 23m | Hits:  81%/76509 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 54m 32s | Avg: 27m 16s | Max: 28m 08s | Hits:  80%/3826  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 03h | Avg: 37m 11s | Max:  1h 23m | Hits:  81%/86069 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 10s | Max: 34m 47s | Hits:  80%/7652  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 09m | Avg: 34m 34s | Max: 35m 53s | Hits:  80%/3826  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 18s | Max: 33m 45s | Hits:  80%/3826  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 15s | Max: 34m 47s | Hits:  80%/3826  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 29s | Max: 33m 41s | Hits:  80%/3826  
      🟩 Clang19            Pass: 100%/7   | Total:  2h 55m | Avg: 25m 04s | Max: 34m 47s | Hits:  86%/13391 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 12m | Avg: 36m 03s | Max: 39m 03s | Hits:  80%/3828  
      🟩 GCC8               Pass: 100%/1   | Total: 35m 11s | Avg: 35m 11s | Max: 35m 11s | Hits:  80%/1914  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 26m | Avg: 43m 15s | Max: 51m 01s | Hits:  80%/3828  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 17m | Avg: 38m 52s | Max: 40m 06s | Hits:  80%/3828  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 14m | Avg: 37m 18s | Max: 38m 54s | Hits:  80%/3828  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 13m | Avg: 36m 43s | Max: 36m 55s | Hits:  80%/3828  
      🟩 GCC13              Pass: 100%/10  | Total:  4h 26m | Avg: 26m 40s | Max: 38m 48s | Hits:  88%/19140 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 10m | Hits:  60%/3812  
      🟩 MSVC14.43          Pass: 100%/3   | Total:  3h 05m | Avg:  1h 01m | Max:  1h 23m | Hits:  73%/5718  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 13m | Hits:  60%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  9h 33m | Avg: 30m 10s | Max: 35m 53s | Hits:  82%/36347 
      🟩 GCC                Pass: 100%/21  | Total: 11h 26m | Avg: 32m 40s | Max: 51m 01s | Hits:  84%/40194 
      🟩 MSVC               Pass: 100%/5   | Total:  5h 24m | Avg:  1h 04m | Max:  1h 23m | Hits:  68%/9530  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 13m | Hits:  60%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 40m 56s | Avg: 20m 28s | Max: 21m 53s | Hits:  90%/3828  
      🟩 rtx2080            Pass: 100%/35  | Total: 23h 22m | Avg: 40m 04s | Max:  1h 13m | Hits:  77%/66946 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 44m | Avg: 28m 27s | Max:  1h 23m | Hits:  90%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 02h | Avg: 40m 22s | Max:  1h 23m | Hits:  77%/76507 
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 30s | Avg: 16m 50s | Max: 32m 00s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 02m | Avg: 15m 38s | Max: 19m 03s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 40m 56s | Avg: 20m 28s | Max: 21m 53s | Hits:  90%/3828  
      🟩 90;90a;100         Pass: 100%/1   | Total: 37m 54s | Avg: 37m 54s | Max: 37m 54s | Hits:  80%/1914  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 14h 43m | Avg: 42m 03s | Max:  1h 11m | Hits:  76%/40160 
      🟩 20                 Pass: 100%/24  | Total: 13h 13m | Avg: 33m 04s | Max:  1h 23m | Hits:  83%/45907 
    
  • 🟩 libcudacxx: Pass: 100%/45 | Total: 8h 56m | Avg: 11m 55s | Max: 44m 29s | Hits: 90%/130532

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  8h 42m | Avg: 12m 08s | Max: 44m 29s | Hits:  90%/123773
      🟩 arm64              Pass: 100%/2   | Total: 14m 28s | Avg:  7m 14s | Max:  9m 48s | Hits:  96%/6759  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 54m 36s | Avg: 10m 55s | Max: 26m 18s | Hits:  96%/16533 
      🟩 12.9               Pass: 100%/40  | Total:  8h 01m | Avg: 12m 02s | Max: 44m 29s | Hits:  89%/113999
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 51m 47s | Avg: 25m 53s | Max: 27m 33s | Hits:  26%/6723  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 54m 36s | Avg: 10m 55s | Max: 26m 18s | Hits:  96%/16533 
      🟩 nvcc12.9           Pass: 100%/38  | Total:  7h 10m | Avg: 11m 19s | Max: 44m 29s | Hits:  93%/107276
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 51m 47s | Avg: 25m 53s | Max: 27m 33s | Hits:  26%/6723  
      🟩 nvcc               Pass: 100%/43  | Total:  8h 04m | Avg: 11m 16s | Max: 44m 29s | Hits:  94%/123809
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 24m 49s | Avg:  6m 12s | Max:  9m 48s | Hits:  97%/13402 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 18s | Avg:  5m 09s | Max:  5m 20s | Hits:  98%/6719  
      🟩 Clang16            Pass: 100%/2   | Total: 10m 58s | Avg:  5m 29s | Max:  5m 42s | Hits:  98%/6719  
      🟩 Clang17            Pass: 100%/2   | Total: 15m 30s | Avg:  7m 45s | Max: 10m 17s | Hits:  95%/6719  
      🟩 Clang18            Pass: 100%/2   | Total: 21m 13s | Avg: 10m 36s | Max: 11m 11s | Hits:  93%/6719  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 19m | Avg: 13m 15s | Max: 27m 33s | Hits:  69%/16821 
      🟩 GCC7               Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  4m 43s | Hits:  99%/6655  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 39s | Avg:  4m 39s | Max:  4m 39s | Hits:  99%/3338  
      🟩 GCC9               Pass: 100%/2   | Total: 14m 33s | Avg:  7m 16s | Max:  9m 24s | Hits:  95%/6667  
      🟩 GCC10              Pass: 100%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 01s | Hits:  98%/6721  
      🟩 GCC11              Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  5m 09s | Hits:  98%/6717  
      🟩 GCC12              Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  5m 36s | Hits:  98%/6721  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 50m | Avg: 11m 05s | Max: 22m 59s | Hits:  97%/17067 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 54m 56s | Avg: 27m 28s | Max: 28m 38s | Hits:  99%/6393  
      🟩 MSVC14.43          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 33s | Max: 32m 57s | Hits:  96%/6445  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 26m | Avg: 43m 11s | Max: 44m 29s | Hits:  27%/6709  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 42m | Avg:  9m 01s | Max: 27m 33s | Hits:  89%/57099 
      🟩 GCC                Pass: 100%/21  | Total:  2h 49m | Avg:  8m 05s | Max: 22m 59s | Hits:  98%/53886 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 58m | Avg: 29m 30s | Max: 32m 57s | Hits:  97%/12838 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 26m | Avg: 43m 11s | Max: 44m 29s | Hits:  27%/6709  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 24m 46s | Avg: 12m 23s | Max: 19m 46s | Hits:  99%/3463  
      🟩 rtx2080            Pass: 100%/43  | Total:  8h 31m | Avg: 11m 54s | Max: 44m 29s | Hits:  90%/127069
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  7h 24m | Avg: 11m 23s | Max: 44m 29s | Hits:  90%/130492
      🟩 NVRTC              Pass: 100%/2   | Total: 44m 32s | Avg: 22m 16s | Max: 22m 59s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 45m 22s | Avg: 15m 07s | Max: 19m 46s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 24s | Avg:  2m 24s | Max:  2m 24s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 44m 32s | Avg: 22m 16s | Max: 22m 59s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 24m 46s | Avg: 12m 23s | Max: 19m 46s | Hits:  99%/3463  
      🟩 90;90a;100         Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s | Hits:  99%/3463  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  4h 31m | Avg: 12m 21s | Max: 41m 53s | Hits:  91%/69677 
      🟩 20                 Pass: 100%/22  | Total:  4h 22m | Avg: 11m 55s | Max: 44m 29s | Hits:  89%/60855 
    
  • 🟩 python: Pass: 100%/12 | Total: 2h 03m | Avg: 10m 19s | Max: 24m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/12  | Total:  2h 03m | Avg: 10m 19s | Max: 24m 47s
    🟩 ctk
      🟩 12.9               Pass: 100%/12  | Total:  2h 03m | Avg: 10m 19s | Max: 24m 47s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/12  | Total:  2h 03m | Avg: 10m 19s | Max: 24m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/12  | Total:  2h 03m | Avg: 10m 19s | Max: 24m 47s
    🟩 cxx
      🟩 GCC13              Pass: 100%/12  | Total:  2h 03m | Avg: 10m 19s | Max: 24m 47s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/12  | Total:  2h 03m | Avg: 10m 19s | Max: 24m 47s
    🟩 gpu
      🟩 rtxa6000           Pass: 100%/12  | Total:  2h 03m | Avg: 10m 19s | Max: 24m 47s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  3m 26s
      🟩 Build cuda.cooperative Pass: 100%/2   | Total:  6m 54s | Avg:  3m 27s | Max:  3m 30s
      🟩 Build cuda.parallel Pass: 100%/2   | Total: 16m 14s | Avg:  8m 07s | Max:  8m 27s
      🟩 Test cuda.cccl     Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  5m 00s
      🟩 Test cuda.cooperative Pass: 100%/2   | Total: 49m 27s | Avg: 24m 43s | Max: 24m 47s
      🟩 Test cuda.parallel Pass: 100%/2   | Total: 34m 37s | Avg: 17m 18s | Max: 17m 29s
    🟩 py_version
      🟩 3.10               Pass: 100%/6   | Total:  1h 01m | Avg: 10m 19s | Max: 24m 40s
      🟩 3.13               Pass: 100%/6   | Total:  1h 01m | Avg: 10m 19s | Max: 24m 47s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 24m 58s | Avg: 6m 14s | Max: 10m 09s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 24m 58s | Avg:  6m 14s | Max: 10m 09s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  4m 54s
      🟩 12.9               Pass: 100%/2   | Total: 15m 14s | Avg:  7m 37s | Max: 10m 09s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  4m 54s
      🟩 nvcc12.9           Pass: 100%/2   | Total: 15m 14s | Avg:  7m 37s | Max: 10m 09s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 24m 58s | Avg:  6m 14s | Max: 10m 09s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 54s | Avg:  4m 54s | Max:  4m 54s
      🟩 Clang19            Pass: 100%/1   | Total:  5m 05s | Avg:  5m 05s | Max:  5m 05s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 50s | Avg:  4m 50s | Max:  4m 50s
      🟩 GCC13              Pass: 100%/1   | Total: 10m 09s | Avg: 10m 09s | Max: 10m 09s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  5m 05s
      🟩 GCC                Pass: 100%/2   | Total: 14m 59s | Avg:  7m 29s | Max: 10m 09s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 24m 58s | Avg:  6m 14s | Max: 10m 09s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 24m 58s | Avg:  6m 14s | Max: 10m 09s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 22m 19s | Avg: 5m 34s | Max: 6m 00s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 59s | Avg:  5m 59s | Max:  6m 00s
      🟩 arm64              Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 12s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 22m 19s | Avg:  5m 34s | Max:  6m 00s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 22m 19s | Avg:  5m 34s | Max:  6m 00s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 22m 19s | Avg:  5m 34s | Max:  6m 00s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 22m 19s | Avg:  5m 34s | Max:  6m 00s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 22m 19s | Avg:  5m 34s | Max:  6m 00s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 22m 19s | Avg:  5m 34s | Max:  6m 00s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 22m 19s | Avg:  5m 34s | Max:  6m 00s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total: 11m 08s | Avg:  5m 34s | Max:  6m 00s
      🟩 20                 Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 59s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 13s | Avg: 7m 36s | Max: 11m 34s | Hits: 98%/328

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max: 11m 34s | Hits:  98%/328   
    🟩 ctk
      🟩 12.9               Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max: 11m 34s | Hits:  98%/328   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max: 11m 34s | Hits:  98%/328   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max: 11m 34s | Hits:  98%/328   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max: 11m 34s | Hits:  98%/328   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max: 11m 34s | Hits:  98%/328   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max: 11m 34s | Hits:  98%/328   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s | Hits:  97%/164   
      🟩 Test               Pass: 100%/1   | Total: 11m 34s | Avg: 11m 34s | Max: 11m 34s | Hits:  98%/164   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
+/- libcu++
+/- CUB
Thrust
+/- CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 187)

# Runner
129 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
12 linux-amd64-gpu-rtxa6000-latest-1
11 linux-amd64-gpu-rtx2080-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@gevtushenko gevtushenko mentioned this pull request Jun 3, 2025
2 tasks
@gevtushenko gevtushenko force-pushed the enh-main/github/env-based-api branch from 3661201 to 603ec97 Compare June 3, 2025 22:09
Copy link
Contributor

github-actions bot commented Jun 4, 2025

🟨 CI finished in 2h 04m: Pass: 93%/187 | Total: 3d 10h | Avg: 26m 26s | Max: 1h 30m | Hits: 85%/280760
  • 🟨 cub: Pass: 78%/47 | Total: 1d 14h | Avg: 49m 20s | Max: 1h 30m | Hits: 75%/46373

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  77%/45  | Total:  1d 12h | Avg: 49m 04s | Max:  1h 30m | Hits:  75%/43847 
      🟩 arm64              Pass: 100%/2   | Total:  1h 50m | Avg: 55m 20s | Max: 59m 43s | Hits:  68%/2526  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 06m | Avg: 33m 08s | Max: 35m 14s | Hits:  74%/2177  
      🔍 nvcc               Pass:  77%/45  | Total:  1d 13h | Avg: 50m 03s | Max:  1h 30m | Hits:  75%/44196 
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/3   | Total:  1h 33m | Avg: 31m 15s | Max: 33m 15s | Hits:  89%/3792  
      🔍 rtx2080            Pass:  72%/36  | Total:  1d 07h | Avg: 52m 40s | Max:  1h 30m | Hits:  68%/32475 
      🟩 rtxa6000           Pass: 100%/8   | Total:  5h 29m | Avg: 41m 07s | Max:  1h 01m | Hits:  91%/10106 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  74%/39  | Total:  1d 10h | Avg: 52m 19s | Max:  1h 30m | Hits:  68%/36265 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 38m 55s | Avg: 38m 55s | Max: 38m 55s | Hits:  99%/1264  
      🟩 GraphCapture       Pass: 100%/1   | Total: 31m 45s | Avg: 31m 45s | Max: 31m 45s | Hits:  99%/1264  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 45m | Avg: 35m 01s | Max: 40m 36s | Hits:  99%/3790  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 42m | Avg: 34m 07s | Max: 35m 41s | Hits:  99%/3790  
    🟨 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 06m | Avg: 33m 08s | Max: 35m 14s | Hits:  74%/2177  
      🟨 nvcc12.0           Pass:  80%/5   | Total:  4h 56m | Avg: 59m 12s | Max:  1h 12m | Hits:  68%/5053  
      🟨 nvcc12.9           Pass:  77%/40  | Total:  1d 08h | Avg: 48m 54s | Max:  1h 30m | Hits:  76%/39143 
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 40m | Avg: 55m 05s | Max: 59m 19s | Hits:  68%/5054  
      🟨 Clang15            Pass:  50%/2   | Total:  1h 06m | Avg: 33m 00s | Max: 54m 44s | Hits:  68%/1261  
      🟨 Clang16            Pass:  50%/2   | Total:  1h 05m | Avg: 32m 44s | Max: 54m 29s | Hits:  68%/1261  
      🟨 Clang17            Pass:  50%/2   | Total:  1h 07m | Avg: 33m 38s | Max: 55m 51s | Hits:  68%/1261  
      🟨 Clang18            Pass:  50%/2   | Total:  1h 13m | Avg: 36m 36s | Max:  1h 01m | Hits:  68%/1261  
      🟩 Clang19            Pass: 100%/7   | Total:  4h 52m | Avg: 41m 43s | Max: 54m 31s | Hits:  79%/8486  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 55m | Avg: 57m 58s | Max: 58m 21s | Hits:  68%/2526  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:  68%/1263  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 59m | Avg: 59m 47s | Max:  1h 00m | Hits:  68%/2526  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 05m | Hits:  68%/2527  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 01m | Hits:  68%/2523  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 08m | Hits:  68%/2523  
      🟩 GCC13              Pass: 100%/11  | Total:  8h 10m | Avg: 44m 34s | Max:  1h 05m | Hits:  85%/13901 
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 35m | Avg:  1h 17m | Max:  1h 22m
      🟥 MSVC14.43          Pass:   0%/2   | Total:  2h 47m | Avg:  1h 23m | Max:  1h 30m
      🟥 NVHPC25.5          Pass:   0%/2   | Total: 30m 54s | Avg: 15m 27s | Max: 16m 04s
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total: 13h 04m | Avg: 41m 17s | Max:  1h 01m | Hits:  73%/18584 
      🟩 GCC                Pass: 100%/22  | Total: 19h 40m | Avg: 53m 39s | Max:  1h 08m | Hits:  76%/27789 
      🟥 MSVC               Pass:   0%/4   | Total:  5h 23m | Avg:  1h 20m | Max:  1h 30m
      🟥 NVHPC              Pass:   0%/2   | Total: 30m 54s | Avg: 15m 27s | Max: 16m 04s
    🟨 ctk
      🟨 12.0               Pass:  80%/5   | Total:  4h 56m | Avg: 59m 12s | Max:  1h 12m | Hits:  68%/5053  
      🟨 12.9               Pass:  78%/42  | Total:  1d 09h | Avg: 48m 09s | Max:  1h 30m | Hits:  76%/41320 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 33m | Avg: 31m 15s | Max: 33m 15s | Hits:  89%/3792  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:  68%/1264  
    🟨 std
      🟨 17                 Pass:  80%/21  | Total: 20h 31m | Avg: 58m 37s | Max:  1h 22m | Hits:  68%/21280 
      🟨 20                 Pass:  76%/26  | Total: 18h 07m | Avg: 41m 50s | Max:  1h 30m | Hits:  81%/25093 
    
  • 🟨 cudax: Pass: 88%/26 | Total: 3h 24m | Avg: 7m 53s | Max: 16m 21s | Hits: 89%/13593

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  86%/22  | Total:  3h 02m | Avg:  8m 17s | Max: 16m 21s | Hits:  88%/11177 
      🟩 arm64              Pass: 100%/4   | Total: 22m 43s | Avg:  5m 40s | Max:  6m 03s | Hits:  90%/2416  
    🚨 ctk: 12.0 🚨
      🔥 12.0               Pass:   0%/3   | Total: 26m 34s | Avg:  8m 51s | Max: 15m 41s
      🟩 12.9               Pass: 100%/23  | Total:  2h 58m | Avg:  7m 45s | Max: 16m 21s | Hits:  89%/13593 
    🚨 cudacxx: nvcc12.0 🚨
      🔥 nvcc12.0           Pass:   0%/3   | Total: 26m 34s | Avg:  8m 51s | Max: 15m 41s
      🟩 nvcc12.9           Pass: 100%/23  | Total:  2h 58m | Avg:  7m 45s | Max: 16m 21s | Hits:  89%/13593 
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 18m 02s | Avg:  9m 01s | Max: 12m 40s | Hits:  94%/1208  
      🔍 rtx2080            Pass:  87%/24  | Total:  3h 06m | Avg:  7m 47s | Max: 16m 21s | Hits:  88%/12385 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  86%/23  | Total:  2h 49m | Avg:  7m 23s | Max: 16m 21s | Hits:  87%/11781 
      🟩 Test               Pass: 100%/3   | Total: 35m 07s | Avg: 11m 42s | Max: 12m 40s | Hits:  99%/1812  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/4   | Total: 27m 54s | Avg:  6m 58s | Max: 11m 54s | Hits:  86%/2414  
      🔍 20                 Pass:  86%/22  | Total:  2h 57m | Avg:  8m 02s | Max: 16m 21s | Hits:  89%/11179 
    🟨 cxx
      🟨 Clang14            Pass:  50%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  6m 06s | Hits:  90%/606   
      🟩 Clang15            Pass: 100%/1   | Total:  5m 59s | Avg:  5m 59s | Max:  5m 59s | Hits:  90%/604   
      🟩 Clang16            Pass: 100%/1   | Total:  5m 57s | Avg:  5m 57s | Max:  5m 57s | Hits:  90%/604   
      🟩 Clang17            Pass: 100%/1   | Total:  6m 00s | Avg:  6m 00s | Max:  6m 00s | Hits:  90%/604   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 40s | Avg:  5m 40s | Max:  5m 40s | Hits:  90%/604   
      🟩 Clang19            Pass: 100%/4   | Total: 27m 32s | Avg:  6m 53s | Max: 10m 31s | Hits:  92%/2416  
      🟨 GCC10              Pass:  50%/2   | Total: 11m 45s | Avg:  5m 52s | Max:  6m 13s | Hits:  90%/606   
      🟩 GCC11              Pass: 100%/1   | Total:  6m 50s | Avg:  6m 50s | Max:  6m 50s | Hits:  90%/604   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 35s | Avg:  6m 35s | Max:  6m 35s | Hits:  90%/604   
      🟩 GCC13              Pass: 100%/8   | Total: 59m 24s | Avg:  7m 25s | Max: 12m 40s | Hits:  92%/4832  
      🟥 MSVC14.39          Pass:   0%/1   | Total: 15m 41s | Avg: 15m 41s | Max: 15m 41s
      🟩 MSVC14.43          Pass: 100%/1   | Total: 16m 21s | Avg: 16m 21s | Max: 16m 21s | Hits:  54%/305   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 25m 47s | Avg: 12m 53s | Max: 13m 53s | Hits:  73%/1204  
    🟨 cxx_family
      🟨 Clang              Pass:  90%/10  | Total:  1h 02m | Avg:  6m 15s | Max: 10m 31s | Hits:  91%/5438  
      🟨 GCC                Pass:  91%/12  | Total:  1h 24m | Avg:  7m 02s | Max: 12m 40s | Hits:  91%/6646  
      🟨 MSVC               Pass:  50%/2   | Total: 32m 02s | Avg: 16m 01s | Max: 16m 21s | Hits:  54%/305   
      🟩 NVHPC              Pass: 100%/2   | Total: 25m 47s | Avg: 12m 53s | Max: 13m 53s | Hits:  73%/1204  
    🟨 cudacxx_family
      🟨 nvcc               Pass:  88%/26  | Total:  3h 24m | Avg:  7m 53s | Max: 16m 21s | Hits:  89%/13593 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 22m 52s | Avg:  7m 37s | Max: 12m 40s | Hits:  93%/1812  
      🟩 90a                Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s | Hits:  90%/604   
    
  • 🟩 thrust: Pass: 100%/47 | Total: 1d 05h | Avg: 37m 12s | Max: 1h 20m | Hits: 81%/89895

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 52m 28s | Avg: 26m 14s | Max: 36m 19s | Hits:  90%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 04h | Avg: 37m 23s | Max:  1h 20m | Hits:  81%/86068 
      🟩 arm64              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 09s | Max: 35m 28s | Hits:  80%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 15m | Avg: 39m 02s | Max:  1h 03m | Hits:  77%/9560  
      🟩 12.9               Pass: 100%/42  | Total:  1d 01h | Avg: 36m 59s | Max:  1h 20m | Hits:  82%/80335 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 55m 39s | Avg: 27m 49s | Max: 29m 45s | Hits:  80%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 15m | Avg: 39m 02s | Max:  1h 03m | Hits:  77%/9560  
      🟩 nvcc12.9           Pass: 100%/40  | Total:  1d 00h | Avg: 37m 26s | Max:  1h 20m | Hits:  82%/76509 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 55m 39s | Avg: 27m 49s | Max: 29m 45s | Hits:  80%/3826  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 04h | Avg: 37m 37s | Max:  1h 20m | Hits:  81%/86069 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 18m | Avg: 34m 31s | Max: 41m 34s | Hits:  80%/7652  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 26s | Max: 35m 29s | Hits:  80%/3826  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 10m | Avg: 35m 07s | Max: 35m 46s | Hits:  80%/3826  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 11m | Avg: 35m 35s | Max: 37m 00s | Hits:  80%/3826  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 55s | Max: 33m 45s | Hits:  80%/3826  
      🟩 Clang19            Pass: 100%/7   | Total:  3h 04m | Avg: 26m 25s | Max: 40m 05s | Hits:  86%/13391 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 08m | Avg: 34m 24s | Max: 35m 19s | Hits:  80%/3828  
      🟩 GCC8               Pass: 100%/1   | Total: 37m 58s | Avg: 37m 58s | Max: 37m 58s | Hits:  80%/1914  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 15m | Avg: 37m 48s | Max: 41m 13s | Hits:  80%/3828  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 43s | Max: 37m 53s | Hits:  80%/3828  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 13m | Avg: 36m 34s | Max: 38m 09s | Hits:  80%/3828  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 32s | Max: 38m 55s | Hits:  80%/3828  
      🟩 GCC13              Pass: 100%/10  | Total:  4h 37m | Avg: 27m 45s | Max: 44m 46s | Hits:  88%/19140 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 09m | Hits:  66%/3812  
      🟩 MSVC14.43          Pass: 100%/3   | Total:  3h 01m | Avg:  1h 00m | Max:  1h 15m | Hits:  77%/5718  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 20m | Hits:  65%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  9h 59m | Avg: 31m 32s | Max: 41m 34s | Hits:  82%/36347 
      🟩 GCC                Pass: 100%/21  | Total: 11h 23m | Avg: 32m 33s | Max: 44m 46s | Hits:  84%/40194 
      🟩 MSVC               Pass: 100%/5   | Total:  5h 14m | Avg:  1h 02m | Max:  1h 15m | Hits:  72%/9530  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 20m | Hits:  65%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 37m 09s | Avg: 18m 34s | Max: 21m 23s | Hits:  90%/3828  
      🟩 rtx2080            Pass: 100%/35  | Total: 23h 41m | Avg: 40m 36s | Max:  1h 20m | Hits:  78%/66946 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 49m | Avg: 28m 59s | Max:  1h 15m | Hits:  90%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 03h | Avg: 40m 54s | Max:  1h 20m | Hits:  78%/76507 
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 57s | Avg: 17m 19s | Max: 32m 53s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 00m | Avg: 15m 05s | Max: 16m 09s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 37m 09s | Avg: 18m 34s | Max: 21m 23s | Hits:  90%/3828  
      🟩 90;90a;100         Pass: 100%/1   | Total: 44m 46s | Avg: 44m 46s | Max: 44m 46s | Hits:  80%/1914  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 14h 58m | Avg: 42m 48s | Max:  1h 13m | Hits:  77%/40160 
      🟩 20                 Pass: 100%/24  | Total: 13h 17m | Avg: 33m 13s | Max:  1h 20m | Hits:  84%/45907 
    
  • 🟩 libcudacxx: Pass: 100%/45 | Total: 8h 24m | Avg: 11m 12s | Max: 46m 14s | Hits: 91%/130571

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  8h 15m | Avg: 11m 30s | Max: 46m 14s | Hits:  91%/123810
      🟩 arm64              Pass: 100%/2   | Total:  9m 37s | Avg:  4m 48s | Max:  4m 50s | Hits:  98%/6761  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 47m 47s | Avg:  9m 33s | Max: 29m 35s | Hits:  98%/16538 
      🟩 12.9               Pass: 100%/40  | Total:  7h 36m | Avg: 11m 25s | Max: 46m 14s | Hits:  90%/114033
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 53m 50s | Avg: 26m 55s | Max: 29m 35s | Hits:  26%/6725  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 47m 47s | Avg:  9m 33s | Max: 29m 35s | Hits:  98%/16538 
      🟩 nvcc12.9           Pass: 100%/38  | Total:  6h 43m | Avg: 10m 36s | Max: 46m 14s | Hits:  94%/107308
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 53m 50s | Avg: 26m 55s | Max: 29m 35s | Hits:  26%/6725  
      🟩 nvcc               Pass: 100%/43  | Total:  7h 30m | Avg: 10m 29s | Max: 46m 14s | Hits:  94%/123846
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 20m 19s | Avg:  5m 04s | Max:  5m 36s | Hits:  98%/13406 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 58s | Avg:  5m 29s | Max:  5m 33s | Hits:  98%/6721  
      🟩 Clang16            Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  5m 54s | Hits:  98%/6721  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  5m 21s | Hits:  98%/6721  
      🟩 Clang18            Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 22s | Hits:  98%/6721  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 22m | Avg: 13m 41s | Max: 29m 35s | Hits:  69%/16826 
      🟩 GCC7               Pass: 100%/2   | Total:  9m 06s | Avg:  4m 33s | Max:  4m 45s | Hits:  99%/6657  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 43s | Avg:  4m 43s | Max:  4m 43s | Hits:  99%/3339  
      🟩 GCC9               Pass: 100%/2   | Total:  9m 15s | Avg:  4m 37s | Max:  4m 52s | Hits:  99%/6669  
      🟩 GCC10              Pass: 100%/2   | Total: 10m 31s | Avg:  5m 15s | Max:  5m 34s | Hits:  98%/6723  
      🟩 GCC11              Pass: 100%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 50s | Hits:  99%/6719  
      🟩 GCC12              Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  5m 19s | Hits:  98%/6723  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 45m | Avg: 10m 32s | Max: 22m 49s | Hits:  98%/17072 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 58m 56s | Avg: 29m 28s | Max: 29m 35s | Hits:  98%/6395  
      🟩 MSVC14.43          Pass: 100%/2   | Total: 55m 28s | Avg: 27m 44s | Max: 28m 42s | Hits:  98%/6447  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 26m | Avg: 43m 10s | Max: 46m 14s | Hits:  27%/6711  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 24m | Avg:  8m 03s | Max: 29m 35s | Hits:  90%/57116 
      🟩 GCC                Pass: 100%/21  | Total:  2h 38m | Avg:  7m 34s | Max: 22m 49s | Hits:  98%/53902 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 54m | Avg: 28m 36s | Max: 29m 35s | Hits:  98%/12842 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 26m | Avg: 43m 10s | Max: 46m 14s | Hits:  27%/6711  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 22m 00s | Avg: 11m 00s | Max: 16m 43s | Hits:  99%/3464  
      🟩 rtx2080            Pass: 100%/43  | Total:  8h 02m | Avg: 11m 13s | Max: 46m 14s | Hits:  91%/127107
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  6h 52m | Avg: 10m 34s | Max: 46m 14s | Hits:  91%/130531
      🟩 NVRTC              Pass: 100%/2   | Total: 44m 25s | Avg: 22m 12s | Max: 22m 49s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 45m 18s | Avg: 15m 06s | Max: 16m 43s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 18s | Avg:  2m 18s | Max:  2m 18s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 44m 25s | Avg: 22m 12s | Max: 22m 49s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 22m 00s | Avg: 11m 00s | Max: 16m 43s | Hits:  99%/3464  
      🟩 90;90a;100         Pass: 100%/1   | Total:  5m 44s | Avg:  5m 44s | Max:  5m 44s | Hits:  98%/3464  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  4h 12m | Avg: 11m 28s | Max: 40m 07s | Hits:  92%/69698 
      🟩 20                 Pass: 100%/22  | Total:  4h 10m | Avg: 11m 21s | Max: 46m 14s | Hits:  90%/60873 
    
  • 🟩 python: Pass: 100%/12 | Total: 1h 54m | Avg: 9m 32s | Max: 20m 18s

    🟩 cpu
      🟩 amd64              Pass: 100%/12  | Total:  1h 54m | Avg:  9m 32s | Max: 20m 18s
    🟩 ctk
      🟩 12.9               Pass: 100%/12  | Total:  1h 54m | Avg:  9m 32s | Max: 20m 18s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/12  | Total:  1h 54m | Avg:  9m 32s | Max: 20m 18s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/12  | Total:  1h 54m | Avg:  9m 32s | Max: 20m 18s
    🟩 cxx
      🟩 GCC13              Pass: 100%/12  | Total:  1h 54m | Avg:  9m 32s | Max: 20m 18s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/12  | Total:  1h 54m | Avg:  9m 32s | Max: 20m 18s
    🟩 gpu
      🟩 rtxa6000           Pass: 100%/12  | Total:  1h 54m | Avg:  9m 32s | Max: 20m 18s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total:  6m 50s | Avg:  3m 25s | Max:  3m 30s
      🟩 Build cuda.cooperative Pass: 100%/2   | Total:  6m 34s | Avg:  3m 17s | Max:  3m 21s
      🟩 Build cuda.parallel Pass: 100%/2   | Total: 15m 44s | Avg:  7m 52s | Max:  7m 57s
      🟩 Test cuda.cccl     Pass: 100%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 08s
      🟩 Test cuda.cooperative Pass: 100%/2   | Total: 39m 50s | Avg: 19m 55s | Max: 20m 18s
      🟩 Test cuda.parallel Pass: 100%/2   | Total: 35m 32s | Avg: 17m 46s | Max: 18m 20s
    🟩 py_version
      🟩 3.10               Pass: 100%/6   | Total: 57m 34s | Avg:  9m 35s | Max: 19m 32s
      🟩 3.13               Pass: 100%/6   | Total: 56m 58s | Avg:  9m 29s | Max: 20m 18s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 19m 19s | Avg: 4m 49s | Max: 5m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 19m 19s | Avg:  4m 49s | Max:  5m 04s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  9m 18s | Avg:  4m 39s | Max:  4m 40s
      🟩 12.9               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  5m 04s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 18s | Avg:  4m 39s | Max:  4m 40s
      🟩 nvcc12.9           Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  5m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 19s | Avg:  4m 49s | Max:  5m 04s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 40s | Avg:  4m 40s | Max:  4m 40s
      🟩 Clang19            Pass: 100%/1   | Total:  5m 04s | Avg:  5m 04s | Max:  5m 04s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 38s | Avg:  4m 38s | Max:  4m 38s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 57s | Avg:  4m 57s | Max:  4m 57s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  5m 04s
      🟩 GCC                Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  4m 57s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 19m 19s | Avg:  4m 49s | Max:  5m 04s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 19m 19s | Avg:  4m 49s | Max:  5m 04s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 19m 24s | Avg: 4m 51s | Max: 5m 35s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 56s | Avg:  5m 28s | Max:  5m 35s
      🟩 arm64              Pass: 100%/2   | Total:  8m 28s | Avg:  4m 14s | Max:  4m 19s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  5m 35s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  5m 35s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  5m 35s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  5m 35s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  5m 35s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  5m 35s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  5m 35s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  9m 40s | Avg:  4m 50s | Max:  5m 21s
      🟩 20                 Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  5m 35s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 14m 53s | Avg: 7m 26s | Max: 12m 00s | Hits: 98%/328

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 14m 53s | Avg:  7m 26s | Max: 12m 00s | Hits:  98%/328   
    🟩 ctk
      🟩 12.9               Pass: 100%/2   | Total: 14m 53s | Avg:  7m 26s | Max: 12m 00s | Hits:  98%/328   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/2   | Total: 14m 53s | Avg:  7m 26s | Max: 12m 00s | Hits:  98%/328   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 14m 53s | Avg:  7m 26s | Max: 12m 00s | Hits:  98%/328   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 14m 53s | Avg:  7m 26s | Max: 12m 00s | Hits:  98%/328   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 14m 53s | Avg:  7m 26s | Max: 12m 00s | Hits:  98%/328   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 14m 53s | Avg:  7m 26s | Max: 12m 00s | Hits:  98%/328   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 53s | Avg:  2m 53s | Max:  2m 53s | Hits:  98%/164   
      🟩 Test               Pass: 100%/1   | Total: 12m 00s | Avg: 12m 00s | Max: 12m 00s | Hits:  98%/164   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
+/- libcu++
+/- CUB
Thrust
+/- CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 187)

# Runner
129 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
12 linux-amd64-gpu-rtxa6000-latest-1
11 linux-amd64-gpu-rtx2080-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@gevtushenko gevtushenko mentioned this pull request Jun 4, 2025
2 tasks
@gevtushenko gevtushenko force-pushed the enh-main/github/env-based-api branch from 1f09d4e to 7ff6bf6 Compare June 5, 2025 00:55
Copy link
Contributor

github-actions bot commented Jun 5, 2025

🟩 CI finished in 1h 40m: Pass: 100%/187 | Total: 2d 04h | Avg: 17m 00s | Max: 1h 23m | Hits: 74%/294458
  • 🟩 cub: Pass: 100%/47 | Total: 17h 31m | Avg: 22m 22s | Max: 1h 23m | Hits: 93%/58060

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 17h 17m | Avg: 23m 02s | Max:  1h 23m | Hits:  93%/55534 
      🟩 arm64              Pass: 100%/2   | Total: 14m 35s | Avg:  7m 17s | Max:  8m 21s | Hits:  99%/2526  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 40m | Avg: 20m 06s | Max:  1h 10m | Hits:  91%/6131  
      🟩 12.9               Pass: 100%/42  | Total: 15h 51m | Avg: 22m 38s | Max:  1h 23m | Hits:  93%/51929 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 22s | Avg:  5m 41s | Max:  5m 53s | Hits:  99%/2177  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 40m | Avg: 20m 06s | Max:  1h 10m | Hits:  91%/6131  
      🟩 nvcc12.9           Pass: 100%/40  | Total: 15h 39m | Avg: 23m 29s | Max:  1h 23m | Hits:  93%/49752 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 22s | Avg:  5m 41s | Max:  5m 53s | Hits:  99%/2177  
      🟩 nvcc               Pass: 100%/45  | Total: 17h 20m | Avg: 23m 07s | Max:  1h 23m | Hits:  93%/55883 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 27m 48s | Avg:  6m 57s | Max:  7m 12s | Hits:  99%/5054  
      🟩 Clang15            Pass: 100%/2   | Total: 14m 16s | Avg:  7m 08s | Max:  7m 29s | Hits:  99%/2523  
      🟩 Clang16            Pass: 100%/2   | Total: 14m 28s | Avg:  7m 14s | Max:  7m 36s | Hits:  99%/2523  
      🟩 Clang17            Pass: 100%/2   | Total: 14m 43s | Avg:  7m 21s | Max:  7m 42s | Hits:  99%/2523  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 59s | Avg:  6m 59s | Max:  7m 11s | Hits:  99%/2523  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 41m | Avg: 14m 33s | Max: 36m 50s | Hits:  99%/8486  
      🟩 GCC7               Pass: 100%/2   | Total: 16m 43s | Avg:  8m 21s | Max:  8m 56s | Hits:  99%/2526  
      🟩 GCC8               Pass: 100%/1   | Total:  8m 40s | Avg:  8m 40s | Max:  8m 40s | Hits:  99%/1263  
      🟩 GCC9               Pass: 100%/2   | Total: 59m 27s | Avg: 29m 43s | Max: 50m 35s | Hits:  87%/2526  
      🟩 GCC10              Pass: 100%/2   | Total: 18m 24s | Avg:  9m 12s | Max:  9m 17s | Hits:  99%/2527  
      🟩 GCC11              Pass: 100%/2   | Total: 17m 23s | Avg:  8m 41s | Max:  8m 42s | Hits:  99%/2523  
      🟩 GCC12              Pass: 100%/2   | Total: 18m 36s | Avg:  9m 18s | Max:  9m 36s | Hits:  99%/2523  
      🟩 GCC13              Pass: 100%/11  | Total:  4h 29m | Avg: 24m 30s | Max: 42m 57s | Hits:  99%/13901 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 15m | Hits:  52%/2156  
      🟩 MSVC14.43          Pass: 100%/2   | Total:  2h 41m | Avg:  1h 20m | Max:  1h 23m | Hits:  52%/2156  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 16m | Hits:  45%/2327  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  3h 07m | Avg:  9m 50s | Max: 36m 50s | Hits:  99%/23632 
      🟩 GCC                Pass: 100%/22  | Total:  6h 48m | Avg: 18m 34s | Max: 50m 35s | Hits:  98%/27789 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 07m | Avg:  1h 16m | Max:  1h 23m | Hits:  52%/4312  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 16m | Hits:  45%/2327  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 11m | Avg: 23m 53s | Max: 34m 11s | Hits:  99%/3792  
      🟩 rtx2080            Pass: 100%/36  | Total: 12h 12m | Avg: 20m 20s | Max:  1h 23m | Hits:  91%/44162 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 07m | Avg: 30m 59s | Max: 42m 57s | Hits:  99%/10106 
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total: 12h 34m | Avg: 19m 20s | Max:  1h 23m | Hits:  92%/47952 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 41m 30s | Avg: 41m 30s | Max: 41m 30s | Hits:  99%/1264  
      🟩 GraphCapture       Pass: 100%/1   | Total: 35m 02s | Avg: 35m 02s | Max: 35m 02s | Hits:  99%/1264  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 53m | Avg: 37m 59s | Max: 42m 57s | Hits:  99%/3790  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 46m | Avg: 35m 37s | Max: 41m 53s | Hits:  99%/3790  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 11m | Avg: 23m 53s | Max: 34m 11s | Hits:  99%/3792  
      🟩 90;90a;100         Pass: 100%/1   | Total:  9m 27s | Avg:  9m 27s | Max:  9m 27s | Hits:  99%/1264  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  8h 03m | Avg: 23m 01s | Max:  1h 23m | Hits:  89%/25677 
      🟩 20                 Pass: 100%/26  | Total:  9h 28m | Avg: 21m 51s | Max:  1h 18m | Hits:  96%/32383 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 12h 07m | Avg: 15m 28s | Max: 1h 19m | Hits: 96%/89895

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 26s | Avg: 10m 43s | Max: 13m 35s | Hits:  99%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 11h 55m | Avg: 15m 54s | Max:  1h 19m | Hits:  96%/86068 
      🟩 arm64              Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  6m 35s | Hits:  99%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 19m | Avg: 15m 54s | Max: 55m 11s | Hits:  96%/9560  
      🟩 12.9               Pass: 100%/42  | Total: 10h 47m | Avg: 15m 25s | Max:  1h 19m | Hits:  96%/80335 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 08s | Avg:  5m 34s | Max:  5m 36s | Hits: 100%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 19m | Avg: 15m 54s | Max: 55m 11s | Hits:  96%/9560  
      🟩 nvcc12.9           Pass: 100%/40  | Total: 10h 36m | Avg: 15m 54s | Max:  1h 19m | Hits:  96%/76509 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 08s | Avg:  5m 34s | Max:  5m 36s | Hits: 100%/3826  
      🟩 nvcc               Pass: 100%/45  | Total: 11h 56m | Avg: 15m 54s | Max:  1h 19m | Hits:  96%/86069 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 03s | Avg:  5m 45s | Max:  6m 13s | Hits: 100%/7652  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 31s | Avg:  6m 15s | Max:  6m 16s | Hits: 100%/3826  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 54s | Avg:  5m 57s | Max:  5m 57s | Hits: 100%/3826  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 59s | Avg:  5m 59s | Max:  6m 02s | Hits: 100%/3826  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 35s | Avg:  5m 47s | Max:  5m 48s | Hits: 100%/3826  
      🟩 Clang19            Pass: 100%/7   | Total: 46m 33s | Avg:  6m 39s | Max: 10m 39s | Hits: 100%/13391 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 18s | Avg:  6m 39s | Max:  6m 58s | Hits:  99%/3828  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 09s | Avg:  7m 09s | Max:  7m 09s | Hits:  99%/1914  
      🟩 GCC9               Pass: 100%/2   | Total: 14m 04s | Avg:  7m 02s | Max:  7m 02s | Hits:  99%/3828  
      🟩 GCC10              Pass: 100%/2   | Total: 14m 29s | Avg:  7m 14s | Max:  7m 20s | Hits:  99%/3828  
      🟩 GCC11              Pass: 100%/2   | Total: 14m 47s | Avg:  7m 23s | Max:  7m 40s | Hits:  99%/3828  
      🟩 GCC12              Pass: 100%/2   | Total: 15m 15s | Avg:  7m 37s | Max:  7m 51s | Hits:  99%/3828  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 39m | Avg:  9m 55s | Max: 17m 40s | Hits:  99%/19140 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 58m | Avg: 59m 16s | Max:  1h 03m | Hits:  80%/3812  
      🟩 MSVC14.43          Pass: 100%/3   | Total:  2h 35m | Avg: 51m 49s | Max:  1h 01m | Hits:  87%/5718  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 37m | Avg:  1h 18m | Max:  1h 19m | Hits:  65%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 57m | Avg:  6m 11s | Max: 10m 39s | Hits: 100%/36347 
      🟩 GCC                Pass: 100%/21  | Total:  2h 58m | Avg:  8m 29s | Max: 17m 40s | Hits:  99%/40194 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 34m | Avg: 54m 48s | Max:  1h 03m | Hits:  84%/9530  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 37m | Avg:  1h 18m | Max:  1h 19m | Hits:  65%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 23m 41s | Avg: 11m 50s | Max: 17m 40s | Hits:  99%/3828  
      🟩 rtx2080            Pass: 100%/35  | Total:  8h 51m | Avg: 15m 10s | Max:  1h 19m | Hits:  96%/66946 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 52m | Avg: 17m 14s | Max:  1h 01m | Hits:  97%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total: 10h 19m | Avg: 15m 29s | Max:  1h 19m | Hits:  96%/76507 
      🟩 TestCPU            Pass: 100%/3   | Total: 52m 07s | Avg: 17m 22s | Max: 33m 39s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total: 55m 18s | Avg: 13m 49s | Max: 17m 40s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 41s | Avg: 11m 50s | Max: 17m 40s | Hits:  99%/3828  
      🟩 90;90a;100         Pass: 100%/1   | Total:  8m 16s | Avg:  8m 16s | Max:  8m 16s | Hits:  99%/1914  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  6h 09m | Avg: 17m 35s | Max:  1h 19m | Hits:  95%/40160 
      🟩 20                 Pass: 100%/24  | Total:  5h 36m | Avg: 14m 00s | Max:  1h 18m | Hits:  97%/45907 
    
  • 🟩 libcudacxx: Pass: 100%/45 | Total: 18h 31m | Avg: 24m 41s | Max: 1h 05m | Hits: 48%/131117

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 18h 00m | Avg: 25m 07s | Max:  1h 05m | Hits:  47%/124328
      🟩 arm64              Pass: 100%/2   | Total: 31m 16s | Avg: 15m 38s | Max: 26m 58s | Hits:  71%/6789  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 59m | Avg: 23m 49s | Max: 27m 08s | Hits:  49%/16608 
      🟩 12.9               Pass: 100%/40  | Total: 16h 32m | Avg: 24m 48s | Max:  1h 05m | Hits:  48%/114509
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 50m 19s | Avg: 25m 09s | Max: 25m 53s | Hits:  26%/6753  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 59m | Avg: 23m 49s | Max: 27m 08s | Hits:  49%/16608 
      🟩 nvcc12.9           Pass: 100%/38  | Total: 15h 42m | Avg: 24m 47s | Max:  1h 05m | Hits:  50%/107756
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 19s | Avg: 25m 09s | Max: 25m 53s | Hits:  26%/6753  
      🟩 nvcc               Pass: 100%/43  | Total: 17h 41m | Avg: 24m 40s | Max:  1h 05m | Hits:  50%/124364
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 46m | Avg: 26m 34s | Max: 30m 21s | Hits:  33%/13462 
      🟩 Clang15            Pass: 100%/2   | Total: 58m 19s | Avg: 29m 09s | Max: 33m 20s | Hits:  37%/6749  
      🟩 Clang16            Pass: 100%/2   | Total: 53m 37s | Avg: 26m 48s | Max: 28m 52s | Hits:  43%/6749  
      🟩 Clang17            Pass: 100%/2   | Total: 56m 12s | Avg: 28m 06s | Max: 29m 48s | Hits:  43%/6749  
      🟩 Clang18            Pass: 100%/2   | Total: 51m 48s | Avg: 25m 54s | Max: 27m 12s | Hits:  43%/6749  
      🟩 Clang19            Pass: 100%/6   | Total:  2h 38m | Avg: 26m 25s | Max:  1h 05m | Hits:  56%/16896 
      🟩 GCC7               Pass: 100%/2   | Total: 47m 08s | Avg: 23m 34s | Max: 26m 33s | Hits:  34%/6685  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 27s | Avg:  4m 27s | Max:  4m 27s | Hits:  99%/3353  
      🟩 GCC9               Pass: 100%/2   | Total: 28m 35s | Avg: 14m 17s | Max: 23m 43s | Hits:  71%/6697  
      🟩 GCC10              Pass: 100%/2   | Total: 54m 27s | Avg: 27m 13s | Max: 28m 37s | Hits:  43%/6751  
      🟩 GCC11              Pass: 100%/2   | Total: 52m 33s | Avg: 26m 16s | Max: 28m 08s | Hits:  42%/6747  
      🟩 GCC12              Pass: 100%/2   | Total: 33m 48s | Avg: 16m 54s | Max: 29m 14s | Hits:  71%/6751  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 27m | Avg: 20m 43s | Max: 43m 00s | Hits:  46%/17142 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 56m 01s | Avg: 28m 00s | Max: 28m 53s | Hits:  70%/6423  
      🟩 MSVC14.43          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 28s | Max: 33m 46s | Hits:  36%/6475  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 19m | Avg: 39m 46s | Max: 43m 17s | Hits:  41%/6739  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  8h 04m | Avg: 26m 55s | Max:  1h 05m | Hits:  44%/57354 
      🟩 GCC                Pass: 100%/21  | Total:  7h 08m | Avg: 20m 23s | Max: 43m 00s | Hits:  53%/54126 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 58m | Avg: 29m 44s | Max: 33m 46s | Hits:  53%/12898 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 19m | Avg: 39m 46s | Max: 43m 17s | Hits:  41%/6739  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 22m 43s | Avg: 11m 21s | Max: 18m 22s | Hits:  99%/3478  
      🟩 rtx2080            Pass: 100%/43  | Total: 18h 08m | Avg: 25m 19s | Max:  1h 05m | Hits:  47%/127639
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total: 16h 13m | Avg: 24m 56s | Max: 43m 17s | Hits:  48%/131077
      🟩 NVRTC              Pass: 100%/2   | Total: 40m 45s | Avg: 20m 22s | Max: 20m 46s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total:  1h 35m | Avg: 31m 55s | Max:  1h 05m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 40m 45s | Avg: 20m 22s | Max: 20m 46s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 22m 43s | Avg: 11m 21s | Max: 18m 22s | Hits:  99%/3478  
      🟩 90;90a;100         Pass: 100%/1   | Total: 43m 00s | Avg: 43m 00s | Max: 43m 00s | Hits:  29%/3478  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  8h 08m | Avg: 22m 11s | Max: 36m 16s | Hits:  53%/69992 
      🟩 20                 Pass: 100%/22  | Total: 10h 21m | Avg: 28m 14s | Max:  1h 05m | Hits:  43%/61125 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 17m | Avg: 5m 18s | Max: 13m 30s | Hits: 98%/15058

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 05m | Avg:  5m 41s | Max: 13m 30s | Hits:  97%/12650 
      🟩 arm64              Pass: 100%/4   | Total: 12m 38s | Avg:  3m 09s | Max:  3m 30s | Hits:  99%/2408  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 20m 05s | Avg:  6m 41s | Max: 13m 30s | Hits:  96%/1511  
      🟩 12.9               Pass: 100%/23  | Total:  1h 57m | Avg:  5m 07s | Max: 13m 03s | Hits:  98%/13547 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 20m 05s | Avg:  6m 41s | Max: 13m 30s | Hits:  96%/1511  
      🟩 nvcc12.9           Pass: 100%/23  | Total:  1h 57m | Avg:  5m 07s | Max: 13m 03s | Hits:  98%/13547 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 17m | Avg:  5m 18s | Max: 13m 30s | Hits:  98%/15058 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  6m 34s | Avg:  3m 17s | Max:  3m 32s | Hits: 100%/1208  
      🟩 Clang15            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s | Hits: 100%/602   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s | Hits: 100%/602   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s | Hits: 100%/602   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s | Hits: 100%/602   
      🟩 Clang19            Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  8m 13s | Hits: 100%/2408  
      🟩 GCC10              Pass: 100%/2   | Total:  7m 15s | Avg:  3m 37s | Max:  3m 42s | Hits:  99%/1208  
      🟩 GCC11              Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s | Hits:  99%/602   
      🟩 GCC12              Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s | Hits:  99%/602   
      🟩 GCC13              Pass: 100%/8   | Total: 37m 50s | Avg:  4m 43s | Max:  9m 41s | Hits:  99%/4816  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 30s | Avg: 13m 30s | Max: 13m 30s | Hits:  81%/303   
      🟩 MSVC14.43          Pass: 100%/1   | Total: 13m 03s | Avg: 13m 03s | Max: 13m 03s | Hits:  81%/303   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 20m 22s | Avg: 10m 11s | Max: 10m 21s | Hits:  88%/1200  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 38m 03s | Avg:  3m 48s | Max:  8m 13s | Hits: 100%/6024  
      🟩 GCC                Pass: 100%/12  | Total: 52m 55s | Avg:  4m 24s | Max:  9m 41s | Hits:  99%/7228  
      🟩 MSVC               Pass: 100%/2   | Total: 26m 33s | Avg: 13m 16s | Max: 13m 30s | Hits:  81%/606   
      🟩 NVHPC              Pass: 100%/2   | Total: 20m 22s | Avg: 10m 11s | Max: 10m 21s | Hits:  88%/1200  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  7m 49s | Hits:  99%/1204  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 06m | Avg:  5m 17s | Max: 13m 30s | Hits:  98%/13854 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  1h 52m | Avg:  4m 52s | Max: 13m 30s | Hits:  97%/13252 
      🟩 Test               Pass: 100%/3   | Total: 25m 43s | Avg:  8m 34s | Max:  9m 41s | Hits:  99%/1806  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 14m 11s | Avg:  4m 43s | Max:  7m 49s | Hits:  99%/1806  
      🟩 90a                Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s | Hits:  99%/602   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 19m 26s | Avg:  4m 51s | Max: 10m 01s | Hits:  96%/2406  
      🟩 20                 Pass: 100%/22  | Total:  1h 58m | Avg:  5m 23s | Max: 13m 30s | Hits:  98%/12652 
    
  • 🟩 python: Pass: 100%/12 | Total: 1h 47m | Avg: 8m 56s | Max: 21m 32s

    🟩 cpu
      🟩 amd64              Pass: 100%/12  | Total:  1h 47m | Avg:  8m 56s | Max: 21m 32s
    🟩 ctk
      🟩 12.9               Pass: 100%/12  | Total:  1h 47m | Avg:  8m 56s | Max: 21m 32s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/12  | Total:  1h 47m | Avg:  8m 56s | Max: 21m 32s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/12  | Total:  1h 47m | Avg:  8m 56s | Max: 21m 32s
    🟩 cxx
      🟩 GCC13              Pass: 100%/12  | Total:  1h 47m | Avg:  8m 56s | Max: 21m 32s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/12  | Total:  1h 47m | Avg:  8m 56s | Max: 21m 32s
    🟩 gpu
      🟩 rtxa6000           Pass: 100%/12  | Total:  1h 47m | Avg:  8m 56s | Max: 21m 32s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total:  5m 44s | Avg:  2m 52s | Max:  2m 57s
      🟩 Build cuda.cooperative Pass: 100%/2   | Total:  5m 58s | Avg:  2m 59s | Max:  3m 02s
      🟩 Build cuda.parallel Pass: 100%/2   | Total: 15m 37s | Avg:  7m 48s | Max:  8m 09s
      🟩 Test cuda.cccl     Pass: 100%/2   | Total:  6m 53s | Avg:  3m 26s | Max:  3m 32s
      🟩 Test cuda.cooperative Pass: 100%/2   | Total: 42m 26s | Avg: 21m 13s | Max: 21m 32s
      🟩 Test cuda.parallel Pass: 100%/2   | Total: 30m 41s | Avg: 15m 20s | Max: 15m 24s
    🟩 py_version
      🟩 3.10               Pass: 100%/6   | Total: 54m 18s | Avg:  9m 03s | Max: 21m 32s
      🟩 3.13               Pass: 100%/6   | Total: 53m 01s | Avg:  8m 50s | Max: 20m 54s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 14m 19s | Avg: 3m 34s | Max: 4m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 14m 19s | Avg:  3m 34s | Max:  4m 29s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  5m 42s | Avg:  2m 51s | Max:  2m 52s
      🟩 12.9               Pass: 100%/2   | Total:  8m 37s | Avg:  4m 18s | Max:  4m 29s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  5m 42s | Avg:  2m 51s | Max:  2m 52s
      🟩 nvcc12.9           Pass: 100%/2   | Total:  8m 37s | Avg:  4m 18s | Max:  4m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 14m 19s | Avg:  3m 34s | Max:  4m 29s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  2m 52s | Avg:  2m 52s | Max:  2m 52s
      🟩 Clang19            Pass: 100%/1   | Total:  4m 08s | Avg:  4m 08s | Max:  4m 08s
      🟩 GCC12              Pass: 100%/1   | Total:  2m 50s | Avg:  2m 50s | Max:  2m 50s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 29s | Avg:  4m 29s | Max:  4m 29s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  7m 00s | Avg:  3m 30s | Max:  4m 08s
      🟩 GCC                Pass: 100%/2   | Total:  7m 19s | Avg:  3m 39s | Max:  4m 29s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 14m 19s | Avg:  3m 34s | Max:  4m 29s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 14m 19s | Avg:  3m 34s | Max:  4m 29s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 17m 17s | Avg: 4m 19s | Max: 4m 36s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 28s | Avg:  4m 14s | Max:  4m 16s
      🟩 arm64              Pass: 100%/2   | Total:  8m 49s | Avg:  4m 24s | Max:  4m 36s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 17m 17s | Avg:  4m 19s | Max:  4m 36s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 17m 17s | Avg:  4m 19s | Max:  4m 36s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 17s | Avg:  4m 19s | Max:  4m 36s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 17m 17s | Avg:  4m 19s | Max:  4m 36s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 17m 17s | Avg:  4m 19s | Max:  4m 36s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 17m 17s | Avg:  4m 19s | Max:  4m 36s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 17m 17s | Avg:  4m 19s | Max:  4m 36s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 25s | Avg:  4m 12s | Max:  4m 13s
      🟩 20                 Pass: 100%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  4m 36s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 12m 17s | Avg: 6m 08s | Max: 10m 10s | Hits: 98%/328

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max: 10m 10s | Hits:  98%/328   
    🟩 ctk
      🟩 12.9               Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max: 10m 10s | Hits:  98%/328   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max: 10m 10s | Hits:  98%/328   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max: 10m 10s | Hits:  98%/328   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max: 10m 10s | Hits:  98%/328   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max: 10m 10s | Hits:  98%/328   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max: 10m 10s | Hits:  98%/328   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s | Hits:  98%/164   
      🟩 Test               Pass: 100%/1   | Total: 10m 10s | Avg: 10m 10s | Max: 10m 10s | Hits:  98%/164   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
+/- libcu++
+/- CUB
Thrust
+/- CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 187)

# Runner
129 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
12 linux-amd64-gpu-rtxa6000-latest-1
11 linux-amd64-gpu-rtx2080-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Copy link
Contributor

github-actions bot commented Jun 6, 2025

🟩 CI finished in 2h 00m: Pass: 100%/187 | Total: 3d 18h | Avg: 29m 01s | Max: 1h 20m | Hits: 75%/294458
  • 🟩 cub: Pass: 100%/47 | Total: 1d 19h | Avg: 54m 56s | Max: 1h 20m | Hits: 68%/58060

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 17h | Avg: 54m 42s | Max:  1h 20m | Hits:  70%/55534 
      🟩 arm64              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 05m | Hits:  42%/2526  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 53m | Avg: 58m 41s | Max:  1h 07m | Hits:  69%/6131  
      🟩 12.9               Pass: 100%/42  | Total:  1d 14h | Avg: 54m 29s | Max:  1h 20m | Hits:  68%/51929 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 03m | Avg: 31m 32s | Max: 33m 09s | Hits:  74%/2177  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 53m | Avg: 58m 41s | Max:  1h 07m | Hits:  69%/6131  
      🟩 nvcc12.9           Pass: 100%/40  | Total:  1d 13h | Avg: 55m 38s | Max:  1h 20m | Hits:  68%/49752 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 32s | Max: 33m 09s | Hits:  74%/2177  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 17h | Avg: 55m 59s | Max:  1h 20m | Hits:  68%/55883 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 39m | Avg: 54m 54s | Max: 58m 19s | Hits:  68%/5054  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 53m | Avg: 56m 56s | Max: 57m 24s | Hits:  68%/2523  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 54m | Avg: 57m 01s | Max: 59m 12s | Hits:  68%/2523  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 48m | Avg: 54m 21s | Max: 55m 37s | Hits:  68%/2523  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 51m | Avg: 55m 37s | Max: 57m 52s | Hits:  68%/2523  
      🟩 Clang19            Pass: 100%/7   | Total:  4h 50m | Avg: 41m 31s | Max: 55m 16s | Hits:  79%/8486  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 58m | Avg: 59m 01s | Max: 59m 39s | Hits:  68%/2526  
      🟩 GCC8               Pass: 100%/1   | Total: 59m 11s | Avg: 59m 11s | Max: 59m 11s | Hits:  68%/1263  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 57m | Avg: 58m 43s | Max: 59m 21s | Hits:  68%/2526  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  68%/2527  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 48s | Max:  1h 01m | Hits:  68%/2523  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m | Hits:  68%/2523  
      🟩 GCC13              Pass: 100%/11  | Total:  8h 45m | Avg: 47m 45s | Max:  1h 17m | Hits:  61%/13901 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 15m | Hits:  74%/2156  
      🟩 MSVC14.43          Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 20m | Hits:  74%/2156  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 15m | Hits:  68%/2327  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 15h 58m | Avg: 50m 25s | Max: 59m 12s | Hits:  72%/23632 
      🟩 GCC                Pass: 100%/22  | Total: 19h 43m | Avg: 53m 47s | Max:  1h 17m | Hits:  64%/27789 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 56m | Avg:  1h 14m | Max:  1h 20m | Hits:  74%/4312  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 15m | Hits:  68%/2327  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 41m | Avg: 33m 40s | Max: 35m 27s | Hits:  71%/3792  
      🟩 rtx2080            Pass: 100%/36  | Total:  1d 11h | Avg: 59m 23s | Max:  1h 20m | Hits:  64%/44162 
      🟩 rtxa6000           Pass: 100%/8   | Total:  5h 43m | Avg: 42m 55s | Max:  1h 17m | Hits:  85%/10106 
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total:  1d 14h | Avg: 59m 07s | Max:  1h 20m | Hits:  62%/47952 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 41m 35s | Avg: 41m 35s | Max: 41m 35s | Hits:  99%/1264  
      🟩 GraphCapture       Pass: 100%/1   | Total: 28m 53s | Avg: 28m 53s | Max: 28m 53s | Hits:  99%/1264  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 44m | Avg: 34m 40s | Max: 37m 18s | Hits:  99%/3790  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 42m | Avg: 34m 00s | Max: 40m 37s | Hits:  99%/3790  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 41m | Avg: 33m 40s | Max: 35m 27s | Hits:  71%/3792  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m | Hits:  15%/1264  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 21h 03m | Avg:  1h 00m | Max:  1h 15m | Hits:  66%/25677 
      🟩 20                 Pass: 100%/26  | Total: 21h 59m | Avg: 50m 44s | Max:  1h 20m | Hits:  70%/32383 
    
  • 🟩 thrust: Pass: 100%/47 | Total: 1d 04h | Avg: 36m 44s | Max: 1h 14m | Hits: 81%/89895

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 53m 51s | Avg: 26m 55s | Max: 40m 20s | Hits:  79%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 03h | Avg: 36m 46s | Max:  1h 14m | Hits:  81%/86068 
      🟩 arm64              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 45s | Max: 40m 22s | Hits:  70%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 12m | Avg: 38m 28s | Max:  1h 01m | Hits:  79%/9560  
      🟩 12.9               Pass: 100%/42  | Total:  1d 01h | Avg: 36m 31s | Max:  1h 14m | Hits:  81%/80335 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 53m 21s | Avg: 26m 40s | Max: 27m 02s | Hits:  80%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 12m | Avg: 38m 28s | Max:  1h 01m | Hits:  79%/9560  
      🟩 nvcc12.9           Pass: 100%/40  | Total:  1d 00h | Avg: 37m 01s | Max:  1h 14m | Hits:  81%/76509 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 53m 21s | Avg: 26m 40s | Max: 27m 02s | Hits:  80%/3826  
      🟩 nvcc               Pass: 100%/45  | Total:  1d 03h | Avg: 37m 10s | Max:  1h 14m | Hits:  81%/86069 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 38s | Max: 36m 33s | Hits:  80%/7652  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 07m | Avg: 33m 56s | Max: 34m 11s | Hits:  80%/3826  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 11m | Avg: 35m 45s | Max: 38m 27s | Hits:  80%/3826  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 12m | Avg: 36m 02s | Max: 38m 01s | Hits:  80%/3826  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 53s | Max: 33m 31s | Hits:  80%/3826  
      🟩 Clang19            Pass: 100%/7   | Total:  2h 56m | Avg: 25m 10s | Max: 37m 23s | Hits:  86%/13391 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 03s | Max: 36m 14s | Hits:  80%/3828  
      🟩 GCC8               Pass: 100%/1   | Total: 38m 34s | Avg: 38m 34s | Max: 38m 34s | Hits:  80%/1914  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 12m | Avg: 36m 11s | Max: 38m 18s | Hits:  80%/3828  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 18m | Avg: 39m 14s | Max: 39m 31s | Hits:  80%/3828  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 17m | Avg: 38m 41s | Max: 39m 09s | Hits:  80%/3828  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 14m | Avg: 37m 19s | Max: 38m 34s | Hits:  80%/3828  
      🟩 GCC13              Pass: 100%/10  | Total:  4h 44m | Avg: 28m 27s | Max: 44m 55s | Hits:  78%/19140 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 10m | Hits:  76%/3812  
      🟩 MSVC14.43          Pass: 100%/3   | Total:  2h 57m | Avg: 59m 08s | Max:  1h 14m | Hits:  84%/5718  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 08m | Hits:  76%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  9h 48m | Avg: 30m 56s | Max: 38m 27s | Hits:  82%/36347 
      🟩 GCC                Pass: 100%/21  | Total: 11h 36m | Avg: 33m 08s | Max: 44m 55s | Hits:  79%/40194 
      🟩 MSVC               Pass: 100%/5   | Total:  5h 09m | Avg:  1h 01m | Max:  1h 14m | Hits:  80%/9530  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 08m | Hits:  76%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 40m 32s | Avg: 20m 16s | Max: 26m 55s | Hits:  79%/3828  
      🟩 rtx2080            Pass: 100%/35  | Total: 23h 29m | Avg: 40m 15s | Max:  1h 14m | Hits:  78%/66946 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 36m | Avg: 27m 40s | Max:  1h 08m | Hits:  89%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 03h | Avg: 40m 34s | Max:  1h 14m | Hits:  77%/76507 
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 59s | Avg: 17m 19s | Max: 34m 20s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total: 51m 16s | Avg: 12m 49s | Max: 13m 37s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 40m 32s | Avg: 20m 16s | Max: 26m 55s | Hits:  79%/3828  
      🟩 90;90a;100         Pass: 100%/1   | Total: 44m 55s | Avg: 44m 55s | Max: 44m 55s | Hits:  76%/1914  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 14h 38m | Avg: 41m 49s | Max:  1h 14m | Hits:  78%/40160 
      🟩 20                 Pass: 100%/24  | Total: 13h 14m | Avg: 33m 05s | Max:  1h 08m | Hits:  83%/45907 
    
  • 🟩 libcudacxx: Pass: 100%/45 | Total: 13h 08m | Avg: 17m 31s | Max: 1h 02m | Hits: 72%/131117

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 12h 18m | Avg: 17m 10s | Max:  1h 02m | Hits:  74%/124328
      🟩 arm64              Pass: 100%/2   | Total: 49m 56s | Avg: 24m 58s | Max: 26m 29s | Hits:  46%/6789  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 04m | Avg: 12m 50s | Max: 28m 19s | Hits:  74%/16608 
      🟩 12.9               Pass: 100%/40  | Total: 12h 04m | Avg: 18m 06s | Max:  1h 02m | Hits:  72%/114509
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 52m 28s | Avg: 26m 14s | Max: 26m 53s | Hits:  26%/6753  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 04m | Avg: 12m 50s | Max: 28m 19s | Hits:  74%/16608 
      🟩 nvcc12.9           Pass: 100%/38  | Total: 11h 12m | Avg: 17m 41s | Max:  1h 02m | Hits:  75%/107756
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 52m 28s | Avg: 26m 14s | Max: 26m 53s | Hits:  26%/6753  
      🟩 nvcc               Pass: 100%/43  | Total: 12h 16m | Avg: 17m 07s | Max:  1h 02m | Hits:  75%/124364
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 38m 53s | Avg:  9m 43s | Max: 25m 00s | Hits:  85%/13462 
      🟩 Clang15            Pass: 100%/2   | Total: 47m 13s | Avg: 23m 36s | Max: 29m 50s | Hits:  55%/6749  
      🟩 Clang16            Pass: 100%/2   | Total:  9m 56s | Avg:  4m 58s | Max:  5m 10s | Hits:  98%/6749  
      🟩 Clang17            Pass: 100%/2   | Total: 33m 07s | Avg: 16m 33s | Max: 28m 05s | Hits:  71%/6749  
      🟩 Clang18            Pass: 100%/2   | Total: 15m 17s | Avg:  7m 38s | Max: 10m 08s | Hits:  96%/6749  
      🟩 Clang19            Pass: 100%/6   | Total:  2h 12m | Avg: 22m 00s | Max: 26m 53s | Hits:  48%/16896 
      🟩 GCC7               Pass: 100%/2   | Total:  8m 19s | Avg:  4m 09s | Max:  4m 21s | Hits:  99%/6685  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s | Hits:  99%/3353  
      🟩 GCC9               Pass: 100%/2   | Total: 28m 13s | Avg: 14m 06s | Max: 23m 24s | Hits:  71%/6697  
      🟩 GCC10              Pass: 100%/2   | Total:  9m 36s | Avg:  4m 48s | Max:  4m 54s | Hits:  99%/6751  
      🟩 GCC11              Pass: 100%/2   | Total:  9m 24s | Avg:  4m 42s | Max:  4m 52s | Hits:  99%/6747  
      🟩 GCC12              Pass: 100%/2   | Total: 30m 15s | Avg: 15m 07s | Max: 25m 13s | Hits:  75%/6751  
      🟩 GCC13              Pass: 100%/10  | Total:  4h 08m | Avg: 24m 48s | Max:  1h 02m | Hits:  54%/17142 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 58m 26s | Avg: 29m 13s | Max: 30m 07s | Hits:  65%/6423  
      🟩 MSVC14.43          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 49s | Max: 33m 10s | Hits:  36%/6475  
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 49m 50s | Avg: 24m 55s | Max: 39m 45s | Hits:  69%/6739  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  4h 36m | Avg: 15m 21s | Max: 29m 50s | Hits:  72%/57354 
      🟩 GCC                Pass: 100%/21  | Total:  5h 38m | Avg: 16m 06s | Max:  1h 02m | Hits:  78%/54126 
      🟩 MSVC               Pass: 100%/4   | Total:  2h 04m | Avg: 31m 01s | Max: 33m 10s | Hits:  51%/12898 
      🟩 NVHPC              Pass: 100%/2   | Total: 49m 50s | Avg: 24m 55s | Max: 39m 45s | Hits:  69%/6739  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 18m 52s | Avg:  9m 26s | Max: 13m 49s | Hits:  98%/3478  
      🟩 rtx2080            Pass: 100%/43  | Total: 12h 49m | Avg: 17m 54s | Max:  1h 02m | Hits:  71%/127639
    🟩 jobs
      🟩 Build              Pass: 100%/39  | Total: 10h 42m | Avg: 16m 27s | Max: 39m 45s | Hits:  72%/131077
      🟩 NVRTC              Pass: 100%/2   | Total: 46m 47s | Avg: 23m 23s | Max: 24m 44s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total:  1h 38m | Avg: 32m 41s | Max:  1h 02m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 56s | Avg:  1m 56s | Max:  1m 56s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 46m 47s | Avg: 23m 23s | Max: 24m 44s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 18m 52s | Avg:  9m 26s | Max: 13m 49s | Hits:  98%/3478  
      🟩 90;90a;100         Pass: 100%/1   | Total: 31m 06s | Avg: 31m 06s | Max: 31m 06s | Hits:  43%/3478  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  5h 56m | Avg: 16m 11s | Max: 33m 10s | Hits:  73%/69992 
      🟩 20                 Pass: 100%/22  | Total:  7h 10m | Avg: 19m 34s | Max:  1h 02m | Hits:  71%/61125 
    
  • 🟩 cudax: Pass: 100%/26 | Total: 3h 01m | Avg: 6m 58s | Max: 14m 26s | Hits: 91%/15058

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 39m | Avg:  7m 15s | Max: 14m 26s | Hits:  91%/12650 
      🟩 arm64              Pass: 100%/4   | Total: 21m 44s | Avg:  5m 26s | Max:  5m 58s | Hits:  90%/2408  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 25m 02s | Avg:  8m 20s | Max: 14m 26s | Hits:  88%/1511  
      🟩 12.9               Pass: 100%/23  | Total:  2h 36m | Avg:  6m 48s | Max: 13m 13s | Hits:  91%/13547 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 25m 02s | Avg:  8m 20s | Max: 14m 26s | Hits:  88%/1511  
      🟩 nvcc12.9           Pass: 100%/23  | Total:  2h 36m | Avg:  6m 48s | Max: 13m 13s | Hits:  91%/13547 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  3h 01m | Avg:  6m 58s | Max: 14m 26s | Hits:  91%/15058 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 10m 41s | Avg:  5m 20s | Max:  5m 46s | Hits:  90%/1208  
      🟩 Clang15            Pass: 100%/1   | Total:  5m 29s | Avg:  5m 29s | Max:  5m 29s | Hits:  90%/602   
      🟩 Clang16            Pass: 100%/1   | Total:  5m 34s | Avg:  5m 34s | Max:  5m 34s | Hits:  90%/602   
      🟩 Clang17            Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s | Hits:  90%/602   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 25s | Avg:  5m 25s | Max:  5m 25s | Hits:  90%/602   
      🟩 Clang19            Pass: 100%/4   | Total: 24m 19s | Avg:  6m 04s | Max:  8m 03s | Hits:  93%/2408  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 14s | Hits:  90%/1208  
      🟩 GCC11              Pass: 100%/1   | Total:  6m 22s | Avg:  6m 22s | Max:  6m 22s | Hits:  90%/602   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 05s | Avg:  6m 05s | Max:  6m 05s | Hits:  90%/602   
      🟩 GCC13              Pass: 100%/8   | Total: 51m 12s | Avg:  6m 24s | Max: 10m 57s | Hits:  92%/4816  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s | Hits:  79%/303   
      🟩 MSVC14.43          Pass: 100%/1   | Total: 13m 13s | Avg: 13m 13s | Max: 13m 13s | Hits:  79%/303   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 21m 14s | Avg: 10m 37s | Max: 10m 47s | Hits:  88%/1200  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 57m 00s | Avg:  5m 42s | Max:  8m 03s | Hits:  91%/6024  
      🟩 GCC                Pass: 100%/12  | Total:  1h 15m | Avg:  6m 17s | Max: 10m 57s | Hits:  92%/7228  
      🟩 MSVC               Pass: 100%/2   | Total: 27m 39s | Avg: 13m 49s | Max: 14m 26s | Hits:  79%/606   
      🟩 NVHPC              Pass: 100%/2   | Total: 21m 14s | Avg: 10m 37s | Max: 10m 47s | Hits:  88%/1200  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  7m 56s | Hits:  95%/1204  
      🟩 rtx2080            Pass: 100%/24  | Total:  2h 48m | Avg:  7m 00s | Max: 14m 26s | Hits:  90%/13854 
    🟩 jobs
      🟩 Build              Pass: 100%/23  | Total:  2h 34m | Avg:  6m 43s | Max: 14m 26s | Hits:  89%/13252 
      🟩 Test               Pass: 100%/3   | Total: 26m 56s | Avg:  8m 58s | Max: 10m 57s | Hits:  99%/1806  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 17m 35s | Avg:  5m 51s | Max:  7m 56s | Hits:  93%/1806  
      🟩 90a                Pass: 100%/1   | Total:  4m 51s | Avg:  4m 51s | Max:  4m 51s | Hits:  90%/602   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 25m 38s | Avg:  6m 24s | Max: 10m 27s | Hits:  90%/2406  
      🟩 20                 Pass: 100%/22  | Total:  2h 35m | Avg:  7m 04s | Max: 14m 26s | Hits:  91%/12652 
    
  • 🟩 python: Pass: 100%/12 | Total: 1h 41m | Avg: 8m 29s | Max: 18m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/12  | Total:  1h 41m | Avg:  8m 29s | Max: 18m 29s
    🟩 ctk
      🟩 12.9               Pass: 100%/12  | Total:  1h 41m | Avg:  8m 29s | Max: 18m 29s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/12  | Total:  1h 41m | Avg:  8m 29s | Max: 18m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/12  | Total:  1h 41m | Avg:  8m 29s | Max: 18m 29s
    🟩 cxx
      🟩 GCC13              Pass: 100%/12  | Total:  1h 41m | Avg:  8m 29s | Max: 18m 29s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/12  | Total:  1h 41m | Avg:  8m 29s | Max: 18m 29s
    🟩 gpu
      🟩 rtxa6000           Pass: 100%/12  | Total:  1h 41m | Avg:  8m 29s | Max: 18m 29s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total:  6m 06s | Avg:  3m 03s | Max:  3m 04s
      🟩 Build cuda.cooperative Pass: 100%/2   | Total:  6m 06s | Avg:  3m 03s | Max:  3m 04s
      🟩 Build cuda.parallel Pass: 100%/2   | Total: 15m 20s | Avg:  7m 40s | Max:  7m 41s
      🟩 Test cuda.cccl     Pass: 100%/2   | Total:  6m 57s | Avg:  3m 28s | Max:  3m 29s
      🟩 Test cuda.cooperative Pass: 100%/2   | Total: 35m 58s | Avg: 17m 59s | Max: 18m 29s
      🟩 Test cuda.parallel Pass: 100%/2   | Total: 31m 30s | Avg: 15m 45s | Max: 16m 03s
    🟩 py_version
      🟩 3.10               Pass: 100%/6   | Total: 50m 10s | Avg:  8m 21s | Max: 17m 29s
      🟩 3.13               Pass: 100%/6   | Total: 51m 47s | Avg:  8m 37s | Max: 18m 29s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 16m 47s | Avg: 4m 11s | Max: 5m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 14s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  9m 00s | Avg:  4m 30s | Max:  5m 14s
      🟩 12.9               Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 07s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 00s | Avg:  4m 30s | Max:  5m 14s
      🟩 nvcc12.9           Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 07s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 14s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 46s | Avg:  3m 46s | Max:  3m 46s
      🟩 Clang19            Pass: 100%/1   | Total:  3m 40s | Avg:  3m 40s | Max:  3m 40s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 07s | Avg:  4m 07s | Max:  4m 07s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  7m 26s | Avg:  3m 43s | Max:  3m 46s
      🟩 GCC                Pass: 100%/2   | Total:  9m 21s | Avg:  4m 40s | Max:  5m 14s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 14s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 16m 47s | Avg:  4m 11s | Max:  5m 14s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 54s | Avg: 4m 13s | Max: 4m 31s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 00s | Avg:  4m 30s | Max:  4m 31s
      🟩 arm64              Pass: 100%/2   | Total:  7m 54s | Avg:  3m 57s | Max:  4m 06s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 16m 54s | Avg:  4m 13s | Max:  4m 31s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 16m 54s | Avg:  4m 13s | Max:  4m 31s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 54s | Avg:  4m 13s | Max:  4m 31s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 16m 54s | Avg:  4m 13s | Max:  4m 31s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 54s | Avg:  4m 13s | Max:  4m 31s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 54s | Avg:  4m 13s | Max:  4m 31s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 54s | Avg:  4m 13s | Max:  4m 31s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 35s | Avg:  4m 17s | Max:  4m 29s
      🟩 20                 Pass: 100%/2   | Total:  8m 19s | Avg:  4m 09s | Max:  4m 31s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 13m 42s | Avg: 6m 51s | Max: 10m 14s | Hits: 92%/328

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 13m 42s | Avg:  6m 51s | Max: 10m 14s | Hits:  92%/328   
    🟩 ctk
      🟩 12.9               Pass: 100%/2   | Total: 13m 42s | Avg:  6m 51s | Max: 10m 14s | Hits:  92%/328   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/2   | Total: 13m 42s | Avg:  6m 51s | Max: 10m 14s | Hits:  92%/328   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 13m 42s | Avg:  6m 51s | Max: 10m 14s | Hits:  92%/328   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 13m 42s | Avg:  6m 51s | Max: 10m 14s | Hits:  92%/328   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 13m 42s | Avg:  6m 51s | Max: 10m 14s | Hits:  92%/328   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 13m 42s | Avg:  6m 51s | Max: 10m 14s | Hits:  92%/328   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s | Hits:  87%/164   
      🟩 Test               Pass: 100%/1   | Total: 10m 14s | Avg: 10m 14s | Max: 10m 14s | Hits:  98%/164   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
+/- libcu++
+/- CUB
Thrust
+/- CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 187)

# Runner
129 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
12 linux-amd64-gpu-rtxa6000-latest-1
11 linux-amd64-gpu-rtx2080-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Copy link
Contributor

@bernhardmgruber bernhardmgruber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I reviewed everything except catch2_test_env_launch_helper.h which seems to duplicate code from the existing catch2_test_launch_helper.h and I have not assessed the whether that incurs any new technical dept.


// RFA is only supported for float and double accumulators
constexpr bool is_float_or_double = _CUDA_VSTD::is_same_v<accum_t, float> || _CUDA_VSTD::is_same_v<accum_t, double>;
constexpr bool is_sum = _CUDA_VSTD::is_same_v<ReductionOpT, ::cuda::std::plus<>>;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Important: Use this to also detect plus<T>

Suggested change
constexpr bool is_sum = _CUDA_VSTD::is_same_v<ReductionOpT, ::cuda::std::plus<>>;
constexpr bool is_sum = detail::reduce::is_plus<ReductionOpT>;

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

Publish reproducible reduction API through requirements
5 participants