Skip to content

Conversation

miscco
Copy link
Contributor

@miscco miscco commented Oct 13, 2025

  • Replace __clz in warp_scan_shfl.cuh.

  • Replace __clz in block_radix_rank.cuh

  • Replace __clz in warp_reduce_shfl.cuh

  • Replace __clz in warp_reduce_smem.cuh

  • Replace thrust's clz with cuda::std::countl

  • Fully qualify with ::cuda

  • Fixup types or copy paste mistakes

  • Address review comments, countr_zero instead of countl(brev())

  • Use __bit_log2 for warp ballot index.

  • Use __bit_log2 for block leader in ComputeRanksItem

  • Ensure that we static cast in __clz to int in case we deal with ARM

  • Rename variable to not conflict with builtin

  • Use __bit_log2

  • Fix incorrect transformation

  • Drop internal clz function in favor of countl_zero

  • Drop unneeded include

  • Fix return type of __ballot_sync to unsigned

  • fix typo

  • Be super safe about unsigned integers

  • Fix argument type in radix_rank

* Replace `__clz` in `warp_scan_shfl.cuh`.

* Replace `__clz` in `block_radix_rank.cuh`

* Replace `__clz` in `warp_reduce_shfl.cuh`

* Replace `__clz` in `warp_reduce_smem.cuh`

* Replace thrust's `clz` with `cuda::std::countl`

* Fully qualify with `::cuda`

* Fixup types or copy paste mistakes

* Address review comments, `countr_zero` instead of `countl(brev())`

* Use __bit_log2 for warp ballot index.

* Use `__bit_log2` for block leader in ComputeRanksItem

* Ensure that we static cast in `__clz` to int in case we deal with ARM

* Rename variable to not conflict with builtin

* Use `__bit_log2`

* Fix incorrect transformation

* Drop internal `clz` function in favor of `countl_zero`

* Drop unneeded include

* Fix return type of `__ballot_sync` to unsigned

* fix typo

* Be super safe about unsigned integers

* Fix argument type in radix_rank

---------

Co-authored-by: Michael Schellenberger Costa <[email protected]>
@miscco miscco requested review from a team as code owners October 13, 2025 13:11
@miscco miscco requested a review from ericniebler October 13, 2025 13:11
@github-project-automation github-project-automation bot moved this to Todo in CCCL Oct 13, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Oct 13, 2025
@bernhardmgruber
Copy link
Contributor

#6099, the source of this backport, omitted a performance analysis of the change. Let's wait with merging this until we know that it does not regress.

Copy link
Contributor

🟨 CI finished in 2h 22m: Pass: 99%/205 | Total: 3d 14h | Avg: 25m 11s | Max: 1h 57m | Hits: 75%/338630
  • 🟨 cub: Pass: 98%/50 | Total: 1d 14h | Avg: 45m 38s | Max: 1h 39m | Hits: 83%/60253

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/48  | Total:  1d 12h | Avg: 45m 46s | Max:  1h 39m | Hits:  83%/57745 
      🟩 arm64              Pass: 100%/2   | Total:  1h 25m | Avg: 42m 34s | Max: 43m 39s | Hits:  84%/2508  
    🔍 ctk: 12.9 🔍
      🟩 12.0               Pass: 100%/5   | Total:  4h 20m | Avg: 52m 07s | Max:  1h 29m | Hits:  82%/6166  
      🔍 12.9               Pass:  97%/45  | Total:  1d 09h | Avg: 44m 55s | Max:  1h 39m | Hits:  83%/54087 
    🔍 cudacxx: nvcc12.9 🔍
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 08s | Hits:  99%/2157  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 20m | Avg: 52m 07s | Max:  1h 29m | Hits:  82%/6166  
      🔍 nvcc12.9           Pass:  97%/43  | Total:  1d 09h | Avg: 46m 46s | Max:  1h 39m | Hits:  82%/51930 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 08s | Hits:  99%/2157  
      🔍 nvcc               Pass:  97%/48  | Total:  1d 13h | Avg: 47m 19s | Max:  1h 39m | Hits:  82%/58096 
    🔍 cxx: Clang19 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  2h 40m | Avg: 40m 12s | Max: 43m 28s | Hits:  84%/5018  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 24m | Avg: 42m 05s | Max: 46m 26s | Hits:  83%/2505  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 22m | Avg: 41m 11s | Max: 42m 17s | Hits:  80%/2505  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 19m | Avg: 39m 56s | Max: 43m 55s | Hits:  83%/2505  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 29m | Avg: 44m 55s | Max: 48m 58s | Hits:  83%/2505  
      🔍 Clang19            Pass:  85%/7   | Total:  3h 28m | Avg: 29m 45s | Max: 50m 22s | Hits:  91%/7168  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 42m | Avg: 51m 03s | Max: 54m 03s | Hits:  80%/2508  
      🟩 GCC8               Pass: 100%/1   | Total: 44m 56s | Avg: 44m 56s | Max: 44m 56s | Hits:  81%/1254  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 43m | Avg: 51m 52s | Max: 56m 57s | Hits:  80%/2508  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 38m | Avg: 49m 25s | Max: 51m 56s | Hits:  80%/2509  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 37m | Avg: 48m 51s | Max: 49m 39s | Hits:  78%/2505  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 42m | Avg: 51m 27s | Max: 51m 27s | Hits:  76%/2505  
      🟩 GCC13              Pass: 100%/12  | Total:  6h 55m | Avg: 34m 35s | Max: 52m 05s | Hits:  91%/15057 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 57m | Avg:  1h 28m | Max:  1h 29m | Hits:  69%/2298  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  5h 19m | Avg:  1h 19m | Max:  1h 39m | Hits:  69%/4596  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 53m | Avg: 56m 58s | Max:  1h 00m | Hits:  74%/2307  
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  94%/19  | Total: 11h 45m | Avg: 37m 07s | Max: 50m 22s | Hits:  86%/22206 
      🟩 GCC                Pass: 100%/23  | Total: 16h 05m | Avg: 41m 58s | Max: 56m 57s | Hits:  85%/28846 
      🟩 MSVC               Pass: 100%/6   | Total:  8h 17m | Avg:  1h 22m | Max:  1h 39m | Hits:  69%/6894  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 58s | Max:  1h 00m | Hits:  74%/2307  
    🔍 gpu: rtxa6000 🔍
      🟩 h100               Pass: 100%/3   | Total:  1h 10m | Avg: 23m 25s | Max: 26m 17s | Hits:  95%/3765  
      🟩 rtx2080            Pass: 100%/39  | Total:  1d 07h | Avg: 49m 01s | Max:  1h 39m | Hits:  80%/47707 
      🔍 rtxa6000           Pass:  87%/8   | Total:  4h 59m | Avg: 37m 28s | Max: 49m 33s | Hits:  94%/8781  
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/42  | Total:  1d 09h | Avg: 48m 08s | Max:  1h 39m | Hits:  80%/51470 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 35m 14s | Avg: 35m 14s | Max: 35m 14s | Hits:  99%/1255  
      🟩 GraphCapture       Pass: 100%/1   | Total: 33m 41s | Avg: 33m 41s | Max: 33m 41s | Hits:  99%/1255  
      🔍 HostLaunch         Pass:  66%/3   | Total:  1h 38m | Avg: 32m 42s | Max: 39m 16s | Hits:  99%/2510  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 32m | Avg: 30m 58s | Max: 35m 50s | Hits:  99%/3763  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/21  | Total: 18h 12m | Avg: 52m 01s | Max:  1h 39m | Hits:  79%/25726 
      🔍 20                 Pass:  96%/29  | Total: 19h 49m | Avg: 41m 01s | Max:  1h 34m | Hits:  86%/34527 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 10m | Avg: 23m 25s | Max: 26m 17s | Hits:  95%/3765  
      🟩 90;90a             Pass: 100%/2   | Total:  1h 33m | Avg: 46m 38s | Max:  1h 05m | Hits:  78%/2404  
      🟩 100;120            Pass: 100%/2   | Total:  1h 27m | Avg: 43m 59s | Max:  1h 00m | Hits:  79%/2404  
    
  • 🟩 thrust: Pass: 100%/50 | Total: 19h 17m | Avg: 23m 08s | Max: 1h 18m | Hits: 93%/95771

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 33m 22s | Avg: 16m 41s | Max: 18m 42s | Hits:  97%/3834  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total: 18h 43m | Avg: 23m 24s | Max:  1h 18m | Hits:  93%/91938 
      🟩 arm64              Pass: 100%/2   | Total: 33m 33s | Avg: 16m 46s | Max: 18m 19s | Hits:  95%/3833  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 05m | Avg: 25m 06s | Max: 58m 42s | Hits:  90%/9575  
      🟩 12.9               Pass: 100%/45  | Total: 17h 11m | Avg: 22m 55s | Max:  1h 18m | Hits:  93%/86196 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 38s | Avg:  5m 49s | Max:  5m 55s | Hits: 100%/3832  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 05m | Avg: 25m 06s | Max: 58m 42s | Hits:  90%/9575  
      🟩 nvcc12.9           Pass: 100%/43  | Total: 16h 59m | Avg: 23m 43s | Max:  1h 18m | Hits:  93%/82364 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 38s | Avg:  5m 49s | Max:  5m 55s | Hits: 100%/3832  
      🟩 nvcc               Pass: 100%/48  | Total: 19h 05m | Avg: 23m 51s | Max:  1h 18m | Hits:  92%/91939 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 06m | Avg: 16m 43s | Max: 18m 44s | Hits:  95%/7664  
      🟩 Clang15            Pass: 100%/2   | Total: 33m 39s | Avg: 16m 49s | Max: 17m 33s | Hits:  95%/3832  
      🟩 Clang16            Pass: 100%/2   | Total: 33m 31s | Avg: 16m 45s | Max: 17m 10s | Hits:  95%/3832  
      🟩 Clang17            Pass: 100%/2   | Total: 33m 02s | Avg: 16m 31s | Max: 16m 47s | Hits:  95%/3832  
      🟩 Clang18            Pass: 100%/2   | Total: 34m 22s | Avg: 17m 11s | Max: 17m 34s | Hits:  95%/3832  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 18m | Avg: 11m 10s | Max: 16m 13s | Hits:  97%/13412 
      🟩 GCC7               Pass: 100%/2   | Total: 34m 40s | Avg: 17m 20s | Max: 17m 26s | Hits:  95%/3834  
      🟩 GCC8               Pass: 100%/1   | Total: 17m 47s | Avg: 17m 47s | Max: 17m 47s | Hits:  95%/1917  
      🟩 GCC9               Pass: 100%/2   | Total: 37m 38s | Avg: 18m 49s | Max: 19m 52s | Hits:  95%/3834  
      🟩 GCC10              Pass: 100%/2   | Total: 35m 58s | Avg: 17m 59s | Max: 18m 54s | Hits:  95%/3834  
      🟩 GCC11              Pass: 100%/2   | Total: 34m 57s | Avg: 17m 28s | Max: 18m 06s | Hits:  95%/3834  
      🟩 GCC12              Pass: 100%/2   | Total: 40m 21s | Avg: 20m 10s | Max: 20m 30s | Hits:  95%/3834  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 45m | Avg: 15m 01s | Max: 19m 43s | Hits:  96%/21087 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 39s | Max: 58m 42s | Hits:  75%/3818  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  4h 04m | Avg: 48m 48s | Max:  1h 07m | Hits:  85%/9545  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 18m | Hits:  70%/3830  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  4h 39m | Avg: 14m 43s | Max: 18m 44s | Hits:  96%/36404 
      🟩 GCC                Pass: 100%/22  | Total:  6h 06m | Avg: 16m 39s | Max: 20m 30s | Hits:  96%/42174 
      🟩 MSVC               Pass: 100%/7   | Total:  6h 01m | Avg: 51m 37s | Max:  1h 07m | Hits:  82%/13363 
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 18m | Hits:  70%/3830  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 21m 23s | Avg: 10m 41s | Max: 11m 28s | Hits:  97%/3834  
      🟩 rtx2080            Pass: 100%/38  | Total: 15h 36m | Avg: 24m 38s | Max:  1h 18m | Hits:  91%/72786 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 19m | Avg: 19m 57s | Max: 53m 30s | Hits:  97%/19151 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total: 17h 31m | Avg: 24m 27s | Max:  1h 18m | Hits:  92%/82362 
      🟩 TestCPU            Pass: 100%/3   | Total: 53m 25s | Avg: 17m 48s | Max: 35m 45s | Hits:  99%/5742  
      🟩 TestGPU            Pass: 100%/4   | Total: 52m 03s | Avg: 13m 00s | Max: 14m 43s | Hits:  99%/7667  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 21m 23s | Avg: 10m 41s | Max: 11m 28s | Hits:  97%/3834  
      🟩 90;90a             Pass: 100%/2   | Total: 58m 49s | Avg: 29m 24s | Max: 43m 39s | Hits:  91%/3826  
      🟩 100;120            Pass: 100%/2   | Total: 59m 03s | Avg: 29m 31s | Max: 43m 35s | Hits:  90%/3826  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  9h 14m | Avg: 26m 23s | Max:  1h 18m | Hits:  91%/40223 
      🟩 20                 Pass: 100%/27  | Total:  9h 29m | Avg: 21m 05s | Max:  1h 11m | Hits:  94%/51714 
    
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 21h 50m | Avg: 27m 18s | Max: 1h 57m | Hits: 59%/166985

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 20h 47m | Avg: 27m 07s | Max:  1h 57m | Hits:  61%/159518
      🟩 arm64              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 28s | Max: 32m 14s | Hits:  31%/7467  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 42m | Avg: 20m 25s | Max: 45m 24s | Hits:  66%/18302 
      🟩 12.9               Pass: 100%/43  | Total: 20h 08m | Avg: 28m 06s | Max:  1h 57m | Hits:  58%/148683
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 53m 38s | Avg: 26m 49s | Max: 29m 55s | Hits:  29%/7433  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 42m | Avg: 20m 25s | Max: 45m 24s | Hits:  66%/18302 
      🟩 nvcc12.9           Pass: 100%/41  | Total: 19h 15m | Avg: 28m 10s | Max:  1h 57m | Hits:  60%/141250
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 53m 38s | Avg: 26m 49s | Max: 29m 55s | Hits:  29%/7433  
      🟩 nvcc               Pass: 100%/46  | Total: 20h 57m | Avg: 27m 19s | Max:  1h 57m | Hits:  61%/159552
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 20m | Avg: 20m 09s | Max: 30m 51s | Hits:  60%/14818 
      🟩 Clang15            Pass: 100%/2   | Total: 41m 49s | Avg: 20m 54s | Max: 30m 49s | Hits:  60%/7427  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 54s | Max: 37m 00s | Hits:  32%/7427  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 26s | Max: 36m 48s | Hits:  32%/7427  
      🟩 Clang18            Pass: 100%/2   | Total: 42m 44s | Avg: 21m 22s | Max: 31m 52s | Hits:  60%/7427  
      🟩 Clang19            Pass: 100%/6   | Total:  2h 46m | Avg: 27m 40s | Max: 37m 53s | Hits:  50%/22326 
      🟩 GCC7               Pass: 100%/2   | Total: 35m 08s | Avg: 17m 34s | Max: 27m 39s | Hits:  60%/7363  
      🟩 GCC8               Pass: 100%/1   | Total: 26m 30s | Avg: 26m 30s | Max: 26m 30s | Hits:  32%/3692  
      🟩 GCC9               Pass: 100%/2   | Total: 43m 43s | Avg: 21m 51s | Max: 33m 42s | Hits:  60%/7375  
      🟩 GCC10              Pass: 100%/2   | Total: 43m 06s | Avg: 21m 33s | Max: 32m 08s | Hits:  60%/7429  
      🟩 GCC11              Pass: 100%/2   | Total: 21m 22s | Avg: 10m 41s | Max: 11m 04s | Hits:  88%/7425  
      🟩 GCC12              Pass: 100%/2   | Total: 21m 49s | Avg: 10m 54s | Max: 11m 07s | Hits:  88%/7429  
      🟩 GCC13              Pass: 100%/11  | Total:  5h 00m | Avg: 27m 20s | Max:  1h 57m | Hits:  75%/30207 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 31m | Avg: 45m 30s | Max: 45m 36s | Hits:  31%/7103  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  2h 50m | Avg: 42m 36s | Max: 47m 43s | Hits:  73%/14697 
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 33m | Avg: 46m 38s | Max: 50m 28s | Hits:  29%/7413  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  7h 43m | Avg: 25m 46s | Max: 37m 53s | Hits:  50%/66852 
      🟩 GCC                Pass: 100%/22  | Total:  8h 12m | Avg: 22m 22s | Max:  1h 57m | Hits:  71%/70920 
      🟩 MSVC               Pass: 100%/6   | Total:  4h 21m | Avg: 43m 34s | Max: 47m 43s | Hits:  60%/21800 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 33m | Avg: 46m 38s | Max: 50m 28s | Hits:  29%/7413  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 28m 59s | Avg: 14m 29s | Max: 18m 57s | Hits:  93%/7634  
      🟩 rtx2080            Pass: 100%/46  | Total: 21h 21m | Avg: 27m 52s | Max:  1h 57m | Hits:  58%/159351
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total: 18h 08m | Avg: 25m 55s | Max: 50m 28s | Hits:  57%/155659
      🟩 NVRTC              Pass: 100%/2   | Total: 45m 48s | Avg: 22m 54s | Max: 25m 15s | Hits:  90%/42    
      🟩 Test               Pass: 100%/3   | Total:  2h 54m | Avg: 58m 09s | Max:  1h 57m | Hits:  92%/11284 
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 45s | Avg:  1m 45s | Max:  1m 45s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 45m 48s | Avg: 22m 54s | Max: 25m 15s | Hits:  90%/42    
      🟩 90                 Pass: 100%/2   | Total: 28m 59s | Avg: 14m 29s | Max: 18m 57s | Hits:  93%/7634  
      🟩 90;90a             Pass: 100%/2   | Total: 53m 43s | Avg: 26m 51s | Max: 37m 37s | Hits:  87%/7585  
      🟩 100;120            Pass: 100%/2   | Total: 53m 15s | Avg: 26m 37s | Max: 38m 47s | Hits:  87%/7585  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  9h 35m | Avg: 26m 10s | Max: 47m 43s | Hits:  50%/77117 
      🟩 20                 Pass: 100%/25  | Total: 12h 13m | Avg: 29m 20s | Max:  1h 57m | Hits:  67%/89868 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 2h 55m | Avg: 6m 16s | Max: 16m 14s | Hits: 98%/15126

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  2h 41m | Avg:  6m 42s | Max: 16m 14s | Hits:  98%/12794 
      🟩 arm64              Pass: 100%/4   | Total: 14m 24s | Avg:  3m 36s | Max:  3m 53s | Hits:  99%/2332  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 22m 33s | Avg:  7m 31s | Max: 15m 47s | Hits:  98%/1448  
      🟩 12.9               Pass: 100%/25  | Total:  2h 33m | Avg:  6m 07s | Max: 16m 14s | Hits:  98%/13678 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 22m 33s | Avg:  7m 31s | Max: 15m 47s | Hits:  98%/1448  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  2h 33m | Avg:  6m 07s | Max: 16m 14s | Hits:  98%/13678 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  2h 55m | Avg:  6m 16s | Max: 16m 14s | Hits:  98%/15126 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  6m 49s | Avg:  3m 24s | Max:  3m 29s | Hits:  99%/1168  
      🟩 Clang15            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s | Hits:  99%/583   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s | Hits:  99%/583   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 45s | Avg:  3m 45s | Max:  3m 45s | Hits:  99%/583   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 25s | Avg:  3m 25s | Max:  3m 25s | Hits:  99%/583   
      🟩 Clang19            Pass: 100%/4   | Total: 21m 01s | Avg:  5m 15s | Max: 10m 35s | Hits:  99%/2332  
      🟩 GCC10              Pass: 100%/2   | Total:  7m 32s | Avg:  3m 46s | Max:  4m 06s | Hits:  99%/1168  
      🟩 GCC11              Pass: 100%/1   | Total:  4m 09s | Avg:  4m 09s | Max:  4m 09s | Hits:  99%/583   
      🟩 GCC12              Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s | Hits:  99%/583   
      🟩 GCC13              Pass: 100%/8   | Total: 39m 12s | Avg:  4m 54s | Max:  9m 49s | Hits:  99%/4664  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 47s | Avg: 15m 47s | Max: 15m 47s | Hits:  95%/282   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 47m 13s | Avg: 15m 44s | Max: 16m 14s | Hits:  95%/852   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 15m 16s | Avg:  7m 38s | Max:  7m 53s | Hits:  97%/1162  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 42m 10s | Avg:  4m 13s | Max: 10m 35s | Hits:  99%/5832  
      🟩 GCC                Pass: 100%/12  | Total: 55m 07s | Avg:  4m 35s | Max:  9m 49s | Hits:  99%/6998  
      🟩 MSVC               Pass: 100%/4   | Total:  1h 03m | Avg: 15m 45s | Max: 16m 14s | Hits:  95%/1134  
      🟩 NVHPC              Pass: 100%/2   | Total: 15m 16s | Avg:  7m 38s | Max:  7m 53s | Hits:  97%/1162  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  6m 43s | Hits:  99%/1166  
      🟩 rtx2080            Pass: 100%/26  | Total:  2h 45m | Avg:  6m 22s | Max: 16m 14s | Hits:  98%/13960 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  2h 28m | Avg:  5m 56s | Max: 16m 14s | Hits:  98%/13377 
      🟩 Test               Pass: 100%/3   | Total: 27m 07s | Avg:  9m 02s | Max: 10m 35s | Hits:  99%/1749  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  6m 43s | Hits:  99%/1166  
      🟩 90;90a             Pass: 100%/2   | Total: 19m 15s | Avg:  9m 37s | Max: 15m 31s | Hits:  97%/867   
      🟩 100;120            Pass: 100%/2   | Total: 20m 07s | Avg: 10m 03s | Max: 16m 14s | Hits:  97%/867   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 14m 24s | Avg:  4m 48s | Max:  7m 23s | Hits:  98%/1747  
      🟩 20                 Pass: 100%/25  | Total:  2h 41m | Avg:  6m 26s | Max: 16m 14s | Hits:  98%/13379 
    
  • 🟩 python: Pass: 100%/18 | Total: 2h 59m | Avg: 9m 59s | Max: 20m 44s

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  2h 59m | Avg:  9m 59s | Max: 20m 44s
    🟩 ctk
      🟩 12.9               Pass: 100%/18  | Total:  2h 59m | Avg:  9m 59s | Max: 20m 44s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/18  | Total:  2h 59m | Avg:  9m 59s | Max: 20m 44s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/18  | Total:  2h 59m | Avg:  9m 59s | Max: 20m 44s
    🟩 cxx
      🟩 GCC13              Pass: 100%/18  | Total:  2h 59m | Avg:  9m 59s | Max: 20m 44s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/18  | Total:  2h 59m | Avg:  9m 59s | Max: 20m 44s
    🟩 gpu
      🟩 h100               Pass: 100%/8   | Total:  1h 09m | Avg:  8m 40s | Max: 14m 31s
      🟩 rtxa6000           Pass: 100%/10  | Total:  1h 50m | Avg: 11m 02s | Max: 20m 44s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total: 16m 36s | Avg:  8m 18s | Max:  8m 25s
      🟩 Test cuda.cccl.cooperative Pass: 100%/4   | Total:  1h 00m | Avg: 15m 05s | Max: 20m 44s
      🟩 Test cuda.cccl.examples Pass: 100%/4   | Total: 19m 27s | Avg:  4m 51s | Max:  5m 13s
      🟩 Test cuda.cccl.headers Pass: 100%/4   | Total: 19m 19s | Avg:  4m 49s | Max:  4m 55s
      🟩 Test cuda.cccl.parallel Pass: 100%/4   | Total:  1h 04m | Avg: 16m 02s | Max: 18m 40s
    🟩 py_version
      🟩 3.10               Pass: 100%/9   | Total:  1h 28m | Avg:  9m 48s | Max: 18m 02s
      🟩 3.13               Pass: 100%/9   | Total:  1h 31m | Avg: 10m 10s | Max: 20m 44s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 19m 16s | Avg: 4m 49s | Max: 6m 58s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 19m 16s | Avg:  4m 49s | Max:  6m 58s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  6m 07s | Avg:  3m 03s | Max:  3m 06s
      🟩 12.9               Pass: 100%/2   | Total: 13m 09s | Avg:  6m 34s | Max:  6m 58s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  6m 07s | Avg:  3m 03s | Max:  3m 06s
      🟩 nvcc12.9           Pass: 100%/2   | Total: 13m 09s | Avg:  6m 34s | Max:  6m 58s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 16s | Avg:  4m 49s | Max:  6m 58s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 01s | Avg:  3m 01s | Max:  3m 01s
      🟩 Clang19            Pass: 100%/1   | Total:  6m 58s | Avg:  6m 58s | Max:  6m 58s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 06s | Avg:  3m 06s | Max:  3m 06s
      🟩 GCC13              Pass: 100%/1   | Total:  6m 11s | Avg:  6m 11s | Max:  6m 11s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  9m 59s | Avg:  4m 59s | Max:  6m 58s
      🟩 GCC                Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  6m 11s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 19m 16s | Avg:  4m 49s | Max:  6m 58s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 19m 16s | Avg:  4m 49s | Max:  6m 58s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 15m 31s | Avg: 3m 52s | Max: 3m 57s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 48s | Avg:  3m 54s | Max:  3m 57s
      🟩 arm64              Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  3m 52s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  3m 57s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  3m 57s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  3m 57s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  3m 57s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  3m 57s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  3m 57s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  3m 57s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  7m 48s | Avg:  3m 54s | Max:  3m 57s
      🟩 20                 Pass: 100%/2   | Total:  7m 43s | Avg:  3m 51s | Max:  3m 52s
    
  • 🟩 cccl_c_parallel: Pass: 100%/3 | Total: 23m 54s | Avg: 7m 58s | Max: 10m 59s | Hits: 98%/495

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 23m 54s | Avg:  7m 58s | Max: 10m 59s | Hits:  98%/495   
    🟩 ctk
      🟩 12.9               Pass: 100%/3   | Total: 23m 54s | Avg:  7m 58s | Max: 10m 59s | Hits:  98%/495   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/3   | Total: 23m 54s | Avg:  7m 58s | Max: 10m 59s | Hits:  98%/495   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 23m 54s | Avg:  7m 58s | Max: 10m 59s | Hits:  98%/495   
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 23m 54s | Avg:  7m 58s | Max: 10m 59s | Hits:  98%/495   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 23m 54s | Avg:  7m 58s | Max: 10m 59s | Hits:  98%/495   
    🟩 gpu
      🟩 h100               Pass: 100%/1   | Total: 10m 41s | Avg: 10m 41s | Max: 10m 41s | Hits:  98%/165   
      🟩 rtx2080            Pass: 100%/2   | Total: 13m 13s | Avg:  6m 36s | Max: 10m 59s | Hits:  98%/330   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 14s | Avg:  2m 14s | Max:  2m 14s | Hits:  98%/165   
      🟩 Test               Pass: 100%/2   | Total: 21m 40s | Avg: 10m 50s | Max: 10m 59s | Hits:  98%/330   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 205)

# Runner
128 linux-amd64-cpu16
23 windows-amd64-cpu16
14 linux-amd64-gpu-h100-latest-1
14 linux-amd64-gpu-rtxa6000-latest-1
12 linux-arm64-cpu16
11 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Integer num_bits = 8 * sizeof(Integer);
Integer num_bits_minus_one = num_bits - 1;

return num_bits_minus_one - clz(x);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thrust::detail::log2 will be deleted in #6188

Comment on lines +52 to +53
#include <cuda/std/__algorithm/clamp.h>
#include <cuda/std/__bit/has_single_bit.h>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

these two headers are never used if I'm not wrong

#include <cuda/std/__algorithm/max.h>
#include <cuda/std/__bit/integral.h>
#include <cuda/std/__functional/operations.h>
#include <cuda/std/__type_traits/conditional.h>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks unused

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We use cuda::std::_If AFAIK

@github-project-automation github-project-automation bot moved this from In Review to In Progress in CCCL Oct 13, 2025
@wmaxey
Copy link
Member

wmaxey commented Oct 13, 2025

We shouldn't be addressing review comments in the backport unless there are material conflicts with the old branch. We should fix these issues in main first and either cherry-pick the changes to the backport or do another backport. That way we aren't cross-pollinating changes between branches.

@github-project-automation github-project-automation bot moved this from In Progress to In Review in CCCL Oct 13, 2025
Copy link
Contributor

🟩 CI finished in 4h 47m: Pass: 100%/205 | Total: 4d 14h | Avg: 32m 13s | Max: 3h 06m | Hits: 75%/339883
  • 🟩 cub: Pass: 100%/50 | Total: 2d 05h | Avg: 1h 03m | Max: 3h 06m | Hits: 70%/61506

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2d 03h | Avg:  1h 03m | Max:  3h 06m | Hits:  70%/58998 
      🟩 arm64              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 36s | Max:  1h 03m | Hits:  64%/2508  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 29m | Avg:  1h 05m | Max:  1h 31m | Hits:  64%/6166  
      🟩 12.9               Pass: 100%/45  | Total:  1d 23h | Avg:  1h 03m | Max:  3h 06m | Hits:  70%/55340 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 05m | Avg: 32m 41s | Max: 33m 24s | Hits:  70%/2157  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 29m | Avg:  1h 05m | Max:  1h 31m | Hits:  64%/6166  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  1d 22h | Avg:  1h 04m | Max:  3h 06m | Hits:  70%/53183 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 41s | Max: 33m 24s | Hits:  70%/2157  
      🟩 nvcc               Pass: 100%/48  | Total:  2d 03h | Avg:  1h 04m | Max:  3h 06m | Hits:  70%/59349 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 49m | Avg: 57m 29s | Max:  1h 06m | Hits:  65%/5018  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m | Hits:  65%/2505  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 07m | Hits:  65%/2505  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:  65%/2505  
      🟩 Clang18            Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 07m | Hits:  65%/2505  
      🟩 Clang19            Pass: 100%/7   | Total:  5h 10m | Avg: 44m 21s | Max:  1h 04m | Hits:  76%/8421  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 07m | Hits:  64%/2508  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m | Hits:  64%/1254  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 15m | Hits:  64%/2508  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 11m | Hits:  64%/2509  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 06m | Hits:  64%/2505  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 15m | Hits:  64%/2505  
      🟩 GCC13              Pass: 100%/12  | Total: 11h 19m | Avg: 56m 35s | Max:  3h 06m | Hits:  79%/15057 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  3h 12m | Avg:  1h 36m | Max:  1h 41m | Hits:  65%/2298  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  5h 54m | Avg:  1h 28m | Max:  1h 41m | Hits:  64%/4596  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 42m | Avg:  1h 21m | Max:  1h 23m | Hits:  64%/2307  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 17h 32m | Avg: 55m 24s | Max:  1h 08m | Hits:  69%/23459 
      🟩 GCC                Pass: 100%/23  | Total: 23h 41m | Avg:  1h 01m | Max:  3h 06m | Hits:  72%/28846 
      🟩 MSVC               Pass: 100%/6   | Total:  9h 07m | Avg:  1h 31m | Max:  1h 41m | Hits:  64%/6894  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 42m | Avg:  1h 21m | Max:  1h 23m | Hits:  64%/2307  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 31m | Avg: 30m 28s | Max: 33m 18s | Hits:  88%/3765  
      🟩 rtx2080            Pass: 100%/39  | Total:  1d 19h | Avg:  1h 06m | Max:  1h 41m | Hits:  65%/47707 
      🟩 rtxa6000           Pass: 100%/8   | Total:  8h 05m | Avg:  1h 00m | Max:  3h 06m | Hits:  87%/10034 
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  1d 22h | Avg:  1h 05m | Max:  1h 41m | Hits:  65%/51470 
      🟩 DeviceLaunch       Pass: 100%/1   | Total:  3h 06m | Avg:  3h 06m | Max:  3h 06m | Hits:  67%/1255  
      🟩 GraphCapture       Pass: 100%/1   | Total: 32m 13s | Avg: 32m 13s | Max: 32m 13s | Hits:  99%/1255  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 39m | Avg: 33m 11s | Max: 38m 45s | Hits:  99%/3763  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 41m | Avg: 33m 47s | Max: 39m 02s | Hits:  99%/3763  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 31m | Avg: 30m 28s | Max: 33m 18s | Hits:  88%/3765  
      🟩 90;90a             Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 24m | Hits:  65%/2404  
      🟩 100;120            Pass: 100%/2   | Total:  1h 55m | Avg: 57m 34s | Max:  1h 09m | Hits:  64%/2404  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  1d 00h | Avg:  1h 09m | Max:  1h 41m | Hits:  65%/25726 
      🟩 20                 Pass: 100%/29  | Total:  1d 04h | Avg: 59m 12s | Max:  3h 06m | Hits:  73%/35780 
    
  • 🟩 thrust: Pass: 100%/50 | Total: 1d 09h | Avg: 39m 47s | Max: 1h 19m | Hits: 82%/95771

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 49m 26s | Avg: 24m 43s | Max: 34m 53s | Hits:  90%/3834  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  1d 07h | Avg: 39m 52s | Max:  1h 19m | Hits:  82%/91938 
      🟩 arm64              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 47s | Max: 40m 53s | Hits:  80%/3833  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 35m | Avg: 43m 08s | Max:  1h 07m | Hits:  79%/9575  
      🟩 12.9               Pass: 100%/45  | Total:  1d 05h | Avg: 39m 24s | Max:  1h 19m | Hits:  83%/86196 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 56m 05s | Avg: 28m 02s | Max: 30m 00s | Hits:  80%/3832  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 35m | Avg: 43m 08s | Max:  1h 07m | Hits:  79%/9575  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  1d 04h | Avg: 39m 56s | Max:  1h 19m | Hits:  83%/82364 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 56m 05s | Avg: 28m 02s | Max: 30m 00s | Hits:  80%/3832  
      🟩 nvcc               Pass: 100%/48  | Total:  1d 08h | Avg: 40m 16s | Max:  1h 19m | Hits:  82%/91939 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 32m | Avg: 38m 13s | Max: 39m 09s | Hits:  80%/7664  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 19m | Avg: 39m 46s | Max: 45m 39s | Hits:  80%/3832  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 16m | Avg: 38m 18s | Max: 39m 23s | Hits:  80%/3832  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 20m | Avg: 40m 06s | Max: 41m 31s | Hits:  80%/3832  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 09m | Avg: 34m 52s | Max: 35m 30s | Hits:  80%/3832  
      🟩 Clang19            Pass: 100%/7   | Total:  3h 03m | Avg: 26m 16s | Max: 38m 25s | Hits:  86%/13412 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 16m | Avg: 38m 03s | Max: 40m 40s | Hits:  80%/3834  
      🟩 GCC8               Pass: 100%/1   | Total: 46m 09s | Avg: 46m 09s | Max: 46m 09s | Hits:  80%/1917  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 21m | Avg: 40m 33s | Max: 44m 29s | Hits:  80%/3834  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 22m | Avg: 41m 18s | Max: 43m 53s | Hits:  80%/3834  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 19m | Avg: 39m 57s | Max: 40m 32s | Hits:  80%/3834  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 24m | Avg: 42m 27s | Max: 44m 33s | Hits:  80%/3834  
      🟩 GCC13              Pass: 100%/11  | Total:  4h 52m | Avg: 26m 35s | Max: 41m 38s | Hits:  87%/21087 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 14m | Hits:  76%/3818  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  5h 09m | Avg:  1h 01m | Max:  1h 13m | Hits:  80%/9545  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 19m | Hits:  76%/3830  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 10h 42m | Avg: 33m 50s | Max: 45m 39s | Hits:  82%/36404 
      🟩 GCC                Pass: 100%/22  | Total: 12h 23m | Avg: 33m 47s | Max: 46m 09s | Hits:  84%/42174 
      🟩 MSVC               Pass: 100%/7   | Total:  7h 31m | Avg:  1h 04m | Max:  1h 14m | Hits:  79%/13363 
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 19m | Hits:  76%/3830  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 35m 31s | Avg: 17m 45s | Max: 20m 21s | Hits:  90%/3834  
      🟩 rtx2080            Pass: 100%/38  | Total:  1d 03h | Avg: 44m 01s | Max:  1h 19m | Hits:  79%/72786 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 40m | Avg: 28m 03s | Max:  1h 13m | Hits:  91%/19151 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total:  1d 07h | Avg: 43m 45s | Max:  1h 19m | Hits:  79%/82362 
      🟩 TestCPU            Pass: 100%/3   | Total: 54m 12s | Avg: 18m 04s | Max: 36m 55s | Hits:  99%/5742  
      🟩 TestGPU            Pass: 100%/4   | Total: 53m 37s | Avg: 13m 24s | Max: 15m 10s | Hits:  99%/7667  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 35m 31s | Avg: 17m 45s | Max: 20m 21s | Hits:  90%/3834  
      🟩 90;90a             Pass: 100%/2   | Total:  1h 37m | Avg: 48m 47s | Max:  1h 04m | Hits:  78%/3826  
      🟩 100;120            Pass: 100%/2   | Total:  1h 31m | Avg: 45m 33s | Max:  1h 00m | Hits:  78%/3826  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 15h 58m | Avg: 45m 39s | Max:  1h 14m | Hits:  79%/40223 
      🟩 20                 Pass: 100%/27  | Total: 16h 21m | Avg: 36m 20s | Max:  1h 19m | Hits:  84%/51714 
    
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 15h 52m | Avg: 19m 50s | Max: 41m 24s | Hits: 72%/166985

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 15h 09m | Avg: 19m 46s | Max: 41m 24s | Hits:  73%/159518
      🟩 arm64              Pass: 100%/2   | Total: 42m 39s | Avg: 21m 19s | Max: 32m 20s | Hits:  60%/7467  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 58m | Avg: 23m 37s | Max: 34m 26s | Hits:  50%/18302 
      🟩 12.9               Pass: 100%/43  | Total: 13h 54m | Avg: 19m 24s | Max: 41m 24s | Hits:  75%/148683
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 58m 31s | Avg: 29m 15s | Max: 29m 46s | Hits:  29%/7433  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 58m | Avg: 23m 37s | Max: 34m 26s | Hits:  50%/18302 
      🟩 nvcc12.9           Pass: 100%/41  | Total: 12h 55m | Avg: 18m 55s | Max: 41m 24s | Hits:  77%/141250
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 58m 31s | Avg: 29m 15s | Max: 29m 46s | Hits:  29%/7433  
      🟩 nvcc               Pass: 100%/46  | Total: 14h 53m | Avg: 19m 25s | Max: 41m 24s | Hits:  74%/159552
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 22m | Avg: 20m 44s | Max: 28m 16s | Hits:  53%/14818 
      🟩 Clang15            Pass: 100%/2   | Total: 41m 00s | Avg: 20m 30s | Max: 28m 03s | Hits:  59%/7427  
      🟩 Clang16            Pass: 100%/2   | Total: 23m 58s | Avg: 11m 59s | Max: 12m 12s | Hits:  88%/7427  
      🟩 Clang17            Pass: 100%/2   | Total: 18m 14s | Avg:  9m 07s | Max: 12m 48s | Hits:  93%/7427  
      🟩 Clang18            Pass: 100%/2   | Total: 44m 01s | Avg: 22m 00s | Max: 31m 26s | Hits:  59%/7427  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 56m | Avg: 19m 20s | Max: 29m 46s | Hits:  70%/22326 
      🟩 GCC7               Pass: 100%/2   | Total: 28m 53s | Avg: 14m 26s | Max: 23m 15s | Hits:  74%/7363  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s | Hits:  98%/3692  
      🟩 GCC9               Pass: 100%/2   | Total: 44m 18s | Avg: 22m 09s | Max: 33m 11s | Hits:  59%/7375  
      🟩 GCC10              Pass: 100%/2   | Total: 18m 24s | Avg:  9m 12s | Max: 13m 35s | Hits:  91%/7429  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 13s | Max: 31m 57s | Hits:  31%/7425  
      🟩 GCC12              Pass: 100%/2   | Total: 18m 11s | Avg:  9m 05s | Max: 12m 11s | Hits:  93%/7429  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 09m | Avg: 17m 12s | Max: 41m 24s | Hits:  86%/30207 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 12m | Avg: 36m 26s | Max: 38m 27s | Hits:  31%/7103  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  2h 26m | Avg: 36m 33s | Max: 39m 22s | Hits:  73%/14697 
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 40m 12s | Avg: 20m 06s | Max: 20m 58s | Hits:  85%/7413  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  5h 26m | Avg: 18m 07s | Max: 31m 26s | Hits:  68%/66852 
      🟩 GCC                Pass: 100%/22  | Total:  6h 06m | Avg: 16m 40s | Max: 41m 24s | Hits:  78%/70920 
      🟩 MSVC               Pass: 100%/6   | Total:  3h 39m | Avg: 36m 31s | Max: 39m 22s | Hits:  59%/21800 
      🟩 NVHPC              Pass: 100%/2   | Total: 40m 12s | Avg: 20m 06s | Max: 20m 58s | Hits:  85%/7413  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 22m 35s | Avg: 11m 17s | Max: 17m 17s | Hits:  99%/7634  
      🟩 rtx2080            Pass: 100%/46  | Total: 15h 29m | Avg: 20m 12s | Max: 41m 24s | Hits:  71%/159351
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total: 13h 38m | Avg: 19m 29s | Max: 39m 22s | Hits:  70%/155659
      🟩 NVRTC              Pass: 100%/2   | Total: 48m 50s | Avg: 24m 25s | Max: 24m 42s | Hits:  90%/42    
      🟩 Test               Pass: 100%/3   | Total:  1h 22m | Avg: 27m 38s | Max: 41m 24s | Hits:  95%/11284 
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 36s | Avg:  1m 36s | Max:  1m 36s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 48m 50s | Avg: 24m 25s | Max: 24m 42s | Hits:  90%/42    
      🟩 90                 Pass: 100%/2   | Total: 22m 35s | Avg: 11m 17s | Max: 17m 17s | Hits:  99%/7634  
      🟩 90;90a             Pass: 100%/2   | Total: 51m 57s | Avg: 25m 58s | Max: 35m 31s | Hits:  86%/7585  
      🟩 100;120            Pass: 100%/2   | Total: 49m 26s | Avg: 24m 43s | Max: 35m 27s | Hits:  86%/7585  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  7h 18m | Avg: 19m 55s | Max: 38m 27s | Hits:  64%/77117 
      🟩 20                 Pass: 100%/25  | Total:  8h 32m | Avg: 20m 30s | Max: 41m 24s | Hits:  79%/89868 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 3h 51m | Avg: 8m 16s | Max: 17m 45s | Hits: 89%/15126

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  3h 28m | Avg:  8m 42s | Max: 17m 45s | Hits:  89%/12794 
      🟩 arm64              Pass: 100%/4   | Total: 23m 01s | Avg:  5m 45s | Max:  6m 25s | Hits:  89%/2332  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 28m 10s | Avg:  9m 23s | Max: 17m 45s | Hits:  87%/1448  
      🟩 12.9               Pass: 100%/25  | Total:  3h 23m | Avg:  8m 08s | Max: 17m 38s | Hits:  89%/13678 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 28m 10s | Avg:  9m 23s | Max: 17m 45s | Hits:  87%/1448  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  3h 23m | Avg:  8m 08s | Max: 17m 38s | Hits:  89%/13678 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  3h 51m | Avg:  8m 16s | Max: 17m 45s | Hits:  89%/15126 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  5m 15s | Hits:  90%/1168  
      🟩 Clang15            Pass: 100%/1   | Total:  6m 21s | Avg:  6m 21s | Max:  6m 21s | Hits:  89%/583   
      🟩 Clang16            Pass: 100%/1   | Total:  6m 21s | Avg:  6m 21s | Max:  6m 21s | Hits:  89%/583   
      🟩 Clang17            Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s | Hits:  89%/583   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s | Hits:  89%/583   
      🟩 Clang19            Pass: 100%/4   | Total: 24m 46s | Avg:  6m 11s | Max:  8m 39s | Hits:  92%/2332  
      🟩 GCC10              Pass: 100%/2   | Total: 12m 18s | Avg:  6m 09s | Max:  6m 32s | Hits:  89%/1168  
      🟩 GCC11              Pass: 100%/1   | Total:  6m 20s | Avg:  6m 20s | Max:  6m 20s | Hits:  89%/583   
      🟩 GCC12              Pass: 100%/1   | Total:  7m 11s | Avg:  7m 11s | Max:  7m 11s | Hits:  89%/583   
      🟩 GCC13              Pass: 100%/8   | Total: 55m 38s | Avg:  6m 57s | Max: 12m 31s | Hits:  92%/4664  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 17m 45s | Avg: 17m 45s | Max: 17m 45s | Hits:  76%/282   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 51m 50s | Avg: 17m 16s | Max: 17m 38s | Hits:  76%/852   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 22m 06s | Avg: 11m 03s | Max: 11m 10s | Hits:  87%/1162  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 58m 46s | Avg:  5m 52s | Max:  8m 39s | Hits:  90%/5832  
      🟩 GCC                Pass: 100%/12  | Total:  1h 21m | Avg:  6m 47s | Max: 12m 31s | Hits:  91%/6998  
      🟩 MSVC               Pass: 100%/4   | Total:  1h 09m | Avg: 17m 23s | Max: 17m 45s | Hits:  76%/1134  
      🟩 NVHPC              Pass: 100%/2   | Total: 22m 06s | Avg: 11m 03s | Max: 11m 10s | Hits:  87%/1162  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 11m 33s | Avg:  5m 46s | Max:  6m 36s | Hits:  94%/1166  
      🟩 rtx2080            Pass: 100%/26  | Total:  3h 40m | Avg:  8m 28s | Max: 17m 45s | Hits:  89%/13960 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  3h 24m | Avg:  8m 09s | Max: 17m 45s | Hits:  88%/13377 
      🟩 Test               Pass: 100%/3   | Total: 27m 46s | Avg:  9m 15s | Max: 12m 31s | Hits:  99%/1749  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 11m 33s | Avg:  5m 46s | Max:  6m 36s | Hits:  94%/1166  
      🟩 90;90a             Pass: 100%/2   | Total: 22m 48s | Avg: 11m 24s | Max: 16m 58s | Hits:  85%/867   
      🟩 100;120            Pass: 100%/2   | Total: 23m 37s | Avg: 11m 48s | Max: 17m 14s | Hits:  85%/867   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 22m 04s | Avg:  7m 21s | Max: 10m 56s | Hits:  89%/1747  
      🟩 20                 Pass: 100%/25  | Total:  3h 29m | Avg:  8m 23s | Max: 17m 45s | Hits:  89%/13379 
    
  • 🟩 python: Pass: 100%/18 | Total: 3h 06m | Avg: 10m 22s | Max: 20m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  3h 06m | Avg: 10m 22s | Max: 20m 29s
    🟩 ctk
      🟩 12.9               Pass: 100%/18  | Total:  3h 06m | Avg: 10m 22s | Max: 20m 29s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/18  | Total:  3h 06m | Avg: 10m 22s | Max: 20m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/18  | Total:  3h 06m | Avg: 10m 22s | Max: 20m 29s
    🟩 cxx
      🟩 GCC13              Pass: 100%/18  | Total:  3h 06m | Avg: 10m 22s | Max: 20m 29s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/18  | Total:  3h 06m | Avg: 10m 22s | Max: 20m 29s
    🟩 gpu
      🟩 h100               Pass: 100%/8   | Total:  1h 16m | Avg:  9m 35s | Max: 15m 38s
      🟩 rtxa6000           Pass: 100%/10  | Total:  1h 50m | Avg: 11m 00s | Max: 20m 29s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total: 17m 01s | Avg:  8m 30s | Max:  8m 44s
      🟩 Test cuda.cccl.cooperative Pass: 100%/4   | Total:  1h 06m | Avg: 16m 30s | Max: 20m 29s
      🟩 Test cuda.cccl.examples Pass: 100%/4   | Total: 19m 00s | Avg:  4m 45s | Max:  4m 58s
      🟩 Test cuda.cccl.headers Pass: 100%/4   | Total: 20m 18s | Avg:  5m 04s | Max:  6m 11s
      🟩 Test cuda.cccl.parallel Pass: 100%/4   | Total:  1h 04m | Avg: 16m 07s | Max: 17m 52s
    🟩 py_version
      🟩 3.10               Pass: 100%/9   | Total:  1h 28m | Avg:  9m 51s | Max: 19m 22s
      🟩 3.13               Pass: 100%/9   | Total:  1h 38m | Avg: 10m 54s | Max: 20m 29s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 18m 52s | Avg: 4m 43s | Max: 5m 57s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  5m 57s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  8m 19s | Avg:  4m 09s | Max:  4m 15s
      🟩 12.9               Pass: 100%/2   | Total: 10m 33s | Avg:  5m 16s | Max:  5m 57s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  8m 19s | Avg:  4m 09s | Max:  4m 15s
      🟩 nvcc12.9           Pass: 100%/2   | Total: 10m 33s | Avg:  5m 16s | Max:  5m 57s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  5m 57s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 15s | Avg:  4m 15s | Max:  4m 15s
      🟩 Clang19            Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 04s | Avg:  4m 04s | Max:  4m 04s
      🟩 GCC13              Pass: 100%/1   | Total:  5m 57s | Avg:  5m 57s | Max:  5m 57s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  8m 51s | Avg:  4m 25s | Max:  4m 36s
      🟩 GCC                Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  5m 57s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  5m 57s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  5m 57s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 18m 02s | Avg: 4m 30s | Max: 4m 50s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 27s | Avg:  4m 43s | Max:  4m 50s
      🟩 arm64              Pass: 100%/2   | Total:  8m 35s | Avg:  4m 17s | Max:  4m 30s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 18m 02s | Avg:  4m 30s | Max:  4m 50s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 18m 02s | Avg:  4m 30s | Max:  4m 50s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 18m 02s | Avg:  4m 30s | Max:  4m 50s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 18m 02s | Avg:  4m 30s | Max:  4m 50s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 18m 02s | Avg:  4m 30s | Max:  4m 50s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 18m 02s | Avg:  4m 30s | Max:  4m 50s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 18m 02s | Avg:  4m 30s | Max:  4m 50s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 42s | Avg:  4m 21s | Max:  4m 37s
      🟩 20                 Pass: 100%/2   | Total:  9m 20s | Avg:  4m 40s | Max:  4m 50s
    
  • 🟩 cccl_c_parallel: Pass: 100%/3 | Total: 25m 54s | Avg: 8m 38s | Max: 11m 46s | Hits: 98%/495

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 25m 54s | Avg:  8m 38s | Max: 11m 46s | Hits:  98%/495   
    🟩 ctk
      🟩 12.9               Pass: 100%/3   | Total: 25m 54s | Avg:  8m 38s | Max: 11m 46s | Hits:  98%/495   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/3   | Total: 25m 54s | Avg:  8m 38s | Max: 11m 46s | Hits:  98%/495   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 25m 54s | Avg:  8m 38s | Max: 11m 46s | Hits:  98%/495   
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 25m 54s | Avg:  8m 38s | Max: 11m 46s | Hits:  98%/495   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 25m 54s | Avg:  8m 38s | Max: 11m 46s | Hits:  98%/495   
    🟩 gpu
      🟩 h100               Pass: 100%/1   | Total: 11m 08s | Avg: 11m 08s | Max: 11m 08s | Hits:  98%/165   
      🟩 rtx2080            Pass: 100%/2   | Total: 14m 46s | Avg:  7m 23s | Max: 11m 46s | Hits:  97%/330   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  3m 00s | Avg:  3m 00s | Max:  3m 00s | Hits:  96%/165   
      🟩 Test               Pass: 100%/2   | Total: 22m 54s | Avg: 11m 27s | Max: 11m 46s | Hits:  98%/330   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 205)

# Runner
128 linux-amd64-cpu16
23 windows-amd64-cpu16
14 linux-amd64-gpu-h100-latest-1
14 linux-amd64-gpu-rtxa6000-latest-1
12 linux-arm64-cpu16
11 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@wmaxey wmaxey merged commit 30ae2ac into NVIDIA:branch/3.1.x Oct 14, 2025
214 of 215 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Oct 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

6 participants