Skip to content

Conversation

gonidelis
Copy link
Member

Fixes #6173 by applying sm80 tunings parameters to sm90 compilations (server) and sm86 tuning parameters to sm120 compilations (workstation)

Copy link
Contributor

copy-pr-bot bot commented Oct 9, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Progress in CCCL Oct 9, 2025
@gonidelis
Copy link
Member Author

Posting some results in Google Sheets instead of laying them here for readability Reasons

@gonidelis
Copy link
Member Author

B200 (SM100)

SM100 perf results spreadsheet

@gonidelis
Copy link
Member Author

NVIDIA GeForce RTX 5090 (SM120)

SM120 perf results spreadsheet

@gonidelis
Copy link
Member Author

NVIDIA H100 PCIe (SM90)

SM90 perf results spreadshseet

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: In Progress

Development

Successfully merging this pull request may close these issues.

[BUG]: Dated segmented sort tuning

1 participant