Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

benchmarking suite should initialize cuda graphs / profiler interaction #4008

Open
tfogal opened this issue Mar 4, 2025 · 0 comments
Open

Comments

@tfogal
Copy link
Collaborator

tfogal commented Mar 4, 2025

As per: https://pytorch.org/docs/stable/torch.compiler_profiling_torch_compile.html#working-around-cuda-graph-profiling-issues, we may need to do some initialization when using cuda graphs.

We are not yet using cuda graphs, but the benchmarking code should just invoke this at the start of execution anyway. Thus if we add a benchmark that graphs around something nvFuser gives, or if we start internally using graphs down the road, we won't hit surprising profiling issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant