Skip to content

Commit

Permalink
Use rocprofv2 instead of rocprof.
Browse files Browse the repository at this point in the history
Abstract the boilerplate for collecting results from a process.
Account for .MLIR_N_REPEATS in rocprofv2 results, which don't include it.
Account for nrepeats in a smarter way -- count the rows, while verifying.
Don't do attention perfRunner.py on gfx110x.
Don't run the CK benchmarking for gfx110x, because ck-benchmark-driver won't compile.
getFusionTestInfo and runFusionKernel turn out to be mostly the same.
Invent --rocprof-version to switch between rocprof and rocprofv2.
Change default to rocprofv2.
  • Loading branch information
pcf000 committed Nov 11, 2024
1 parent 99fc9d2 commit 3ee8364
Showing 1 changed file with 192 additions and 145 deletions.
Loading

0 comments on commit 3ee8364

Please sign in to comment.