[FEA] Qualification tool: Add operators stats output csv file #1157
Labels
feature request
New feature or request
user_tools
Scope the wrapper module running CSP, QualX, and reports (python)
Is your feature request related to a problem? Please describe.
Currently we have
rapids_4_spark_qualification_output_unsupportedOperators.csv
which has details about unsupported operators andrapids_4_spark_qualification_output_execs.csv
which has details about the execs. It would be good to add another csv file which has additional stats on the operators. It would help to determine the frequency of the operators in an application, task durations of the operators and so on.Sample output format:
operator_stats.csv
where
Total Task Exec Duration = sum of task exec duration which the operator is part of
Add statistics to the output file incrementally:
Tasks
Adding stats for supported operators is a bit tricky. We have to filter out the operators if they are part of the unsupported operators. We will miss some of the operators in stats file if they are not mapped to any stages
The text was updated successfully, but these errors were encountered: