229 parse all arguments in pytorch trace file #230

fengxizhou · 2025-03-08T21:09:00Z

What does this PR do?

The HTA currently filters a specific set of arguments defined by the ParserConfig class. This approach effectively minimizes memory footprint, which is crucial when dealing with fixed traces.
When working with new types of traces, such as MTIA traces, the current implementation requires modifying the ParserConfig class. This introduces an additional layer of effort and potential complexity.

Proposed Solution

We're adding a new boolean attribute, parse_all_arguments (default False), to the ParserConfig class. When set to True, the parser will parse all arguments in the trace file. Arguments defined in events_args will be parsed as specified, while undefined arguments will be parsed using a standard naming convention with inferred default values.

The undefined arguments are parsed with the following method:
name: the converted name, which is the raw name from the trace file converted to lowercase and with spaces replaced by underscores.

raw_name: the original name as it appeared in the trace file
value_type, the data type inferred based on the value that was given to the argument
default_value: -1 for int, "" for string, None for object

Benefits

Flexibility: Enables seamless adaptation to new trace types without modifying the ParserConfig class.
Ease of Use: Provides a simple and intuitive way to toggle between parsing specific arguments and parsing all arguments.
Reduced Maintenance: Simplifies the codebase by eliminating the need for frequent updates to the ParserConfig class for new trace types.

Before submitting

- Add parse_all_args attribute and corresponding setter method - Set default parser_backend to JSON - Introduce transform_arg_name static method for argument normalization - Implement unit tests for new features in ParserConfig

… test case test_set_global_parser_config_version to prevent the side impact due to mocking parse_event_args_yaml.

fengxizhou added 2 commits March 8, 2025 15:57

update pre-commit hook versions in .pre-commit-config.yaml

0b6db42

Fix get_test_data_path to match current directory structure

445e941

fengxizhou linked an issue Mar 8, 2025 that may be closed by this pull request

Parse All Arguments in PyTorch Trace File #229

Open

fengxizhou self-assigned this Mar 8, 2025

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 8, 2025

fengxizhou added 3 commits March 8, 2025 16:15

Restore pre-commit hooks to the truck version

ea6f314

Enhance ParserConfig with new features and tests

f01465a

- Add parse_all_args attribute and corresponding setter method - Set default parser_backend to JSON - Introduce transform_arg_name static method for argument normalization - Implement unit tests for new features in ParserConfig

Refactor ParserConfig to allow None as default parser_backend and fix…

5727738

… test case test_set_global_parser_config_version to prevent the side impact due to mocking parse_event_args_yaml.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

229 parse all arguments in pytorch trace file #230

229 parse all arguments in pytorch trace file #230

fengxizhou commented Mar 8, 2025 •

edited

Loading

229 parse all arguments in pytorch trace file #230

Are you sure you want to change the base?

229 parse all arguments in pytorch trace file #230

Conversation

fengxizhou commented Mar 8, 2025 • edited Loading

What does this PR do?

Proposed Solution

Before submitting

fengxizhou commented Mar 8, 2025 •

edited

Loading