Add score averaging and convergence early stop #734

CedricHwong · 2025-12-30T10:30:52Z

What does this PR do?

Overview:
This PR extends GradNAS gradient scoring with
(1) optional score averaging
(2) early stopping when gradient scores converge. New search config keys are added with defaults: average_scores, score_convergence_tol, score_convergence_patience, and score_convergence_min_updates. The gradient estimation loop now exits early once all hparam score trackers converge. Unit tests cover the averaging math, convergence boundaries, and disabled-convergence behavior.

Usage

  import modelopt.torch.prune as mtp

  # Example: enable/adjust score averaging + convergence early stop
  pruned_model, search_history = mtp.prune(
      model=model,
      mode="gradnas",
      constraints={"flops": "90%"},
      dummy_input=dummy_input,
      config={
          "data_loader": train_loader,
          "loss_func": loss_func,
          # New options:
          "average_scores": True,
          "score_convergence_tol": 1e-3,
          "score_convergence_patience": 5,
          "score_convergence_min_updates": 10,
      },
  )

Testing

pytest tests/unit/torch/prune/test_gradnas.py
pre-commit run --all-files

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines (https://github.com/NVIDIA/Model-
Optimizer/blob/main/CONTRIBUTING.md) and your commits are signed.
Is this change backward compatible?: No Default GradNAS behavior now averages gradient scores and may stop early on convergence, which can change score scale and number of batches processed vs. previous behavior.
Did you write any new necessary tests?: Yes
Did you add or update any necessary documentation?: No
Did you update Changelog (https://github.com/NVIDIA/Model-Optimizer/blob/main/CHANGELOG.rst)?: No

Additional Information

N/A

Signed-off-by: CedricHwong <[email protected]>

copy-pr-bot · 2025-12-30T10:30:56Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

feats/gradnas:add score averaging and convergence early stop

5599a0f

Signed-off-by: CedricHwong <[email protected]>

CedricHwong requested a review from a team as a code owner December 30, 2025 10:30

CedricHwong requested a review from AAnoosheh December 30, 2025 10:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add score averaging and convergence early stop #734

Add score averaging and convergence early stop #734

CedricHwong commented Dec 30, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add score averaging and convergence early stop #734

Are you sure you want to change the base?

Add score averaging and convergence early stop #734

Conversation

CedricHwong commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Usage

Testing

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot bot commented Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

CedricHwong commented Dec 30, 2025 •

edited

Loading