Fix error when trying to access `state_dict` after activation quantization #371

DN6 · 2025-02-07T11:08:33Z

What does this PR do?

Trying to access the model state dict after using activation quantization throws an unexpected error.

Traceback (most recent call last):
  File "/home/dhruv/diffusers/../scripts/create_dummy_flux.py", line 42, in <module>
    model.state_dict()
  File "/home/dhruv/miniconda3/envs/diffusers/lib/python3.10/site-packages/torch/nn/modules/module.py", line 2219, in state_dict
    module.state_dict(
  File "/home/dhruv/miniconda3/envs/diffusers/lib/python3.10/site-packages/torch/nn/modules/module.py", line 2219, in state_dict
    module.state_dict(
  File "/home/dhruv/miniconda3/envs/diffusers/lib/python3.10/site-packages/torch/nn/modules/module.py", line 2219, in state_dict
    module.state_dict(
  [Previous line repeated 1 more time]
  File "/home/dhruv/miniconda3/envs/diffusers/lib/python3.10/site-packages/torch/nn/modules/module.py", line 2216, in state_dict
    self._save_to_state_dict(destination, prefix, keep_vars)
  File "/home/dhruv/miniconda3/envs/diffusers/lib/python3.10/site-packages/optimum/quanto/nn/qmodule.py", line 150, in _save_to_state_dict
    destination[prefix + "weight"] = self.weight if keep_vars else self.weight.detach()
AttributeError: 'NoneType' object has no attribute 'detach'

This is because when replacing the LayerNorms in the model with QLayerNorm the weights are set to None, so this line will throw an error

optimum-quanto/optimum/quanto/nn/qmodule.py

Line 150 in 2202f84

    
           destination[prefix + "weight"] = self.weight if keep_vars else self.weight.detach()

Snippet to test

import torch
from diffusers import FluxTransformer2DModel
from optimum.quanto import freeze, quantize, qint8, Calibration

torch_device = torch.device("cuda")


def get_dummy_inputs():
    return {
        "hidden_states": torch.randn(
            (1, 4096, 64), generator=torch.Generator("cpu").manual_seed(0)
        ).to(torch_device, torch.bfloat16),
        "encoder_hidden_states": torch.randn(
            (1, 512, 4096),
            generator=torch.Generator("cpu").manual_seed(0),
        ).to(torch_device, torch.bfloat16),
        "pooled_projections": torch.randn(
            (1, 768),
            generator=torch.Generator("cpu").manual_seed(0),
        ).to(torch_device, torch.bfloat16),
        "timestep": torch.tensor([1]).to(torch_device, torch.bfloat16),
        "img_ids": torch.randn(
            (4096, 3), generator=torch.Generator("cpu").manual_seed(0)
        ).to(torch_device, torch.bfloat16),
        "txt_ids": torch.randn(
            (512, 3), generator=torch.Generator("cpu").manual_seed(0)
        ).to(torch_device, torch.bfloat16),
        "guidance": torch.tensor([3.5]).to(torch_device, torch.bfloat16),
    }


model = FluxTransformer2DModel.from_pretrained(
    "hf-internal-testing/tiny-flux-transformer"
)
quantize(model, weights=qint8, activations=qint8)
model.to(torch_device)

with Calibration(momentum=0.9):
    model(**get_dummy_inputs())

freeze(model)
model.state_dict()

Fixes # (issue)

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you run all tests locally and make sure they pass.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

dacorvo

Thanks for the pull-request: can you squash your commits and put a meaningful commit message to allow the CI to run ?

git reset --soft HEAD~2
git commit
git push -f

github-actions · 2025-03-02T02:07:37Z

This PR is stale because it has been open 15 days with no activity. Remove stale label or comment or this will be closed in 5 days.

dacorvo · 2025-03-03T15:15:42Z

The CI errors are fixed in #373 .

dacorvo

LGTM, thanks !

DN6 requested a review from dacorvo as a code owner February 7, 2025 11:08

dacorvo requested changes Feb 11, 2025

View reviewed changes

fix: correct serialization of non-affine QLayerNorm

1801dbc

DN6 force-pushed the state-dict-fix branch from 48c23e0 to 1801dbc Compare February 14, 2025 12:15

DN6 mentioned this pull request Feb 18, 2025

[Quantization] Add Quanto backend huggingface/diffusers#10756

Open

6 tasks

github-actions bot added the Stale label Mar 2, 2025

DN6 requested a review from dacorvo March 3, 2025 07:09

github-actions bot removed the Stale label Mar 4, 2025

Merge branch 'main' into state-dict-fix

1b60259

dacorvo approved these changes Mar 5, 2025

View reviewed changes

dacorvo merged commit 9af9869 into huggingface:main Mar 5, 2025
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix error when trying to access `state_dict` after activation quantization #371

Fix error when trying to access `state_dict` after activation quantization #371

DN6 commented Feb 7, 2025

dacorvo left a comment •

edited

Loading

github-actions bot commented Mar 2, 2025

dacorvo commented Mar 3, 2025 •

edited

Loading

dacorvo left a comment

Fix error when trying to access state_dict after activation quantization #371

Fix error when trying to access state_dict after activation quantization #371

Conversation

DN6 commented Feb 7, 2025

What does this PR do?

Before submitting

Who can review?

dacorvo left a comment • edited Loading

Choose a reason for hiding this comment

github-actions bot commented Mar 2, 2025

dacorvo commented Mar 3, 2025 • edited Loading

dacorvo left a comment

Choose a reason for hiding this comment

Fix error when trying to access `state_dict` after activation quantization #371

Fix error when trying to access `state_dict` after activation quantization #371

dacorvo left a comment •

edited

Loading

dacorvo commented Mar 3, 2025 •

edited

Loading