Torchao weights only compability #34355

SunMarc · 2024-10-23T17:14:39Z

What does this PR do ?

This PR makes torchao serialized model loadable with weights_only=True which is the default. Otherwise, you need to set weights_only=False which is not recommended.

cc @jerryzh168 cc @MekkCyber

HuggingFaceDocBuilderDev · 2024-10-23T17:40:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jerryzh168 · 2024-10-23T18:44:02Z

src/transformers/quantizers/quantizer_torchao.py

@@ -73,6 +73,50 @@ def validate_environment(self, *args, **kwargs):
                    )
                else:
                    self.offload = True
+        if self.pre_quantized:
+            safe_globals = []


if we do import torchao, I think we should get everything here (classes etc. being added to safeglobals)? otherwise we'd need to fix torchao

I'm using torchao 0.5.0 and it's not working on my side. I can try with the latest tomorrow !

I see, it's not expected I think, I think it should be fixed in torchao side, I feel 0.5 should have this functionality already actually. if you can have a standalone repro that will be very helpful for us. I remember I have tested in https://huggingface.co/docs/transformers/main/en/quantization/torchao

Actually, we ran into this issue with @MekkCyber on the example you shared in the docs.
Here's a the reproducer, let us know if you also have this issue :

from transformers import TorchAoConfig, AutoTokenizer, AutoModelForCausalLM import torch model_name = "TinyLlama/TinyLlama-1.1B-Chat-v1.0" quant_config = TorchAoConfig("int4_weight_only", group_size=32) quantized_model = AutoModelForCausalLM.from_pretrained( model_name, torch_dtype=torch.bfloat16, device_map="cuda:0", quantization_config=quant_config, ) output_dir = "llama3-8b-int4wo-128" quantized_model.save_pretrained(output_dir, safe_serialization=False) loaded_quantized_model = AutoModelForCausalLM.from_pretrained(output_dir, device_map="cuda:0")

OK will test and report back

src/transformers/quantizers/quantizer_torchao.py

tests/quantization/torchao_integration/test_torchao.py

MekkCyber · 2024-10-23T22:46:59Z

Thanks for this PR @SunMarc, really helpful !

weights only compability

ef4976d

SunMarc requested a review from MekkCyber October 23, 2024 17:14

jerryzh168 reviewed Oct 23, 2024

View reviewed changes

MekkCyber reviewed Oct 23, 2024

View reviewed changes

src/transformers/quantizers/quantizer_torchao.py Show resolved Hide resolved

MekkCyber reviewed Oct 23, 2024

View reviewed changes

tests/quantization/torchao_integration/test_torchao.py Outdated Show resolved Hide resolved

MekkCyber reviewed Oct 23, 2024

View reviewed changes

tests/quantization/torchao_integration/test_torchao.py Outdated Show resolved Hide resolved

MekkCyber reviewed Oct 23, 2024

View reviewed changes

tests/quantization/torchao_integration/test_torchao.py Outdated Show resolved Hide resolved

better tests from code review

b7a7fba

MekkCyber approved these changes Oct 31, 2024

View reviewed changes

Merge branch 'main' into torchao_weights_only

54a8e5e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Torchao weights only compability #34355

Torchao weights only compability #34355

SunMarc commented Oct 23, 2024

HuggingFaceDocBuilderDev commented Oct 23, 2024

jerryzh168 Oct 23, 2024 •

edited

Loading

SunMarc Oct 23, 2024

jerryzh168 Oct 23, 2024

SunMarc Oct 24, 2024 •

edited

Loading

jerryzh168 Oct 31, 2024

MekkCyber commented Oct 23, 2024

Torchao weights only compability #34355

Are you sure you want to change the base?

Torchao weights only compability #34355

Conversation

SunMarc commented Oct 23, 2024

What does this PR do ?

HuggingFaceDocBuilderDev commented Oct 23, 2024

jerryzh168 Oct 23, 2024 • edited Loading

Choose a reason for hiding this comment

SunMarc Oct 23, 2024

Choose a reason for hiding this comment

jerryzh168 Oct 23, 2024

Choose a reason for hiding this comment

SunMarc Oct 24, 2024 • edited Loading

Choose a reason for hiding this comment

jerryzh168 Oct 31, 2024

Choose a reason for hiding this comment

MekkCyber commented Oct 23, 2024

jerryzh168 Oct 23, 2024 •

edited

Loading

SunMarc Oct 24, 2024 •

edited

Loading