Support huggingface popular weight format for weight-only quantization #1580

PenghuiCheng · 2024-05-30T07:29:29Z

Type of Change

feature
No API changed

Description

Support huggingface woq model format for intel GPU

Expected Behavior & Potential Risk

support AutoGPTQ model on huggingface models hub for WOQ on intel GPU

Signed-off-by: Cheng Penghui <[email protected]>

github-actions · 2024-05-30T07:30:05Z

⚡ Required checks status: All passing 🟢

Groups summary

🟢 Format Scan Tests workflow

Check ID	Status
format-scan (pylint)	success	✅
format-scan (bandit)	success	✅
format-scan (cloc)	success	✅
format-scan (cpplint)	success	✅

These checks are required after the changes to intel_extension_for_transformers/transformers/llm/quantization/utils.py, intel_extension_for_transformers/transformers/modeling/modeling_auto.py.

🟢 Optimize Unit Test workflow

Check ID	Status
optimize-unit-test-baseline	success	✅
optimize-unit-test-PR-test	success	✅
Genreate-OptimizeUT-Report	success	✅

These checks are required after the changes to intel_extension_for_transformers/transformers/llm/quantization/utils.py, intel_extension_for_transformers/transformers/modeling/modeling_auto.py.

🟢 NeuralChat Unit Test

Check ID	Status
neuralchat-unit-test-baseline	success	✅
neuralchat-unit-test-PR-test	success	✅
Generate-NeuralChat-Report	success	✅

These checks are required after the changes to intel_extension_for_transformers/transformers/llm/quantization/utils.py, intel_extension_for_transformers/transformers/modeling/modeling_auto.py.

🟢 Engine Unit Test workflow

Check ID	Status
engine-unit-test-baseline	success	✅
engine-unit-test-PR-test	success	✅
Genreate-Engine-Report	success	✅

These checks are required after the changes to intel_extension_for_transformers/transformers/llm/quantization/utils.py, intel_extension_for_transformers/transformers/modeling/modeling_auto.py.

Thank you for your contribution! 💜

Note
This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact VincyZhang or XuehaoSun for help.

PenghuiCheng · 2024-06-07T10:26:54Z

depend on ipex gpu xetla kenel ready.

…ormat

Signed-off-by: Cheng Penghui <[email protected]>

intel_extension_for_transformers/transformers/llm/quantization/utils.py

…ormat

Signed-off-by: zhenwei-intel <[email protected]>

Support huggingface popular weight format for weight-only quantization

be3be16

Signed-off-by: Cheng Penghui <[email protected]>

PenghuiCheng requested review from changwangss, a32543254 and zhenwei-intel May 30, 2024 07:38

PenghuiCheng added the WIP label Jun 7, 2024

PenghuiCheng added 2 commits June 15, 2024 07:58

Merge remote-tracking branch 'origin/main' into penghuic/support_hf_f…

f648e25

…ormat

Fixed issue of loading woq model for intel gpu

c096d68

Signed-off-by: Cheng Penghui <[email protected]>

zhenwei-intel approved these changes Jun 16, 2024

View reviewed changes

a32543254 approved these changes Jun 21, 2024

View reviewed changes

intel_extension_for_transformers/transformers/llm/quantization/utils.py Show resolved Hide resolved

changwangss approved these changes Jun 21, 2024

View reviewed changes

PenghuiCheng removed the WIP label Jul 1, 2024

PenghuiCheng and others added 5 commits July 2, 2024 07:52

Merge remote-tracking branch 'origin/main' into penghuic/support_hf_f…

51a50d4

…ormat

update qconfig for xpu

9e438b5

Signed-off-by: zhenwei-intel <[email protected]>

Merge branch 'main' into penghuic/support_hf_format

e788e8d

Merge branch 'main' into penghuic/support_hf_format

42b9906

Merge branch 'main' into penghuic/support_hf_format

ac239db

kevinintel merged commit 3e85ca9 into main Jul 5, 2024
20 checks passed

kevinintel deleted the penghuic/support_hf_format branch July 5, 2024 09:20

DDEle restored the penghuic/support_hf_format branch July 8, 2024 01:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support huggingface popular weight format for weight-only quantization #1580

Support huggingface popular weight format for weight-only quantization #1580

PenghuiCheng commented May 30, 2024

github-actions bot commented May 30, 2024 •

edited

Loading

PenghuiCheng commented Jun 7, 2024

Support huggingface popular weight format for weight-only quantization #1580

Support huggingface popular weight format for weight-only quantization #1580

Conversation

PenghuiCheng commented May 30, 2024

Type of Change

Description

Expected Behavior & Potential Risk

github-actions bot commented May 30, 2024 • edited Loading

⚡ Required checks status: All passing 🟢

Groups summary

PenghuiCheng commented Jun 7, 2024

github-actions bot commented May 30, 2024 •

edited

Loading