LoRA

Feb 8, 2025

8aea531 · Feb 8, 2025

Name	Name	Last commit message	Last commit date
parent directory ..
chatglm_finetune	chatglm_finetune	Support LoRA ChatGLM with Alpaca Dataset (#11580 )	Jul 16, 2024
README.md	README.md	update more lora example (#12785 )	Feb 8, 2025
alpaca_lora_finetuning.py	alpaca_lora_finetuning.py	Remove zero3 context manager from LoRA (#11346 )	Jun 18, 2024
deepspeed_zero3_config.json	deepspeed_zero3_config.json	Finetune ChatGLM with Deepspeed Zero3 LoRA (#11314 )	Jun 18, 2024
export_merged_model.py	export_merged_model.py	LLM: make finetuning examples more common for other models (#10078 )	Feb 4, 2024
lora_deepspeed_zero3_finetune_chatglm3_6b_arc_2_card.sh	lora_deepspeed_zero3_finetune_chatglm3_6b_arc_2_card.sh	Finetune ChatGLM with Deepspeed Zero3 LoRA (#11314 )	Jun 18, 2024
lora_finetune_llama2_7b_arc_1_card.sh	lora_finetune_llama2_7b_arc_1_card.sh	Replace ipex with ipex-llm (#10554 )	Mar 28, 2024
lora_finetune_llama2_7b_pvc_1110_4_card.sh	lora_finetune_llama2_7b_pvc_1110_4_card.sh	Replace ipex with ipex-llm (#10554 )	Mar 28, 2024
lora_finetune_llama2_7b_pvc_1550_1_tile.sh	lora_finetune_llama2_7b_pvc_1550_1_tile.sh	Replace ipex with ipex-llm (#10554 )	Mar 28, 2024
lora_finetune_llama2_7b_pvc_1550_4_card.sh	lora_finetune_llama2_7b_pvc_1550_4_card.sh	Replace ipex with ipex-llm (#10554 )	Mar 28, 2024

README.md

LoRA Finetuning with IPEX-LLM

This example ports Alpaca-LoRA to IPEX-LLM (using LoRA algorithm) on Intel GPU.

0. Requirements

To run this example with IPEX-LLM on Intel GPUs, we have some recommended requirements for your machine, please refer to here for more information.

1. Install

conda create -n llm python=3.11
conda activate llm
# below command will install intel_extension_for_pytorch==2.1.10+xpu as default
pip install --pre --upgrade ipex-llm[xpu] --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/xpu/us/
pip install transformers==4.45.0 "trl<0.12.0" datasets
pip install fire peft==0.10.0
pip install bitsandbytes==0.45.1 scipy
pip install oneccl_bind_pt==2.1.100 --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/xpu/us/ # necessary to run distributed finetuning

2. Configures OneAPI environment variables

source /opt/intel/oneapi/setvars.sh

3. LoRA Finetune

Here, we provide example usages on different hardware. Please refer to the appropriate script based on your device:

Finetuning LLaMA2-7B on single Arc A770

bash lora_finetune_llama2_7b_arc_1_card.sh

Finetuning ChatGLM3-6B on two Arc A770

# install deepspeed dependencies
source /opt/intel/oneapi/setvars.sh # necessary to run before installing deepspeed
pip install git+https://github.com/microsoft/DeepSpeed.git@78c518e
pip install git+https://github.com/intel/intel-extension-for-deepspeed.git@ec33277

#start finetuning
bash lora_deepspeed_zero3_finetune_chatglm3_6b_arc_2_card.sh

Finetuning LLaMA2-7B on four Intel Data Center GPU Max 1100

bash lora_finetune_llama2_7b_pvc_1100_1_card.sh

Finetuning LLaMA2-7B on single tile of Intel Data Center GPU Max 1550

bash lora_finetune_llama2_7b_pvc_1550_1_tile.sh

Finetuning LLaMA2-7B on four Intel Data Center GPU Max 1550

bash lora_finetune_llama2_7b_pvc_1550_4_card.sh

4. (Optional) Resume Training

If you fail to complete the whole finetuning process, it is suggested to resume training from a previously saved checkpoint by specifying resume_from_checkpoint to the local checkpoint folder as following:

python ./alpaca_lora_finetuning.py \
    --base_model "meta-llama/Llama-2-7b-hf" \
    --data_path "yahma/alpaca-cleaned" \
    --output_dir "./ipex-llm-qlora-alpaca" \
    --resume_from_checkpoint "./ipex-llm-qlora-alpaca/checkpoint-1100"

5. Sample Output

{'loss': 1.9231, 'learning_rate': 2.9999945367033285e-05, 'epoch': 0.0}
{'loss': 1.8622, 'learning_rate': 2.9999781468531096e-05, 'epoch': 0.01}
{'loss': 1.9043, 'learning_rate': 2.9999508305687345e-05, 'epoch': 0.01}
{'loss': 1.8967, 'learning_rate': 2.999912588049185e-05, 'epoch': 0.01}
{'loss': 1.9658, 'learning_rate': 2.9998634195730358e-05, 'epoch': 0.01}
{'loss': 1.8386, 'learning_rate': 2.9998033254984483e-05, 'epoch': 0.02}
{'loss': 1.809, 'learning_rate': 2.999732306263172e-05, 'epoch': 0.02}
{'loss': 1.8552, 'learning_rate': 2.9996503623845395e-05, 'epoch': 0.02}
  1%|█                                                                                                                                                         | 8/1164 [xx:xx<xx:xx:xx, xx s/it]

6. Merge the adapter into the original model

python ./export_merged_model.py --repo-id-or-model-path REPO_ID_OR_MODEL_PATH --adapter_path ./outputs/checkpoint-200 --output_path ./outputs/checkpoint-200-merged

Then you can use ./outputs/checkpoint-200-merged as a normal huggingface transformer model to do inference.

7. Troubleshooting

Please refer to here for solutions of common issues during finetuning.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

LoRA

LoRA

README.md

LoRA Finetuning with IPEX-LLM

0. Requirements

1. Install

2. Configures OneAPI environment variables

3. LoRA Finetune

Finetuning LLaMA2-7B on single Arc A770

Finetuning ChatGLM3-6B on two Arc A770

Finetuning LLaMA2-7B on four Intel Data Center GPU Max 1100

Finetuning LLaMA2-7B on single tile of Intel Data Center GPU Max 1550

Finetuning LLaMA2-7B on four Intel Data Center GPU Max 1550

4. (Optional) Resume Training

5. Sample Output

6. Merge the adapter into the original model

7. Troubleshooting

Files

LoRA

Directory actions

More options

Directory actions

More options

Latest commit

History

LoRA

Folders and files

parent directory

README.md

LoRA Finetuning with IPEX-LLM

0. Requirements

1. Install

2. Configures OneAPI environment variables

3. LoRA Finetune

Finetuning LLaMA2-7B on single Arc A770

Finetuning ChatGLM3-6B on two Arc A770

Finetuning LLaMA2-7B on four Intel Data Center GPU Max 1100

Finetuning LLaMA2-7B on single tile of Intel Data Center GPU Max 1550

Finetuning LLaMA2-7B on four Intel Data Center GPU Max 1550

4. (Optional) Resume Training

5. Sample Output

6. Merge the adapter into the original model

7. Troubleshooting