Further fine tuning flan-alpaca-gpt4-lora-xl #3

vinay1986 · 2023-05-19T04:11:59Z

Can reasonwang/flan-alpaca-gpt4-lora-xl be further fine tuned?

If yes, what would be the steps for it?

Reason-Wang · 2023-05-19T06:06:24Z

You can load the model and finetune it just like in train.py.

vinay1986 · 2023-05-19T07:38:24Z

Thanks.

Do you mean fine tune the base model from scratch?!

Reason-Wang · 2023-05-19T08:12:44Z

No, you can load "flan-alpaca-gpt4-lora-xl" like this:

model_name = "reasonwang/flan-t5-xl-8bit"; peft_model_id = "reasonwang/flan-alpaca-gpt4-lora-xl"
tokenizer = transformers.AutoTokenizer.from_pretrained(model_name)
base_model = transformers.AutoModelForSeq2SeqLM.from_pretrained(model_name)
peft_model = PeftModel.from_pretrained(base_model, peft_model_id)

And finetune this peft_model directly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Further fine tuning flan-alpaca-gpt4-lora-xl #3

Further fine tuning flan-alpaca-gpt4-lora-xl #3

vinay1986 commented May 19, 2023

Reason-Wang commented May 19, 2023

vinay1986 commented May 19, 2023

Reason-Wang commented May 19, 2023

Further fine tuning flan-alpaca-gpt4-lora-xl #3

Further fine tuning flan-alpaca-gpt4-lora-xl #3

Comments

vinay1986 commented May 19, 2023

Reason-Wang commented May 19, 2023

vinay1986 commented May 19, 2023

Reason-Wang commented May 19, 2023