Lora微调后要怎么加载测试呢？ #298

qazqaz12378 · 2024-06-26T07:09:01Z

finetune/readme.md下面写的加载lora方法的方法只有一部分，请问具体要怎么加载测试呢？

White65534 · 2024-06-26T08:05:57Z

问的太对了哥，给我愁死了，这readme写的八成是写错了，那个msg应该是model

y-x-x123 · 2024-06-26T08:56:38Z

我也是很疑惑

lyc728 · 2024-06-28T09:53:08Z

先对模型进行合并，就可以推理了

White65534 · 2024-06-28T09:56:50Z

怎么合并呢？哪里有示例或者demo吗

…

On Fri, 28 Jun 2024 at 7:53 PM, lyc728 ***@***.***> wrote: 先对模型进行合并，就可以推理了 — Reply to this email directly, view it on GitHub <#298 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A26FH73GSP2KYPQ3CFOQXW3ZJUXBVAVCNFSM6AAAAABJ5H5MTSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCOJWGUZTQMBXGQ> . You are receiving this because you commented.Message ID: ***@***.***>

y-x-x123 · 2024-06-28T10:00:08Z

先对模型进行合并，就可以推理了

谢谢昨天在issues看到了合并的方法就可以用了

y-x-x123 · 2024-06-28T10:02:47Z

怎么合并呢？哪里有示例或者demo吗
…
On Fri, 28 Jun 2024 at 7:53 PM, lyc728 @.> wrote: 先对模型进行合并，就可以推理了 — Reply to this email directly, view it on GitHub <#298 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/A26FH73GSP2KYPQ3CFOQXW3ZJUXBVAVCNFSM6AAAAABJ5H5MTSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCOJWGUZTQMBXGQ . You are receiving this because you commented.Message ID: @.>

这个链接里面有个合并模型https://www.chenxublog.com/2024/04/23/single-card-fine-tuning-minicpm-merging-lora-and-converting-to-gguf-for-llama-cpp.html

todaydeath · 2024-06-29T05:57:08Z

先对模型进行合并，就可以推理了

谢谢昨天在issues看到了合并的方法就可以用了

合并后的模型和基础模型对比，效果明显么

todaydeath · 2024-06-29T09:14:51Z

问的太对了哥，给我愁死了，这readme写的八成是写错了，那个msg应该是model

写成model会报错，就用msg就好了，不用管返回值。但是在运行的时候，显存占多少啊？我这里24G显存不够用，多显卡运行又报错

limllzu · 2024-06-30T07:58:25Z

有没有运行全量微调后加载测试的，我使用官方提供的lora微调的加载测试的会报错

MonkeyNi · 2024-07-03T06:02:46Z

有没有运行全量微调后加载测试的，我使用官方提供的lora微调的加载测试的会报错

Lora finetune后，用官方代码调试，device='auto' 改成自己的device，比如cpu or gpu（否则报错：‘NotImplementedError: Cannot copy out of meta tensor; no data!’），测试通过，结果正常。

MuDong612 · 2024-07-03T07:24:18Z

有没有运行全量微调后加载测试的，我使用官方提供的lora微调的加载测试的会报错

Lora finetune后，用官方代码调试，device='auto' 改成自己的device，比如cpu or gpu（否则报错：‘NotImplementedError: Cannot copy out of meta tensor; no data!’），测试通过，结果正常。

@MonkeyNi 您好，请问用官方代码加载lora，到了msg = model.load_state_dict(vpm_resampler_embedtokens_weight, strict=False)，之后具体怎么进行测试啊

lyc728 · 2024-07-03T07:30:13Z

参考chat代码就行

lyc728 · 2024-07-03T07:33:33Z

有没有运行全量微调后加载测试的，我使用官方提供的lora微调的加载测试的会报错

Lora finetune后，用官方代码调试，device='auto' 改成自己的device，比如cpu or gpu（否则报错：‘NotImplementedError: Cannot copy out of meta tensor; no data!’），测试通过，结果正常。

更新最新代码再试下

MuDong612 · 2024-07-03T07:37:12Z

参考chat代码就行

from chat import MiniCPMVChat, img2base64
import torch
import json

torch.manual_seed(0)

from peft import AutoPeftModelForCausalLM

model = AutoPeftModelForCausalLM.from_pretrained(
# path to the output directory
path_to_adapter,
device_map="auto",
trust_remote_code=True
).eval()

vpm_resampler_embedtokens_weight = torch.load(f"{path_to_adapter}/vpm_resampler_embedtokens.pt")

msg = model.load_state_dict(vpm_resampler_embedtokens_weight, strict=False)

im_64 = img2base64('./assets/airplane.jpeg')

msgs = [{"role": "user", "content": "Tell me the model of this aircraft."}]

inputs = {"image": im_64, "question": json.dumps(msgs)}
answer = chat_model.chat(inputs)
print(answer)
请问是这样吗

LDLINGLINGLING · 2024-07-03T12:54:02Z

这看起来是对的

Fai-yong · 2024-07-03T15:45:13Z

微调之后，能不能用自定义的数据集，对模型进行测试？能的话咋搞

wailokkwok · 2024-07-04T06:14:39Z

from peft import AutoPeftModelForCausalLM
import torch
from chat import MiniCPMVChat, img2base64
import json

path_to_adapter="xxxxxx/output_minicpmv2_lora/checkpoint-200"
model = AutoPeftModelForCausalLM.from_pretrained(
# path to the output directory
path_to_adapter,
device_map="cuda:0",
trust_remote_code=True
).eval()

vpm_resampler_embedtokens_weight = torch.load(f"{path_to_adapter}/vpm_resampler_embedtokens.pt")

model.load_state_dict(vpm_resampler_embedtokens_weight, strict=False)

im_64 = img2base64('xxxxxxxxxx')

msgs = [{"role": "user", "content": "What is the car mileage?"}]

inputs = {"image": im_64, "question": json.dumps(msgs)}
answer = model.chat(inputs)
print(answer)

Traceback (most recent call last):
File "/home/matthew_kwok_tictag_io/minicpm/MiniCPM-V/predict.py", line 25, in
answer = model.chat(inputs)
TypeError: MiniCPMV.chat() missing 2 required positional arguments: 'msgs' and 'tokenizer'

LDLINGLINGLING · 2024-07-04T08:56:02Z

试试改成model.chat(mage=im_64, msgs=json.dumps(msgs),tokenizer=tokenizer)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lora微调后要怎么加载测试呢？ #298

Lora微调后要怎么加载测试呢？ #298

qazqaz12378 commented Jun 26, 2024

White65534 commented Jun 26, 2024

y-x-x123 commented Jun 26, 2024

lyc728 commented Jun 28, 2024

White65534 commented Jun 28, 2024 via email

y-x-x123 commented Jun 28, 2024

y-x-x123 commented Jun 28, 2024

todaydeath commented Jun 29, 2024

todaydeath commented Jun 29, 2024

limllzu commented Jun 30, 2024

MonkeyNi commented Jul 3, 2024

MuDong612 commented Jul 3, 2024

lyc728 commented Jul 3, 2024

lyc728 commented Jul 3, 2024

MuDong612 commented Jul 3, 2024

LDLINGLINGLING commented Jul 3, 2024

Fai-yong commented Jul 3, 2024

wailokkwok commented Jul 4, 2024 •

edited

Loading

LDLINGLINGLING commented Jul 4, 2024

Lora微调后要怎么加载测试呢？ #298

Lora微调后要怎么加载测试呢？ #298

Comments

qazqaz12378 commented Jun 26, 2024

White65534 commented Jun 26, 2024

y-x-x123 commented Jun 26, 2024

lyc728 commented Jun 28, 2024

White65534 commented Jun 28, 2024 via email

y-x-x123 commented Jun 28, 2024

y-x-x123 commented Jun 28, 2024

todaydeath commented Jun 29, 2024

todaydeath commented Jun 29, 2024

limllzu commented Jun 30, 2024

MonkeyNi commented Jul 3, 2024

MuDong612 commented Jul 3, 2024

lyc728 commented Jul 3, 2024

lyc728 commented Jul 3, 2024

MuDong612 commented Jul 3, 2024

LDLINGLINGLING commented Jul 3, 2024

Fai-yong commented Jul 3, 2024

wailokkwok commented Jul 4, 2024 • edited Loading

LDLINGLINGLING commented Jul 4, 2024

wailokkwok commented Jul 4, 2024 •

edited

Loading