You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Encounter error when running qwen2-VL in ipex-llm processing input video with big frame number, below is detail error message and code, video attached as well.
Error information
File "/home/lvm/qwenvl/reproduce.py", line 53, in query_video
generated_ids = model.generate(**inputs, max_new_tokens=128)
File "/home/lvm/miniforge3/envs/qwen/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/home/lvm/miniforge3/envs/qwen/lib/python3.10/site-packages/ipex_llm/transformers/pipeline_parallel.py", line 283, in generate
return original_generate(self,
File "/home/lvm/miniforge3/envs/qwen/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/home/lvm/miniforge3/envs/qwen/lib/python3.10/site-packages/transformers/generation/utils.py", line 2215, in generate
result = self._sample(
File "/home/lvm/miniforge3/envs/qwen/lib/python3.10/site-packages/transformers/generation/utils.py", line 3249, in _sample
next_tokens = torch.multinomial(probs, num_samples=1).squeeze(1)
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
hi, this error is caused by fp16 overflow, we'll fix it as soon as possible.
For now, if you are using Arc A7xx/5xx/3xx or Lunar Lake (Ultra 2xxV), you can try model = model.float().to("xpu") instead of model = model.half().to("xpu").
Encounter error when running qwen2-VL in ipex-llm processing input video with big frame number, below is detail error message and code, video attached as well.
Error information
Video:
https://github.com/user-attachments/assets/fa970bd8-294b-44c3-b807-ffa3f85e1046
Code:
The text was updated successfully, but these errors were encountered: