[2025.0.0.0rc2] Meta-Llama-3-8B-Instruct chatting with itself (not producing / detecting EOS token properly) #1631

RyanMetcalfeInt8 · 2025-01-24T16:53:43Z

Using the chat sample, it seems like instead of producing / detecting EOS token, it generates 'assistant' string/token, which causes model to continually chat with itself until max tokens is reached:

I am using openvino-genai==2025.0.0.0rc2, and the LLama3 model I am using was generated using:

optimum-cli export openvino -m meta-llama/Meta-Llama-3-8B-Instruct --weight-format int4 --sym --group-size -1 --ratio 1.0 Meta-Llama-3-8B-Instruct

Optimum version used for conversion is from: pip install -r export-requirements.txt

optimum                       1.24.0.dev0
optimum-intel                 1.22.0.dev0+b49fcbb

The text was updated successfully, but these errors were encountered:

RyanMetcalfeInt8 · 2025-01-24T16:54:17Z

By the way, issue occurs with all devices -- CPU, GPU, NPU.

Aznie-Intel · 2025-01-29T04:07:29Z

Hi @RyanMetcalfeInt8 Have you tried the TinyLlama-1.1B-Chat-v1.0 model? I didn't encounter any issue with the model.

RyanMetcalfeInt8 · 2025-01-29T12:52:44Z

Hi @Aznie-Intel -- No, I didn't try TinyLlama. Perhaps the issue is model-dependent, then.

RyanMetcalfeInt8 · 2025-01-29T12:53:43Z

Although it does seem that even in your screenshot, the chatbot is chatting with itself, right?

zulkifli-halim assigned Aznie-Intel Jan 28, 2025

Munesh-Intel self-assigned this Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[2025.0.0.0rc2] Meta-Llama-3-8B-Instruct chatting with itself (not producing / detecting EOS token properly) #1631

[2025.0.0.0rc2] Meta-Llama-3-8B-Instruct chatting with itself (not producing / detecting EOS token properly) #1631

RyanMetcalfeInt8 commented Jan 24, 2025

RyanMetcalfeInt8 commented Jan 24, 2025

Aznie-Intel commented Jan 29, 2025

RyanMetcalfeInt8 commented Jan 29, 2025

RyanMetcalfeInt8 commented Jan 29, 2025 •

edited

Loading

[2025.0.0.0rc2] Meta-Llama-3-8B-Instruct chatting with itself (not producing / detecting EOS token properly) #1631

[2025.0.0.0rc2] Meta-Llama-3-8B-Instruct chatting with itself (not producing / detecting EOS token properly) #1631

Comments

RyanMetcalfeInt8 commented Jan 24, 2025

RyanMetcalfeInt8 commented Jan 24, 2025

Aznie-Intel commented Jan 29, 2025

RyanMetcalfeInt8 commented Jan 29, 2025

RyanMetcalfeInt8 commented Jan 29, 2025 • edited Loading

RyanMetcalfeInt8 commented Jan 29, 2025 •

edited

Loading