[WIP][Bug] Issue 934 #962

riedgar-ms · 2024-07-23T17:49:08Z

Digging into #934

codecov-commenter · 2024-07-23T18:00:02Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 91.66667% with 1 line in your changes missing coverage. Please review.

Project coverage is 62.20%. Comparing base (b66f2a0) to head (362c7ec).
Report is 1 commits behind head on main.

Files	Patch %	Lines
guidance/models/llama_cpp/_llama_cpp.py	91.66%	1 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #962      +/-   ##
==========================================
+ Coverage   58.00%   62.20%   +4.19%     
==========================================
  Files          63       63              
  Lines        4848     4860      +12     
==========================================
+ Hits         2812     3023     +211     
+ Misses       2036     1837     -199

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

riedgar-ms · 2024-07-24T16:07:36Z

guidance/models/llama_cpp/_llama_cpp.py

+        # and has a tendency to segfault.
+        # To address this, shorten the bytes sent to the tokenizer to a valid
+        # UTF-8 value
+        # I hope this will not bite us with some subtle other bug in future


I'm not very optimistic about this. This 'fix' almost certainly breaks recode() when said fix kicks in. And fundamentally, these bytes came from the LLM, so why on earth is its tokeniser refusing to have anything to do with them?

Thanks so much for investigating this Richard. I agree, this feels like potentially buggy behavior directly in llama-cpp. Do we have a repro for 934 that directly uses the llama-cpp (or maybe llama-cpp-python if we need to) bindings? Instead of attempting to fix through this route, a lower level repo might mean we could/should raise an issue on llama-cpp?

@knilink had found:
printf '\xe6\xad' | ./llama-tokenize -m ./Meta-Llama-3-8B-Instruct.Q8_0.gguf --stdin

I have filed a llama-cpp-python-based issue (rewritten from @knilink 's investigation) directly on the HF repo whence I'm grabbing the GGUF

And filed the LlamaCpp bug:
ggerganov/llama.cpp#8691

riedgar-ms · 2024-07-24T18:00:28Z

tests/model_integration/test_model.py

+def test_with_multitokenchars(selected_model: guidance.models.Model):
+    # Taken from https://github.com/guidance-ai/guidance/issues/934
+    lm = selected_model
+    lm += "歪" + select(["打正着", "门邪道"])


For the record:

>>> a '打' >>> b '门' >>> a.encode() b'\xe6\x89\x93' >>> b.encode() b'\xe9\x97\xa8'

So the two leading characters in the select() do not share any common bytes here.

riedgar-ms · 2024-07-24T18:48:04Z

I have filed a bug on the HF Hub model:
https://huggingface.co/bartowski/Meta-Llama-3-8B-Instruct-GGUF/discussions/9
But I think this is really a problem in the LlamaCpp layer

riedgar-ms added 5 commits July 23, 2024 13:27

Convert issue into test

91e6bc9

Add extra model

58994d3

Move test to better location

2b1c89c

Hook new model into PR gate

5477fd3

Add note into test

3ebd297

riedgar-ms added 4 commits July 24, 2024 10:33

Merge remote-tracking branch 'upstream/main' into riedgar-ms/issue-934

a0d8acd

Fixing model

9ff140a

Draft of a fix

6e79e50

Better check

ef0e515

riedgar-ms mentioned this pull request Jul 24, 2024

LlamaCpp model crashes with multi-token characters #934

Open

riedgar-ms commented Jul 24, 2024

View reviewed changes

mypy fix

1ec2793

riedgar-ms commented Jul 24, 2024

View reviewed changes

riedgar-ms added 2 commits July 25, 2024 08:35

Merge branch 'main' into riedgar-ms/issue-934

207b686

Merge branch 'main' into riedgar-ms/issue-934

362c7ec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][Bug] Issue 934 #962

[WIP][Bug] Issue 934 #962

riedgar-ms commented Jul 23, 2024

codecov-commenter commented Jul 23, 2024 •

edited

Loading

riedgar-ms Jul 24, 2024 •

edited

Loading

Harsha-Nori Jul 25, 2024

riedgar-ms Jul 25, 2024 •

edited

Loading

riedgar-ms Jul 25, 2024

riedgar-ms Jul 24, 2024 •

edited

Loading

riedgar-ms commented Jul 24, 2024

[WIP][Bug] Issue 934 #962

Are you sure you want to change the base?

[WIP][Bug] Issue 934 #962

Conversation

riedgar-ms commented Jul 23, 2024

codecov-commenter commented Jul 23, 2024 • edited Loading

Codecov Report

riedgar-ms Jul 24, 2024 • edited Loading

Choose a reason for hiding this comment

Harsha-Nori Jul 25, 2024

Choose a reason for hiding this comment

riedgar-ms Jul 25, 2024 • edited Loading

Choose a reason for hiding this comment

riedgar-ms Jul 25, 2024

Choose a reason for hiding this comment

riedgar-ms Jul 24, 2024 • edited Loading

Choose a reason for hiding this comment

riedgar-ms commented Jul 24, 2024

codecov-commenter commented Jul 23, 2024 •

edited

Loading

riedgar-ms Jul 24, 2024 •

edited

Loading

riedgar-ms Jul 25, 2024 •

edited

Loading

riedgar-ms Jul 24, 2024 •

edited

Loading