Fixing bug 857 (regression from 0.1.14 to 0.1.15) #858

FoxBuchele · 2024-05-25T03:43:15Z

Fixing regression issue in 0.1.15 where capturing text would cause the LM to slice it incorrectly due to stop and start tokens.

Fixing issue where capturing text would cause the LM to slice it incorrectly due to stop and start tokens.

FoxBuchele · 2024-05-25T04:06:42Z

Converted to draft because I noticed some errors when writing tests (specifically, guidance functions within f-strings were not converted properly).

Looking into fixing it the 'right' way, and will re-open the pull request then, unless someone else gets to it first.

riedgar-ms · 2024-05-28T18:52:21Z

Could you add a test which fails without your fix?

codecov-commenter · 2024-05-28T19:00:46Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 61.25%. Comparing base (4ae2ea6) to head (cf2bf6f).
Report is 1 commits behind head on main.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #858      +/-   ##
==========================================
+ Coverage   56.43%   61.25%   +4.81%     
==========================================
  Files          63       63              
  Lines        4791     4798       +7     
==========================================
+ Hits         2704     2939     +235     
+ Misses       2087     1859     -228

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

FoxBuchele · 2024-05-28T19:11:56Z

I can definitely try!

The fix is only relevant/used in models that use role blocks (with system(), with assistant(), etc) that also have closing tags whenever the role is completed, so I'll have to dig into the test suite to determine how to write a test properly that only runs on some types of LMs.

riedgar-ms · 2024-05-28T19:26:39Z

I can definitely try!

The fix is only relevant/used in models that use role blocks (with system(), with assistant(), etc) that also have closing tags whenever the role is completed, so I'll have to dig into the test suite to determine how to write a test properly that only runs on some types of LMs.

Look at the selected_model_name and selected_model fixtures for that. You can see one approach here:

guidance/tests/models/test_llama_cpp.py

Line 11 in c9e71fb

def llamacpp_model(selected_model, selected_model_name):

FoxBuchele · 2024-05-29T05:47:37Z

I can definitely try!
The fix is only relevant/used in models that use role blocks (with system(), with assistant(), etc) that also have closing tags whenever the role is completed, so I'll have to dig into the test suite to determine how to write a test properly that only runs on some types of LMs.

Look at the selected_model_name and selected_model fixtures for that. You can see one approach here:

guidance/tests/models/test_llama_cpp.py

Line 11 in c9e71fb

def llamacpp_model(selected_model, selected_model_name):

Thanks!! That was very helpful.

I added a single basic test that ensures if you capture text within a role block, the text captured is sliced from the model at the appropriate location.

riedgar-ms · 2024-05-29T12:23:51Z

tests/library/test_capture.py


 from ..utils import get_model


+@pytest.fixture(scope="module")
+def instruct_model(selected_model, selected_model_name):
+    if selected_model_name in ["transformers_phi3cpu_mini_4k_instruct"]:


Is this the only model for which the test works? I thought that some of the others supported the role tags? Perhaps move the fixture to conf.py and call it something like model_with_role_tags ?

Ah it appears so! It looks like there's both transformers and non transformers of phi3 mini instruct 4k, as well as a couple other instruct versions I recognized.

Good idea to move it to the conftest.py file though, I can definitely see situations where we could use more tests specifically for models that utilize roles.

… to conftest.py

riedgar-ms · 2024-06-03T13:39:08Z

@FoxBuchele there seems to be a test failure related to this?

riedgar-ms · 2024-07-12T17:05:27Z

guidance/library/_capture.py

@@ -1,6 +1,9 @@
 from .._guidance import guidance
 from .._grammar import capture as grammar_capture, GrammarFunction

+# Adapted from active_role_end in _model.py, functionality should be shared probably?
+import re
+format_pattern = re.compile(r"<\|\|_.*?_\|\|>", flags=re.DOTALL)


Slightly concerned that this appears to be relying on ChatML tags, which not all models use

Fixing bug 857

1c04d57

Fixing issue where capturing text would cause the LM to slice it incorrectly due to stop and start tokens.

FoxBuchele mentioned this pull request May 25, 2024

Regression in 0.1.15 Causes Incorrect Token Slicing in 'Role' Blocks #857

Open

FoxBuchele marked this pull request as draft May 25, 2024 04:05

Understood issue better, then tried again.

9a7d627

FoxBuchele marked this pull request as ready for review May 25, 2024 20:10

Adding test coverage for capturing text within roles

37c8883

riedgar-ms reviewed May 29, 2024

View reviewed changes

FoxBuchele and others added 3 commits May 29, 2024 21:44

Expanding model coverage for role-based models, and moving definition…

1624a85

… to conftest.py

Merge branch 'main' into Fix-Issue-857

7dea7d2

Merge branch 'main' into Fix-Issue-857

5eda1c9

riedgar-ms added 2 commits July 12, 2024 12:57

Merge remote-tracking branch 'upstream/main' into Fix-Issue-857

0347516

Move test

cf2bf6f

riedgar-ms reviewed Jul 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing bug 857 (regression from 0.1.14 to 0.1.15) #858

Fixing bug 857 (regression from 0.1.14 to 0.1.15) #858

FoxBuchele commented May 25, 2024

FoxBuchele commented May 25, 2024

riedgar-ms commented May 28, 2024

codecov-commenter commented May 28, 2024 •

edited

Loading

FoxBuchele commented May 28, 2024

riedgar-ms commented May 28, 2024

FoxBuchele commented May 29, 2024

riedgar-ms May 29, 2024

FoxBuchele May 30, 2024

riedgar-ms commented Jun 3, 2024

riedgar-ms Jul 12, 2024

Fixing bug 857 (regression from 0.1.14 to 0.1.15) #858

Are you sure you want to change the base?

Fixing bug 857 (regression from 0.1.14 to 0.1.15) #858

Conversation

FoxBuchele commented May 25, 2024

FoxBuchele commented May 25, 2024

riedgar-ms commented May 28, 2024

codecov-commenter commented May 28, 2024 • edited Loading

Codecov Report

FoxBuchele commented May 28, 2024

riedgar-ms commented May 28, 2024

FoxBuchele commented May 29, 2024

riedgar-ms May 29, 2024

Choose a reason for hiding this comment

FoxBuchele May 30, 2024

Choose a reason for hiding this comment

riedgar-ms commented Jun 3, 2024

riedgar-ms Jul 12, 2024

Choose a reason for hiding this comment

codecov-commenter commented May 28, 2024 •

edited

Loading