Use chat template #135

DePasqualeOrg · 2024-09-29T19:28:04Z

Since huggingface/swift-transformers#104, it's now possible to use the chat template from tokenizer_config.json. I've updated LLMEval to use the chat template, but I noticed that the output from the Mistral models ends with <|im_end|>, even when this string is included in extraEOSTokens. Perhaps @pcuenca has an idea why this might be happening?

DePasqualeOrg · 2024-09-30T11:56:29Z

Using the chat templates with Llama 3.2 will result in an error until none ~~and tojson are~~ is implemented in the Jinja package: johnmai-dev/Jinja#4

Libraries/LLM/Models.swift

davidkoski · 2024-09-30T21:24:49Z

Using the chat templates with Llama 3.2 will result in an error until none and tojson are implemented in the Jinja package: maiqingqiang/Jinja#4

OK, should we hold off on taking this until that is merged (and we can pick it up here)?

DePasqualeOrg · 2024-09-30T21:47:08Z

OK, should we hold off on taking this until that is merged (and we can pick it up here)?

Yes, since LLMEval includes model configs for Llama 3.2. In the meantime, this branch can be used to test Llama 3.2 with the work in progress on Jinja.

DePasqualeOrg · 2024-09-30T21:52:08Z

~~We had a similar issue with an EOS token at the end of the output of Gemma 2: #89~~

This shouldn't be appearing in the output if it's included in extraEOSTokens. It could be an issue with the tokenizer, because I can see when it's generating in my app that <|im_end|> is being produced as multiple tokens, and it sometimes even continues generating beyond this string.

DePasqualeOrg · 2024-10-01T14:30:35Z

The problem with Mistral 7B is resolved here: huggingface/swift-transformers#134

DePasqualeOrg · 2024-10-05T11:20:39Z

Llama 3.2 1B and 3B and Mistral 7B now work with chat templates, thanks to the changes in swift-transformers and Jinja.

davidkoski

Looks great, thank you!

DePasqualeOrg marked this pull request as draft September 30, 2024 11:55

davidkoski reviewed Sep 30, 2024

View reviewed changes

Libraries/LLM/Models.swift Outdated Show resolved Hide resolved

DePasqualeOrg force-pushed the use-chat-template branch from 8e36c71 to 24223b2 Compare October 1, 2024 20:41

Use chat template

07ce9a6

DePasqualeOrg force-pushed the use-chat-template branch from 24223b2 to 07ce9a6 Compare October 1, 2024 21:04

pcuenca mentioned this pull request Oct 2, 2024

Support multiple chat templates per model huggingface/swift-transformers#134

Merged

Update packages

b9ff172

DePasqualeOrg marked this pull request as ready for review October 5, 2024 11:20

davidkoski mentioned this pull request Oct 22, 2024

Add Phi 3.5 MoE #116

Merged

davidkoski approved these changes Oct 22, 2024

View reviewed changes

davidkoski merged commit 5a7a1a4 into ml-explore:main Oct 22, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use chat template #135

Use chat template #135

Uh oh!

DePasqualeOrg commented Sep 29, 2024 •

edited

Loading

Uh oh!

DePasqualeOrg commented Sep 30, 2024 •

edited

Loading

Uh oh!

Uh oh!

davidkoski commented Sep 30, 2024

Uh oh!

DePasqualeOrg commented Sep 30, 2024

Uh oh!

DePasqualeOrg commented Sep 30, 2024 •

edited

Loading

Uh oh!

DePasqualeOrg commented Oct 1, 2024

Uh oh!

DePasqualeOrg commented Oct 5, 2024

Uh oh!

davidkoski left a comment

Uh oh!

Uh oh!

Uh oh!

Use chat template #135

Use chat template #135

Uh oh!

Conversation

DePasqualeOrg commented Sep 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DePasqualeOrg commented Sep 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

davidkoski commented Sep 30, 2024

Uh oh!

DePasqualeOrg commented Sep 30, 2024

Uh oh!

DePasqualeOrg commented Sep 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DePasqualeOrg commented Oct 1, 2024

Uh oh!

DePasqualeOrg commented Oct 5, 2024

Uh oh!

davidkoski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

DePasqualeOrg commented Sep 29, 2024 •

edited

Loading

DePasqualeOrg commented Sep 30, 2024 •

edited

Loading

DePasqualeOrg commented Sep 30, 2024 •

edited

Loading