Fix LLaMA 3.2, add `clean_special_chars` #289

gsarti · 2024-10-15T15:04:59Z

Description

This PR fixes support for multi-EOS models (e.g. LLaMA 3.2, closes #287) and adds a new clean_special_chars: bool = False argument to model.attribute to support the cleaning of special characters from tokens in the out.source and out.target sequences using the native tokenizer.decode function provided by transformers.

Also adds GraniteForCausalLM, GraniteMoeForCausalLM and OlmoeForCausalLM to the model config.

gsarti added 2 commits October 15, 2024 16:59

Fix LLaMA 3.2, add clean_special_chars

2c12311

Add model configs and changelogs

2a607e6

gsarti merged commit d7c5269 into main Oct 15, 2024
3 checks passed

gsarti deleted the clean-char-llama-32 branch October 15, 2024 15:37

gsarti mentioned this pull request Oct 15, 2024

Does Inseq support attributions with example granularity as in Captum's few-shot? #285

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LLaMA 3.2, add `clean_special_chars` #289

Fix LLaMA 3.2, add `clean_special_chars` #289

gsarti commented Oct 15, 2024 •

edited

Loading

Fix LLaMA 3.2, add clean_special_chars #289

Fix LLaMA 3.2, add clean_special_chars #289

Conversation

gsarti commented Oct 15, 2024 • edited Loading

Description

Fix LLaMA 3.2, add `clean_special_chars` #289

Fix LLaMA 3.2, add `clean_special_chars` #289

gsarti commented Oct 15, 2024 •

edited

Loading