Enable `padding_side` as call time kwargs #33385

zucchini-nlp · 2024-09-09T09:26:52Z

What does this PR do?

This PR adds padding_side as a valid kwargs when calling tokenizers, so that the users can set padding side when tokenizing. Is a follow-up from #32858 (comment), where we found that most processors accept padding_side but don't use it when tokenizing.

Added test for that and ran it for all models

HuggingFaceDocBuilderDev · 2024-09-09T09:46:04Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2024-09-10T08:55:09Z

The failing test is not related to this PR, seems to be related to training datasets and tasks. PR is ready for review!

amyeroberts

Awesome - thanks for adding!

yonigozlan · 2024-09-12T14:43:19Z

Looks good to me thanks @zucchini-nlp, this should unblock a lot of the kwargs uniformization PRs :). Anything blocking before this gets merged? Is it because of the tests failing?

zucchini-nlp · 2024-09-12T14:49:43Z

I wanted to get @ArthurZucker 's review as well as this is tokenizers related. For tests I can rerun them and hope they won't fail this time

ArthurZucker

Looks great to me!

zucchini-nlp · 2024-09-13T08:19:56Z

Thanks, can someone merge pls as the tests won't pass after rerunning?

amyeroberts · 2024-09-13T10:58:31Z

@zucchini-nlp Yep - I can merge

* fix * add padding-side kwarg * add padding side in all models & fix tests * fix copies * fix tests

zucchini-nlp added 8 commits September 2, 2024 16:43

fix

d784af5

Merge remote-tracking branch 'upstream/main' into main

50fa6b6

Merge remote-tracking branch 'upstream/main' into main

845d205

Merge remote-tracking branch 'upstream/main' into main

96f5941

Merge remote-tracking branch 'upstream/main' into main

fefc1fb

add padding-side kwarg

5f3a8d4

add padding side in all models & fix tests

bd5038a

fix copies

1247387

zucchini-nlp mentioned this pull request Sep 9, 2024

Uniformize kwargs for LLaVa processor and update docs #32858

Merged

5 tasks

zucchini-nlp added 2 commits September 10, 2024 10:38

fix tests

04d8b5b

Merge remote-tracking branch 'upstream/main' into padding-side

28bd7a8

zucchini-nlp requested review from amyeroberts and yonigozlan September 10, 2024 08:55

amyeroberts approved these changes Sep 10, 2024

View reviewed changes

zucchini-nlp requested a review from ArthurZucker September 10, 2024 09:55

ArthurZucker approved these changes Sep 13, 2024

View reviewed changes

Merge branch 'huggingface:main' into padding-side

5a76cbb

amyeroberts merged commit 4b0418d into huggingface:main Sep 13, 2024
22 of 24 checks passed

asomoza mentioned this pull request Sep 26, 2024

[Tests] Fix ChatGLMTokenizer huggingface/diffusers#9536

Merged

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

Enable padding_side as call time kwargs (huggingface#33385)

190a909

* fix * add padding-side kwarg * add padding side in all models & fix tests * fix copies * fix tests

shenxiangzhuang mentioned this pull request Feb 21, 2025

fix(type): padding_side type should be Optional[str] #36326

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable `padding_side` as call time kwargs #33385

Enable `padding_side` as call time kwargs #33385

Uh oh!

zucchini-nlp commented Sep 9, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Sep 9, 2024

Uh oh!

zucchini-nlp commented Sep 10, 2024

Uh oh!

amyeroberts left a comment

Uh oh!

yonigozlan commented Sep 12, 2024

Uh oh!

zucchini-nlp commented Sep 12, 2024 •

edited

Loading

Uh oh!

ArthurZucker left a comment

Uh oh!

zucchini-nlp commented Sep 13, 2024

Uh oh!

amyeroberts commented Sep 13, 2024

Uh oh!

Uh oh!

Uh oh!

Enable padding_side as call time kwargs #33385

Enable padding_side as call time kwargs #33385

Uh oh!

Conversation

zucchini-nlp commented Sep 9, 2024

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Sep 9, 2024

Uh oh!

zucchini-nlp commented Sep 10, 2024

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

yonigozlan commented Sep 12, 2024

Uh oh!

zucchini-nlp commented Sep 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp commented Sep 13, 2024

Uh oh!

amyeroberts commented Sep 13, 2024

Uh oh!

Uh oh!

Uh oh!

Enable `padding_side` as call time kwargs #33385

Enable `padding_side` as call time kwargs #33385

zucchini-nlp commented Sep 12, 2024 •

edited

Loading