Adding evaluation checks to prevent Transformer ValueError #3105

stsfaroz · 2024-12-01T09:51:43Z

When neither eval_dataset nor evaluator is provided, Transformers raises a ValueError stating that an eval_dataset must be passed or eval_strategy. However, this error does not account for the evaluator parameter.

Error raised by Transformers:

ValueError: You have set `args.eval_strategy` to "steps" but you didn't pass an `eval_dataset` to `Trainer`. 
Either set `args.eval_strategy` to "no" or pass an `eval_dataset`.

Instead, we now raise a more specific error:

ValueError: Either `eval_dataset` or `evaluator` must be provided for evaluation, 
or set `eval_strategy='no'` to skip evaluation.

… edge case

tomaarsen · 2024-12-02T11:51:09Z

Hello!

Thanks for tackling this, I think it's indeed quite smart to "get ahead" of the error that transformers will give and give our own Sentence Transformers-specific error. I updated the phrasing a bit because I quite like how the transformers error flowed ("you did X, but not Y. Please do Y or avoid X").

The edge case of no eval_strategy/no evaluator was already tackled a bit in #3035 to get Transformers v4.46 compatibility, but the "no eval_strategy and no evaluator" case was left as-is:

If evaluator is not set, then the ValueError is acceptable I think

So I also updated the test that I made back then with more details about what the expected ValueError should be.

What do you think? @stsfaroz

Tom Aarsen

tomaarsen · 2024-12-02T12:03:31Z

Looks like the tests failed for Python 3.9 and 3.10 because Python 3.11 changed how Enums are formatted by default: https://docs.python.org/3/whatsnew/3.11.html#enum

I.e. they now get printed as IntervalStrategy.STEPS instead of "steps" (the string they represent).

Tom Aarsen

Copilot reviewed 2 out of 2 changed files in this pull request and generated no suggestions.

stsfaroz and others added 7 commits December 1, 2024 15:14

Adding evaluation checks to prevent Transformer ValueError

2cf93ee

Update trainer.py

dddddae

Update trainer.py

42e8de2

Update trainer.py

5736c0c

Rewrite comments & ValueError somewhat

06606ce

Fix a typo that I introduced

1706dfb

Extend the no_eval_dataset_with_eval_strategy test with newly updated…

f168b5c

… edge case

Fix tests for Python 3.9, 3.10

5693bef

tomaarsen requested a review from Copilot December 2, 2024 12:25

Copilot AI reviewed Dec 2, 2024

View reviewed changes

tomaarsen approved these changes Dec 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding evaluation checks to prevent Transformer ValueError #3105

Adding evaluation checks to prevent Transformer ValueError #3105

stsfaroz commented Dec 1, 2024

tomaarsen commented Dec 2, 2024

tomaarsen commented Dec 2, 2024

Adding evaluation checks to prevent Transformer ValueError #3105

Are you sure you want to change the base?

Adding evaluation checks to prevent Transformer ValueError #3105

Conversation

stsfaroz commented Dec 1, 2024

tomaarsen commented Dec 2, 2024

tomaarsen commented Dec 2, 2024

Choose a reason for hiding this comment