-
Notifications
You must be signed in to change notification settings - Fork 100
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
utterance ref cannot be empty ? #98
Comments
My reasoning at the time was that evaluation datasets like |
Ok, would you be open to changing the behavior ? |
Yes, do you think a |
Yes, the |
Do you know how |
Not sure how sclite treats that case. Anyway, this is unlikely to happen, as sclite is typically used with test-sets with touhsands of utterances. With 1 utterance, 0 ref-word edge case, you are right, that this leads to division by zero. So the WER sholud be be |
Hello,
is there a good reason why an utterance ref is required to be non-empty ?
https://github.com/jitsi/jiwer/blob/9db6e4649dfff1e91de5640e224ea51de01b0a50/jiwer/process.py#L158C1-L159C69
IMHO, i'd expect that it can be empty (sclite behavior).
It is a valid situation, if the utterance in test set contains just silence, it's reference is empty,
and the ASR system should produce an empty string and not hallucinate any symbol.
I hacked it accordingly here:
https://github.com/KarelVesely84/jiwer/tree/allow_empty_ref
Best regards
Karel Vesely
The text was updated successfully, but these errors were encountered: