You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is there any possibility to get start and end timestamp both for every token, currently we are getting only start time of every token.
A scenario that we face in general is, we have multiple speech samples where there is a long silence (>0.5 seconds to 3 seconds) as we have only the start time, the particular word is kind of spoken for 3 seconds which is not a proper scenario when we check the duration analysis.
Another question, is it possible to get logits in the result(s) along with timestamps so that we can apply some other algorithms on them?
The text was updated successfully, but these errors were encountered:
Is there any possibility to get start and end timestamp both for every token, currently we are getting only start time of every token.
A scenario that we face in general is, we have multiple speech samples where there is a long silence (>0.5 seconds to 3 seconds) as we have only the start time, the particular word is kind of spoken for 3 seconds which is not a proper scenario when we check the duration analysis.
Another question, is it possible to get logits in the result(s) along with timestamps so that we can apply some other algorithms on them?
The text was updated successfully, but these errors were encountered: