Getting end timestamp in result(s) #982

rohithkodali · 2024-06-06T17:03:59Z

Is there any possibility to get start and end timestamp both for every token, currently we are getting only start time of every token.

A scenario that we face in general is, we have multiple speech samples where there is a long silence (>0.5 seconds to 3 seconds) as we have only the start time, the particular word is kind of spoken for 3 seconds which is not a proper scenario when we check the duration analysis.

Another question, is it possible to get logits in the result(s) along with timestamps so that we can apply some other algorithms on them?

csukuangfj · 2024-06-11T09:05:17Z

Please see
#989

Currently, we can only get stop timestamp of each token for CTC models.

csukuangfj · 2024-06-11T09:06:27Z

Another question, is it possible to get logits in the result(s) along with timestamps so that we can apply some other algorithms on them?

Please see

sherpa-onnx/sherpa-onnx/csrc/online-recognizer.h

Line 44 in 09efe54

std::vector<float> ys_probs; //< log-prob scores from ASR model

csukuangfj mentioned this issue Jun 11, 2024

Return token stop timestamp for CTC decoding. #989

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting end timestamp in result(s) #982

Getting end timestamp in result(s) #982

rohithkodali commented Jun 6, 2024

csukuangfj commented Jun 11, 2024

csukuangfj commented Jun 11, 2024

Getting end timestamp in result(s) #982

Getting end timestamp in result(s) #982

Comments

rohithkodali commented Jun 6, 2024

csukuangfj commented Jun 11, 2024

csukuangfj commented Jun 11, 2024