Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

推荐一个日文语音识别的工具,ReazonSpeech #99

Open
fanglangxinghai opened this issue Apr 2, 2024 · 2 comments
Open

推荐一个日文语音识别的工具,ReazonSpeech #99

fanglangxinghai opened this issue Apr 2, 2024 · 2 comments

Comments

@fanglangxinghai
Copy link

可以用Whisper 的tiny模型的参数量,达到比Whisper的Large v2模型还准确。

@PingZi-Wing
Copy link

看了下介绍好像很牛,不过我用colab试了下没成功,崩溃了

@PingZi-Wing
Copy link

今天折腾了下,总算在colab上试用成功。结论不如fast whisper large v2。这玩意太耗内存了,25分钟的音频就把免费的12G内存爆了,20分钟的音频才成功,这时峰值占了10G内存。速度没有很快,20分钟转录了4分钟,而且好像有不识别前10s的毛病,准确度感觉不如large v2。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants