Replies: 2 comments 3 replies
-
I could not resist doing a trial implementation and already it is really fantastic! My current implementation provides (with Konele free app as the client) a simple self-hosted server-based voice recognition service to my de-Googled Android phone that seems really great. My current websocket wrapper for whisper.cpp is very stupid and simple, but with the accuracy of Whisper, and the fantastic implementation by ggerganov, the result is still impressive! |
Beta Was this translation helpful? Give feedback.
-
I created a git repo to help anyone who is interested to set this up. It really works great! |
Beta Was this translation helpful? Give feedback.
-
For some time I have been using Kaldi voice recognition as a service on my home server. My (de-Googled) Android (LineageOS) phone communicates with this server through the (open source) Konele app from f-droid repo. This gives me voice recognition input and search services similar to "Ok Google" and Siri in a free and open source environment completely under my control. It really works quite well, though I think Kaldi is not up to Whisper's level as a speech recognition system.
The communication between Konele (the Android voice input part) and the voice server backend is websockets and doesn't seem too complex. I am considering making a similar websockets server wrapper around Whisper.cpp that could substitute in for the Kaldi-based backend and work with the existing Konele frontend. I searched around and it doesn't seem that anyone has done this yet.
Anyone have any thoughts on this?
Rich
Beta Was this translation helpful? Give feedback.
All reactions