Hello!
Made some changes and added a few features! As always Dot can be installed through the website, or the binaries can be found on this very release. Please note that the Windows update will release in a few days.
🦔 FEATURES:
- Added whisper.cpp, you can now use your voice to interact with Dot!
- Added Text-To-Speech! A TTS model can now be used to read the LLM generated answers. TTS funcionalities are handled via sherpaonnx, the default TTS model is this really cool GLaDOS voice. Changing the voice model is not fully supported yet but I can provide instructions on how to do that if anyone is interested.
- Brand new settings button! Really regret not having done this before but users can now change the LLM settings in multiple ways, things like context length, max tokens, or default prompt can now be easily changed.
- LLM selection! It is now possible to use Dot with your local LLM of choice, any GGUF model should work without issues!
Screen.Recording.2024-05-20.at.20.27.52.1.mov
🦆CHANGES:
- Phi-3 is now the default LLM that will be used by Dot, it is not only extremely light but it works surprisingly well for RAG tasks.
- Minor UI changes
🐝 KNOWN BUGS:
- Sometimes answers that are too long are not displayed at all. This is due to the JSON output generated by python being split due to its length, the JS render process is then unable to read the truncated JSON. This issue can now be avoided by decreasing the max tokens setting.
- The LLM can take an annoyingly long time to start up once the first message is sent, but otherwise it works fine once loaded.
🐓 WORKING ON:
- Docker image
- Linux version
- MacOS intel version
- Improved performance
- Making it work on 8gb RAM devices
- Text streaming
- Make things faster
- It now talks and hears, maybe it should also see? 🤔
Anyway, hope you enjoy! Please let me know if there are any issues :)