|
20 | 20 | <img src="media/covers/seait_image.jpg">
|
21 | 21 |
|
22 | 22 | <h1 align="center">Super Easy AI Installer Tool</h1>
|
23 |
| - <h4 align="center">One SEAIT to Install Them, One Click to Launch Them, One Space-Saving Models Folder to Bind Them All.</h3> |
| 23 | + <h4 align="center"><em>One SEAIT to Install Them, One Click to Launch Them, One Space-Saving Models Folder to Bind Them All.</em></h4> |
24 | 24 | <h4 align="center">Don't forget to leave a star.</h4>
|
25 | 25 |
|
26 | 26 |
|
@@ -61,22 +61,44 @@ But remains a great solution for users with minimal technical knowledge or exper
|
61 | 61 |
|
62 | 62 | ### Tested on Windows 10+ and Nvidia GPU-based cards
|
63 | 63 |
|
64 |
| -### Update [0.1.3] |
| 64 | +### Update [0.1.4] |
65 | 65 |
|
66 | 66 | Added
|
67 | 67 |
|
68 |
| -- Toolbox tab |
69 |
| -- Under toolbox tab, added symlink link creator |
70 |
| - - Out of space? No problem, now you can put all your models in one directory and create symbolic links to them. |
71 |
| - you can do this for any directory you want including virtual environments if you want it too. |
| 68 | +### bark-gui and openai whisper-ui both tested on GTX 970 4GB and worked great. |
72 | 69 |
|
73 |
| -- Symlink Tool - a tool that creates symbolic links. |
74 |
| -- Models Treasury - a centralized folder for all the popular model directories. Simply assign to use on all your projects. |
| 70 | +[Play man](media/preview/0.1.4/final_23-25-37.wav) |
75 | 71 |
|
76 |
| -Video tutorial soon. |
77 |
| - |
78 |
| -### Spread the word; don't only keep it to yourself. |
| 72 | +[Play woman](media/preview/0.1.4/final_23-33-38.wav) |
| 73 | + |
| 74 | +- bark-gui [text-to-speech and voice cloning] |
| 75 | + |
| 76 | + https://github.com/suno-ai/bark |
| 77 | + |
| 78 | + Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints ready for inference. |
| 79 | + |
| 80 | + https://github.com/C0untFloyd/bark-gui |
| 81 | + |
| 82 | + bark-gui is This is a simple Web UI for an extended Bark Version using Gradio, meant to be run locally. |
| 83 | + |
| 84 | +- whisper-ui [speech-to-text] |
79 | 85 |
|
| 86 | + https://github.com/openai/whisper |
| 87 | + |
| 88 | + Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. |
| 89 | + |
| 90 | + A bit old maybe there new GUIs for whisper but i used this one. |
| 91 | + |
| 92 | + https://github.com/hayabhay/whisper-ui |
| 93 | + |
| 94 | + whisper-ui is a simple Streamlit UI for OpenAI's Whisper speech-to-text model. It let's you download and transcribe media from YouTube videos, playlists, or local files. You can then browse, filter, and search through your saved audio files. |
| 95 | + |
| 96 | + I have also have an old fork of this project with some differences that let chose gpu or cpu but its older then this one i might added later if requested. |
| 97 | + |
| 98 | +Minor fixes and changes to the code. |
| 99 | + |
| 100 | +### Spread the word; don't only keep it to yourself. |
| 101 | +<img src="media/preview/0.1.4/1_0.1.4.jpg"> |
80 | 102 | <img src="media/preview/0.1.2/1_0.1.2.jpg">
|
81 | 103 | <img src="media/preview/0.1.3/1_0.1.3.jpg">
|
82 | 104 | <img src="media/preview/0.1.2/2_0.1.2.jpg">
|
|
0 commit comments