Skip to content

Commit 62a0d83

Browse files
committed
0.1.4
added bark-gui and whisper-ui
1 parent 6d70d13 commit 62a0d83

File tree

9 files changed

+546
-361
lines changed

9 files changed

+546
-361
lines changed

README.md

Lines changed: 33 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@
2020
<img src="media/covers/seait_image.jpg">
2121

2222
<h1 align="center">Super Easy AI Installer Tool</h1>
23-
<h4 align="center">One SEAIT to Install Them, One Click to Launch Them, One Space-Saving Models Folder to Bind Them All.</h3>
23+
<h4 align="center"><em>One SEAIT to Install Them, One Click to Launch Them, One Space-Saving Models Folder to Bind Them All.</em></h4>
2424
<h4 align="center">Don't forget to leave a star.</h4>
2525

2626

@@ -61,22 +61,44 @@ But remains a great solution for users with minimal technical knowledge or exper
6161

6262
### Tested on Windows 10+ and Nvidia GPU-based cards
6363

64-
### Update [0.1.3]
64+
### Update [0.1.4]
6565

6666
Added
6767

68-
- Toolbox tab
69-
- Under toolbox tab, added symlink link creator
70-
- Out of space? No problem, now you can put all your models in one directory and create symbolic links to them.
71-
you can do this for any directory you want including virtual environments if you want it too.
68+
### bark-gui and openai whisper-ui both tested on GTX 970 4GB and worked great.
7269

73-
- Symlink Tool - a tool that creates symbolic links.
74-
- Models Treasury - a centralized folder for all the popular model directories. Simply assign to use on all your projects.
70+
[Play man](media/preview/0.1.4/final_23-25-37.wav)
7571

76-
Video tutorial soon.
77-
78-
### Spread the word; don't only keep it to yourself.
72+
[Play woman](media/preview/0.1.4/final_23-33-38.wav)
73+
74+
- bark-gui [text-to-speech and voice cloning]
75+
76+
https://github.com/suno-ai/bark
77+
78+
Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints ready for inference.
79+
80+
https://github.com/C0untFloyd/bark-gui
81+
82+
bark-gui is This is a simple Web UI for an extended Bark Version using Gradio, meant to be run locally.
83+
84+
- whisper-ui [speech-to-text]
7985

86+
https://github.com/openai/whisper
87+
88+
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.
89+
90+
A bit old maybe there new GUIs for whisper but i used this one.
91+
92+
https://github.com/hayabhay/whisper-ui
93+
94+
whisper-ui is a simple Streamlit UI for OpenAI's Whisper speech-to-text model. It let's you download and transcribe media from YouTube videos, playlists, or local files. You can then browse, filter, and search through your saved audio files.
95+
96+
I have also have an old fork of this project with some differences that let chose gpu or cpu but its older then this one i might added later if requested.
97+
98+
Minor fixes and changes to the code.
99+
100+
### Spread the word; don't only keep it to yourself.
101+
<img src="media/preview/0.1.4/1_0.1.4.jpg">
80102
<img src="media/preview/0.1.2/1_0.1.2.jpg">
81103
<img src="media/preview/0.1.3/1_0.1.3.jpg">
82104
<img src="media/preview/0.1.2/2_0.1.2.jpg">

layout/project.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -81,7 +81,7 @@ def create_layout(project,lang_data):
8181
sg.Frame("",[
8282
[
8383
sg.MLine(f"""{project['description'][0]}""",k=f"{main_key}description_{project['id']}_console_ml-",visible=True,text_color=color.DIM_BLUE,border_width=10,sbar_width=20,sbar_trough_color=0,
84-
autoscroll=True, auto_refresh=True,expand_x=True,expand_y=True,font=FONT,no_scrollbar=True,disabled=True,size=(100,3)),
84+
autoscroll=True, auto_refresh=True,expand_x=True,expand_y=True,font=FONT,no_scrollbar=False,disabled=True,size=(100,3),),
8585
],
8686
],expand_x=True,expand_y=False,border_width=5,pad=(10,10),relief=sg.RELIEF_FLAT,element_justification="l",background_color=color.DARK_GRAY)
8787
] if project['description'] else [],

layout/toolbox.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -87,7 +87,7 @@ def create_layout(lang_data,tools):
8787
sg.Frame('',[
8888
[
8989
sg.Image(data=ic.args,background_color=color.DARK_GRAY),
90-
sg.Text(f"{lang_data[LOCAL_SET]} Symlink",font=FONT,text_color=color.LIGHT_GRAY,background_color=color.DARK_GRAY),
90+
sg.Text(f"Symlink Creator",font=FONT,text_color=color.LIGHT_GRAY,background_color=color.DARK_GRAY),
9191
],
9292
[
9393
sg.Frame('',[

media/preview/0.1.4/1_0.1.4.jpg

166 KB
Loading
2.15 MB
Binary file not shown.
3.16 MB
Binary file not shown.

seait.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -299,6 +299,8 @@ def set_project_path(window, id_number, input_project_path,trigger_event=True):
299299

300300
about_layout.events(event)
301301
toolbox_layout.events(event,values,window,lang_data)
302+
requirements_layout.events(window,event,lang_data)
303+
302304

303305

304306

util/CONSTANTS.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
VERSION = '0.1.3'
1+
VERSION = '0.1.4'
22
APP_TITLE = f"Super Easy AI Installer Tool - Ver {VERSION}"
33

44
FONT_S = 'Arial 8'

util/projects_data.py

Lines changed: 508 additions & 347 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)