-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Installed this to use Existing WhisperX installation Conda (that uses my Nvidia GPU) and chose GPU option, but it only uses CPU #15
Comments
Hi,
Rerun the GUI and you should be able to select the "cuda" option for GPU. Note that this will work if your environment was set up correctly and Pytorch for GPU is installed. So, if that didn't work, I suggest you select to create a new environment, as it's the recommended way. To do that, first delete the Let me know if you have more trouble and, if so, attach the logs that appear in the terminal when you set up the program for the first time, so that I can know exactly when the bug appears. I hope this helps. |
Thank you for the response, but strangely enough I don't appear to have a config.json file in that folder or anywhere else...I only have these two files in the config folder: Because I lack it entirely as if I "deleted" it (as per your instructions above), I ran the whisper-gui.bat and got this error/result: ?? |
Seems an error when trying to install additional dependencies. I will look into it and try to recreate the error. |
Thanks. Let me know if you need anything else from me. |
Hi, I could replicate your error on Windows, should be solved now. I suggest you delete your However, I realized some errors may raise due to the latest releases of some packages:
For now, this last error only happens on Windows for me, I had no problems running the program on Linux. Maybe you could find a workaround with WSL. |
Thanks for the details! I guess I'm stuck waiting for CTranslate2 to be fixed.... :-( I can try WSL...I have Ubuntu installed, but I have WhisperX already working via command line in Windows. I guess I can reinstall everything there for now. Same issue with my Ubuntu WSL installation....odd, I guess I'll just wait. ...also I don't have a models\whisperx\folder to delete a model in the Windows version (to fix the first thing you listed) but I'll re-run the command anyways, so I guess I'm just waiting in either case. |
I've posted an issue in the CTranslate2 repo: OpenNMT/CTranslate2#1630 |
Yeah, so basically, there appear to be 2 solutions:
Let me know if that solved your problem. |
I think I've already tried number 1, and it still had a problem...but I'm gonna double check that. I will let you know! |
Hmm, well I just ran whisper-gui.bat and it asked me to update, and enable auto-updates and then started up. I can select CUDA now, so I guess it's all working now! :-) One more question..how do I set --task translate to work via this GUI for a given video/audio file? I use this command flag often for most of the videos I process through Whisper-X. Thank you for all the help and info. |
I'm glad it's working for you now! I'm closing this issue as it's been solved. Feel free to open a new issue for a new suggestion for this project or if you encounter any more bugs. Thanks. |
Hello, your code is very good. I would like to ask how to add localization options. I want to translate the interface into Chinese. |
Also, when the subtitles and audio files are forced to align, if the wav2vec2 model of the corresponding language is selected for processing, will the effect be better than the current general model? |
Every time I run the .bat file, I get this message: "torchvision is not available - cannot save figures". What is going on? Is there anything I need to do? |
Hi, thank you!
Yes, I would say the transcription should be more accurate.
This is fine, not a problem. Just a warning you can avoid by installing torchvision, but you can safely ignore it. |
Nice. How to add the corresponding wav2vec2 model to the options? Does it mean to modify the corresponding item in the code? If so, how? |
Wav2vec2 is an alternative, different model to Whisper. If we were to use it, we would need to add an option to choose between both, and modify some functions like _transcribe() and add a function transcribe_wav2vec2(). Are you interested in contributing? |
The Gui won't let me switch to CPU. Did I miss something? The instance of WhipserX from the command line IS using my Nvidia GPU.
I'm using Windows 11.
The text was updated successfully, but these errors were encountered: