You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -61,23 +61,23 @@ This repository refines the timestamps of openAI's Whisper model via forced alig
61
61
62
62
63
63
<h2align="left"id="setup">Setup ⚙️</h2>
64
-
Tested for PyTorch 0.11, Python 3.8 (use other versions at your own risk!)
64
+
Tested for PyTorch 2.0, Python 3.10 (use other versions at your own risk!)
65
65
66
66
GPU execution requires the NVIDIA libraries cuBLAS 11.x and cuDNN 8.x to be installed on the system. Please refer to the [CTranslate2 documentation](https://opennmt.net/CTranslate2/installation.html).
67
67
68
68
69
-
### 1. Create Python3.8 environment
69
+
### 1. Create Python3.10 environment
70
70
71
-
`conda create --name whisperx python=3.8`
71
+
`conda create --name whisperx python=3.10`
72
72
73
73
`conda activate whisperx`
74
74
75
75
76
-
### 2. Install PyTorch 0.11.0, e.g. for Linux and Windows:
76
+
### 2. Install PyTorch2.0, e.g. for Linux and Windows CUDA11.7:
Copy file name to clipboardExpand all lines: whisperx/transcribe.py
+1-6Lines changed: 1 addition & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -72,7 +72,6 @@ def cli():
72
72
73
73
parser.add_argument("--hf_token", type=str, default=None, help="Hugging Face Access Token to access PyAnnote gated models")
74
74
# parser.add_argument("--model_flush", action="store_true", help="Flush memory from each model after use, reduces GPU requirement but slower processing >1 audio file.")
75
-
parser.add_argument("--tmp_dir", default=None, help="Temporary directory to write audio file if input if not .wav format (only for VAD).")
0 commit comments