Run speech to text for video for free. This outputs 3 files, 1 words.json
file that has all of the words said along with their timing and confidence, 1 transcript.txt
with just all the words read, and 1 captions.srt
file which is the captions for that video.
$ python3 -m venv venv
$ source venv/bin/activate
$ pip install -r requirements.txt
This is the language library this speech to text uses. Download this Here
Once downloaded unzip in your project directory.
Video must be .mp4
or .mov
$ python main.py --video videoname.mp4
Video must be .wav
$ python main.py --audio audioname.wav