A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
-
Updated
Jun 21, 2024 - Python
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
A video call application that recognizes gestures (signal language) and converts them into text and sound.
Generate captions for videos using the power of OpenAI's Whisper API
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
This repository is an implementation of the Wav2Vec2 model for converting speech into text through a series of speech recognition, noise removal and STT to transcribe the text from a video file.
Streamline your video/audio intake by transforming multimedia content into navigable collections of transcribed text and summaries!
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
An AI tools which helps to analyze any YouTube video, give the sentiment of the video and suggest description and topics related the content. Lastly, It extract the subtitles from the video by understanding the audio then transcribe it in any language with timestamps and also embed the subtitles into the video
Convert videos into colourful ASCII art for terminal display using Python and OpenCV.
A curated list of zero-shot captioning papers
SolutionAI App which can solve any problems or summarize any Image or Youtube video of any duration to the shortest summary you need.
Convert a video file or camera captured to display as text.
Everything is very simple: you either download a picture file or specify its link when running a python script, and output you get a text file, and you can immediately view on the command line how it will look the result of your conversion.
A Python tool for transcribing videos using Whisper
Unlimited Youtube-Transcript-Generator
Generate automatic transcripts and subtitles for your videos with the help of the neural network-based.
Add a description, image, and links to the video-to-text topic page so that developers can more easily learn about it.
To associate your repository with the video-to-text topic, visit your repo's landing page and select "manage topics."