Web app for transcribing audio file (.wav format) to text usingGoogle Cloud Speech API.
-
Updated
Jun 22, 2020 - HTML
Web app for transcribing audio file (.wav format) to text usingGoogle Cloud Speech API.
Transcribe Bangla Audio into Text
Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
Timestamped ASR microservice
Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. In this template, we will import the Whisper model on Inferless Platform.
AudioTextPro: Convert audio to text accurately in real-time using our advanced AI speech recognition technology.
There is simple backend project to use whisper-rs.
Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. In this template, we will import the Whisper model on Inferless Platform.
A Windows desktop application that can generate subtitles, translations, and summaries for videos in 8 languages using API and SDK from Tencent, Alibaba, and Baidu. You can use it for generating bilingual transcripts for videos and summarising the key points from the transcript using LexRank.
This application contains "Audio to text", "Dictation" and "Gender prediction" modules in it.
AWS Lambda Function which creates a transcribe job, that reads mp3 file and converts it into text format in a json file.
Whisper Large V3 is a pre-trained model developed by OpenAI and designed for tasks like automatic speech recognition (ASR), speech translation and language identification.
inter-convert between audio & text, easy to use with GUI desktop application by PaddleSpeech and PySide6.
TranscriptGen is an application for transcribing audio and video files. Transcription output is .txt or .srt. Most audio and video formats supported (with ffmpeg).
WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator
Transform audio recordings into text transcripts effortlessly with AudioTranscribe! 🎙️📝 Simplify your transcription process and enhance accessibility with top-notch accuracy. Explore the power of text-to-speech conversion today! 🚀🎧
Event-driven AI > A Python-Kafka event-driven micro-services solution for distributed audio transcriptions.
Speech-to-Text Realtime with Extension là tiện ích mở rộng chuyển giọng nói thành văn bản tức thì. Hỗ trợ nhiều ngôn ngữ, phù hợp cho ghi chép cuộc họp, dịch vụ khách hàng, và hỗ trợ người khuyết tật. Dễ cài đặt và sử dụng trên các trình duyệt phổ biến, mang lại sự tiện lợi và hiệu quả cao.
Streamline your video/audio intake by transforming multimedia content into navigable collections of transcribed text and summaries!
Add a description, image, and links to the audio-to-text topic page so that developers can more easily learn about it.
To associate your repository with the audio-to-text topic, visit your repo's landing page and select "manage topics."