Event-driven AI > A Python-Kafka event-driven micro-services solution for distributed audio transcriptions.
-
Updated
Jun 14, 2024 - Python
Event-driven AI > A Python-Kafka event-driven micro-services solution for distributed audio transcriptions.
Transform audio recordings into text transcripts effortlessly with AudioTranscribe! 🎙️📝 Simplify your transcription process and enhance accessibility with top-notch accuracy. Explore the power of text-to-speech conversion today! 🚀🎧
Implemented some of the models and techniques learned in NLP to help build systems that help in daily life.
Web app for transcribing audio file (.wav format) to text usingGoogle Cloud Speech API.
Transcribe Bangla Audio into Text
Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
Timestamped ASR microservice
Speech-to-Text Realtime with Extension là tiện ích mở rộng chuyển giọng nói thành văn bản tức thì. Hỗ trợ nhiều ngôn ngữ, phù hợp cho ghi chép cuộc họp, dịch vụ khách hàng, và hỗ trợ người khuyết tật. Dễ cài đặt và sử dụng trên các trình duyệt phổ biến, mang lại sự tiện lợi và hiệu quả cao.
An efficient desktop application for transcribing audio files into text using Vosk speech recognition.
Converts speech to text from any audio/video file
core shell functions building blocks for advanced AI pipelines
AudioTextPro: Convert audio to text accurately in real-time using our advanced AI speech recognition technology.
There is simple backend project to use whisper-rs.
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
A Windows desktop application that can generate subtitles, translations, and summaries for videos in 8 languages using API and SDK from Tencent, Alibaba, and Baidu. You can use it for generating bilingual transcripts for videos and summarising the key points from the transcript using LexRank.
Whisper Large V3 is a pre-trained model developed by OpenAI and designed for tasks like automatic speech recognition (ASR), speech translation and language identification.
AWS Lambda Function which creates a transcribe job, that reads mp3 file and converts it into text format in a json file.
A SwiftUI App For People Who Need To Take Down Important Information Quickly.
Edge AI > AI app to easily perform transcriptions on regular computers. Quality on par with on-cloud alternatives. Lower costs. Reduced privacy risks.
Add a description, image, and links to the audio-to-text topic page so that developers can more easily learn about it.
To associate your repository with the audio-to-text topic, visit your repo's landing page and select "manage topics."