Open-source AI library (audio to text, simple NLP, and common algorithms)
-
Updated
Jun 29, 2024 - Python
Open-source AI library (audio to text, simple NLP, and common algorithms)
Converts speech to text from any audio/video file
Timestamped ASR microservice
Tero Subtitler is an open source, cross-platform, and free subtitle editing software.
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
GUI Showcase of using Whisper to transcribe and analyze Youtube video
There is simple backend project to use whisper-rs.
Event-driven AI > A Python-Kafka event-driven micro-services solution for distributed audio transcriptions.
Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
The "Audio to Text Transcription with AssemblyAI and Streamlit" project is a web application that allows users to upload audio files and convert them into text using the AssemblyAI API.
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically.
Speech-to-Text Realtime with Extension là tiện ích mở rộng chuyển giọng nói thành văn bản tức thì. Hỗ trợ nhiều ngôn ngữ, phù hợp cho ghi chép cuộc họp, dịch vụ khách hàng, và hỗ trợ người khuyết tật. Dễ cài đặt và sử dụng trên các trình duyệt phổ biến, mang lại sự tiện lợi và hiệu quả cao.
AudioInsight is a web application that processes audio, generates transcriptions, and allows users to ask questions about the related audio.
An application in which you can record the output audio stream and turn it into text format.
Edge AI > AI app to easily perform transcriptions on regular computers. Quality on par with on-cloud alternatives. Lower costs. Reduced privacy risks.
An efficient desktop application for transcribing audio files into text using Vosk speech recognition.
This application contains "Audio to text", "Dictation" and "Gender prediction" modules in it.
Add a description, image, and links to the audio-to-text topic page so that developers can more easily learn about it.
To associate your repository with the audio-to-text topic, visit your repo's landing page and select "manage topics."