VoiceWorker

VoiceWorker is a Streamlit-based application for transcribing audio files or recordings using the Faster Whisper model. This app supports multiple languages and allows users to choose from various pre-trained model sizes for transcription tasks.

Why Faster Whisper?

Faster Whisper is a highly optimized implementation of OpenAI's Whisper model. It offers:

Enhanced Speed: Faster Whisper processes audio data significantly faster than the original implementation, making it ideal for real-time or large-scale transcription tasks.
Resource Efficiency: It uses less memory and computational power, enabling transcription on devices with limited resources.
Customizability: The model provides more flexibility with compute types (e.g., float16), which further optimizes performance on compatible hardware.

Difference Between Faster Whisper and Whisper

Feature	Faster Whisper	Original Whisper
Speed	Faster processing and inference	Slower inference
Resource Usage	Lower memory and computational needs	Higher memory and resource demand
Flexibility	Customizable compute types (e.g., float16)	Fixed compute type (float32)
Scalability	Suitable for real-time transcription	Best for offline, batch processing

Faster Whisper is especially advantageous for interactive applications like VoiceWorker, where speed and efficiency are critical.

Features

Transcribe audio or video files.
Record live audio for real-time transcription.
Support for over 90 languages.
GPU acceleration with CUDA (if available).
Flexible model size selection.

Installation

Clone this repository:

git clone https://github.com/amgawishx/VoiceWorker.git
cd VoiceWorker

Install dependencies:
```
pip install -r requirements.txt
```
Run the application:
```
streamlit run app.py
```

Usage

Launch the app with the command above.
Choose your desired Whisper model size and transcription language in the app interface.
Upload an audio/video file or use the built-in audio recorder.
View the transcription output directly in the app.

Requirements

Python 3.8+
Streamlit
Faster Whisper
PyTorch
GPU with CUDA support (optional but recommended for faster processing)

Acknowledgments

Faster Whisper for providing the optimized transcription model.
Streamlit for the interactive user interface framework.

VoiceWorker simplifies audio transcription tasks with speed and efficiency. Feel free to contribute or raise issues on the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
app.py		app.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoiceWorker

Why Faster Whisper?

Difference Between Faster Whisper and Whisper

Features

Installation

Usage

Requirements

Acknowledgments

About

Releases

Packages

Languages

License

amgawishx/VoiceWorker

Folders and files

Latest commit

History

Repository files navigation

VoiceWorker

Why Faster Whisper?

Difference Between Faster Whisper and Whisper

Features

Installation

Usage

Requirements

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages