Skip to content

gpapadopp/voice-recognition-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio Transcript Generator

Table of Contents

About

The Audio Transcript Generator is a Python application that takes an MP3 or WAV audio file as input, processes the audio, and outputs the transcribed text of the audio content. This project leverages the power of speech recognition to convert spoken words into written text.

Demo

Features

  • Transcribe audio files (MP3/WAV) to text.
  • Easy-to-use command-line interface.
  • Supports multiple languages for transcription.
  • Customizable output options.

Getting Started

Prerequisites

  • Python 3.6 or higher
  • FFmpeg (required for processing audio files)

Installation

  1. Clone this repository to your local machine:

    git clone https://github.com/gpapadopp/voice-recognition-python.git
  2. Navigate to the project directory:

    cd voice-recognition-python
  3. Install the required Python packages:

    pip install -r requirements.txt

Usage

  1. Place your MP3 or WAV audio file in the project directory.

  2. Open a the main.py file of the project, go to the last row, and place the file name of your MP3 or WAV audio file.

  3. The transcription will be displayed in the terminal.

Contributing

Contributions are welcome! If you'd like to contribute to this project, please follow these steps:

  1. Fork the repository.
  2. Create a new branch for your feature/bugfix: git checkout -b feature-name.
  3. Make your changes and commit them: git commit -m "Description of changes".
  4. Push your changes to your fork: git push origin feature-name.
  5. Create a pull request detailing your changes.

License

This project is licensed under the MIT License.

Contact

For questions or inquiries, please contact [email protected].

About

Voice Recognition Software with Python

Resources

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages