Automatic Speech Recognition utilizing Faster Whisper.

Stack:

Installation

For ease of use it's recommended to use the provided docker-compose.yml. CPU Support: Use the latest tag.

services:
  automatic_speech_recognition:
    image: ghcr.io/doppeltilde/automatic_speech_recognition:latest
    ports:
      - "8000:8000"
    volumes:
      - models:/root/.cache/huggingface/hub:rw
    environment:
      - DEFAULT_ASR_MODEL_NAME
      - COMPUTE_TYPE
      - USE_API_KEYS
      - API_KEYS
    restart: unless-stopped

volumes:
  models:

NVIDIA GPU Support: Use the latest-cuda tag.

services:
  automatic_speech_recognition_cuda:
    image: ghcr.io/doppeltilde/automatic_speech_recognition:latest-cuda
    ports:
      - "8000:8000"
    volumes:
      - models:/root/.cache/huggingface/hub:rw
    environment:
      - DEFAULT_ASR_MODEL_NAME
      - COMPUTE_TYPE
      - USE_API_KEYS
      - API_KEYS
    restart: unless-stopped
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              count: all
              capabilities: [ gpu ]

volumes:
  models:

Create a .env file and set the preferred values.

DEFAULT_ASR_MODEL_NAME=base
COMPUTE_TYPE=float16

# False == Public Access
# True == Access Only with API Key
USE_API_KEYS=False

# Comma seperated api keys
API_KEYS=abc,123,xyz

Models

Any model designed and compatible with faster-whisper should work.

Usage

Note

Please be aware that the initial process may require some time, as the model is being downloaded.

Tip

Interactive API documentation can be found at: http://localhost:8000/docs

Notice: This project was initally created to be used in-house, as such the development is first and foremost aligned with the internal requirements.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
src		src
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.cuda		Dockerfile.cuda
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic Speech Recognition utilizing Faster Whisper.

Stack:

Installation

Models

Usage

About

Releases

Packages 1

Languages

License

doppeltilde/automatic_speech_recognition

Folders and files

Latest commit

History

Repository files navigation

Automatic Speech Recognition utilizing Faster Whisper.

Stack:

Installation

Models

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 1

Languages