Skip to content

This project provides a simplified API for interacting with the Yandex TTS (Text-to-Speech) service, part of Yandex Cloud's Speech Kit. The API allows users to synthesize speech from text input and offers options to customize the voice, speech rate, language, and output file name.

Notifications You must be signed in to change notification settings

HELLPUSYY666/YandexCloudsyntesizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Yandex TTS API

Overview

This project provides a simplified API for interacting with the Yandex TTS (Text-to-Speech) service, part of Yandex Cloud's Speech Kit. The API allows users to synthesize speech from text input and offers options to customize the voice, speech rate, language, and output file name.

Features

Text Synthesis: Accepts text input for speech synthesis. Voice Options: Allows selection of the voice of the speaker. Speech Rate: Provides control over the speed of speech. Language Support: Enables selection of the language for synthesis. Audio Output: Returns a download link for the synthesized audio file in WAV format (8 kHz, 128-bit PCM, Mono). DynDNS Access: The API is accessible via DynDNS or similar services. Requirements

Docker Installation

Clone the repository: git clone cd Build the Docker image: sudo docker build -t yandex_tts_api . Run the Docker container: sudo docker run -p 80:5000 --rm --name yandex_tts_api yandex_tts_api API Usage

Endpoint: /synthesize Method: POST Request Body:{ "text": "Hello, world!", "voice": "oksana", "speech_rate": "1.0", "language": "ru-RU", "output_file_name": "output.wav" } Response: { "download_link": "http://example.com/path/to/output.wav" } License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Yandex Cloud Documentation for TTS service details.

About

This project provides a simplified API for interacting with the Yandex TTS (Text-to-Speech) service, part of Yandex Cloud's Speech Kit. The API allows users to synthesize speech from text input and offers options to customize the voice, speech rate, language, and output file name.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published