Skip to content

Sound Studio

noco-ai edited this page Feb 17, 2024 · 11 revisions

Text to Speech

The text to speech UI allows you to experiment directly with TTS model that Spell Book is running. The UI allows for each user to store generated sound files for later review and download. xTTS supports using ASR voice samples for voice generation.

image

  • #1 Text Prompt Text to turn into speech
  • #2 Selected Model Dropdown to select TTS generation model
  • #3 Current waveform Waveform of the currently loaded sound file
  • #4 Current WAV data Information and controls for currently loaded file
  • #5 Delete WAV Delete WAV file from the server
  • #6 Download WAV Download WAV file for TTS from server
  • #7 Load WAV Load and play the WAV file
  • #8 Advanced Settings Update generation voice used for each model

Speech Recognition

image

Music Generation

image

Clone this wiki locally