How Much Audio Data Is Needed for Fine-Tuning Voice Tone? #838

JiangNanDream · 2025-01-18T14:47:09Z

JiangNanDream
Jan 18, 2025

I used about 20 minutes of audio for fine-tuning and tested the model after training with merged weights at 200, 5500, and 10,000 steps. However, the actual output performance was worse than the original model and couldn't even generate natural speech.
I want the model to generate voice tone and pitch that align more closely with my real voice.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How Much Audio Data Is Needed for Fine-Tuning Voice Tone? #838

{{title}}

Replies: 0 comments

Select a reply

How Much Audio Data Is Needed for Fine-Tuning Voice Tone? #838

JiangNanDream Jan 18, 2025

Replies: 0 comments

JiangNanDream
Jan 18, 2025