Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

如果只需要简单的文本合成语音,可以吗? #131

Open
bk111 opened this issue Dec 11, 2024 · 2 comments
Open

如果只需要简单的文本合成语音,可以吗? #131

bk111 opened this issue Dec 11, 2024 · 2 comments

Comments

@bk111
Copy link

bk111 commented Dec 11, 2024

如果只需要简单的文本合成语音,可以吗?该怎么做?
如果只需要简单的文本合成语音并生成srt,可以吗?该怎么做?

@baiyin
Copy link

baiyin commented Dec 11, 2024

glm-4-voice 的 audio decoder 是 token-to-speech (token 是 audio token), 而不是 text-to-speech
所以直接用来做文本到语音感觉不太适合。
只是个人理解,不一定正确

@bk111
Copy link
Author

bk111 commented Dec 11, 2024

glm-4-voice 的 audio decoder 是 token-to-speech (token 是 audio token), 而不是 text-to-speech 所以直接用来做文本到语音感觉不太适合。 只是个人理解,不一定正确

它可能用的是cosyvoice来文本生语音的,但我直接用cosyvoice的演示,效果却很差。 就想着谁能把text_input直接送给它的cosyvoice模块。只要后半截文本转语音部分。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants