Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Help]: Sample code for TTA #336

Open
cpken opened this issue Nov 7, 2024 · 7 comments
Open

[Help]: Sample code for TTA #336

cpken opened this issue Nov 7, 2024 · 7 comments
Assignees

Comments

@cpken
Copy link

cpken commented Nov 7, 2024

TTA 的示例代码

https://audit-demo.github.io/ 里看到了很多 TTA 的示例,但是在项目中,仅提供了【文本生成音频】的示例,没有提供其它的示例,如:

  • 添加:将另一个声音事件添加到输入音频。
  • 删除:从输入音频中删除一个或多个声音事件。
  • 替换:用另一声音事件替换输入音频中的一个声音事件。
  • 修复:基于上下文或提供的文本描述来完成音频的掩蔽片段。
  • 音频超分辨率任务可以被视为完成低采样输入音频的高频信息(将低采样输入音频转换为高采样输出音频)。

希望能增加更多的示例,谢谢。

@feifei788
Copy link

请说明AudioCaps文件夹下的valid.json文件的格式,谢谢!

@cpken
Copy link
Author

cpken commented Nov 7, 2024

项目里没有找到【AudioCaps文件夹下的valid.json】

@feifei788
Copy link

源码并没有说明valid.json文件的格式示例,运行时报错:FileNotFoundError: [Errno 2] No such file or directory: 'data\AudioCaps\valid.json'
能否发一个valid.json文件的代码示例

@cpken
Copy link
Author

cpken commented Nov 7, 2024

抱歉老铁,可能无法帮助你,不清楚你在测试哪些实例。

@feifei788
Copy link

$ sh egs/tta/autoencoderkl/run_train.sh
我在测试tta/autoencoderkl下的run_train.sh文件

@cpken
Copy link
Author

cpken commented Nov 7, 2024

目前还未尝试进行训练测试,训练数据好像是在这里获取 https://github.com/open-mmlab/Amphion/blob/main/egs/datasets/README.md

@yuantuo666 yuantuo666 changed the title [Help]: TTA 的示例代码 [Help]: Sample code for TTA Nov 9, 2024
@yuantuo666
Copy link
Collaborator

Hi, we prefer English issues so that more people worldwide can participate, and it is also easier to search for issues.

@HeCheng0625 Could you help with this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants