Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

为啥我生成的音频文件和视频的时间对不上 #13

Open
Cqy1995 opened this issue May 20, 2024 · 1 comment
Open

为啥我生成的音频文件和视频的时间对不上 #13

Cqy1995 opened this issue May 20, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@Cqy1995
Copy link

Cqy1995 commented May 20, 2024

image demo的视频时间是30s,生成后的字幕文件是57秒 image
@buxuku buxuku added the bug Something isn't working label May 20, 2024
@buxuku
Copy link
Owner

buxuku commented May 20, 2024

whisper 的文本生成比较依赖于上下文,这个视频我是通过 ffmpeg 截取了 30s, 这可能是视频导致出来的问题。当然, whisper 也确实会存在时间轴对不上的情况,我将尝试寻找更精确的输出方案。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants