基于CUDA11.8的 glm4voice 服务(vllm版) 部署参考 #123

baiyin · 2024-12-05T08:58:42Z

cuda 11.8 版本上搭建 glm4voice，踩了些坑，供参考

https://github.com/baiyin/baiyin.github.io/blob/main/_posts/2024-12-05-deploy-glm4voice-vllm-server-for-cuda118.md

sixsixcoder · 2024-12-09T09:48:11Z

感谢您的贡献，有问题随时提issues

wang-TJ-20 · 2025-01-14T05:25:53Z

cuda 11.8 版本上搭建 glm4voice，踩了些坑，供参考

https://github.com/baiyin/baiyin.github.io/blob/main/_posts/2024-12-05-deploy-glm4voice-vllm-server-for-cuda118.md

@baiyin ,hi,有试过对cosyvoice2.0进行vllm 加速吗，看到cosyvocie2.0的llm模块更换为了qwen0.5

changqingla · 2025-01-22T03:19:50Z

请问实测vllm版部署会加快推理速度吗

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

基于CUDA11.8的 glm4voice 服务(vllm版) 部署参考 #123

基于CUDA11.8的 glm4voice 服务(vllm版) 部署参考 #123

baiyin commented Dec 5, 2024

sixsixcoder commented Dec 9, 2024

wang-TJ-20 commented Jan 14, 2025

changqingla commented Jan 22, 2025

基于CUDA11.8的 glm4voice 服务(vllm版) 部署参考 #123

基于CUDA11.8的 glm4voice 服务(vllm版) 部署参考 #123

Comments

baiyin commented Dec 5, 2024

sixsixcoder commented Dec 9, 2024

wang-TJ-20 commented Jan 14, 2025

changqingla commented Jan 22, 2025