Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【Help】请求MLX项目帮助 #1

Open
southkorea2013 opened this issue Nov 20, 2024 · 0 comments
Open

【Help】请求MLX项目帮助 #1

southkorea2013 opened this issue Nov 20, 2024 · 0 comments

Comments

@southkorea2013
Copy link

@AtomicVar 你好,我看到你这边之前贡献过MLX的代码。我们这边有一个客户的紧急MLX相关的项目,和视频推理相关的(Awni这边说需要用到fused attention。但是Awni最近没有时间做,不过客户的项目时间很紧),是否可以留一个邮箱或者手机号码方便我们联系?我的邮箱是:[email protected]

下面是Awni和我的聊天记录:
Hi Nan, there isn’t really a way to do that right now. But the good news is we already have a fused attention in the works which should dramatically reduce the memory use requirements. Unfortunately it’s not out yet and may yet take a few weeks to get out.

Nan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant