Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vulkan Backend Support for improved device compatibility #124

Closed
FruityWelsh opened this issue May 5, 2023 · 3 comments
Closed

Vulkan Backend Support for improved device compatibility #124

FruityWelsh opened this issue May 5, 2023 · 3 comments
Assignees
Labels
enhancement New feature or request fixed-in-next-release good first issue Good for newcomers

Comments

@FruityWelsh
Copy link

Please describe the feature you want
Support for the pytorch vulkan backend so that older nvidia gpus, as well as intell, amd, and some phone gpus can be supported.
https://pytorch.org/tutorials/prototype/vulkan_workflow.html

Additional context
Personally ran into difficulties testing this project, because my laptop is too old to support Nvidia, and my cloud accounts aren't authorized to deploy GPU compute. I imagine I am not the only one limited on working on this project by these kinds of lim facs.


Please reply with a 👍 if you want this feature.

@FruityWelsh FruityWelsh added the enhancement New feature or request label May 5, 2023
@wsxiaoys
Copy link
Member

related:webonnx/wonnx#169

@wsxiaoys
Copy link
Member

The recent merges of two Vulkan-based implementations in llama.cpp are as follows:

  1. The first merge: ggerganov/llama.cpp@2307523
  2. The second merge: ggerganov/llama.cpp@fbf1dde

We are quite interested in integrating one of these implementations into Tabby. For example, by adding a vulkan device option, similar to the metal option, we can make Tabby compatible with certain GPUs, such as the Intel Arc.

@wsxiaoys
Copy link
Member

wsxiaoys commented Mar 6, 2024

Note this is done in #1588 and a vulkan binary has been added in https://github.com/TabbyML/tabby/actions/runs/8179533225, consider give it a try!

cc @boxbeam

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request fixed-in-next-release good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

3 participants