Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feat] Add jamba support #63

Open
yundai424 opened this issue Aug 24, 2024 · 0 comments · May be fixed by #102
Open

[feat] Add jamba support #63

yundai424 opened this issue Aug 24, 2024 · 0 comments · May be fixed by #102
Labels

Comments

@yundai424
Copy link
Collaborator

🚀 The feature, motivation and pitch

model code here -- https://github.com/huggingface/transformers/blob/main/src/transformers/models/jamba/modeling_jamba.py

might be interesting to see how is a triton implementation of mixer forward compared to existing cuda forward too 🤔

Alternatives

No response

Additional context

No response

@yundai424 yundai424 added the enhancement New feature or request label Aug 24, 2024
@ByronHsu ByronHsu added feature and removed enhancement New feature or request labels Aug 24, 2024
@ByronHsu ByronHsu assigned ByronHsu and unassigned ByronHsu Aug 26, 2024
@ByronHsu ByronHsu linked a pull request Aug 26, 2024 that will close this issue
3 tasks
@yundai424 yundai424 linked a pull request Aug 26, 2024 that will close this issue
3 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants