Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for StarCoder models #18

Open
osanseviero opened this issue Aug 17, 2023 · 4 comments
Open

Add support for StarCoder models #18

osanseviero opened this issue Aug 17, 2023 · 4 comments
Labels
enhancement New feature or request

Comments

@osanseviero
Copy link

Hi there! StarCoder from BigCode was trained for this kind of tasks, so having some documentation/support for it would be great.

Very nice project btw 🔥

@ishaan-jaff
Copy link

@osanseviero working on this PR for your issue: #19

@silvanmelchior
Copy link
Owner

Thanks!

Regarding starcoder: Was this finetuned to work in chat-style interactions? When I had a first look at it, it appeared as if it's mostly for coding only / reading comments.
Also from the size of the model I'm not sure if it can work well in this setup.

If someone can demo that it works well however, I'm very open to adding it!

@silvanmelchior silvanmelchior added enhancement New feature or request question Further information is requested labels Aug 19, 2023
@osanseviero
Copy link
Author

There are multiple BigCode models (1.1B, 3B, 7B, and 15B), so there are smaller version indeed. As for which model, it depends on what exact behaviour one would want. https://twitter.com/lvwerra/status/1691127139314159628 is a good explanation

@silvanmelchior
Copy link
Owner

silvanmelchior commented Aug 22, 2023

I tried it out a bit on HF, looks nice!

Definitely sth to consider, or maybe just in general a better explanation on how to add more models.

@silvanmelchior silvanmelchior removed the question Further information is requested label Aug 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants