Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Onnx - Consider as way to share models #2

Open
cjerzak opened this issue Mar 8, 2024 · 4 comments
Open

Onnx - Consider as way to share models #2

cjerzak opened this issue Mar 8, 2024 · 4 comments

Comments

@cjerzak
Copy link
Owner

cjerzak commented Mar 8, 2024

https://en.wikipedia.org/wiki/Open_Neural_Network_Exchange

https://huggingface.co/blog/convert-transformers-to-onnx

@cjerzak cjerzak changed the title Onnx - Share model Onnx - Consider as way to share models Apr 18, 2024
@beniaminogreen
Copy link
Collaborator

Hi Connor,

Did you manage to look into this? I am thinking of experimenting with this rust wrapper for the ONNX runtime as a way to provide fast inference in R for ML models without needing a python install. If we are able to export the models you have built using the ONNX format, we might be able to quickly perform inference using this approach.

Best,
Ben

@cjerzak
Copy link
Owner Author

cjerzak commented Jul 10, 2024

true. i haven't taken a close look but will do so later this/next week! we do want to simplify the inference pipeline...

@beniaminogreen
Copy link
Collaborator

Just got back from travels and after a long time working out the compilation + some dependency steps, I am excited to report that I am able to deploy GPT-2 on my laptop using this approach.

Seems like this could be a very promising avenue for deploying our models if they are easy to translate into the appropriate format. Ideally, we could pre-process everything in R, then pass the data to the ONNX runtime to do the difficult steps efficiently. Will provide updates as I make more progress.

@cjerzak
Copy link
Owner Author

cjerzak commented Jul 23, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants