-
Notifications
You must be signed in to change notification settings - Fork 154
Issues: microsoft/onnxruntime-genai
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
onnx.checker.check_model("phi-3.5-v-instruct-vision.onnx") Error
#1276
opened Feb 24, 2025 by
jason-engage
Support for DeepSeek-R1-Distilled-NPU-Optimized model that comes with AI Toolkit
#1258
opened Feb 15, 2025 by
max-krasnyansky
C examples refer to 0.4.0 in rel-0.5.2
documentation and samples
Improvements or additions to documentation
#1157
opened Dec 17, 2024 by
natke
MAIN BRANCH CONTAINS API CHANGES
0.6.0
bug
Something isn't working
question
Further information is requested
release
#1142
opened Dec 11, 2024 by
aciddelgado
.Net How to free GPU memory after each inference
bug
Something isn't working
enhancement
New feature or request
performance
#1131
opened Dec 9, 2024 by
strikene
0.5.2 DML 2x to 4x Slower than 0.4.0 (Big regression)
ep:DML
performance
#1114
opened Dec 3, 2024 by
elephantpanda
0.5.2 GPU crashes if initial input is 360 zeros.
crash
ep:DML
#1113
opened Dec 3, 2024 by
elephantpanda
Bug DMLFusedNode_0_0 on second token in 0.5.2 (DML) (Wrong tensor shape)
ep:DML
#1112
opened Dec 3, 2024 by
elephantpanda
.Net After updating to .5, Phi3.5Mini outputs some meaningless characters
model quality
#1109
opened Nov 30, 2024 by
strikene
onnxruntime-genai
generation speed very slow on int4
performance
#1098
opened Nov 23, 2024 by
tarekziade
awq example runs into error with llama 3.2 3b due to embedding layer
documentation and samples
Improvements or additions to documentation
ep:DML
#1089
opened Nov 22, 2024 by
tranlm
Previous Next
ProTip!
Follow long discussions with comments:>50.