Replies: 1 comment 3 replies
-
@npredey does this happen for a specific gemini model or for all gemini models? is it the same when you use curl/fetch directly against the gemini api? |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am using the AI SDK to generate a chat response like this:
However, even on a simple "hello", the gemini model takes anywhere from 6-12 seconds to respond. In the GCP Console, this takes < 30 ms. I understand there may be some latency, but is there anything that can be done to improve this performance? Other models I've tested locally have no performance issues.
Thank you!
Beta Was this translation helpful? Give feedback.
All reactions