Ideal parameters when serving only one client? #1268
Unanswered
andysalerno
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I'm using TGI to serve up an assistant, for which I'm the only user.
TGI might be a bit overkill for this, since it is geared toward efficiently serving many clients. But it's so easy to set up, and so stable, that it's working beautifully for this scenario :)
So my question is, if I know I am only serving one request at a time, what would be the ideal parameters?
I'm guessing these:
but I'm not sure if there are any other considerations or any other things I could do.
Beta Was this translation helpful? Give feedback.
All reactions