You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@synw Sorry for off-topic question, but you might have some experience:
I'm testing around with LLama2 models, and I found that it's extremely slow, especially if there is a bit of context in the prompt. At the beginning it was at full workload (CPU wise), now its around 10-15% and the prediction takes like 30 mins.
I assume it looks different to you, right?
I am wondering is there any project already use this project?
The text was updated successfully, but these errors were encountered: