Skip to content

I have enabled CACHE=1, there is a long wait in the middle of the conversation #447

Open
@abc20220327

Description

@abc20220327

I have enabled CACHE=1, the API I use, in the process of continuous dialogue, every time I ask a question, I need to wait a lot of time, whether the model will be reloaded every time the complete call is made, why is it so slow?

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions