Is there an option to only return the retrieved text without the llm participating in the response?
This seems very important, otherwise we have to wait for the LLM to generate for a long time every time we retrieve. It should be more straightforward.