Optimize LLM usage #40

vivekuppal · 2023-07-26T17:31:49Z

Do not ping LLM if we do not need to. Previously we were always pinging LLM even when response suggestions were off. This was causing unnecessary costs of LLM and delays in UI updates.

Partial refactoring work towards separating conversation into its own object towards forward looking features.
Make OpenAI model configurable in parameters.yaml so it is easier for end user to change it without touching code. With new models like chatgpt 4 being available, end users should be able to change models easily.

…ng LLM even when response suggestions were off. Partial refactoring work towards separating the conversation object into its own entity.

… change for end user.

AudioTranscriber.py

Do not ping LLM if we do not need to. Previously we were always pingi…

fb492d1

…ng LLM even when response suggestions were off. Partial refactoring work towards separating the conversation object into its own entity.

vivekuppal self-assigned this Jul 26, 2023

Make openai model configurable using parameters.yaml so it is easy to…

9d25d86

… change for end user.

vivekuppal changed the title ~~DRAFT: Optimize LLM usage~~ Optimize LLM usage Jul 26, 2023

abhinavuppal1 reviewed Jul 26, 2023

View reviewed changes

AudioTranscriber.py Show resolved Hide resolved

abhinavuppal1 approved these changes Jul 26, 2023

View reviewed changes

vivekuppal merged commit 26cfaad into main Jul 26, 2023
2 checks passed

vivekuppal deleted the vu-optimize-llm-responses branch July 26, 2023 18:39

vivekuppal added the bug Something isn't working label Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize LLM usage #40

Optimize LLM usage #40

vivekuppal commented Jul 26, 2023 •

edited

Loading

Optimize LLM usage #40

Optimize LLM usage #40

Conversation

vivekuppal commented Jul 26, 2023 • edited Loading

vivekuppal commented Jul 26, 2023 •

edited

Loading