ChatPlugin response time too slow #1234

nurkmez2 · 2024-12-12T16:22:14Z

Describe the bug
The following functions in https://github.com/microsoft/chat-copilot/blob/main/webapi/Plugins/Chat/ChatPlugin.cs take much time
model : GPT-4o in Azure

- GetAudienceAsync = 22965 ms

https://github.com/microsoft/chat-copilot/blob/main/webapi/Plugins/Chat/ChatPlugin.cs#L363

- ExtractChatHistory= 9623

https://github.com/microsoft/chat-copilot/blob/main/webapi/Plugins/Chat/ChatPlugin.cs#L111

- GetUserIntentAsync = 10615

https://github.com/microsoft/chat-copilot/blob/main/webapi/Plugins/Chat/ChatPlugin.cs#L406

To Reproduce
Steps to reproduce the behavior:
Run web API app and ask a question with a context

Expected behavior
Faster response time

Screenshots
If applicable, add screenshots to help explain your problem.

Platform

Windows
Visual Studio, VS Code
Language: C#, JS
Source: [e.g. latest version

Additional context

What can be done to improve response time of those ?
How can ExtractChatHistory & GetAudienceAsync &GetUserIntentAsync function more effectively with Semantic Kernel?

imsharukh1994 · 2024-12-24T13:27:14Z

Asynchronous Programming:
Ensure all functions are asynchronous to avoid blocking the main thread.
Cache Results:
Cache frequently accessed data like audience information and chat history to avoid redundant calculations.

Use in-memory caching (e.g., MemoryCache or Redis) for storing repeated data.
3. Optimize API Calls:
For functions making API calls to GPT-4, use parallel requests to reduce wait time.

Use batching for multiple requests or streaming responses for large results.
4. Database Optimization:
For ExtractChatHistory:

Index your database on frequently queried fields (e.g., user_id, timestamp).
Use pagination to fetch only the necessary data instead of the entire history.
5. Pre-process Input Locally:
For GetUserIntentAsync:

Clean and pre-process the input before sending it to GPT-4.
Cache common intents to avoid recalculating for repeated queries.

`public async Task GetUserIntentAsync(string input)
{
// Cache common intents
var cachedIntent = Cache.Get($"intent_{input}");
if (cachedIntent != null) return cachedIntent;

// Use Semantic Kernel or GPT-4 with asynchronous processing
var intent = await GetIntentFromAPIAsync(input);

// Cache result
Cache.Set($"intent_{input}", intent);

return intent;

}

public async Task GetAudienceAsync()
{
// Parallelize API calls if needed
var tasks = new List { Task.Run(() => FetchAudienceData()) };
await Task.WhenAll(tasks);
return tasks[0].Result;
}
`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChatPlugin response time too slow #1234

ChatPlugin response time too slow #1234

nurkmez2 commented Dec 12, 2024 •

edited

Loading

imsharukh1994 commented Dec 24, 2024

ChatPlugin response time too slow #1234

ChatPlugin response time too slow #1234

Comments

nurkmez2 commented Dec 12, 2024 • edited Loading

imsharukh1994 commented Dec 24, 2024

nurkmez2 commented Dec 12, 2024 •

edited

Loading