1s Latency Definitiion #14

RuchirB · 2024-02-09T15:04:26Z

Tried out the project, very impressed. Thanks for open sourcing. Quick question on latency.

Noticed a minimum latency of at least 3-4s. I am measuring latency as delay between when the human speaks and when the AI responds. This was with everything deployed on fly.io in Ashburn using the exact demo as instructed.

Looks like the biggest bottleneck is the request from Twilio -> Fly.io and Fly.io -> Twilio. Second biggest bottleneck looks like transcription via deepgram.

The ReadMe suggests a latency of 1s—can you clarify the definition of latency here? Is that just looking at gpt response + TTS?

Any ideas on how to reduce latency? Is there a roadmap for this project we can follow somewhere?

ansario · 2024-03-08T17:13:56Z

You could use the gpt-4-turbo-preview (or 3.5) GPT model for a small boost.

ANIL-KADURKA · 2024-07-18T17:49:43Z

i am acheieving same latency i am using Groq for faster access but still the latency is 4 tell me the best way like for STT adn TTS it staking more time the STT is taking a time of 1.5 and tts a time of 1.2 please help me out the best configuraiton for the deepgram and wahtever it is how you are getting 1 by god sake?

devsalman247 · 2024-08-23T13:34:15Z

Tried out the project, very impressed. Thanks for open sourcing. Quick question on latency.

Noticed a minimum latency of at least 3-4s. I am measuring latency as delay between when the human speaks and when the AI responds. This was with everything deployed on fly.io in Ashburn using the exact demo as instructed.

Looks like the biggest bottleneck is the request from Twilio -> Fly.io and Fly.io -> Twilio. Second biggest bottleneck looks like transcription via deepgram.

The ReadMe suggests a latency of 1s—can you clarify the definition of latency here? Is that just looking at gpt response + TTS?

Any ideas on how to reduce latency? Is there a roadmap for this project we can follow somewhere?

@RuchirB Same case here...I am using Grok instead openai GPT models but still experiencing some delay b/w receiving audio packets from twilio & human speaking...

badereddineqodia · 2024-10-08T14:09:57Z

I think using OpenAI's real-time API now is perfect, as it eliminates the need for additional middle services that would add more latency.

boxed-dev · 2024-11-04T06:02:25Z

I created an application using the OpenAI real-time API with function call and twilio integration, but it seems too rigid and robotic. Additionally, I find it too expensive to be feasible for real-world use, at least for now.

I think using OpenAI's real-time API now is perfect, as it eliminates the need for additional middle services that would add more latency.

devsalman247 · 2024-12-02T13:48:47Z

Try elevenlabs conversational AI..

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1s Latency Definitiion #14

1s Latency Definitiion #14

RuchirB commented Feb 9, 2024

ansario commented Mar 8, 2024

ANIL-KADURKA commented Jul 18, 2024

devsalman247 commented Aug 23, 2024

badereddineqodia commented Oct 8, 2024

boxed-dev commented Nov 4, 2024

devsalman247 commented Dec 2, 2024

1s Latency Definitiion #14

1s Latency Definitiion #14

Comments

RuchirB commented Feb 9, 2024

ansario commented Mar 8, 2024

ANIL-KADURKA commented Jul 18, 2024

devsalman247 commented Aug 23, 2024

badereddineqodia commented Oct 8, 2024

boxed-dev commented Nov 4, 2024

devsalman247 commented Dec 2, 2024