How to use rate limit in add_routes #683

kientv · 2024-06-14T10:05:51Z

kientv
Jun 14, 2024

@eyurtsev
I have 2 problems:

Handle rate limit of OpenAI and other llm.
Rate limit on langserve (add_routes).

How to do it by langserve?

Thanks in advances

Jun 26, 2024

Easiest way is to prepend a runnable lambda to your runnable. That runnable lambda should be a passthrough that implements a rate limiting algorithm. The easiest way to do this is using a token bucket algorithm. We haven't built in support into the framework for this yet. In the meanwhile you can adapt the code here: https://github.com/langchain-ai/langchain-benchmarks/blob/main/langchain_benchmarks/rate_limiting.py#L90

View full answer

eyurtsev · 2024-06-26T16:03:42Z

eyurtsev
Jun 26, 2024
Maintainer

Easiest way is to prepend a runnable lambda to your runnable. That runnable lambda should be a passthrough that implements a rate limiting algorithm. The easiest way to do this is using a token bucket algorithm. We haven't built in support into the framework for this yet. In the meanwhile you can adapt the code here: https://github.com/langchain-ai/langchain-benchmarks/blob/main/langchain_benchmarks/rate_limiting.py#L90

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use rate limit in add_routes #683

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

How to use rate limit in add_routes #683

kientv Jun 14, 2024

Replies: 1 comment

eyurtsev Jun 26, 2024 Maintainer

kientv
Jun 14, 2024

eyurtsev
Jun 26, 2024
Maintainer