Feature: tlm batch api #149

ryansingman · 2023-11-28T19:58:06Z

Adds batch_prompt, batch_get_confidence_score APIs to enable scale use of TLM

Includes refactor to use of async requests (enabling request concurrency)

Both batch methods are intended to gracefully handle query exceptions w/ retries. For rate limit errors, the retry occurs after a wait specified by the backend. For other errors, an exponential backoff is applied.

See testing instructions: https://github.com/cleanlab/cleanlab-studio-backend/pull/988

LukeMainwaring · 2023-12-01T17:14:02Z

cleanlab_studio/studio/trustworthy_language_model.py

@@ -46,21 +63,147 @@ def __init__(self, api_key: str, quality_preset: QualityPreset) -> None:

        self._quality_preset = quality_preset

+        self._event_loop = asyncio.get_event_loop()
+        self._query_semaphore = asyncio.Semaphore(max_concurrent_requests)


do we want to limit concurrency with the asyncio semaphore? if i set it to a high value, i.e. 1000, i get the generic API error. note: it doesn't hit the RateLimitError, which seems to be the expected one based on the error message suggesting to lower max_concurrent_requests . not sure how likely it is that the typical user will be setting this value or just using the default

ulya-tkch · 2024-01-05T23:53:58Z

When trying to run scenario 1 linked for testing from the staging environment, I get below error. I am not sure if it is my environment or code issue. My env: staging, cleanlab-studio is pointed to editable install of feature/tlm-batch-api (this branch)

api_key = <api_key>
tlm = Studio(api_key).TLM(quality_preset="low", max_concurrent_requests=64)

tlm.batch_prompt([f"what's 2 + {i}?" for i in range(128)], retries=1)

ryansingman · 2024-01-19T02:38:31Z

When trying to run scenario 1 linked for testing from the staging environment, I get below error. I am not sure if it is my environment or code issue. My env: staging, cleanlab-studio is pointed to editable install of feature/tlm-batch-api (this branch)
api_key = <api_key>
tlm = Studio(api_key).TLM(quality_preset="low", max_concurrent_requests=64)

tlm.batch_prompt([f"what's 2 + {i}?" for i in range(128)], retries=1)

Running async code in a Jupyter notebook will require the following: https://github.com/erdewit/nest_asyncio

ryansingman added 9 commits November 17, 2023 09:01

add async prompt and get confidence score methods

fb40f4c

further WIP batch api

8b22598

batch fixes, docs

bb013c5

fix docs

a1485fc

make max concurrent requests configurable

108811c

ensure local scoped clients are closed

d0025a3

fix error messages

824911e

mypy fix

7703dba

mypy fix

f73fcc5

ryansingman marked this pull request as ready for review November 29, 2023 01:07

ryansingman requested review from LukeMainwaring and ulya-tkch November 29, 2023 01:07

ryansingman added 2 commits November 28, 2023 18:10

mypy fix

94f0ea2

mypy

61af7a2

LukeMainwaring approved these changes Nov 30, 2023

View reviewed changes

LukeMainwaring reviewed Dec 1, 2023

View reviewed changes

add TLM request limit cap

89e0aa9

Merge branch 'main' into feature/tlm-batch-api

f1233bb

ryansingman added 4 commits January 18, 2024 19:43

mypy fixes

17fd315

use nest_asyncio to ensure tlm can be run from jupyter notebook

57fd667

fix setup.py

0347956

mypy nest_asyncio

7b5d4bd

ryansingman merged commit 9de06ef into main Jan 19, 2024
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: tlm batch api #149

Feature: tlm batch api #149

ryansingman commented Nov 28, 2023 •

edited

Loading

LukeMainwaring Dec 1, 2023

ulya-tkch commented Jan 5, 2024 •

edited

Loading

ryansingman commented Jan 19, 2024

Feature: tlm batch api #149

Feature: tlm batch api #149

Conversation

ryansingman commented Nov 28, 2023 • edited Loading

LukeMainwaring Dec 1, 2023

Choose a reason for hiding this comment

ulya-tkch commented Jan 5, 2024 • edited Loading

ryansingman commented Jan 19, 2024

ryansingman commented Nov 28, 2023 •

edited

Loading

ulya-tkch commented Jan 5, 2024 •

edited

Loading