Display correct progress bar when resuming with batch #197

RyanMarten · 2024-12-04T03:13:21Z

Fixes #196

vutrung96 · 2024-12-04T17:58:33Z

src/bespokelabs/curator/request_processor/openai_batch_request_processor.py

-                self.remaining_batch_ids.remove(batch_id)
-                response_files_found += 1
-        if response_files_found > 0:
+        tasks = [self.resume(batch_id, all_response_files) for batch_id in self.remaining_batch_ids]


can we change this to .update_finished_batches or sth more reflective? this confuses me a little bit since i thought it was actually resuming the batch downloads.

vutrung96 · 2024-12-04T18:02:02Z

src/bespokelabs/curator/request_processor/openai_batch_request_processor.py

-            pbar.n += self.tracker.n_completed_in_progress_requests
-            pbar.n += self.tracker.n_failed_in_progress_requests
-            pbar.refresh()
+            self.pbar.n = 0


we should also do this pbar update outside of the loop, otherwise i think the log is a bit confusing if all batches hit the cache and this loop doesn't get triggered

Completed OpenAI requests in batches: 0%| | 0/3 [00:00<?, ?request/s]2024-12-04 17:59:40,228 - bespokelabs.curator.request_processor.openai_batch_request_processor - INFO - File /home/trung/.cache/curator/47f1b0209c82f62b/responses_2.jsonl found for batch batch_67509878d7f48191882c1c055f27a8e1, skipping status check and download. 2024-12-04 17:59:40,229 - bespokelabs.curator.request_processor.openai_batch_request_processor - INFO - File /home/trung/.cache/curator/47f1b0209c82f62b/responses_1.jsonl found for batch batch_67509878f4a48191b133673cfddc8f27, skipping status check and download. 2024-12-04 17:59:40,230 - bespokelabs.curator.request_processor.openai_batch_request_processor - INFO - File /home/trung/.cache/curator/47f1b0209c82f62b/responses_0.jsonl found for batch batch_675098793790819197d298dc42ace1e1, skipping status check and download. 2024-12-04 17:59:40,233 - bespokelabs.curator.request_processor.openai_batch_request_processor - INFO - Found 3 out of 3 completed batches, resuming polling for the remaining 0 batches. Completed OpenAI requests in batches: 0%| | 0/3 [00:00<?, ?request/s] 2024-12-04 17:59:40,240 - bespokelabs.curator.request_processor.base_request_processor - INFO - Using existing dataset file /home/trung/.cache/curator/47f1b0209c82f62b/ef46db3751d8e999.arrow

note how Completed OpenAI requests is 0% even though all batches hit the cache because we never actually entered the loop

RyanMarten · 2024-12-04T19:57:20Z

ah feel like this should just be an in memory database

vutrung96 · 2024-12-04T20:06:19Z

why in-memory database? i think the current logic works fine, just needs that one fix?

RyanMarten · 2024-12-04T20:07:53Z

Yea it's not necessary, it's just even confusing me with all the places you need to keep track of everything.
I'll send you the fixed version shortly

RyanMarten · 2024-12-05T01:09:04Z

Addressing this now in #198

fix batch pbar on resume

7cfbfef

RyanMarten requested review from vutrung96 and CharlieJCJ December 4, 2024 03:13

RyanMarten mentioned this pull request Dec 4, 2024

Allow user to switch keys during batch and resume #198

Merged

vutrung96 reviewed Dec 4, 2024

View reviewed changes

RyanMarten closed this Dec 5, 2024

RyanMarten deleted the ryam/batch-pbar-resume branch December 5, 2024 01:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Display correct progress bar when resuming with batch #197

Display correct progress bar when resuming with batch #197

RyanMarten commented Dec 4, 2024

vutrung96 Dec 4, 2024

vutrung96 Dec 4, 2024

RyanMarten commented Dec 4, 2024

vutrung96 commented Dec 4, 2024

RyanMarten commented Dec 4, 2024

RyanMarten commented Dec 5, 2024

Display correct progress bar when resuming with batch #197

Display correct progress bar when resuming with batch #197

Conversation

RyanMarten commented Dec 4, 2024

vutrung96 Dec 4, 2024

Choose a reason for hiding this comment

vutrung96 Dec 4, 2024

Choose a reason for hiding this comment

RyanMarten commented Dec 4, 2024

vutrung96 commented Dec 4, 2024

RyanMarten commented Dec 4, 2024

RyanMarten commented Dec 5, 2024