WIP: Add RPS based load option #65

dagrayvid · 2024-10-21T00:53:35Z

No description provided.

sjmonson · 2024-10-21T16:16:32Z

user.py

@@ -59,17 +63,48 @@ def _init_user_process_logging(self):
        self.logger = logging.getLogger("user")
        return logging.getLogger("user")

+    def _user_loop(self, test_end_time):
+        while self.stop_q.empty():


Just a thought for the future. What if we have the main process SIGTERM (or SIGUSR1) the subprocesses as a stop message and write a custom signal handler to clean up?

Yeah totally, that's long overdue. I was looking into it and should work on it sometime.

sjmonson · 2024-10-21T16:32:48Z

load_test.py

@@ -89,48 +119,50 @@ def main(args):
    log_reader_thread = logging_utils.init_logging(args.log_level, logger_q)

    # Create processes and their Users
+    schedule_q = mp_ctx.Queue(1)


Suggested change

schedule_q = mp_ctx.Queue(1)

schedule_q = mp_ctx.Queue(1)

schedule_q.cancel_join_thread()

Toggle cancel_join_thread() here to avoid the queue blocking on exit.

sjmonson · 2024-11-04T20:18:58Z

load_test.py

    for query in dataset.get_next_n_queries(2 * concurrency):
-        dataset_q.put(query)
+        request_q.put((None, query))


From a clarity perspective I think it would be better to have this be a dict or object. E.g.

Suggested change

request_q.put((None, query))

request_q.put(dict(query=query, req_time=None))

or set a field on the query dict.

Using a field in the query dict is much more elegant!

Yeah, I like the idea of making it a field in the query dict.

sjmonson · 2024-11-04T20:29:13Z

load_test.py

+
+    return


Drop this return?

Suggested change

return

It doesn't but I wonder if adding a dedicated try-catch exception block in this function worth it. We currently catch all the cascade exceptions with the generic Exception class in the main function but it's probably not the cleanest way to handle the exception IMO.

Not suggesting this should be addressed in this PR but a follow-up PR to cleanup our exception handling might be good.

npalaska

Some minor nits and comments but this looks ready to go.

npalaska · 2024-11-13T21:42:57Z

config.yaml

  max_sequence_tokens: 2048
 load_options:
-  type: constant #Future options: loadgen, stair-step
-  concurrency: 1
+  type: rps #Options: concurrency, rps, loadgen, stair-step


Any reason you replaced the word constant load type with concurrency? imo, constant sounds more closer to Constant Load which is a Continuous stream of requests.

I was thinking that constant is ambiguous, as RPS can also be constant. My other thinking is that we might later add dynamically changing RPS or dynamically changing concurrency so either RPS or concurrency could be constant or dynamic.

npalaska · 2024-11-13T22:01:42Z

load_test.py

+
+    return


It doesn't but I wonder if adding a dedicated try-catch exception block in this function worth it. We currently catch all the cascade exceptions with the generic Exception class in the main function but it's probably not the cleanest way to handle the exception IMO.

Not suggesting this should be addressed in this PR but a follow-up PR to cleanup our exception handling might be good.

npalaska · 2024-11-13T22:05:49Z

load_test.py

+
+def main_loop_concurrency_mode(dataset, request_q, start_time, end_time):
+    """Let all users send requests repeatedly until end_time"""
+    logging.info("Test from main process")


Do we still need this logging statement here?

No, I'll remove this thanks!

npalaska · 2024-11-13T22:20:22Z

load_test.py

    for query in dataset.get_next_n_queries(2 * concurrency):
-        dataset_q.put(query)
+        request_q.put((None, query))


Yeah, I like the idea of making it a field in the query dict.

npalaska · 2024-11-13T22:22:38Z

plugins/caikit_client_plugin.py

@@ -79,7 +79,6 @@ def request_grpc(self, query, user_id, test_end_time: float=0):
        result.output_tokens_before_timeout = result.output_tokens
        result.output_text = response

-        result.calculate_results()


I wonder if its time to depreciate the caikit_client_plugin?

npalaska · 2024-11-13T22:23:46Z

plugins/dummy_plugin.py

@@ -35,7 +35,6 @@ def request_http(self, query, user_id, test_end_time: float=0):

        result.end_time = time.time()

-        result.calculate_results()


When we do a cleanup we probably should remove this file.

Yeah, this was originally added with the thought that it could be used in some test cases but we may want to remove it depending on how we decide to handle testing (unit tests, e2e tests, etc...)

npalaska · 2024-11-13T22:45:48Z

user.py

@@ -59,17 +63,48 @@ def _init_user_process_logging(self):
        self.logger = logging.getLogger("user")
        return logging.getLogger("user")

+    def _user_loop(self, test_end_time):
+        while self.stop_q.empty():


Yeah totally, that's long overdue. I was looking into it and should work on it sometime.

npalaska · 2024-11-13T22:48:02Z

user.py

+            except queue.Empty:
+                # if timeout passes, queue.Empty will be thrown
+                # User should check if stop_q has been set, else poll again
+                # self.debug.info("User waiting for a request to be scheduled")


Should this line be uncommented?

dagrayvid added 2 commits October 19, 2024 17:23

Split dataset before process creation

94dd326

Add RPS based load option

439df72

dagrayvid force-pushed the dataset-split branch from bf19e71 to 439df72 Compare October 21, 2024 02:09

dagrayvid added 2 commits October 21, 2024 10:05

Add RPS summary including late requests

f0da2c6

Rewrite main_process to support variable req rate in the future

f025833

dagrayvid requested a review from sjmonson October 21, 2024 14:05

sjmonson suggested changes Oct 21, 2024

View reviewed changes

dagrayvid added 2 commits October 21, 2024 21:30

Switch back to prompts being passed through a queue

0bfeb9e

Fix rps main loop bug

f375f55

sjmonson suggested changes Nov 4, 2024

View reviewed changes

npalaska approved these changes Nov 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Add RPS based load option #65

WIP: Add RPS based load option #65

dagrayvid commented Oct 21, 2024

sjmonson Oct 21, 2024

npalaska Nov 13, 2024

sjmonson Oct 21, 2024

sjmonson Nov 4, 2024

dagrayvid Nov 7, 2024

npalaska Nov 13, 2024

sjmonson Nov 4, 2024

npalaska Nov 13, 2024

npalaska left a comment

npalaska Nov 13, 2024

dagrayvid Nov 14, 2024

npalaska Nov 13, 2024

npalaska Nov 13, 2024

dagrayvid Nov 14, 2024

npalaska Nov 13, 2024

npalaska Nov 13, 2024

npalaska Nov 13, 2024

dagrayvid Nov 14, 2024

npalaska Nov 13, 2024

npalaska Nov 13, 2024

	schedule_q = mp_ctx.Queue(1)
	schedule_q = mp_ctx.Queue(1)
	schedule_q.cancel_join_thread()

	request_q.put((None, query))
	request_q.put(dict(query=query, req_time=None))

		@@ -35,7 +35,6 @@ def request_http(self, query, user_id, test_end_time: float=0):

		result.end_time = time.time()

		result.calculate_results()

WIP: Add RPS based load option #65

Are you sure you want to change the base?

WIP: Add RPS based load option #65

Conversation

dagrayvid commented Oct 21, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

npalaska left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment