Make load test more generic for other LM tasks #53

drewrip · 2024-07-19T14:03:19Z

In order to better use llm-load-test to evaluate embedding tasks I had to chop the code up a bit. I don't think this even necessarily should or needs to be merged, but I wanted to open this for discussion and publicize this code as I work on it.

drewrip · 2024-08-01T19:40:22Z

@dagrayvid Definitely no rush on this, but whenever you have a second could you take a look at this PR and let me know what you think in terms of where this could should/could go? It restructures the current llm-load-test model quite a bit to make the embedding stuff fit, so it might not fit with the vision for llm-load-test. Happy to keep this as a fork otherwise :)

ccamacho · 2024-08-06T06:43:55Z

Containerfile

+RUN git switch $GIT_BRANCH
+RUN pip3 install -r requirements.txt
+
+CMD python3 load_test.py -c $LLM_LOAD_TEST_CONFIG -log $LLM_LOAD_TEST_LOG_LEVEL


Can you add here an ENV for the output files?

Not 100% sure what you mean, but the path for the output files should be included in the llm-load-test config file

dagrayvid · 2024-08-07T15:52:51Z

@ccamacho @drewrip for now, I'm thinking that testing embedding models is out-of-scope for llm-load-test, and this should remain as a separate fork for now. One reason is that it requires making the output processing "pluggable", which is otherwise unnecessary so far. Another reason is that I think a tool made specifically for embedding tasks would probably have a different type of dataset.

Does that make sense? Open to more discussion on this

refactor to support other LM tasks

8fcebf0

drewrip force-pushed the embeddings branch from 2dc4460 to 8fcebf0 Compare July 19, 2024 14:03

drewrip added 8 commits July 19, 2024 10:04

add plugin for caikit embedding

b648c3b

specific TextGen request, Containerfile

8e7eb18

Update containerfile to use args

7693e0d

temporarily update caikit-nlp-client to fork

70f887d

update default log level to warning

02bae86

throughput measures for objects

4588b36

add option to skip raw output

310f150

calculate total_tokens for embedding

7f1873f

ccamacho reviewed Aug 6, 2024

View reviewed changes

add start_time to summary

a7c0d7a

drewrip added 4 commits August 20, 2024 15:40

configure a batch size

7b24ef5

make timeout configurable

2b64e58

default log level debug

a7e5c1f

switch to main branch of drewrip caikit-nlp-client fork

3f1f2d1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make load test more generic for other LM tasks #53

Make load test more generic for other LM tasks #53

drewrip commented Jul 19, 2024

drewrip commented Aug 1, 2024

ccamacho Aug 6, 2024

drewrip Aug 6, 2024

dagrayvid commented Aug 7, 2024

Make load test more generic for other LM tasks #53

Are you sure you want to change the base?

Make load test more generic for other LM tasks #53

Conversation

drewrip commented Jul 19, 2024

drewrip commented Aug 1, 2024

ccamacho Aug 6, 2024

Choose a reason for hiding this comment

drewrip Aug 6, 2024

Choose a reason for hiding this comment

dagrayvid commented Aug 7, 2024