Ks/refactor scoring endpoint #204

shehadak · 2023-09-13T12:36:40Z

This PR separates the functionality of the language submission endpoint into two separate methods: run_score, which calls the core scoring endpoint on every model/benchmark pair provided, and get_models_and_benchmarks which provides a list of model/benchmark pairs for scoring given a list of new models and new benchmarks. Previously, both functionalities were included in the run_score method. However, motivated by the need to run scoring jobs on multiple nodes in an HPC instead of in a single job, this PR enables scripts to separately identify the list of model/benchmark pairs and run calls to the core scoring endpoint.

This PR is complementary to brain-score/core#40

A note on backward compatibility: While this PR modifies run_scoring into two separate methods, the default functionality is still the same. In particular, if a user does not specify --fn=get_models_and_benchmarks, the run_scoring method will be called, which is the expected functionality prior to this PR.

A note on pyproject.toml: The commit aa21570 is for debugging purposes and will be reverted before merging.

brainscore_language/submission/endpoints.py

pyproject.toml

brainscore_language/submission/endpoints.py

kvfairchild · 2023-11-13T15:22:07Z

brainscore_language/submission/endpoints.py

@@ -41,11 +51,20 @@ def send_email_to_submitter(uid: int, domain: str, pr_number: str,

 if __name__ == '__main__':
    parser = make_argparser()
+    parser.add_argument('--fn', type=str, nargs='?', default='run_scoring',


Is there a reason to add this here instead of including it directly in make_argparser()?

Good point, especially since the same functionality will apply to other domains. I moved to make_argparser in Core 12fcfbc and removed it from Language in 6cfd50c.

kvfairchild · 2023-11-13T15:35:48Z

brainscore_language/submission/endpoints.py

+def resolve_models_benchmarks(args_dict: Dict[str, Union[str, List]]):
    benchmarks, models = retrieve_models_and_benchmarks(args_dict)

-    run_scoring_endpoint(domain="language", jenkins_id=args_dict["jenkins_id"],
-                         models=models, benchmarks=benchmarks, user_id=args_dict["user_id"],
-                         model_type="artificialsubject", public=args_dict["public"],
-                         competition=args_dict["competition"])
-
+    model_ids = resolve_models(domain="language", models=models)
+    benchmark_ids = resolve_benchmarks(domain="language", benchmarks=benchmarks)
+    print("BS_NEW_MODELS=" + " ".join(model_ids))
+    print("BS_NEW_BENCHMARKS=" + " ".join(benchmark_ids))
+    return model_ids, benchmark_ids


Suggest moving resolve_models_benchmarks() to core

I agree. I moved it to Core in 6eb46a4 and removed it from Language in d05eaf2.

…scoring

shehadak requested review from kvfairchild and mschrimpf September 13, 2023 18:07

mschrimpf reviewed Sep 28, 2023

View reviewed changes

brainscore_language/submission/endpoints.py Outdated Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

kvfairchild reviewed Nov 2, 2023

View reviewed changes

brainscore_language/submission/endpoints.py Outdated Show resolved Hide resolved

shehadak force-pushed the ks/refactor_scoring_endpoint branch from 9736df3 to 1e28aef Compare November 2, 2023 17:47

shehadak force-pushed the ks/refactor_scoring_endpoint branch 2 times, most recently from f0098de to 46bf7fd Compare November 11, 2023 05:36

kvfairchild reviewed Nov 13, 2023

View reviewed changes

kvfairchild approved these changes Nov 13, 2023

View reviewed changes

shehadak force-pushed the ks/refactor_scoring_endpoint branch from 1827097 to 67515e2 Compare November 16, 2023 00:26

shehadak added 12 commits November 22, 2023 09:10

Modified scoring endpoint to separate getting modesl/benchmarks from …

0cd635f

…scoring

Change core reference to custom fork

d651872

Fixed bug in flag parsing

e9a0867

added unit test for model/benchmark retrieval method

f75e596

Added unit test for scoring with BS_INSTALL_DEPENDENCIES

e501d1c

Chagned resolving methods to use refactored core methods

d20ec6b

minor bug fix

ea37d5a

fixed minor typo bug in unit test

95add0d

minor renaming for clarity

fa39789

Moved '--fn' to core

b465aa5

Moved 'resolve_models_benchmarks' to core

6642e88

undo 93456ee

ba4fa4c

shehadak force-pushed the ks/refactor_scoring_endpoint branch from 9425003 to ba4fa4c Compare November 22, 2023 14:11

shehadak merged commit 78766a6 into main Nov 22, 2023
3 checks passed

shehadak deleted the ks/refactor_scoring_endpoint branch November 28, 2023 13:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ks/refactor scoring endpoint #204

Ks/refactor scoring endpoint #204

shehadak commented Sep 13, 2023

kvfairchild Nov 13, 2023

shehadak Nov 13, 2023

kvfairchild Nov 13, 2023

shehadak Nov 13, 2023

Ks/refactor scoring endpoint #204

Ks/refactor scoring endpoint #204

Conversation

shehadak commented Sep 13, 2023

kvfairchild Nov 13, 2023

Choose a reason for hiding this comment

shehadak Nov 13, 2023

Choose a reason for hiding this comment

kvfairchild Nov 13, 2023

Choose a reason for hiding this comment

shehadak Nov 13, 2023

Choose a reason for hiding this comment