Switch from Multi Threading to Multi Processing by r-sharp · Pull Request #206 · MetOffice/SimSys_Scripts

r-sharp · 2026-03-06T17:36:31Z

PR Summary

Sci/Tech Reviewer:
Code Reviewer:

Some performance tests on the original code, used to scan a full UM clone, revealed that altering the maximum number of workers anywhere between 1 and 64 gave no noticaable performance improvement whatsoever. Whils only a little imrpvement at low thread counts it was a little surprising to see nothing whatsoever.

However, there had always been an intention to switch to using multiple processes, as the tasks of opening files and scanning the contents was likely to work well with completely independant tasks on different processes.

Initial tests of this change have demonstrated timing improvements on the VDI when using 2 processors :

umdp3_checker_timings/2nd_draft_mulitiprocessor_01_time.txt : real 2m38.319s
umdp3_checker_timings/2nd_draft_mulitiprocessor_02_time.txt : real 1m29.531s
umdp3_checker_timings/2nd_draft_mulitiprocessor_03_time.txt : real 1m8.422s
umdp3_checker_timings/2nd_draft_mulitiprocessor_04_time.txt : real 1m5.695s

Where the numbers in the file names indicate the maximum number of workers specified at runtime.

Also when submitted to 16 processors on SPICE using salloc --time=30 --mem=8G --ntasks=16 --x11 --bell reasonable scaling occurs up to the 16 processors requested.

umdp3_checker_timings/SPICE_16proc_UM_01_time.txt : real 2m41.835s
umdp3_checker_timings/SPICE_16proc_UM_02_time.txt : real 1m8.262s
umdp3_checker_timings/SPICE_16proc_UM_04_time.txt : real 0m35.004s
umdp3_checker_timings/SPICE_16proc_UM_08_time.txt : real 0m18.225s
umdp3_checker_timings/SPICE_16proc_UM_16_time.txt : real 0m13.493s
umdp3_checker_timings/SPICE_16proc_UM_32_time.txt : real 0m14.478s

Where the numbers in the file names indicate the maximum number of workers specified at runtime.
All timings were simply gathered with bash buit-in time function.

Based on these results, it is also presumed that setting the default maximum number of workers to '2', anticipating use on the VDI is probably best. Automated use, such as within rose-stem or as a GitHub action can specify other values more suitable for those environments.

is blocked-by Unifying the style checkers #196
is blocked-by Fix bug in pathnames when using --fullcheck #201
Purely by virtue of being 'built' on top of those changes.

Code Quality Checklist

I have performed a self-review of my own code
My code follows the project's style guidelines
Comments have been included that aid understanding and enhance the readability of the code
My changes generate no new warnings
All automated checks in the CI pipeline have completed successfully

Testing

This change has been tested appropriately (please describe)

Run on command line on both VDI and SPICE to test a full clone of the UM and also a UM branch.
R3esults were as expected, performance on 2 or more processors was improved.

Security Considerations

I have reviewed my changes for potential security issues
Sensitive data is properly handled (if applicable)
Authentication and authorisation are properly implemented (if applicable)

AI Assistance and Attribution

Some of the content of this change has been produced with the assistance of Generative AI tool name (e.g., Met Office Github Copilot Enterprise, Github Copilot Personal, ChatGPT GPT-4, etc) and I have followed the Simulation Systems AI policy (including attribution labels)

AI has been used for line completion. Curiously, quite a bit of what it suggested (Using multiprocessing.Pool instead of ThreadPoolExecuter) was taken back out to make the code easier to follow and didn't affect performance.

Sci/Tech Review

I understand this area of code and the changes being added
The proposed changes correspond to the pull request description
Documentation is sufficient (do documentation papers need updating)
Sufficient testing has been completed

(Please alert the code reviewer via a tag when you have approved the SR)

Code Review

All dependencies have been resolved
Related Issues have been properly linked and addressed
Code quality standards have been met
Tests are adequate and have passed
Security considerations have been addressed
Performance impact is acceptable

…es which git / VS Code are marking as in conflict. Annoyingly, this seems to have introduced a bug as well.

…esolving clashes. But looking at it, I really can't fathom out 'how'

…n't edited this file...

…e done before.

…ard.gif)

…o be pickled

r-sharp and others added 16 commits February 20, 2026 13:00

Yet another weird round of having to repeatedly accept the same chang…

fa59ff5

…es which git / VS Code are marking as in conflict. Annoyingly, this seems to have introduced a bug as well.

Fixing issue with external runners not establishing properly.

dba0aee

Fixing an error that seems to have crept in during merging main and r…

f257a51

…esolving clashes. But looking at it, I really can't fathom out 'how'

Where do these line length errors in the linter(s) come from - I have…

77fcbf1

…n't edited this file...

quick "improvement" to the error reporting in one of the tests

d51bc16

Why am I constantly having to re-impliment fixes/tidying I'm sure I'v…

5b9c4a8

…e done before.

And now ruff_format wants to change them back again...

ad4d9a3

Merge branch 'main' into unifying_the_style_checkers

6b553db

Merge branch 'main' into unifying_the_style_checkers

950e55d

Fix issue with "TLD" being repeated for paths when using --fullcheck

545580b

Undoing auto formatting again as github CI doesn't agree with it (pic…

b263eab

…ard.gif)

Merge branch 'main' into path_bugfix

6916b7b

Remove the thread lock (not threading any more) to allow TestResult t…

f04ef21

…o be pickled

1st draft switching to multiprocessing.

779f2c7

Functioning multiprocessor version using ProcessPoolExecuter

2670254

Tidy and reduce default max-workers to 2

ec37cc9

github-actions bot assigned r-sharp Mar 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch from Multi Threading to Multi Processing#206

Switch from Multi Threading to Multi Processing#206
r-sharp wants to merge 16 commits intoMetOffice:mainfrom
r-sharp:switch_to_multiprocessing

r-sharp commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

r-sharp commented Mar 6, 2026

PR Summary

Code Quality Checklist

Testing

Security Considerations

AI Assistance and Attribution

Sci/Tech Review

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant