perf: query in-memory filtering as predicate [DHIS2-18041] #20094

jbee · 2025-02-26T13:49:50Z

Refactors the matching within the in-memory query engine to run the matching in 2 steps

This is an important performance improvement because...

the 1st step which contains multiple lookups is only done once (as compared to before where is was done for each object candidate).
the matching itself does not create temporary collections. The algorithm before the refactoring would matrix multiply nested paths with collections and then test each result. For example, a path like x.y.z with x and y being collections of size N and M would result in a intermediate collection of size N * M if z objects. These then each would be tested against the filter. This would happen for each object candidate tested. But building the intermediate collections is entirely unnecessary. The same can be done using "exists any" logic testing along the path instead. So in this example the code checks: exists and e in N where there exists any e in M where z matches the test.

There are countless tests directly and indirectly testing this. No new tests were added.

Would be any sort of metadata API query using a filter: ?filter=...

sonarqubecloud · 2025-02-26T13:53:47Z

Quality Gate passed

refactor: query in-memory filtering as predicate [DHIS2-18041]

c7c9e12

jbee self-assigned this Feb 26, 2025

netroms approved these changes Feb 26, 2025

View reviewed changes

david-mackessy approved these changes Feb 26, 2025

View reviewed changes

jbee merged commit 185f088 into master Feb 26, 2025
17 checks passed

jbee deleted the DHIS2-18041-in-mem-filter branch February 26, 2025 14:15