perf: increase efficiency by using coroutines #322

Axelen123 · 2024-11-10T19:28:46Z

This PR contains ideas I got while looking at discussions and code related to the perf/parallel-patch-execution branch.

Changes

Fingerprint matching and class searches are now done in parallel
IO operations are now performed on IO threads
Changed the Patch API to use coroutines

TODOs

Fix resource patch data races
Improve API
Cleanup

LisoUseInAIKyrios · 2024-11-10T19:52:44Z

So everything is still in order and single threaded, except when each patch is resolving a fingerprint it's that single fingerprint resolving that runs in parallel?

This should prevent all concurrency issues and the patches code won't need any changes or need to care about writing proper multi-threaded code.

What is the resource patch data race issue?

LisoUseInAIKyrios · 2024-11-10T23:40:36Z

I'm trying this locally, but it looks like the patches are running in parallel?

I'm seeing the patches are not finishing in alphabetical order. If the patching is done in parallel then patching errors can occur non-deterministically (ie: at random), because multiple patches can modify the same method in an interleaving fashion. Where one patch is getting an insert index, and then another patch adds code that changes that insertion index, and the first patch then uses the wrong index which is now pointing to the wrong instruction.

I'm only seeing a minor speedup with this change.

Without this change, it takes 70 seconds total patching time from start to finish (including installing the patched app).
With this change I'm seeing 65 seconds total time.

5 seconds of improvement is less than a 10% improvement and seems too little to notice. Slower devices would see a larger speedup, but it still would not be substantial.

LisoUseInAIKyrios · 2024-11-10T23:50:23Z

An unrelated idea to speed up patching, is to use KMP for the opcode pattern matching.

KMP is ideal since it changes searching to O(N) instead of the current worse case O(N^2). But it really shines when the search target has a limited set of characters and many repeating patterns, such as the repeating and common opcode patterns generated by the compiler.

That alone would most likely give even better performance than trying to parallelize patching.

oSumAtrIX and others added 7 commits November 9, 2024 05:17

perf: Execute patches in parallel

57620fd

fix: Add synchronization to proxy pool

ce57c4f

fix: Update failing test

d33f199

fix logging indentation

07c3ae4

dont synchronize when not necessary

708e91e

refactor

2ecec37

parallelism PoC

a899a52

Axelen123 mentioned this pull request Nov 10, 2024

perf: increase efficiency by using coroutines ReVanced/revanced-patches#3885

Draft

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: increase efficiency by using coroutines #322

perf: increase efficiency by using coroutines #322

Axelen123 commented Nov 10, 2024

LisoUseInAIKyrios commented Nov 10, 2024 •

edited

Loading

LisoUseInAIKyrios commented Nov 10, 2024

LisoUseInAIKyrios commented Nov 10, 2024

perf: increase efficiency by using coroutines #322

Are you sure you want to change the base?

perf: increase efficiency by using coroutines #322

Conversation

Axelen123 commented Nov 10, 2024

Changes

TODOs

LisoUseInAIKyrios commented Nov 10, 2024 • edited Loading

LisoUseInAIKyrios commented Nov 10, 2024

LisoUseInAIKyrios commented Nov 10, 2024

LisoUseInAIKyrios commented Nov 10, 2024 •

edited

Loading