perf(tree): worker pooling for storage in multiproof generation #18883

yongkangc · 2025-10-07T03:05:52Z

This PR aims to add worker pooling to our existing proof generation. This is the first pr to perform this migration.

Core logic:

Determine workers based on parallelism
Spawn x workers each with its own tx
Queue storage proofs via crossbeam channels to the workers

Changes:

Use tokio blocking pool
Replaced ProofTaskManager with a new new_proof_task_handle function for improved task management. We removed ProofTaskManager here as it was an intemediary that is not needed with direct wiring.

Issues closed:

References:

Overall POC: perf(trie): proofmanager optimisation WIP #18829
Previously we had a PR up by Rhovian. However the issue was that this is a larger refactors with significant back and forth, and therefore we had to bring this refactor in internally.

Next Steps:

Moving Account and Blinded nodes to adopt the same infrastructure

- Introduced constants for maximum and minimum storage/account proof workers to manage memory usage. - Implemented `default_proof_workers` function to calculate optimal worker counts based on available CPU cores. - Updated `TreeConfig` to include storage and account proof worker counts, with defaults derived from CPU detection. - Modified `ProofTaskManager` initialization to accept storage worker count. - Adjusted tests to accommodate new proof worker configurations.

- Simplified the `ProofTaskManager` by implementing a transaction pool using `crossbeam` channels for better concurrency. - Replaced the previous transaction management logic with a pre-initialized pool of transactions, improving efficiency in handling proof tasks. - Updated task dispatching to utilize the new transaction pool, ensuring smoother execution and resource management. - Enhanced error handling and logging for transaction operations to improve debugging capabilities.

- Changed `account_nodes` and `storage_nodes` from `usize` to `AtomicUsize` to support concurrent updates from multiple workers. - Implemented a custom `Clone` for `ProofTaskMetrics` to handle atomic values correctly. - Updated the `record` method to load atomic values with `Ordering::Relaxed` for improved performance.

- Replaced `ProofTaskManager` with a new `new_proof_task_handle` function for improved task management. - Updated related code to utilize the new proof task handle, enhancing clarity and reducing complexity. - Adjusted tests to reflect changes in proof task initialization and handling.

- Updated the `spawn` method in `payload_processor` to return a `ProviderResult`, improving error handling during task initialization. - Refactored related code in `payload_validator` and `multiproof` to handle the new result type, ensuring consistent error management. - Adjusted tests to validate the changes in task spawning and error handling mechanisms.

- Consolidated the initialization of `StateProviderBuilder` in the `bench_state_root` function for improved readability. - Streamlined the `spawn` method calls in `payload_validator` and `payload_processor` to enhance code clarity. - Adjusted formatting and structure in `proof_task` to maintain consistency across the codebase.

- Added logic to drain receivers for accounts not touched by the walker, allowing workers to deliver results without encountering closed channels. - Enhanced error handling for closed channels during storage proof reception, ensuring robustness in parallel processing.

Copilot

Pull Request Overview

This PR refactors proof generation to use worker pooling for improved performance. It replaces the existing ProofTaskManager with a new new_proof_task_handle function that creates a pool of pre-warmed database transactions.

Replaced ProofTaskManager with direct worker pooling via crossbeam channels
Added atomic metrics tracking for concurrent access
Introduced storage and account worker count configuration with auto-detection based on CPU cores

Reviewed Changes

Copilot reviewed 9 out of 10 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
crates/trie/parallel/src/proof_task_metrics.rs	Updated metrics to use atomic types for thread-safe concurrent access
crates/trie/parallel/src/proof_task.rs	Complete refactor replacing ProofTaskManager with worker pool architecture
crates/trie/parallel/src/proof.rs	Updated to use crossbeam channels and handle orphaned receivers
crates/trie/parallel/Cargo.toml	Added crossbeam-channel dependency
crates/engine/tree/src/tree/payload_validator.rs	Added error handling for proof task creation
crates/engine/tree/src/tree/payload_processor/multiproof.rs	Updated test to use new proof task handle creation
crates/engine/tree/src/tree/payload_processor/mod.rs	Updated to use new proof task handle and added error handling
crates/engine/tree/benches/state_root_task.rs	Added error handling for payload processor spawn
crates/engine/primitives/src/config.rs	Added worker count configuration with CPU-based auto-detection

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-07T04:45:19Z

crates/trie/parallel/src/proof_task.rs

+    let max_concurrency = max_concurrency.max(1);
+    let storage_worker_count = storage_worker_count.max(1);
+    let queue_capacity = max_concurrency;
+    let worker_count = storage_worker_count.min(max_concurrency);


The relationship between max_concurrency, storage_worker_count, and the final worker_count is unclear. The logic suggests worker_count is capped by max_concurrency, but the purpose of having both parameters is not evident from the code or documentation.

crates/trie/parallel/src/proof_task.rs

crates/engine/primitives/src/config.rs

Co-authored-by: Copilot <[email protected]>

- Updated the documentation for `new_proof_task_handle` to clarify its purpose and functionality. - Refined the logic for determining `queue_capacity` and `worker_count` to ensure proper initialization of the worker pool. - Removed outdated comments regarding transaction pool resource management for clarity.

- Eliminated constants and functions related to storage and account proof worker counts to simplify configuration. - Updated `TreeConfig` and related functions to remove references to proof worker counts, streamlining the codebase. - Adjusted proof task handling to utilize half of the queue capacity for storage proofs, enhancing resource management.

yongkangc · 2025-10-07T05:48:18Z

crates/trie/parallel/src/proof_task.rs

-/// A task that manages sending multiproof requests to a number of tasks that have longer-running
-/// database transactions
-#[derive(Debug)]
-pub struct ProofTaskManager<Factory: DatabaseProviderFactory> {


debated really hard about removing prooftask manager in this pr - thought it makes things much easier to wire up rn

because the prooftasksmanager had too many previous abstractions that were uncessary and only used by storage proof, it was easier to remove that now

Copilot

Pull Request Overview

Copilot reviewed 8 out of 9 changed files in this pull request and generated 2 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

crates/trie/parallel/src/proof_task.rs

Co-authored-by: Copilot <[email protected]>

yongkangc · 2025-10-07T06:14:23Z

initial benching at commit 6d117dd

yongkangc · 2025-10-07T06:43:19Z

moved over to #18887

yongkangc · 2025-10-07T07:23:43Z

Latest commit seems to have improved slightly after messing with the workers config

github-project-automation bot added this to Reth Tracker Oct 7, 2025

github-project-automation bot moved this to Backlog in Reth Tracker Oct 7, 2025

yongkangc changed the title ~~perf: worker pooling for multiproof~~ perf: worker pooling for storage in multiproof generation Oct 7, 2025

yongkangc changed the title ~~perf: worker pooling for storage in multiproof generation~~ perf(tree): worker pooling for storage in multiproof generation Oct 7, 2025

yongkangc self-assigned this Oct 7, 2025

yongkangc added 6 commits October 7, 2025 03:17

add crossbeam deps

ea20159

comments

f2dab42

yongkangc force-pushed the yk/worker_pool_manager branch from c4cf477 to 3ee7c73 Compare October 7, 2025 03:20

yongkangc added 5 commits October 7, 2025 03:38

clamp to 1

476e853

comments

b507803

yongkangc requested a review from Copilot October 7, 2025 04:44

yongkangc moved this from Backlog to In Review in Reth Tracker Oct 7, 2025

Copilot AI reviewed Oct 7, 2025

View reviewed changes

yongkangc and others added 6 commits October 7, 2025 12:49

Update crates/trie/parallel/src/proof_task.rs

46bc3ea

Co-authored-by: Copilot <[email protected]>

readding span guard

79485d7

fmt

50fd5a0

worker clamp

d486048

yongkangc commented Oct 7, 2025

View reviewed changes

yongkangc requested a review from Copilot October 7, 2025 05:49

Copilot AI reviewed Oct 7, 2025

View reviewed changes

crates/trie/parallel/src/proof_task.rs Outdated Show resolved Hide resolved

crates/trie/parallel/src/proof_task.rs Outdated Show resolved Hide resolved

Apply suggestion from @Copilot

34ed23b

Co-authored-by: Copilot <[email protected]>

yongkangc added 2 commits October 7, 2025 05:57

comment

6d117dd

comment

f30a04c

yongkangc closed this Oct 7, 2025

github-project-automation bot moved this from In Review to Done in Reth Tracker Oct 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(tree): worker pooling for storage in multiproof generation #18883

perf(tree): worker pooling for storage in multiproof generation #18883

Uh oh!

yongkangc commented Oct 7, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yongkangc Oct 7, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

yongkangc commented Oct 7, 2025 •

edited

Loading

Uh oh!

yongkangc commented Oct 7, 2025

Uh oh!

yongkangc commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

perf(tree): worker pooling for storage in multiproof generation #18883

perf(tree): worker pooling for storage in multiproof generation #18883

Uh oh!

Conversation

yongkangc commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yongkangc Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

yongkangc commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yongkangc commented Oct 7, 2025

Uh oh!

yongkangc commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yongkangc commented Oct 7, 2025 •

edited

Loading

yongkangc Oct 7, 2025 •

edited

Loading

yongkangc commented Oct 7, 2025 •

edited

Loading