Making force merge threadpool 1/8th of total cores #17255

gbbafna · 2025-02-05T07:53:29Z

Description

Currently force merge threads are bounded to size of 1 , irrespective of total cores available. This makes the force merges very slow and doesn't scale it with number of cores

This PR increases the thread count to 1/8th of the total cores, thereby making the force merges to run faster, still capped at 12.5% of overall CPU available.

Related Issues

Resolves #[Issue number to be closed when this PR is merged]

Check List

Functionality includes testing.
API changes companion pull request created, if applicable.
Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

github-actions · 2025-02-05T08:57:19Z

✅ Gradle check result for abc0a90: SUCCESS

codecov · 2025-02-05T08:58:05Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 72.45%. Comparing base (302a3fd) to head (4ec5931).
Report is 3 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff              @@
##               main   #17255      +/-   ##
============================================
- Coverage     72.47%   72.45%   -0.03%     
+ Complexity    65618    65551      -67     
============================================
  Files          5291     5291              
  Lines        304347   304331      -16     
  Branches      44182    44181       -1     
============================================
- Hits         220578   220503      -75     
- Misses        65670    65759      +89     
+ Partials      18099    18069      -30

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

peternied

Nice! This will be a big improvement for customers that have trouble scaling up that depend on force merges

server/src/main/java/org/opensearch/threadpool/ThreadPool.java

Bukhtawar

LGTM, much needed. Should we also make it dynamic?

gbbafna · 2025-02-07T05:59:00Z

LGTM, much needed. Should we also make it dynamic?

OpenSearch Threadpools are already dynamic now starting 2.17 . Reference

github-actions · 2025-02-07T06:28:01Z

❌ Gradle check result for 846824f: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

sohami · 2025-02-07T07:42:28Z

LGTM, much needed. Should we also make it dynamic?

OpenSearch Threadpools are already dynamic now starting 2.17 . Reference

With dynamic I understand we can change it without restart. But to set the default behavior is there any benchmark we have to understand the improvement with increasing the threadpool size in terms of disk IOPS utilization and force merge time reduction ? My question would be why 1/8th and not 1/4th or 1/16th as default ?

gbbafna · 2025-02-11T07:03:29Z

With dynamic I understand we can change it without restart. But to set the default behavior is there any benchmark we have to understand the improvement with increasing the threadpool size in terms of disk IOPS utilization and force merge time reduction ? My question would be why 1/8th and not 1/4th or 1/16th as default ?

We want to start with 1/8th as that will consume 12.5% of CPU . 1/4th would be too high - as it would be a sizeable impact on indexing and 1/16th would be too low.

We have seen many opensearch clusters having >=16 CPUs seeing slow force merges . Whereas for clusters having more nodes but less cores per nodes performing them much better. This is why we want to make this a factor of cores.

github-actions · 2025-02-11T13:07:51Z

✅ Gradle check result for 846824f: SUCCESS

jainankitk · 2025-02-11T20:52:01Z

@gbbafna - This change is good if force_merge were the only merge running on nodes. There is the default lucene merge that runs during indexing. Ideally, the threshold should account for both and limit the behavior depending on the number of threads being consumed for merge already.

Also, merge operation is not just CPU bound, but disk I/O bound as well. Allowing concurrent force_merge operations without considering I/O seems risky to me. Also, force_merge operation can intermittenly consume upto 2x the disk space of segments being merged. With single thread, the validation is easy, not sure how we correctly validate that during concurrent force_merge operations

Signed-off-by: Gaurav Bafna <[email protected]>

github-actions · 2025-02-12T05:12:34Z

❌ Gradle check result for 4ec5931: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

gbbafna · 2025-02-12T07:00:43Z

@gbbafna - This change is good if force_merge were the only merge running on nodes. There is the default lucene merge that runs during indexing. Ideally, the threshold should account for both and limit the behavior depending on the number of threads being consumed for merge already.

Also, merge operation is not just CPU bound, but disk I/O bound as well. Allowing concurrent force_merge operations without considering I/O seems risky to me. Also, force_merge operation can intermittenly consume upto 2x the disk space of segments being merged. With single thread, the validation is easy, not sure how we correctly validate that during concurrent force_merge operations

You are right about this being risky . But I would rather have it as a default and advice the users to have disk space and IO proportionate to their CPU, which they should be having any ways . When users just scale up CPU w/o disk space and IO proportionately, both of them are going to be the bottlenecks for other operations as well . If it is not proportionate, the advice would be to change the defaults via yml or not do force merges concurrently.

jainankitk · 2025-02-12T07:31:34Z

You are right about this being risky . But I would rather have it as a default and advice the users to have disk space and IO proportionate to their CPU, which they should be having any ways . When users just scale up CPU w/o disk space and IO proportionately, both of them are going to be the bottlenecks for other operations as well . If it is not proportionate, the advice would be to change the defaults via yml or not do force merges concurrently.

IMO, defaults should be the one that work for most uses. I am assuming this change is being driven by the performance tuning need for few customers. As such, there are many more customers for which the default of 1 works and don't need this to change. Also given OpenSearch Threadpools are dynamic now, we should tune the number of force_merge threads for customers that need more, learn from those tunings, before changing the default.

github-actions · 2025-02-12T07:51:40Z

✅ Gradle check result for 4ec5931: SUCCESS

gbbafna · 2025-02-12T08:34:19Z

You are right about this being risky . But I would rather have it as a default and advice the users to have disk space and IO proportionate to their CPU, which they should be having any ways . When users just scale up CPU w/o disk space and IO proportionately, both of them are going to be the bottlenecks for other operations as well . If it is not proportionate, the advice would be to change the defaults via yml or not do force merges concurrently.

IMO, defaults should be the one that work for most uses. I am assuming this change is being driven by the performance tuning need for few customers. As such, there are many more customers for which the default of 1 works and don't need this to change. Also given OpenSearch Threadpools are dynamic now, we should tune the number of force_merge threads for customers that need more, learn from those tunings, before changing the default.

I don't think it is working for most use cases . We have heard from numerous users that force merges are not scaling from them, despite a lot of capacity remaining unused on their clusters . For majority of users who use <16 cores, this change is not going to take in any effect . For rest of them , this is only going in take in effect, when there is large amount of force merges to be done. Hence i think that the pros outweigh the cons here. I do hear your concerns and would put it in our documentation.

ashking94

LGTM

gbbafna force-pushed the merge-threads-1 branch from ffa411f to abc0a90 Compare February 5, 2025 07:55

gbbafna added the backport 2.x Backport to 2.x branch label Feb 5, 2025

gbbafna marked this pull request as ready for review February 5, 2025 08:39

ajaymovva approved these changes Feb 5, 2025

View reviewed changes

chaitanya588 approved these changes Feb 5, 2025

View reviewed changes

peternied approved these changes Feb 5, 2025

View reviewed changes

jed326 reviewed Feb 5, 2025

View reviewed changes

server/src/main/java/org/opensearch/threadpool/ThreadPool.java Show resolved Hide resolved

Bukhtawar approved these changes Feb 6, 2025

View reviewed changes

Making force merge threadpool 1/8th of total cores

4ec5931

Signed-off-by: Gaurav Bafna <[email protected]>

gbbafna force-pushed the merge-threads-1 branch from 846824f to 4ec5931 Compare February 12, 2025 04:31

opensearch-ci-bot mentioned this pull request Feb 11, 2025

[AUTOCUT] Gradle Check Flaky Test Report for MetadataCreateIndexServiceTests #17291

Open

ashking94 approved these changes Feb 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making force merge threadpool 1/8th of total cores #17255

Making force merge threadpool 1/8th of total cores #17255

gbbafna commented Feb 5, 2025 •

edited

Loading

github-actions bot commented Feb 5, 2025

codecov bot commented Feb 5, 2025 •

edited

Loading

peternied left a comment

Bukhtawar left a comment

gbbafna commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

sohami commented Feb 7, 2025

gbbafna commented Feb 11, 2025

github-actions bot commented Feb 11, 2025

jainankitk commented Feb 11, 2025

github-actions bot commented Feb 12, 2025

gbbafna commented Feb 12, 2025

jainankitk commented Feb 12, 2025

github-actions bot commented Feb 12, 2025

gbbafna commented Feb 12, 2025

ashking94 left a comment

Making force merge threadpool 1/8th of total cores #17255

Are you sure you want to change the base?

Making force merge threadpool 1/8th of total cores #17255

Conversation

gbbafna commented Feb 5, 2025 • edited Loading

Description

Related Issues

Check List

github-actions bot commented Feb 5, 2025

codecov bot commented Feb 5, 2025 • edited Loading

Codecov Report

peternied left a comment

Choose a reason for hiding this comment

Bukhtawar left a comment

Choose a reason for hiding this comment

gbbafna commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

sohami commented Feb 7, 2025

gbbafna commented Feb 11, 2025

github-actions bot commented Feb 11, 2025

jainankitk commented Feb 11, 2025

github-actions bot commented Feb 12, 2025

gbbafna commented Feb 12, 2025

jainankitk commented Feb 12, 2025

github-actions bot commented Feb 12, 2025

gbbafna commented Feb 12, 2025

ashking94 left a comment

Choose a reason for hiding this comment

gbbafna commented Feb 5, 2025 •

edited

Loading

codecov bot commented Feb 5, 2025 •

edited

Loading