Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Enhancement] run auto analyze job with priority #55446

Merged
merged 3 commits into from
Jan 26, 2025

Conversation

murphyatwork
Copy link
Contributor

@murphyatwork murphyatwork commented Jan 26, 2025

Why I'm doing:

In case of a database with 10K+ tables, the auto analyze job can sustain for a few hours, so the job will not stop even if out of the running window.

What I'm doing:

To address this issue, we introduce two optimizations:

  1. Check the running window before analyze each table
  2. Sort jobs with priority, considering the stats healthy and stats staleness

Out of scope:

  1. Analyzing external tables do not support priority
  2. If analyze a single table can take a long time, it cannot be interrupted immediately

Fixes #55447

What type of PR is this:

  • BugFix
  • Feature
  • Enhancement
  • Refactor
  • UT
  • Doc
  • Tool

Does this PR entail a change in behavior?

  • Yes, this PR will result in a change in behavior.
  • No, this PR will not result in a change in behavior.

If yes, please specify the type of change:

  • Interface/UI changes: syntax, type conversion, expression evaluation, display information
  • Parameter changes: default values, similar parameters but with different default values
  • Policy changes: use new policy to replace old one, functionality automatically enabled
  • Feature removed
  • Miscellaneous: upgrade & downgrade compatibility, etc.

Checklist:

  • I have added test cases for my bug fix or my new feature
  • This pr needs user documentation (for new or modified features or behaviors)
    • I have added documentation for my new feature or new function
  • This is a backport pr

Bugfix cherry-pick branch check:

  • I have checked the version labels which the pr will be auto-backported to the target branch
    • 3.4
    • 3.3
    • 3.2
    • 3.1
    • 3.0

2. interrupt the job if out of running window

Signed-off-by: Murphy <[email protected]>
Copy link

Quality Gate Failed Quality Gate failed

Failed conditions
B Maintainability Rating on New Code (required ≥ A)

See analysis details on SonarQube Cloud

Catch issues before they fail your Quality Gate with our IDE extension SonarQube for IDE

@kangkaisen kangkaisen enabled auto-merge (squash) January 26, 2025 06:11
Copy link

[Java-Extensions Incremental Coverage Report]

pass : 0 / 0 (0%)

Copy link

[FE Incremental Coverage Report]

pass : 38 / 43 (88.37%)

file detail

path covered_line new_line coverage not_covered_line_detail
🔵 com/starrocks/statistic/ExternalAnalyzeJob.java 1 2 50.00% [199]
🔵 com/starrocks/statistic/NativeAnalyzeJob.java 1 2 50.00% [227]
🔵 com/starrocks/statistic/StatisticAutoCollector.java 4 5 80.00% [110]
🔵 com/starrocks/statistic/StatisticsCollectJobFactory.java 12 14 85.71% [519, 521]
🔵 com/starrocks/statistic/StatisticsCollectJob.java 20 20 100.00% []

Copy link

[BE Incremental Coverage Report]

pass : 0 / 0 (0%)

@kangkaisen kangkaisen merged commit 4f8f233 into StarRocks:main Jan 26, 2025
48 of 50 checks passed
Copy link

@Mergifyio backport branch-3.4

@github-actions github-actions bot removed the 3.4 label Jan 26, 2025
Copy link

@Mergifyio backport branch-3.3

Copy link

@Mergifyio backport branch-3.2

Copy link
Contributor

mergify bot commented Jan 26, 2025

backport branch-3.4

✅ Backports have been created

Copy link
Contributor

mergify bot commented Jan 26, 2025

backport branch-3.3

✅ Backports have been created

Copy link
Contributor

mergify bot commented Jan 26, 2025

backport branch-3.2

✅ Backports have been created

mergify bot pushed a commit that referenced this pull request Jan 26, 2025
Signed-off-by: Murphy <[email protected]>
(cherry picked from commit 4f8f233)

# Conflicts:
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticsCollectJobFactory.java
#	fe/fe-core/src/test/java/com/starrocks/statistic/StatisticsCollectJobTest.java
mergify bot pushed a commit that referenced this pull request Jan 26, 2025
Signed-off-by: Murphy <[email protected]>
(cherry picked from commit 4f8f233)

# Conflicts:
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticsCollectJobFactory.java
#	fe/fe-core/src/test/java/com/starrocks/statistic/StatisticsCollectJobTest.java
mergify bot pushed a commit that referenced this pull request Jan 26, 2025
Signed-off-by: Murphy <[email protected]>
(cherry picked from commit 4f8f233)

# Conflicts:
#	fe/fe-core/src/main/java/com/starrocks/statistic/ExternalAnalyzeJob.java
#	fe/fe-core/src/main/java/com/starrocks/statistic/NativeAnalyzeJob.java
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticAutoCollector.java
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticsCollectJob.java
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticsCollectJobFactory.java
#	fe/fe-core/src/test/java/com/starrocks/statistic/StatisticsCollectJobTest.java
murphyatwork added a commit that referenced this pull request Jan 26, 2025
Signed-off-by: Murphy <[email protected]>
(cherry picked from commit 4f8f233)
Signed-off-by: Murphy <[email protected]>

# Conflicts:
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticsCollectJobFactory.java
#	fe/fe-core/src/test/java/com/starrocks/statistic/StatisticsCollectJobTest.java
murphyatwork added a commit that referenced this pull request Jan 26, 2025
Signed-off-by: Murphy <[email protected]>
(cherry picked from commit 4f8f233)
Signed-off-by: Murphy <[email protected]>

# Conflicts:
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticsCollectJobFactory.java
#	fe/fe-core/src/test/java/com/starrocks/statistic/StatisticsCollectJobTest.java
murphyatwork added a commit that referenced this pull request Jan 26, 2025
Signed-off-by: Murphy <[email protected]>
(cherry picked from commit 4f8f233)
Signed-off-by: Murphy <[email protected]>

# Conflicts:
#	fe/fe-core/src/main/java/com/starrocks/statistic/ExternalAnalyzeJob.java
#	fe/fe-core/src/main/java/com/starrocks/statistic/NativeAnalyzeJob.java
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticAutoCollector.java
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticsCollectJob.java
#	fe/fe-core/src/main/java/com/starrocks/statistic/StatisticsCollectJobFactory.java
#	fe/fe-core/src/test/java/com/starrocks/statistic/StatisticsCollectJobTest.java
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Enhance statistics collection for a large amount of tables
3 participants