Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HIVE-28341: Iceberg: Change Major QB Full Table Compaction to compact… #5328

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

difin
Copy link
Contributor

@difin difin commented Jun 27, 2024

… partition by partition

What changes were proposed in this pull request?

Change Major QB Full Table Compaction to compact partition by partition

Why are the changes needed?

Currently, Iceberg Major compaction compacts a whole table in one step. If a table is partitioned and has a lot of data this operation can take a lot of time and it risks getting write conflicts at the commit stage. This PR proposes to improve it to work partition by partition. Also, for each partition it will create one snapshot instead of 2 snapshots (truncate+IOW) created now when compacting the whole table in one step.

Does this PR introduce any user-facing change?

No

Is the change a dependency upgrade?

No

How was this patch tested?

New query test added and updated existing iceberg compaction query tests with the new correct expected results.

Copy link

sonarcloud bot commented Jun 28, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants