-
Notifications
You must be signed in to change notification settings - Fork 461
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GLUTEN-8432][CH]Remove duplicate output attributes of aggregate's child #8450
Conversation
Run Gluten Clickhouse CI on x86 |
Run Gluten Clickhouse CI on x86 |
It better to add some test cases for this scenario? |
Run Gluten ClickHouse CI on ARM |
1 similar comment
Run Gluten ClickHouse CI on ARM |
Run Gluten ClickHouse CI on ARM |
Run Gluten ClickHouse CI on ARM |
Run Gluten ClickHouse CI on ARM |
Run Gluten ClickHouse CI on ARM |
Run Gluten ClickHouse CI on ARM |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
…ild (apache#8450) * Try to remove grouping keys in arguments of count(distinct) * adjust child's output in CHHashAggregateExecTransformer * update * update
What changes were proposed in this pull request?
(Please fill in changes proposed in this fix)
Fixes: #8432
CH doesn' support duplicate columns in a block. Normally, there is no duplicate attributes in the output of an aggregate's child. There is one exception, when AQE is enabled, an shuffle resutl may be reused. If one of the aggregate function is
count(distinct)
and its arguments contain the grouping keys, duplicate attributes would be introduced. And we cannot erase the duplicate attributes by #8164, since the rule cannot be applied.How was this patch tested?
(Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)
manual tests
(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)