Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GLUTEN-8432][CH]Remove duplicate output attributes of aggregate's child #8450

Merged
merged 4 commits into from
Jan 17, 2025

Conversation

lgbo-ustc
Copy link
Contributor

@lgbo-ustc lgbo-ustc commented Jan 7, 2025

What changes were proposed in this pull request?

(Please fill in changes proposed in this fix)

Fixes: #8432

CH doesn' support duplicate columns in a block. Normally, there is no duplicate attributes in the output of an aggregate's child. There is one exception, when AQE is enabled, an shuffle resutl may be reused. If one of the aggregate function is count(distinct) and its arguments contain the grouping keys, duplicate attributes would be introduced. And we cannot erase the duplicate attributes by #8164, since the rule cannot be applied.

How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)

manual tests

(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

Copy link

github-actions bot commented Jan 7, 2025

#8432

Copy link

github-actions bot commented Jan 7, 2025

Run Gluten Clickhouse CI on x86

@lgbo-ustc lgbo-ustc changed the title [GLUTEN-8432][CH]Remove duplicated output attributes of aggregate's child [GLUTEN-8432][CH]Remove duplicate output attributes of aggregate's child Jan 7, 2025
Copy link

github-actions bot commented Jan 7, 2025

Run Gluten Clickhouse CI on x86

@zzcclp
Copy link
Contributor

zzcclp commented Jan 7, 2025

It better to add some test cases for this scenario?

Copy link

Run Gluten ClickHouse CI on ARM

1 similar comment
Copy link

Run Gluten ClickHouse CI on ARM

Copy link

Run Gluten ClickHouse CI on ARM

Copy link

Run Gluten ClickHouse CI on ARM

Copy link

Run Gluten ClickHouse CI on ARM

Copy link

Run Gluten ClickHouse CI on ARM

Copy link

Run Gluten ClickHouse CI on ARM

Copy link
Contributor

@zzcclp zzcclp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lgbo-ustc lgbo-ustc merged commit 17d017e into apache:main Jan 17, 2025
6 checks passed
baibaichen pushed a commit to baibaichen/gluten that referenced this pull request Feb 1, 2025
…ild (apache#8450)

* Try to remove grouping keys in arguments of count(distinct)

* adjust child's output in CHHashAggregateExecTransformer

* update

* update
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[CH] org.apache.gluten.exception.GlutenException: Not found column 8Ð'ï
2 participants