August 29, 2025: Weekly Status Update in Gluten #10586
GlutenPerfBot
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This weekly update is generated by LLMs. You're welcome to join our Github for in-depth discussions.
Overall Activity Summary
This week in the Gluten community has been highly productive, with a strong focus on advancing Flink integration, enhancing data lake write capabilities, and performing significant core code refactoring. Development on the Velox backend remains very active, with major strides in Iceberg support and shuffle performance. We also saw important build system improvements and continued efforts to simplify the codebase for better long-term maintainability.
Key Ongoing Projects
Several key initiatives are driving the project forward, thanks to our dedicated contributors:
--add-opens
options to MAVEN_OPTS for Java 17 compatibility #10572 by @KevinyhZou.gluten-it
integration testing framework is being improved by @zhztheplayer, with work to support Delta tables in benchmarks ([VL] Gluten-it: Support using Delta tables in TPC-H and TPC-DS benchmarks #10562) and clean up Maven dependencies ([VL] Gluten-it: Clean up Maven dependency relationships #10563).Priority Items
We encourage the community to review and provide feedback on these important pull requests:
ColumnarShuffleReader
to boost shuffle performance. Feedback on this architectural change is highly valuable.CharVarcharCodegenUtils
has seen extensive discussion and could use a final review to move it toward merging.Notable Discussions
Several important conversations are shaping the future of Gluten:
Emerging Trends
Based on this week's activity, we've identified several key trends:
Good First Issues
Looking to make your first contribution to Gluten? These issues are well-defined and a great way to get started:
date_from_unix_date
function in the ClickHouse backend.split_part
function for the ClickHouse backend.SparkPartitionID
function in the ClickHouse backend.MakeYMInterval
expression for the ClickHouse backend.These issues are excellent entry points for contributors with some C++ and Scala/Java experience. They involve implementing a single, well-scoped function, allowing you to get familiar with Gluten's expression framework and contribution process without needing to understand the entire system. Welcome to the community
Beta Was this translation helpful? Give feedback.
All reactions