Adding Gluten (on Velox) to ClickBench #10465
Replies: 2 comments
-
@Iskander14yo, |
Beta Was this translation helpful? Give feedback.
-
I once looked at the ClickBench workload and yes Gluten had some issues working with that dataset. ClickBench doesn't check the correctness of query results so we need to be cautious on the result we get now :) That said, the work so far is much appreciated. And we'll look to improve the results in the future. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi!
I recently opened a ClickHouse/ClickBench#575 to add Gluten (on Velox) to ClickBench - one of the popular benchmarks for analytical workloads.
I thought it might be useful to let Velox/Gluten community know about this PR. And while results yet to be measured I'd also appreciate any feedback on whether my configuration and setup are correct.
My notes (besides those in README):
I couldn’t get Gluten to run with pyspark==3.5.5 as the docs recommend; I hit warnings/errors (referenced in the README).
I didn’t find a reliable, single grep-able pattern to detect operators/functions where Gluten falls back to Spark because warnings vary significantly. For example, below are logs for a last query:
Beta Was this translation helpful? Give feedback.
All reactions