Rapids integration #278
Replies: 5 comments 1 reply
-
GPU support seems to be added to Spark 3. "NVIDIA has also created a new Spark shuffle implementation that optimizes the data transfer between Spark processes. This shuffle implementation is built upon GPU-accelerated communication libraries, including UCX, RDMA, and NCCL." |
Beta Was this translation helpful? Give feedback.
-
They are also using CuDF at GPUs: |
Beta Was this translation helpful? Give feedback.
-
they seem to be implemented shuffling with scala at spark-rapids: |
Beta Was this translation helpful? Give feedback.
-
The BLINK paper is about heterogeneous environments where there can be slow and fast connections between different GPUs. So it doesn't have a connection with the task-based execution. https://proceedings.mlsys.org/paper/2020/file/43ec517d68b6edd3015b3edc9a11367b-Paper.pdf |
Beta Was this translation helpful? Give feedback.
-
Attaching the file we drew in the discussion |
Beta Was this translation helpful? Give feedback.
-
We discussed about two approaches for integrating with Rapids
Beta Was this translation helpful? Give feedback.
All reactions