Set correct threads_per_warp to 64 for AMD GPU #170

yaomingamd · 2025-04-14T16:38:39Z

In xla0.5.0, some algorithms depends on warp_size 32, so that thereads_per_warp has been manually set to 32. This PR tries to reset threads_per_warp to 64 correctly for current AMD GPU and at the same, fix unittest failure/numerical issues due to this change. I have run unittest of jax, it shows that unittests have much less failure/numerical issues than original/base branch rocm-jaxlib-v0.5.0. For example, pytest tests/pallas pass all unittests.
=============== 1907 passed, 3307 skipped in 1281.18s (0:21:21) ================

TODO: need to find way setit to 32 for CUDA/NVIDIA GPU

TODO: need to set warp_size 32 for CUDA//NVIDIA GPU

…lly set warp_size =32 for AMD GPU it is a temporary solution

yaomingamd added 9 commits April 13, 2025 20:39

Get correct ThreadsPerWarp for AMD GPU by call HIP API

4069dd5

set correct threads_per_warp 64 for ROCm

cafd14d

set correct threads_per_warp 64 for ROCm.

5646a91

TODO: need to find way setit to 32 for CUDA/NVIDIA GPU

set correct warp_size 64 for ROCm

dfce4f0

set correct warp_size 64 for AMD GPU

51086ec

TODO: need to set warp_size 32 for CUDA//NVIDIA GPU

consistent with warp_size 64, 2 warps will has same threads as 4 warps

b8295bd

current reduction algorithm only fit for warp_size 32, so that manua…

405067a

…lly set warp_size =32 for AMD GPU it is a temporary solution

temporary solution.

8d61ea3

fix typo

534511e

i-chaochen mentioned this pull request Apr 14, 2025

Rocm jaxlib v0.5.0 warpsize #169

Open

i-chaochen requested review from zoranjovanovic-ns and draganmladjenovic April 14, 2025 20:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set correct threads_per_warp to 64 for AMD GPU #170

Set correct threads_per_warp to 64 for AMD GPU #170

yaomingamd commented Apr 14, 2025

Set correct threads_per_warp to 64 for AMD GPU #170

Are you sure you want to change the base?

Set correct threads_per_warp to 64 for AMD GPU #170

Conversation

yaomingamd commented Apr 14, 2025