[webgpu] Fix MatMulNBits prefill shader synchronization #23663

daijh · 2025-02-12T08:14:28Z

Description

This commit adds a workgroupBarrier to the MatMulNBits prefill shader to ensure proper synchronization between workgroup invocations, resolving a potential race condition.

Motivation and Context

See above.

This commit adds a `workgroupBarrier` to the MatMulNBits prefill shader to ensure proper synchronization between workgroup invocations, resolving a potential race condition.

daijh · 2025-02-12T08:15:45Z

@qjia7 @jchen10

daijh · 2025-02-12T08:16:01Z

@guschmue @fs-eire, please take a look.

guschmue · 2025-02-13T18:07:46Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2025-02-13T18:07:53Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2025-02-13T18:08:00Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2025-02-13T18:08:03Z

Azure Pipelines successfully started running 2 pipeline(s).

guschmue · 2025-02-13T18:08:06Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI

azure-pipelines · 2025-02-13T18:08:19Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-02-13T18:08:23Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-02-13T18:08:33Z

Azure Pipelines successfully started running 9 pipeline(s).

qjia7

LGTM, thanks!

guschmue

great catch!

[webgpu] Fix MatMulNBits prefill shader synchronization

5e00fbf

This commit adds a `workgroupBarrier` to the MatMulNBits prefill shader to ensure proper synchronization between workgroup invocations, resolving a potential race condition.

guschmue added the ep:WebGPU ort-web webgpu provider label Feb 13, 2025

qjia7 approved these changes Feb 14, 2025

View reviewed changes

guschmue approved these changes Feb 14, 2025

View reviewed changes

guschmue merged commit d07ea64 into microsoft:main Feb 14, 2025
76 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[webgpu] Fix MatMulNBits prefill shader synchronization #23663

[webgpu] Fix MatMulNBits prefill shader synchronization #23663

daijh commented Feb 12, 2025

daijh commented Feb 12, 2025

daijh commented Feb 12, 2025

guschmue commented Feb 13, 2025

guschmue commented Feb 13, 2025

guschmue commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

guschmue commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

qjia7 left a comment

guschmue left a comment

[webgpu] Fix MatMulNBits prefill shader synchronization #23663

[webgpu] Fix MatMulNBits prefill shader synchronization #23663

Conversation

daijh commented Feb 12, 2025

Description

Motivation and Context

daijh commented Feb 12, 2025

daijh commented Feb 12, 2025

guschmue commented Feb 13, 2025

guschmue commented Feb 13, 2025

guschmue commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

guschmue commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

azure-pipelines bot commented Feb 13, 2025

qjia7 left a comment

Choose a reason for hiding this comment

guschmue left a comment

Choose a reason for hiding this comment