[Proton][Dialect] Add Proton Device Memory Buffer Init and Allocate Pass #5606

CRobeck · 2025-01-14T17:53:05Z

Add the init and allocation of the Proton dialect device buffer that can be used in place of the shared memory buffer. The device buffer is just a module local, zero initialized, stack buffer in address space(1).

third_party/amd/backend/compiler.py

lib/Conversion/TritonToTritonGPU/TritonToTritonGPUPass.cpp

third_party/amd/lib/TritonAMDGPUToLLVM/TritonGPUToLLVM.cpp

third_party/nvidia/backend/compiler.py

third_party/nvidia/lib/TritonNVIDIAGPUToLLVM/TritonGPUToLLVM.cpp

third_party/proton/dialect/include/Dialect/Proton/IR/ProtonOps.td

fywkevin · 2025-01-25T19:39:13Z

third_party/proton/dialect/include/TritonProtonToLLVM/PatternTritonProtonOpToLLVM.h

@@ -10,6 +10,11 @@ void populateRecordOpToLLVMPattern(LLVMTypeConverter &typeConverter,
                                   RewritePatternSet &patterns,
                                   const TargetInfoBase &targetInfo,
                                   PatternBenefit benefit);
+void populateInitDeviceBufferOpToLLVMPattern(LLVMTypeConverter &typeConverter,


After we have our own llvm lowering conversion pass, let's this move to proton/dialect/lib/...

third_party/proton/dialect/lib/TritonProtonToLLVM/InitDeviceBufferOpToLLVM.cpp

fywkevin · 2025-01-25T19:43:08Z

third_party/proton/test/test_device_buffer.py

+
+
+@triton.jit
+def softmax_kernel(output_ptr, input_ptr, input_row_stride, output_row_stride, n_rows, n_cols, BLOCK_SIZE: tl.constexpr,


For the end-to-end testing, you could manually construct a TTGIR with the buffer_alloc_op and read write to it and finally write it back to gmem to check its value in python.

Right, I think we'll want to go through in another PR and add all the end to end testing at once to make sure we have the code coverage we want.

…_buffer

CRobeck added 20 commits January 10, 2025 22:02

temp

afab47b

temp

05adc3e

update

0b6947c

update

0a286b2

update

2ce6944

update

66b78f5

temp

1320379

temp

18e05e3

temp

e188eb8

temp

b22610d

clean up

6c1b80e

clean up

6cc88d0

temp

6c2f237

temp

01be85e

temp

6fb8b59

temp

5e0b4ac

temp

3642a3b

temp

93f1ae1

update

1f3fa88

update

c4b173a

CRobeck changed the base branch from main to proton-dev January 23, 2025 03:55

CRobeck changed the title ~~[WIP][Proton][Dialect] Add Initial Infrastructure For Proton Shared Memory Buffer~~ [Proton][Dialect] Add Infrastructure For Proton Device Memory Buffer Jan 23, 2025

update

73d584d

CRobeck force-pushed the proton_buffer branch from 708cb2c to 73d584d Compare January 23, 2025 21:58

CRobeck added 3 commits January 23, 2025 22:21

update

303a53a

update

755e30d

update

51754f9

CRobeck marked this pull request as ready for review January 23, 2025 22:26

CRobeck requested review from antiagainst and zhanglx13 as code owners January 23, 2025 22:26

CRobeck requested a review from ptillet as a code owner January 23, 2025 22:26

CRobeck added 2 commits January 23, 2025 18:05

Merge branch 'proton-dev' into proton_buffer

1f314eb

update

efd3faa

CRobeck mentioned this pull request Jan 24, 2025

[Dialect] Implement Proton device buffer init and alloc ops #5689

Open

CRobeck added 3 commits January 24, 2025 02:21

update

97fc32a

update

e86281e

update

b6c0b85

CRobeck changed the title ~~[Proton][Dialect] Add Infrastructure For Proton Device Memory Buffer~~ [Proton][Dialect] Add Infrastructure For Proton Device Memory Buffer Pass Jan 24, 2025

CRobeck changed the title ~~[Proton][Dialect] Add Infrastructure For Proton Device Memory Buffer Pass~~ [Proton][Dialect] Add Proton Device Memory Buffer Pass Jan 24, 2025

CRobeck changed the title ~~[Proton][Dialect] Add Proton Device Memory Buffer Pass~~ [Proton][Dialect] Add Proton Device Memory Buffer Init and Allocate Pass Jan 25, 2025

fywkevin self-assigned this Jan 25, 2025

fywkevin requested changes Jan 25, 2025

View reviewed changes

CRobeck added 4 commits January 26, 2025 01:12

update

f5c8fe2

Merge branch 'proton_buffer' of github.com:CRobeck/triton into proton…

892f8ec

…_buffer

update

7c64da9

update naming

7be389b

CRobeck force-pushed the proton_buffer branch 3 times, most recently from 1c9d9ca to 7be389b Compare January 29, 2025 14:23

CRobeck added 6 commits January 29, 2025 09:29

Merge branch 'proton-dev' into proton_buffer

00d7fb7

replace TritonGPUToLLVM/Utility.h macros with TritonLLVMOpBuilder

b298ecd

update ops

9f97404

remove pass from other backends

e56471f

update ops

4472d8f

update

ca3b294

CRobeck requested a review from fywkevin January 29, 2025 16:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Proton][Dialect] Add Proton Device Memory Buffer Init and Allocate Pass #5606

[Proton][Dialect] Add Proton Device Memory Buffer Init and Allocate Pass #5606

CRobeck commented Jan 14, 2025 •

edited

Loading

fywkevin Jan 25, 2025

fywkevin Jan 25, 2025

CRobeck Jan 25, 2025



		@triton.jit
		def softmax_kernel(output_ptr, input_ptr, input_row_stride, output_row_stride, n_rows, n_cols, BLOCK_SIZE: tl.constexpr,

[Proton][Dialect] Add Proton Device Memory Buffer Init and Allocate Pass #5606

Are you sure you want to change the base?

[Proton][Dialect] Add Proton Device Memory Buffer Init and Allocate Pass #5606

Conversation

CRobeck commented Jan 14, 2025 • edited Loading

fywkevin Jan 25, 2025

Choose a reason for hiding this comment

fywkevin Jan 25, 2025

Choose a reason for hiding this comment

CRobeck Jan 25, 2025

Choose a reason for hiding this comment

CRobeck commented Jan 14, 2025 •

edited

Loading