[TKW] Work on buffer ops #492

Hardcode84 · 2025-02-11T19:06:31Z

Split read/write ops indexing on thread-dependent and thread-independent.
Use symbolic vals for strides instead of extracting them from memref.
For now only replace gather/scatter ops with buffer ops.

harsh-nod

Some minor requests for comments and refactors, but overall looks good!

harsh-nod · 2025-02-13T18:11:00Z

iree/turbine/kernel/wave/codegen/read_write.py

@@ -75,15 +76,29 @@ def _get_start_indices(
    return start_indices


+def _split_index(src: IndexExpr) -> tuple[IndexExpr, IndexExpr]:


Could you add a comment explaining why it is not sufficient to just substitute in subs_th , why we need to compute the difference and the need for safe_subs as well?

harsh-nod · 2025-02-13T19:00:49Z

iree/turbine/kernel/wave/codegen/read_write.py

+        IndexingContext.current(), symbolic_shape, allow_mixed_shapes=True
+    )
+    if (
+        emitter.params.get("use_buffer_load_ops", False)


Can you refactor this into a variable/function so something like

use_buffer_ops = emitter.params.get("use_buffer_load_ops", False) has_integer_strides = all(isinstance(s, int) for s in strides) can_emit_buffer_ops = use_buffer_ops and has_integer_strides and is_gather if can_emit_buffer_ops: ...

harsh-nod · 2025-02-13T19:31:22Z

iree/turbine/kernel/wave/codegen/read_write.py

    elements_per_thread: int,
    mask: Optional[Value],
    offsets_vec: Optional[Value],
 ) -> Value:
    if mask is None and offsets_vec is None:
        return vector_d.load(vector_type, mem, start_indices)

+    is_gather = offsets_vec is not None


Don't we also need to check the address space since we can have gathers from shared memory? In this case, we cannot use buffer ops right? (Also same for scatters from shared memory)

Signed-off-by: Ivan Butygin <[email protected]>

This reverts commit cfef902. Signed-off-by: Ivan Butygin <[email protected]>

Signed-off-by: Ivan Butygin <[email protected]>

harsh-nod

lgtm!

* Split read/write ops indexing on thread-dependent and thread-independent. * Use symbolic vals for strides instead of extracting them from memref. * For now only replace gather/scatter ops with buffer ops. --------- Signed-off-by: Ivan Butygin <[email protected]> Signed-off-by: xintin <[email protected]>

Hardcode84 requested review from harsh-nod and raikonenfnu February 11, 2025 19:06

Hardcode84 force-pushed the buffer-ops-indexing branch from 6355a62 to eac4567 Compare February 12, 2025 01:34

Hardcode84 marked this pull request as ready for review February 12, 2025 19:05

harsh-nod requested changes Feb 13, 2025

View reviewed changes

harsh-nod reviewed Feb 13, 2025

View reviewed changes

Hardcode84 added 16 commits February 14, 2025 14:36

split index

9964644

Signed-off-by: Ivan Butygin <[email protected]>

split index

cac0cdb

Signed-off-by: Ivan Butygin <[email protected]>

safe subs

b21873e

Signed-off-by: Ivan Butygin <[email protected]>

extend att test

d17fda1

Signed-off-by: Ivan Butygin <[email protected]>

strides

4ff5cd5

Signed-off-by: Ivan Butygin <[email protected]>

simplify

d126508

Signed-off-by: Ivan Butygin <[email protected]>

strides

cd4c9b3

Signed-off-by: Ivan Butygin <[email protected]>

dump

05140a4

Signed-off-by: Ivan Butygin <[email protected]>

debug

a908e34

Signed-off-by: Ivan Butygin <[email protected]>

Revert "debug"

8fc57b4

This reverts commit cfef902. Signed-off-by: Ivan Butygin <[email protected]>

heuristic

d85c091

Signed-off-by: Ivan Butygin <[email protected]>

remove debug code

7e1f951

Signed-off-by: Ivan Butygin <[email protected]>

fix offset calculation

83b13ab

Signed-off-by: Ivan Butygin <[email protected]>

fix lit

2d9d3aa

Signed-off-by: Ivan Butygin <[email protected]>

refac

2007ad2

Signed-off-by: Ivan Butygin <[email protected]>

check global mem

f0cacae

Signed-off-by: Ivan Butygin <[email protected]>

Hardcode84 force-pushed the buffer-ops-indexing branch from 670e275 to f0cacae Compare February 14, 2025 14:04

comment

6c9c1bf

Signed-off-by: Ivan Butygin <[email protected]>

harsh-nod approved these changes Feb 14, 2025

View reviewed changes

Hardcode84 merged commit 3ccd679 into iree-org:main Feb 14, 2025
10 checks passed

Hardcode84 deleted the buffer-ops-indexing branch February 14, 2025 17:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TKW] Work on buffer ops #492

[TKW] Work on buffer ops #492

Hardcode84 commented Feb 11, 2025 •

edited

Loading

harsh-nod left a comment

harsh-nod Feb 13, 2025

Hardcode84 Feb 14, 2025

harsh-nod Feb 14, 2025

harsh-nod Feb 13, 2025

Hardcode84 Feb 14, 2025

harsh-nod Feb 13, 2025

Hardcode84 Feb 14, 2025

harsh-nod left a comment

		@@ -75,15 +76,29 @@ def _get_start_indices(
		return start_indices


		def _split_index(src: IndexExpr) -> tuple[IndexExpr, IndexExpr]:

[TKW] Work on buffer ops #492

[TKW] Work on buffer ops #492

Conversation

Hardcode84 commented Feb 11, 2025 • edited Loading

harsh-nod left a comment

Choose a reason for hiding this comment

harsh-nod Feb 13, 2025

Choose a reason for hiding this comment

Hardcode84 Feb 14, 2025

Choose a reason for hiding this comment

harsh-nod Feb 14, 2025

Choose a reason for hiding this comment

harsh-nod Feb 13, 2025

Choose a reason for hiding this comment

Hardcode84 Feb 14, 2025

Choose a reason for hiding this comment

harsh-nod Feb 13, 2025

Choose a reason for hiding this comment

Hardcode84 Feb 14, 2025

Choose a reason for hiding this comment

harsh-nod left a comment

Choose a reason for hiding this comment

Hardcode84 commented Feb 11, 2025 •

edited

Loading