Data-Tiling: Migrate round_dims_to to iteration_sizes after encoding specialization is on by default #19897

hanhanW · 2025-02-04T06:40:10Z

The round_dims_to field in the encoding was useful for data-tiling late materialization path, because it provides the hint for both host and device code. The host side can allocate the storage buffer based on the hint, and the device gets the limitation of padding space. (Otherwise, the device could access the buffer out of bounds.)

However, it is not the ideal solution because the device could request larger tile sizes for some cases (e.g., matvec), which leads to inefficient strategy. Also, the host could allocate a huge buffer that is not fully used by the device. Sometimes the device just need a little more storage buffer, but not unconditionally pad each dimension to large size.

Today, we have encoding specialization, which is not yet on by default. The encoding implements the interface methods, that it can propagate the request from the executable target to the host. I.e., the host can allocate exact storage buffer for the tensor encoding. Once we turn the pass on by default, we no longer need the round_dims_to field in the encoding. Then the next question is that what information we want to encode in the encodings. I think the answer is the iteration size of each dimension. On CPU, it can generate more efficient code if we recognize that there is a narrow matrix (e.g., matvec/vecmat/etc). Today, we abuse the round_dims_to field to provide such information (which is bad). If we are going to deprecate the round_dims_to field, we'll need to introduce iteration_sizes field to carry the information.

Note, the task depends on the encoding specialization pass. We should implement this after we make the encoding specialization pass on by default.

The text was updated successfully, but these errors were encountered:

pashu123 · 2025-02-05T05:52:56Z

@hanhanW Shall I work on this?

hanhanW · 2025-02-05T05:58:37Z

@hanhanW Shall I work on this?

SGTM, but you need to wait a bit. I think I can make the specialization on by default next week. Let's also chat more once I have my data-tiling RFC ready.

hanhanW added the codegen Shared code generation infrastructure and dialects label Feb 4, 2025

hanhanW self-assigned this Feb 4, 2025

hanhanW added enhancement ➕ New feature or request good first issue 🌱 Good for newcomers labels Feb 4, 2025

hanhanW mentioned this issue Feb 11, 2025

RFC: Switching data-tiling to late materialization path #19967

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data-Tiling: Migrate round_dims_to to iteration_sizes after encoding specialization is on by default #19897

Data-Tiling: Migrate round_dims_to to iteration_sizes after encoding specialization is on by default #19897

hanhanW commented Feb 4, 2025

pashu123 commented Feb 5, 2025

hanhanW commented Feb 5, 2025

Data-Tiling: Migrate round_dims_to to iteration_sizes after encoding specialization is on by default #19897

Data-Tiling: Migrate round_dims_to to iteration_sizes after encoding specialization is on by default #19897

Comments

hanhanW commented Feb 4, 2025

pashu123 commented Feb 5, 2025

hanhanW commented Feb 5, 2025