RFE: Take contiguity caching into nvFuser #4043

csarofeen · 2025-03-07T12:46:14Z

At the moment the caching of contiguity changes is done in python which has some latency associated with it. It's not as bad with Ivan's PR: Lightning-AI/lightning-thunder@56b922a

However, this seems to be something that nvFuser should take ownership of in its caching system. Today we require a new fusion definition on any contiguity changes, as it can be a valuable optimization within a kernel. One idea is to put this aspect in the concretization pass in nvFuser if the latency of that would be tolerable.

This would prevent having any caching logic within python for nvfuser execution.

Original discussion in:
Lightning-AI/lightning-thunder#1840

The text was updated successfully, but these errors were encountered:

csarofeen added the enhancement New feature or request label Mar 7, 2025

csarofeen changed the title ~~Take contiguity caching into nvFuser~~ RFE: Take contiguity caching into nvFuser Mar 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFE: Take contiguity caching into nvFuser #4043

RFE: Take contiguity caching into nvFuser #4043

csarofeen commented Mar 7, 2025

RFE: Take contiguity caching into nvFuser #4043

RFE: Take contiguity caching into nvFuser #4043

Comments

csarofeen commented Mar 7, 2025