-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extend IdModel to map DIDs for certain patterns. #3987
Comments
In this case, IIUC, what we want is to let IdModel recognize, for a given iter domain of
and
then:
Am I right that |
I don't think so. We only need |
If |
According to https://github.com/NVIDIA/Fuser/blob/main/doc/math/integer-division.md Theorem 2.11:
is unconditionally true if both |
BTW, I really hope we formally write down a mathematical proof in this section: |
One important mathematical difference of outer split vs inner split I would like to highlight is: |
The following patterns came out from DID loop split (#2563).
Case 1: split reshape (before SdpwFwd)
We may want to do inner split by
d
at some point. See case 3. I don't think whether inner or outer will make a whole lot of difference for IdModel.Case 2: merge reshape (after SdpaFwd)
Case 3: slice (used after the QKV linear in GPT)
There are several ways to represent that slice as mentioned in http://nv/ezS. One of the ways that I think is promising is:
Case 4: cat (backprop of the above slice)
In all above cases, d, a, h* are static.
cc @naoyam who requested me to write this down
The text was updated successfully, but these errors were encountered: