Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[backend] NFC: Split architecture dependant and independant parts of FMA dot conversion #5655

Merged

Conversation

binarman
Copy link
Contributor

This PR splits FMA dot conversion from Triton GPU to LLVM in two parts:

  • Common code with iteration across M/N dim
  • Architecture dependant scalar multiplication of vectos across K dim

This PR do not introduce any test, because it does not fix any bugs or introduce new functionality, it refactors code to simplify later changes: #4594

This PR splits FMA dot conversion from Triton GPU to LLVM in two parts:
- Common code with iteration across M/N dim
- Architecture dependant scalar multiplication of vectos across K dim
@antiagainst antiagainst changed the title Split architecture dependant and independant parts of FMA dot conversion [backend] NFC: Split architecture dependant and independant parts of FMA dot conversion Jan 25, 2025
@antiagainst antiagainst marked this pull request as ready for review January 25, 2025 01:22
@antiagainst antiagainst requested a review from ptillet as a code owner January 25, 2025 01:22
- remove redundant namespaces
- improve comments
- use more general op builder class
@binarman binarman requested a review from zhanglx13 as a code owner January 29, 2025 18:11
@antiagainst antiagainst merged commit 0753712 into triton-lang:main Jan 29, 2025
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants