Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimized PTX IntrinsicMath implementation to use LibDevice. #1151

Closed
wants to merge 10 commits into from

Conversation

MoFtZ
Copy link
Collaborator

@MoFtZ MoFtZ commented Jan 17, 2024

Depends on #1148.

Restructured ILGPU and ILGPU.Algorithms, moving the IntrinsicMath implementations into ILGPU itself. XMath has other functions that are not part of IntrisicMath, so it will stay as-is for now.

CLMath in ILGPU.Algorithms only needed to support Rcp and Log(x,y). These have been moved into ILGPU, and CLMath has been removed.

PTXMath in ILGPU.Algorithms provided a number of math functions using Cordic implemenentations. Now that pre-generated LibDevice is available in ILGPU, switched all the IntrinsicMath functions to call LibDevice for Cuda GPUs. The pre-generated LibDevice PTX code only works on >= SM_60, so the Cordic functions in ILGPU.Algorithms have been modified to only register for < SM_60. Otherwise, they are no longer used.

Unit Tests for IntrinsicMath have not been implemented. These are currently running via the ILGPU.Algorithms unit tests.

@MoFtZ MoFtZ force-pushed the feature/intrinsic-math branch from 37af1f7 to 7f5b781 Compare January 21, 2024 12:20
@MoFtZ MoFtZ marked this pull request as draft January 21, 2024 20:37
@MoFtZ MoFtZ force-pushed the feature/intrinsic-math branch 2 times, most recently from 12de7ec to 38184e8 Compare January 22, 2024 09:55
@MoFtZ MoFtZ marked this pull request as ready for review January 22, 2024 10:15
@MoFtZ MoFtZ force-pushed the feature/intrinsic-math branch from 38184e8 to bb287e9 Compare January 23, 2024 10:20
@MoFtZ MoFtZ marked this pull request as draft January 23, 2024 10:35
@MoFtZ MoFtZ force-pushed the feature/intrinsic-math branch from bb287e9 to c3a1db2 Compare January 23, 2024 10:49
@MoFtZ MoFtZ force-pushed the feature/intrinsic-math branch from dc44edc to 1f16948 Compare January 31, 2024 10:28
@MoFtZ MoFtZ force-pushed the feature/intrinsic-math branch from 1f16948 to e87d746 Compare January 31, 2024 11:19
@MoFtZ
Copy link
Collaborator Author

MoFtZ commented Jan 31, 2024

Abandoned. To be combined with #1148.

@MoFtZ MoFtZ closed this Jan 31, 2024
@MoFtZ MoFtZ deleted the feature/intrinsic-math branch February 1, 2024 01:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant