Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU implementation of hamming distance #541

Open
wants to merge 160 commits into
base: main
Choose a base branch
from

Conversation

felixpetschko
Copy link
Collaborator

@felixpetschko felixpetschko commented Aug 19, 2024

Hamming distance implementation with numba.cuda for GPU support.
This is built on top of the changes in Hamming distance implementation with Numba #512

@grst:

Close #479

felixpetschko and others added 30 commits April 29, 2024 13:28
@Intron7
Copy link
Member

Intron7 commented Feb 14, 2025

For a large Matrix in the Hamming-Kernel have you checked that int is enough to cover the indexing of data?

@felixpetschko
Copy link
Collaborator Author

For a large Matrix in the Hamming-Kernel have you checked that int is enough to cover the indexing of data?

That's a good point, but I think for our purposes int should be enough for the kernels. I think the most problematic part is the indptr array of the CSR matrices, therefore I added some int32 overflow checks. If such an overflow occures, the user can try it with a smaller cutoff to get less result values that need to be stored in the CSR matrix.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
run-gpu-ci runs GPU CI
Projects
Status: In progress
Development

Successfully merging this pull request may close these issues.

Large dataset tutorial
3 participants