Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

latest changes on pm calculation #8

Open
wants to merge 128 commits into
base: master
Choose a base branch
from
Open

latest changes on pm calculation #8

wants to merge 128 commits into from

Conversation

nianyic7
Copy link
Collaborator

@nianyic7 nianyic7 commented Nov 4, 2024

Mark the changes to petapm. Will keep developing things and eventually create a working version which we can merge into master.

nianyic7 and others added 30 commits October 7, 2024 14:15
pfft to cufftMp first attempt
The standard does not guarantee that the merges are atomic, it seems.
This new code seems to improve memory safety even on intel, so let's
make it unconditional. It will make the sort a bit slower but we can
live with that.
We should use system malloc by default for large allocations so that we
can get the CUDA memory management. The removes the VALGRIND flag.
mymalloc calls now use the cuda allocation, but tamalloc continue to use
a private cpu heap.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants