-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ep2 w4 #80
Ep2 w4 #80
Conversation
… be included here
Hi Olivier, thanks a lot, very interesting and very good idea to open an issue to document it! Not sure, maybe keep it in progress for the time being, and we do more tests trying to understand this better? It seems that it is not a game changer. But maybe, if we do find other game changers (cuda streams/graph...?), maybe we shoudl reevaluate this and we will see larger effects. Just brainstorming. How do you produce the plots of registers actualy used by the way? Interesting plots. Thanks Andrea |
I open a PR.
Such that we have a place attached to the code to report finding on this idea.
So this branch implements the idea to replace the W[6] of the wavefuntions by a W[4]
and retrieving from global memory (hoping from L1 cache) the removed information:
Current point of concern with the current implementation:
[ ] need to investigate if the convention for the 4 momenta is general enough
[ ] is this a smart idea