Performance opt eurohack #318

s-mayani · 2024-10-15T13:40:00Z

During the course of the Eurohack24 hackathon at CSCS, we worked on performance analysis of our full PIC code by looking at the performance of the Alpine mini-app Landau Damping on the Nvidia A100 GPU cluster at PSI.

The results showed that the scatter operation could be sped up quite a bit by implementing a sorting of particles which are in the same cell, which increases data locality when having to do the atomic add for the interpolation, and avoids throwing away the cache. Furthermore, we can add locally within the cell and then do a single atomic add to the actual field grid point.

The motivation and results are shown in the slides attached. The zip file contains the Nvidia nsight systems reports for the different runs we did to test the performance improvements.

sorted_scatter_PR.pdf
final_reports.zip

This still needs some clean-up; namely improving the design (e.g. by making a class for the sort) and improving the sorting algorithm itself, which is not yet optimal. Additionally, the next goal to improve performance would be using Kokkos team policy and scatter many cells per team.

s-mayani added 7 commits October 7, 2024 15:20

add nvtx ranges in leapfrog for performance tools

4bc3c40

remove atomic add in scatter for performance analysis

4bbb3da

implement a particle cell index finder

d092d9b

add sorting and permuting

fb44278

remove the naive addition, put back atomic add

7ecc1b8

make sort and permute performant, add new scatter to use the sort

865ec5d

remove testing of normal add instead of atomic

53c94e4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Performance opt eurohack #318

Performance opt eurohack #318

Uh oh!

s-mayani commented Oct 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Performance opt eurohack #318

Are you sure you want to change the base?

Performance opt eurohack #318

Uh oh!

Conversation

s-mayani commented Oct 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant