SPH

The smoothed particle hydrodynamics (SPH) technique is a purely Lagrangian method. SPH discretizes a fluid in a series of interpolation points (SPH particles) whose distribution follows the mass density of the fluid and their evolution relies on a weighted interpolation over close neighboring particles.

The parallelization of SPH codes is not trivial due to their boundless nature and the absence of a structured grid.

SPH-EXA

SPH-EXA is a C++20 simulation code for hydrodynamics simulations (with gravity and other physics), parallelized with MPI, OpenMP, CUDA, and HIP.

SPH-EXA is built with high performance, scalability, portability, and resilience in mind. Its SPH implementation is based on SPHYNX, ChaNGa, and SPH-flow, three SPH codes selected in the PASC SPH-EXA project to act as parent and reference codes to SPH-EXA.

The performance of standard codes is negatively impacted by factors such as imbalanced multi-scale physics, individual time-stepping, halos exchange, and long-range forces. Therefore, the goal is to extrapolate common basic SPH features, and consolidate them in a fully optimized, Exascale-ready, MPI+X, SPH code: SPH-EXA.

Check our wiki for more details

Folder structure

SPH-EXA
├── README.md
├── docs
├── domain                           - Cornerstone library: octree building and domain decomposition
│   ├── include
│   │   └── cstone
│   │       ├── CMakeLists.txt
│   │       ├── cuda
│   │       ├── domain
│   │       ├── findneighbors.hpp
│   │       ├── halos
│   │       ├── primitives
│   │       ├── sfc
│   │       ├── tree
│   │       └── util
│   └── test                        - Cornerstone unit- performance-
│       ├── integration_mpi           and integration tests
│       ├── performance
│       ├── unit
│       └── unit_cuda
├── ryoanji                         - Ryoanji: N-body solver for gravity
│   ├─── src
│   └─── test                       - demonstrator app and unit tests
│
├── sph                             - SPH implementation
│   ├─── include
│   │    └── sph
│   └─── test                       - SPH kernel unit tests
│
└── main/src
    ├── init                        - initial conditions for test cases
    ├── io                          - file output functionality
    └── sphexa                      - SPH main application front-end

Toolchain requirements

The code requires a C++20 compiler for both the CPU and GPU parts.

GCC 12 and later
Clang 16 and later
CUDA 12 and later
ROCm 6 and later. ROCm 5 compiles, but has bugs preventing the reliable use of GPU-aware-MPI

Compilation

Minimal CMake configuration:

mkdir build
cd build
cmake <GIT_SOURCE_DIR>

Compilation at sciCORE (UniBas):

ml HDF5/1.14.2-gompi-2022a-zen2
ml CMake/3.23.1-GCCcore-11.3.0
ml CUDA/11.8.0

mkdir build
cd build
cmake <GIT_SOURCE_DIR>

CMake configuration on Daint on Alps: CUDA 12.6 + GCC 13.3:

uenv image pull prgenv-gnu/24.11:v1
uenv start prgenv-gnu/24.11:v1 --view=default

mkdir build
cd build

CC=mpicc CXX=mpicxx cmake -DCMAKE_CUDA_ARCHITECTURES=90 -DCSTONE_WITH_GPU_AWARE_MPI=ON -S <GIT_SOURCE_DIR

Module and CMake configuration on LUMI (ROCm 6.2.2)

module swap PrgEnv-cray PrgEnv-gnu
module load CrayEnv buildtools craype-accel-amd-gfx90a rocm cray-hdf5-parallel
export MPICH_GPU_SUPPORT_ENABLED=1
cmake -DCMAKE_CXX_COMPILER=CC -DCMAKE_HIP_ARCHITECTURES=gfx90a -DCSTONE_WITH_GPU_AWARE_MPI=ON -S <GIT_SOURCE_DIR>

Build everything: make -j

Running the main application

The main sphexa (and sphexa-cuda, if GPUs are available) application can either start a simulation by reading initial conditions from a file or generate an initial configuration for a named test case. Self-gravity will be activated automatically based on named test-case choice or if the HDF5 initial configuration file has an HDF5 attribute with a non-zero value for the gravitational constant.

Arguments:

--init CASE/FILE: use the case name as seen below or provide an HDF5 file with initial conditions
--glass FILE: template glass block for IC generation avaiable from
-n NUM : Run the simulation with NUM^3 (NUM to the cube) number of particles (for named test cases). [NOTE: This might vary with the test case]
-s NUM : Run the simulation with NUM of iterations (time-steps) if NUM is integer. Run until the specified physical time if NUM is real.
-w NUM : Dump particle data every NUM iterations (time-steps) if NUM is integer. Dump data at the specified physical time if NUM is real.
-f FIELDS: Comma separated list of particle fields for file output dumps. See a list of common ouput fields below.
--quiet : Do not print any output to stdout

Implemented cases:

--sedov: spherical blast wave
--noh: spherical implosion
--evrard: gravitational collapse of an isothermal cloud
--turbulence: subsonic turbulence in a box
--kelvin-helmholtz: development of the subsonic Kelvin-Helmholtz instability in a thin slice

Only the Sedov test case supports running without providing a glass block (--glass), but for accurate simulation results, a glass block is nevertheless strongly recommended.

Common output fields:

x, y, z: position
vx, vy, vz: velocity
h: smoothing length
rho: density
c: speed of sound
p: pressure
temp: temperature
u: internal energy
nc: number of neighbors
divv: Module of the divergence of the velocity field
curlv: Module of the curl of the velocity field

Example usage:

OMP_NUM_THREADS=4 ./sphexa --init sedov -n 100 -s 1000 -w 10 -f x,y,z,rho,p Runs Sedov with 100^3 particles for 1000 iterations (time-steps) with 4 OpenMP threads and dumps particle xyz-coordinates, density and pressure data every 10 iterations
OMP_NUM_THREADS=4 ./sphexa-cuda --init sedov -n 100 -s 1000 -w 10 -f x,y,z,rho,p Runs Sedov with 100^3 particles for 1000 iterations (time-steps) with 4 OpenMP threads. Uses the GPU for most of the compute work.
OMP_NUM_THREADS=4 mpiexec -np 2 ./sphexa --init noh -n 100 -s 1000 -w 10 Runs Noh with 100^3 particles for 1000 iterations (time-steps) with 2 MPI ranks of 4 OpenMP threads each. Works when using MPICH. For OpenMPI, use mpirun instead.
OMP_NUM_THREADS=12 srun -Cgpu -A<your account> -n<nnodes> -c12 --hint=nomultithread ./sphexa-cuda --init sedov -n 100 -s 1000 -w 10 Optimal runtime configuration on Piz Daint for nnodes GPU compute nodes. Launches 1 MPI rank with 12 OpenMP threads per node.
./sphexa-cuda --init evrard --glass 50c.h5 -s 2000 -w 100 -f x,y,z,rho,p,vx,vy,vz Run SPH-EXA, initializing particle data from an input file (e.g. for the Evrard collapse). Includes gravitational forces between particles. The angle dependent accuracy parameter theta can be specificed with --theta <value>, the default is 0.5.

Restarting from checkpoint files

If output to file is enabled and if the -f option is not provided, sphexa will output all conserved particle fields which allows restoring the simulation to the exact state at the time of writing the output. This includes the following fields x_m1, y_m1, z_m1, du_m1. In order to save diskspace, sphexa can be instructed to omit these fields by setting the -f option, e.g. -f x,y,z,m,h,temp,alpha,vx,vy,vz. If one wants to restart the simulation from an output file containing these fields, it is necessary to add the _m1. We provide an example script that can be used to achieve this:

./scripts/add_m1.py <hdf5-output-file>

Unit, integration and regression tests

Cornerstone octree comes with an extensive suite of unit, integration and regression tests, see README.

SPH kernel unit tests:

./sph/test/hydro_ve
./sph/test/hydro_std

Input data

Some tests require template blocks with glass-like (Voronoi tesselated) particle distributions, these can be obtained here:

Ryoanji GPU N-body solver

Ryoanji is a high-performance GPU N-body solver for gravity. It relies on the cornerstone octree framework for tree construction, EXAFMM multipole kernels, and a warp-aware tree-traversal inspired by the Bonsai GPU tree-code.

Authors (in alphabetical order)

Ruben Cabezon (PI)
Aurelien Cavelan
Florina Ciorba (PI)
Jonathan Coles
Jose Escartin
Jean M. Favre
Sebastian Keller (lead dev)
Noah Kubli
Lucio Mayer (PI)
Jg Piccinali
Tom Quinn
Darren Reed
Lukas Schmid
Osman Seckin Simsek
Yiqing Zhu

Paper references

License

This project is licensed under the MIT License - see the LICENSE file for details

Name	Name	Last commit message	Last commit date
Latest commit sekelle Update README.md with LUMI config Mar 18, 2025 20d8aea · Mar 18, 2025 History 2,977 Commits
.github/workflows	.github/workflows	Clean stale .github/.gitlab/.jenkins files (#491 )	Feb 25, 2025
ci	ci	removed custom parallel scan	Feb 14, 2025
cmake	cmake	update setup_GTest.cmake	Sep 15, 2024
docs	docs	Added initial documentation describing equations used for cosmology. (#…	Oct 31, 2024
domain	domain	Implement physics for protoplanetary disks / TDEs (#489 )	Feb 18, 2025
extern	extern	Support domain exchange of 1-byte element arrays	Jun 7, 2024
main	main	Test named base identification fix for evrard-cooling case (#492 )	Mar 18, 2025
physics	physics	Implement physics for protoplanetary disks / TDEs (#489 )	Feb 18, 2025
ryoanji	ryoanji	dont link against unused gmock lib	Feb 14, 2025
scripts	scripts	revert courant condition to the default 0.2 for all test cases (#493 )	Mar 18, 2025
sph	sph	removed custom parallel scan	Feb 14, 2025
.clang-format	.clang-format	access particle field as tuple of references	Jun 17, 2022
.clang-tidy	.clang-tidy	disable openmp default(none) warning	Apr 12, 2021
.gitignore	.gitignore	Small correction in .gitignore	Dec 15, 2022
.gitmodules	.gitmodules	updated grackle submodule config	Jun 7, 2022
.readthedocs.yaml	.readthedocs.yaml	Basic config for readthedocs.org (#290 )	Aug 17, 2022
CMakeLists.txt	CMakeLists.txt	Implement physics for protoplanetary disks / TDEs (#489 )	Feb 18, 2025
LICENSE	LICENSE	LICENSE created online with Bitbucket	Feb 13, 2019
README.md	README.md	Update README.md with LUMI config	Mar 18, 2025
README_insitu.md	README_insitu.md	HIP and Spack (#468 )	Dec 9, 2024
pyproject.toml	pyproject.toml	Basic config for readthedocs.org (#290 )	Aug 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SPH

SPH-EXA

Folder structure

Toolchain requirements

Compilation

Running the main application

Restarting from checkpoint files

Unit, integration and regression tests

Input data

Ryoanji GPU N-body solver

Authors (in alphabetical order)

Paper references

License

Acknowledgments

About

Releases 15

Packages

Contributors 20

Languages

License

sphexa-org/sphexa

Folders and files

Latest commit

History

Repository files navigation

SPH

SPH-EXA

Folder structure

Toolchain requirements

Compilation

Running the main application

Restarting from checkpoint files

Unit, integration and regression tests

Input data

Ryoanji GPU N-body solver

Authors (in alphabetical order)

Paper references

License

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 15

Packages 0

Contributors 20

Languages

Packages