CosmosSimulationWithCuda

Real-time N-body algorithm for a billion particles (2.3GB memory per 100M particles required), accelerated with CUDA.

Dependency: Single header-only project with OpenCV(for demo) and CUDA(for compute) APIs. Uses Vcpkg to load OpenCV.
Multi-GPU work distribution: particles are computed only in their own GPUs.
GPU-GPU communication: partially overlaps with computations to hide latency.
Render: frames are generated asynchronously and buffered to the user's thread, further hiding latency.

Algorithm:

Mass values of particles are projected onto a lattice of 2048x2048 cells (this Constants::N value can be changed from header)
The lattice is sent to convolution operation.
- FFT for infinite ranged forces (filter weights = 1 / r)
- deconvolution of mass-scatter kernel for short-ranged forces (to undo self-pull)
Then both results are summed elementwise to have a total potential
Gradient of the potential is sampled by each particle and used as force acting on them
Euler integration is used for velocity and position updates (when other parts are optimized for higher accuracy, this will become Verlet Integration)
Complexity: O(N Log(N)) with a low constant cost so that it can run 120 FPS for 20 million particles using a mainstream gpu.

FFT Convolution:

Particle mass lattice is 2D
First, FFT of all rows are computed.
Then FFT of all columns are computed.
Repeated for both mass-lattice and gravity-lattice
Element-wise (complex-value) multiplication of both results --> this is FFT of convolution
Inverse-FFT of the output is computed --> convolution complete

Gravity Lattice:

This is another 2D array with its center at (N/2,N/2) both in indexing of array and 1/r calculations of cells.
1/r is used as the gravitational potential.

Particle lattice:

Gradient:

Similar to the mass projection, but opposite
Gradient at each point is computed
Each particle samples or multisamples (depends on accuracy mode) gradients and computes the force acting on it
Force is divided by the mass of particle to compute movement during that time step (Euler Integration), together with velocity updates.

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
CosmosCuda.cuh		CosmosCuda.cuh
CosmosSimulationWithCuda.sln		CosmosSimulationWithCuda.sln
CosmosSimulationWithCuda.vcxproj		CosmosSimulationWithCuda.vcxproj
LICENSE.txt		LICENSE.txt
README.md		README.md
binary_star1.png		binary_star1.png
binary_star2.png		binary_star2.png
binary_star3.png		binary_star3.png
binary_star4.png		binary_star4.png
cuda-streams.png		cuda-streams.png
galaxy-brush.png		galaxy-brush.png
main.cu		main.cu
t1.png		t1.png
t2.png		t2.png
t3.png		t3.png
testc.png		testc.png
timeline.png		timeline.png
vcpkg.json		vcpkg.json

Provide feedback