Awesome 3D Gaussian Splatting Resources

A curated list of papers and open-source resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months. If you have any additions or suggestions, feel free to contribute. Additional resources like blog posts, videos, etc. are also welcome.

Added 18 papers: Z-Splat, Dual-Camera, StylizedGS, Hash3D, Revisiting Densification, Gaussian Pancakes, 3D-aware Deformable Gaussians, SpikeNVS, Zero-shot PC completion, SplatPose, DreamScene360, RealmDreamer, Gaussian-ILC, Reinforcment Learning with GGS, GoMAvatar, OccGaussian, LoopGaussian, Review

April 11, 2024

Code release of latentSplat

April 9, 2024

Added 1 paper: EgoLifter

April 8, 2024

Added 3 papers: Robust Gaussian Splatting, SC4D, and MM-Gaussian

April 5, 2024

Added 5 papers: Surface Reconstruction, TCLC-GS, GaSpCT, OmniGS, and Per-Gaussian Embedding,
Fixes

April 2, 2024

Added 11 papers: HO, SGD, HGS, Snap-it, InstantSplat, 3DGSR, MM3DGS, HAHA, CityGaussain, Mirror-3DGS, and Feature Splatting

March 30, 2024

Added 8 papers: Modeling uncertainty, GRM, Gamba, CoherentGS, TOGS, SA-GS, and GaussianCube

March 27, 2024

Added Other Implementation: 360-gaussian-splatting
CVPR '24 labels added
Added 5 papers: Comp4D, DreamPolisher, DN-Splatter, 2D GS, and Octree-GS

March 26, 2024

Added 13 paper: latentSplat, GS on the Move, RadSplat, Mini-Splatting, SyncTweedies, HAC, STAG4D, EndoGSLAM, Pixel-GS, Semantic Gaussians, Gaussian in the Wild, CG-SLAM, and GSDF

March 24, 2024:

Added paper: Gaussian Frosting

March 20, 2024:

Added 4 papers: GVGEN, HUGS, RGBD GS-ICP SLAM, and High-Fidelity SLAM

March 19, 2024:

Added Pointrix
Added 3DGS tutorial by the original authors
Added GauStudio
Added 23 papers: Touch-GS, GGRt, FDGaussian, SWAG, Den-SOFT, Gaussian-Flow, View-Consistent 3D Editing, BAGS, GeoGaussian, GS-Pose, Analytic-Splatting, Seamless 3D Maps, Texture-GS, Recent Advances in 3DGS, Compact 3DGS for Dense Visual SLAM, BrightDreamer, 3DGS-Reloc, Beyond Uncertainty, Motion-Aware 3DGS, Fed3DGS, GaussNav, 3DGS-Calib, and NEDS-SLAM

March 17, 2024:

Update repo name and link for 3DGS.cpp (originally VulkanSplatting)

March 16, 2024:

SplatTV
Added 6 papers: GaussianGrasper, new splitting algorithm, Controllable Text-to-3D Generation, Spring-Mass 3DGS, Hyper-3DGS, and DreamScene

March 14, 2024:

Added 6 papers: SemGauss, StyleGaussian, Gaussian Splatting in Style, GaussCtrl, GaussianImage, and RAIN-GS

March 8, 2024:

Tutorial: Howto capture images for 3DGS
Added 6 papers: SplattingAvatar, DNGaussian, Radiative Gaussians, BAGS, GSEdit, and ManiGaussian

March 8, 2024:

Added 3DGStream Viewer

March 6, 2024:

1 paper added: Splat-Nav

March 5, 2024:

1 paper added: 3DGStream
Code releases
New viewer added

March 2, 2024:

1 paper added: 3D Gaussian Model for Animation and Texturing
New section: Courses that also teach 3DGS.

February 28, 2024:

VastGaussian

February 27, 2024:

2 papers added: Spec-Gaussian and GEA
SC-GS code released

February 24, 2024:

2 papers added: Identifying unnecessary Gaussians and Gaussian Pro

February 23, 2024:

Corrected Authors and updated abstract for EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting

February 21, 2024:

Added one paper: Reshaping SLAM: a Survey

February 20, 2024:

GaussianObject code released
Added one paper: GaussianHair

February 19, 2024:

Blog post added: NeRFs vs. 3DGS.

February 16, 2024:

2 papers added: IM-3D and GES
GaMeS code released

February 14, 2024:

Added viewer: VulkanSplatting - cross-platform, high performance 3DGS renderer in C++ and Vulkan Compute

February 13, 2024:

Code releases: (16th Jan 2024) Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting
3 papers added: 3DGala, ImplicitDeepFake, and 3D Gaussians as a New Vision Era.

February 9, 2024:

1 paper added: HeadStudio

February 8, 2024:

3 papers added: Rig3DGS, Mesh-based GS, and LGM February 6, 2024:
Added 2 papers: SGS-SLAM and 4D Gaussian Splatting

February 5, 2024:

Moved SWAGS to Dynmatics and Deformation section
Added 2 paper: GaussianObject and GaMeSh
GS++ renamed to Optimal Projection

February 2, 2024:

Added 6 papers: VR-GS, Segment Anything, Gaussian Splashing, GS++, 360-GS, and StopThePop
TRIPS code release

January 30, 2024:

Code changes: GaussianAvatars code changed to private

January 29, 2024:

Added 2 papers: LIV-GaussMap and TIP-Editor

January 26, 2024:

Removed retracted paper: Animatable 3D Gaussians for High-fidelity Synthesis of Human Motions
3 papers added: EndoGaussians, PSAvatar, and GauU-Scene

January 25, 2024:

Added viewer: Splatapult - 3d gaussian splatting renderer in C++ and OpenGL, works with OpenXR for tethered VR

January 24, 2024:

Added utility: GSOPs (Gaussian Splat Operators) for SideFX Houdini
Code releases: GaussianAvatars

January 23, 2024:

3 papers added: Amortized Gen3D, Deformable Endoscopic Tissues, Fast dynamic 3D Object Generation
Code releases: Animatable Avatars, Compressed 3D Gaussians, GaussianAvatar

January 13, 2024:

4 papers added: CoSSegGaussians, TRIPS, Gaussian Shadow Casting for Neural Characters and DISTWAR

January 9, 2024:

1 paper added: A Survey on 3D Gaussian Splatting (The first survey)

January 8, 2024:

4 papers added: SWAGS (added paper from 2023 which I forgot to add before, ), first review paper, compressed 3DGS, and an application paper for Characterizing Satellite Geometry.

January 7, 2024:

1 Open source implementation: taichi-splatting - work is originally derived off Taichi 3D Gaussian Splatting, with significant re-organisation and changes.

January 5, 2024:

3 papers added: FMGS, PEGASUS, and Repaint123.

January 2, 2024:

1 paper added: Street Gaussians.

January 2, 2024:

Deblurring Gaussians paper link updated.
SAGA code released.
2 papers from 2023 added: Text2Immersion and 2D-Guided 3DG Segmentation.
Mathematical supplemend of gsplat lib.
Add years in categories.
GSM code released.

December 29, 2023:

1 paper added (apparently missed that one before): Gaussian-Head-Avatar.
Blog post head avatars added.

December 29, 2023:

3 papers added: DreamGaussian4D, 4DGen, and Spacetime Gaussian.

December 27, 2023:

3 papers added: LangSplat, Deformable 3DGS, and Human101.
Blog post added: Comprehensive Review of 3DGS.

December 25, 2023:

Efficient 3D Gaussian Representation for Monocular/Multi-view Dynamic Scenes code released.
GPS-Gaussian code released.

December 24, 2023:

2 papers added: Self-Organization Gaussian Grids and Gaussian Splitting.
Added repo for enhancing Gaussian rendering to model more complex scenes.

December 21, 2023:

3 papers added: Splatter Image, pixelSplat, and align your gaussians.
Gaussian Grouping code released.

December 19, 2023:

2 papers added: GAvatar and GauFRe.

December 18, 2023:

Added utility: SpectacularAI - Conversion scripts for different 3DGS conventions.
SuGaR code released.

December 16, 2023:

Added WebGL viewer 3: Gauzilla.

December 15, 2023:

4 papers added: DrivingGaussian, iComMa, Triplane, and 3DGS-Avatar.
Relightable Gaussians code released.

December 13, 2023:

5 papers added: Gaussian-SLAM, CoGS, ASH, CF-GS, and Photo-SLAM.

December 11, 2023:

2 papers added: Gaussian Splatting SLAM and Denoising Scores for 3D Generation.
ScaffoldGS code released.

December 8, 2023:

2 papers added: EAGLES and MonoGaussianAvatar.

December 7, 2023:

LucidDreamer code released.
9 papers added: GauHuman, HeadGaS, HiFi4G, Gaussian-Flow, Feature-3DGS, Gaussian-Avatar, FlashAvatar, Relightable, and Deblurring Gaussians.

December 5, 2023:

9 papers added: NeuSG, GaussianHead, GaussianAvatars, GPS-Gaussian, Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction, SplaTAM, MANUS, Segment Any, and Language embedded 3D Gaussians.

December 4, 2023:

8 papers added: Gaussian Grouping, MD Splatting, DynMF, Scaffold-GS, SparseGS, FSGS, Control4D, and SC-GS.

December 1, 2023:

4 papers added: Compact3D, GaussianShader, Periodic Vibration Gaussian and Gaussian Shell Maps for Efficient 3D Human Generation.
Created Table of contents for each category and added line breaks.

November 30, 2023:

Added Unreal game engine implementation.
5 papers added: LightGaussian, FisherRF, HUGS, HumanGaussian, CG3D, and Multi Scale 3DGS.

November 29, 2023:

Added two papers: Point and Move and IR-GS.

November 28, 2023:

Added five papers: GaussinEditor, Relightable Gaussians, GART, Mip-Splatting, HumanGaussian.

November 27, 2023:

Added two papers: Gaussian Editing and Compact 3D Gaussians.

November 25, 2023:

Animatable Gaussians project added (paper not yet released).

November 22, 2023:

3 new GS papers added: Animatable, Depth-Regularized, and Monocular/Multi-view 3DGS.
Added some classic papers.
Added another GS paper also called LucidDreamer.

November 21, 2023:

3 new GS papers added: GaussianDiffusion, LucidDreamer, PhysGaussian.
2 more GS papers added: SuGaR, PhysGaussian.

November 21, 2023:

Added the paper GS-SLAM

November 17, 2023:

Added PlayCanvas implementation to Game Engines section.

November 16, 2023:

Deformable 3D Gaussians code released.
Drivable 3D Gaussian Avatars paper added.

November 8, 2023:

Some notes about the 3DGS implementation and unsive/rsal format discussion.

November 4, 2023:

Added 2D gaussian splatting.
Added very detailed (technical) blog post explaining 3D gaussian splatting.

October 28, 2023:

Added Utilities Section.
Added 3DGS Converter for editing 3DGS .ply files in Cloud Compare to Utilities.
Added Kapture (for bundler to colmap model conversion) and Kapture image cropper script with conversion instructions to Utilities.

October 23, 2023:

Added python WebGL viewer 2.
Added Intro to gaussian splatting (and Unity viewer) video blog.

October 21, 2023:

Added python OpenGL viewer.
Added typescript WebGPU viewer.

October 20, 2023:

Made abstracts readable (removed hyphenations).
Added Windows tutorial.
Other minor text fixes.
Added Jupyter notebook viewer.

October 19, 2023:

Added Github page link for Real-time Photorealistic Dynamic Scene Representation.
Re-ordered headings.
Added other unofficial implementations.
Moved Nerfstudio gsplat and fast: C++/CUDA to Unofficial Implementations.
Added Nerfstudio, Blender, WebRTC, iOS & Metal viewers.

October 17, 2023:

GaussianDreamer code released.
Added Real-time Photorealistic Dynamic Scene Representation.

October 16, 2023:

Added Deformable 3D Gaussians paper.
Dynamic 3D Gaussians code released. October 15, 2023: Initial list with first 6 papers.

Seminal Paper introducing 3D Gaussian Splatting:

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Authors: Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, George Drettakis

Abstract

Radiance Field methods have recently revolutionized novel-view synthesis of scenes captured with multiple photos or videos. However, achieving high visual quality still requires neural networks that are costly to train and render, while recent faster methods inevitably trade off speed for quality. For unbounded and complete scenes (rather than isolated objects) and 1080p resolution rendering, no current method can achieve real-time display rates. We introduce three key elements that allow us to achieve state-of-the-art visual quality while maintaining competitive training times and importantly allow high-quality real-time (≥ 30 fps) novel-view synthesis at 1080p resolution. First, starting from sparse points produced during camera calibration, we represent the scene with 3D Gaussians that preserve desirable properties of continuous volumetric radiance fields for scene optimization while avoiding unnecessary computation in empty space; Second, we perform interleaved optimization/density control of the 3D Gaussians, notably optimizing anisotropic covariance to achieve an accurate representation of the scene; Third, we develop a fast visibility-aware rendering algorithm that supports anisotropic splatting and both accelerates training and allows real-time rendering. We demonstrate state-of-the-art visual quality and real-time rendering on several established datasets.

3D Object Detection:

2024:

1. 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection

Authors: Yang Cao, Yuanliang Jv, Dan Xu

Abstract

Neural Radiance Fields (NeRF) are widely used for novel-view synthesis and have been adapted for 3D Object Detection (3DOD), offering a promising approach to 3D object detection through view-synthesis representation. However, NeRF faces inherent limitations: (i) It has limited representational capacity for 3DOD due to its implicit nature, and (ii) it suffers from slow rendering speeds. Recently, 3D Gaussian Splatting (3DGS) has emerged as an explicit 3D representation that addresses these limitations with faster rendering capabilities. Inspired by these advantages, this paper introduces 3DGS into 3DOD for the first time, identifying two main challenges: (i) Ambiguous spatial distribution of Gaussian blobs – 3DGS primarily relies on 2D pixel-level supervision, resulting in unclear 3D spatial distribution of Gaussian blobs and poor differentiation between objects and background, which hinders 3DOD; (ii) Excessive background blobs – 2D images often include numerous background pixels, leading to densely reconstructed 3DGS with many noisy Gaussian blobs representing the background, negatively affecting detection. To tackle the challenge (i), we leverage the fact that 3DGS reconstruction is derived from 2D images, and propose an elegant and efficient solution by incorporating 2D Boundary Guidance to significantly enhance the spatial distribution of Gaussian blobs, resulting in clearer differentiation between objects and their background (see Fig. 1). To address the challenge (ii), we propose a Box-Focused Sampling strategy using 2D boxes to generate object probability distribution in 3D spaces, allowing effective probabilistic sampling in 3D to retain more object blobs and reduce noisy background blobs. Benefiting from the proposed Boundary Guidance and Box-Focused Sampling, our final method, 3DGS-DET, achieves significant improvements (+5.6 on [email protected], +3.7 on [email protected]) over our basic pipeline version, without introducing any additional learnable parameters. Furthermore, 3DGS-DET significantly outperforms the state-of-the-art NeRF-based method, NeRF-Det, achieving improvements of +6.6 on [email protected] and +8.1 on [email protected] for the ScanNet dataset, and impressive +31.5 on [email protected] for the ARKITScenes dataset. Codes and models are publicly available at: https://github.com/yangcaoai/3DGS-DET.

📄 Paper | 💻 Code (not yet)

Autonomous Driving:

Despite recent advancements in high-fidelity human reconstruction techniques, the requirements for densely captured images or time-consuming per-instance optimization significantly hinder their applications in broader scenarios. To tackle these issues, we present HumanSplat that predicts the 3D Gaussian Splatting properties of any human from a single input image in a generalizable manner. In particular, HumanSplat comprises a 2D multi-view diffusion model and a latent reconstruction transformer with human structure priors that adeptly integrate geometric priors and semantic features within a unified framework. A hierarchical loss that incorporates human semantic information is further designed to achieve high-fidelity texture modeling and better constrain the estimated multiple views. Comprehensive experiments on standard benchmarks and in-the-wild images demonstrate that HumanSplat surpasses existing state-of-the-art methods in achieving photorealistic novel-view synthesis. Project page: https://humansplat.github.io/.

📄 Paper | 🌐 Project Page

Classic work:

1. A Generalization of Algebraic Surface Drawing

Authors: James F. Blinn

Comment:: First paper rendering 3D gaussians.

Abstract

The mathematical description of three-dimensional surfaces usually falls into one of two classifications: parametric and implicit. An implicit surface is defined to be all points which satisfy some equation F (x, y, z) = 0. This form is ideally suited for image space shaded picture drawing; the pixel coordinates are substituted for x and y, and the equation is solved for z. Algorithms for drawing such objects have been developed primarily for first- and second-order polynomial functions, a subcategory known as algebraic surfaces. This paper presents a new algorithm applicable to other functional forms, in particular to the summation of several Gaussian density distributions. The algorithm was created to model electron density maps of molecular structures, but it can be used for other artistically interesting shapes.

📄 Paper

2. Approximate Differentiable Rendering with Algebraic Surfaces

Authors: Leonid Keselman and Martial Hebert

Comment:: First paper to do differentiable rendering optimization of 3D gaussians.

Abstract

Differentiable renderers provide a direct mathematical link between an object’s 3D representation and images of that object. In this work, we develop an approximate differentiable renderer for a compact, interpretable representation, which we call Fuzzy Metaballs. Our approximate renderer focuses on rendering shapes via depth maps and silhouettes. It sacrifices fidelity for utility, producing fast runtimes and high-quality gradient information that can be used to solve vision tasks. Compared to mesh-based differentiable renderers, our method has forward passes that are 5x faster and backwards passes that are 30x faster. The depth maps and silhouette images generated by our method are smooth and defined everywhere. In our evaluation of differentiable renderers for pose estimation, we show that our method is the only one comparable to classic techniques. In shape from silhouette, our method performs well using only gradient descent and a per-pixel loss, without any surrogate losses or regularization. These reconstructions work well even on natural video sequences with segmentation artifacts.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

3. Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling

Authors: Jan U. Müller, Michael Weinmann, Reinhard Klein

Comment: Builds 2D screen-space gaussians from underlying 3D representations.

Abstract

We propose an efficient and GPU-accelerated sampling framework which enables unbiased gradient approximation for differentiable point cloud rendering based on surface splatting. Our framework models the contribution of a point to the rendered image as a probability distribution. We derive an unbiased approximative gradient for the rendering function within this model. To efficiently evaluate the proposed sample estimate, we introduce a tree-based data-structure which employs multi-pole methods to draw samples in near linear time. Our gradient estimator allows us to avoid regularization required by previous methods, leading to a more faithful shape recovery from images. Furthermore, we validate that these improvements are applicable to real-world applications by refining the camera poses and point cloud obtained from a real-time SLAM system. Finally, employing our framework in a neural rendering setting optimizes both the point cloud and network parameters, highlighting the framework’s ability to enhance data driven approaches.

📄 Paper 💻 Code

4. Generating and Real-Time Rendering of Clouds

Authors: Petr Man

Comment: Splatting of anisotropic gaussians. Basically a non-differentiable implementation of 3DGS.

Abstract

This paper presents a method for generation and real-time rendering of static clouds. Perlin noise function generates three dimensional map of a cloud. We also present a twopass rendering algorithm that performs physically based approximation. In the first preprocessed phase it computes multiple forward scattering. In the second phase first order anisotropic scattering at runtime is evaluated. The generated map is stored as voxels and is unsuitable for the real-time rendering. We introduce a more suitable inner representation of cloud that approximates the original map and contains much less information. The cloud is then represented by a set of metaballs (spheres) with parameters such as center positions, radii and density values. The main contribution of this paper is to propose a method, that transforms the original cloud map to the inner representation. This method uses the Radial Basis Function (RBF) neural network.

📄 Paper

Compression:

3D Gaussian Splatting has recently emerged as a highly promising technique for modeling of static 3D scenes. In contrast to Neural Radiance Fields, it utilizes efficient rasterization allowing for very fast rendering at high-quality. However, the storage size is significantly higher, which hinders practical deployment, e.g. on resource constrained devices. In this paper, we introduce a compact scene representation organizing the parameters of 3D Gaussian Splatting (3DGS) into a 2D grid with local homogeneity, ensuring a drastic reduction in storage requirements without compromising visual quality during rendering. Central to our idea is the explicit exploitation of perceptual redundancies present in natural scenes. In essence, the inherent nature of a scene allows for numerous permutations of Gaussian parameters to equivalently represent it. To this end, we propose a novel highly parallel algorithm that regularly arranges the high-dimensional Gaussian parameters into a 2D grid while preserving their neighborhood structure. During training, we further enforce local smoothness between the sorted parameters in the grid. The uncompressed Gaussians use the same structure as 3DGS, ensuring a seamless integration with established renderers. Our method achieves a reduction factor of 17x to 42x in size for complex scenes with no increase in training time, marking a substantial leap forward in the domain of 3D scene distribution and consumption.

📄 Paper | 🌐 Project Page | 💻 Code

Diffusion:

2024:

1. AGG: Amortized Generative 3D Gaussians for Single Image to 3D

Authors: Dejia Xu, Ye Yuan, Morteza Mardani, Sifei Liu, Jiaming Song, Zhangyang Wang, Arash Vahdat

Abstract

Given the growing need for automatic 3D content creation pipelines, various 3D representations have been studied to generate 3D objects from a single image. Due to its superior rendering efficiency, 3D Gaussian splatting-based models have recently excelled in both 3D reconstruction and generation. 3D Gaussian splatting approaches for image to 3D generation are often optimization-based, requiring many computationally expensive score-distillation steps. To overcome these challenges, we introduce an Amortized Generative 3D Gaussian framework (AGG) that instantly produces 3D Gaussians from a single image, eliminating the need for per-instance optimization. Utilizing an intermediate hybrid representation, AGG decomposes the generation of 3D Gaussian locations and other appearance attributes for joint optimization. Moreover, we propose a cascaded pipeline that first generates a coarse representation of the 3D data and later upsamples it with a 3D Gaussian super-resolution module. Our method is evaluated against existing optimization-based 3D Gaussian frameworks and sampling-based pipelines utilizing other 3D representations, where AGG showcases competitive generation abilities both qualitatively and quantitatively while being several orders of magnitude faster.

📄 Paper | 🌐 Project Page| 🎥 Short Presentation

2. Fast Dynamic 3D Object Generation from a Single-view Video

Authors: Zijie Pan, Zeyu Yang, Xiatian Zhu, Li Zhang

Abstract

Generating dynamic three-dimensional (3D) object from a single-view video is challenging due to the lack of 4D labeled data. Existing methods extend text-to-3D pipelines by transferring off-the-shelf image generation models such as score distillation sampling, but they are slow and expensive to scale (e.g., 150 minutes per object) due to the need for back-propagating the information-limited supervision signals through a large pretrained model. To address this limitation, we propose an efficient video-to-4D object generation framework called Efficient4D. It generates high-quality spacetime-consistent images under different camera views, and then uses them as labeled data to directly train a novel 4D Gaussian splatting model with explicit point cloud geometry, enabling real-time rendering under continuous camera trajectories. Extensive experiments on synthetic and real videos show that Efficient4D offers a remarkable 10-fold increase in speed when compared to prior art alternatives while preserving the same level of innovative view synthesis quality. For example, Efficient4D takes only 14 minutes to model a dynamic object.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

3. GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

Authors: Chen Yang, Sikuang Li, Jiemin Fang, Ruofan Liang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

Abstract

Reconstructing and rendering 3D objects from highly sparse views is of critical importance for promoting applications of 3D vision techniques and improving user experience. However, images from sparse views only contain very limited 3D information, leading to two significant challenges: 1) Difficulty in building multi-view consistency as images for matching are too few; 2) Partially omitted or highly compressed object information as view coverage is insufficient. To tackle these challenges, we propose GaussianObject, a framework to represent and render the 3D object with Gaussian splatting, that achieves high rendering quality with only 4 input images. We first introduce techniques of visual hull and floater elimination which explicitly inject structure priors into the initial optimization process for helping build multi-view consistency, yielding a coarse 3D Gaussian representation. Then we construct a Gaussian repair model based on diffusion models to supplement the omitted object information, where Gaussians are further refined. We design a self-generating strategy to obtain image pairs for training the repair model. Our GaussianObject is evaluated on several challenging datasets, including MipNeRF360, OmniObject3D, and OpenIllumination, achieving strong reconstruction results from only 4 views and significantly outperforming previous state-of-the-art methods.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

Authors: Heng Yu, Chaoyang Wang, Peiye Zhuang, Willi Menapace, Aliaksandr Siarohin, Junli Cao, Laszlo A Jeni, Sergey Tulyakov, Hsin-Ying Lee

Authors: Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Munan Ning, Li Yuan

Abstract

Recent one image to 3D generation methods commonly adopt Score Distillation Sampling (SDS). Despite the impressive results, there are multiple deficiencies including multi-view inconsistency, over-saturated and over-smoothed textures, as well as the slow generation speed. To address these deficiencies, we present Repaint123 to alleviate multi-view bias as well as texture degradation and speed up the generation process. The core idea is to combine the powerful image generation capability of the 2D diffusion model and the texture alignment ability of the repainting strategy for generating high-quality multi-view images with consistency. We further propose visibility-aware adaptive repainting strength for overlap regions to enhance the generated image quality in the repainting process. The generated high-quality and multi-view consistent images enable the use of simple Mean Square Error (MSE) loss for fast 3D content generation. We conduct extensive experiments and show that our method has a superior ability to generate high-quality 3D content with multi-view consistency and fine textures in 2 minutes from scratch.

📄 Paper | 🌐 Project Page | 💻 Code (not yet)

Dynamics and Deformation:

Recently, 3D Gaussian, as an explicit 3D representation method, has demonstrated strong competitiveness over NeRF (Neural Radiance Fields) in terms of expressing complex scenes and training duration. These advantages signal a wide range of applications for 3D Gaussians in 3D understanding and editing. Meanwhile, the segmentation of 3D Gaussians is still in its infancy. The existing segmentation methods are not only cumbersome but also incapable of segmenting multiple objects simultaneously in a short amount of time. In response, this paper introduces a 3D Gaussian segmentation method implemented with 2D segmentation as supervision. This approach uses input 2D segmentation maps to guide the learning of the added 3D Gaussian semantic information, while nearest neighbor clustering and statistical filtering refine the segmentation results. Experiments show that our concise method can achieve comparable performances on mIOU and mAcc for multi-object segmentation as previous single-object segmentation methods.

📄 Paper

Language Embedding:

📄 Paper | 🌐 Project Page | 💻 Code (not yet)

Mesh Extraction and Physics:

Authors: Runfa Blark Li, Keito Suzuki, Bang Du, Ki Myung Brian Lee, Nikolay Atanasov, Truong Nguyen

Abstract

A signed distance function (SDF) is a useful representation for continuous-space geometry and many related operations, including rendering, collision checking, and mesh generation. Hence, reconstructing SDF from image observations accurately and efficiently is a fundamental problem. Recently, neural implicit SDF (SDF-NeRF) techniques, trained using volumetric rendering, have gained a lot of attention. Compared to earlier truncated SDF (TSDF) fusion algorithms that rely on depth maps and voxelize continuous space, SDF-NeRF enables continuous-space SDF reconstruction with better geometric and photometric accuracy. However, the accuracy and convergence speed of scene-level SDF reconstruction require further improvements for many applications. With the advent of 3D Gaussian Splatting (3DGS) as an explicit representation with excellent rendering quality and speed, several works have focused on improving SDF-NeRF by introducing consistency losses on depth and surface normals between 3DGS and SDF-NeRF. However, loss-level connections alone lead to incremental improvements. We propose a novel neural implicit SDF called “SplatSDF” to fuse 3DGS and SDF-NeRF at an architecture level with significant boosts to geometric and photometric accuracy and convergence speed. Our SplatSDF relies on 3DGS as input only during training, and keeps the same complexity and efficiency as the original SDF-NeRF during inference. Our method outperforms state-of-the-art SDF-NeRF models on geometric and photometric evaluation by the time of submission.

📄 Paper | 🌐 Project Page | 💻 Code

2023:

1. [CVPR '24] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics

Authors: Tianyi Xie, Zeshun Zong, Yuxin Qiu, Xuan Li, Yutao Feng, Yin Yang, Chenfanfu Jiang

Abstract

We introduce PhysGaussian, a new method that seamlessly integrates physically grounded Newtonian dynamics within 3D Gaussians to achieve high-quality novel motion synthesis. Employing a custom Material Point Method (MPM), our approach enriches 3D Gaussian kernels with physically meaningful kinematic deformation and mechanical stress attributes, all evolved in line with continuum mechanics principles. A defining characteristic of our method is the seamless integration between physical simulation and visual rendering: both components utilize the same 3D Gaussian kernels as their discrete representations. This negates the necessity for triangle/tetrahedron meshing, marching cubes, "cage meshes," or any other geometry embedding, highlighting the principle of "what you see is what you simulate (WS2)." Our method demonstrates exceptional versatility across a wide variety of materials--including elastic entities, metals, non-Newtonian fluids, and granular materials--showcasing its strong capabilities in creating diverse visual content with novel viewpoints and movements.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

2. [CVPR '24] SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

Authors: Antoine Guédon, Vincent Lepetit

Abstract

We propose a method to allow precise and extremely fast mesh extraction from 3D Gaussian Splatting. Gaussian Splatting has recently become very popular as it yields realistic rendering while being significantly faster to train than NeRFs. It is however challenging to extract a mesh from the millions of tiny 3D gaussians as these gaussians tend to be unorganized after optimization and no method has been proposed so far. Our first key contribution is a regularization term that encourages the gaussians to align well with the surface of the scene. We then introduce a method that exploits this alignment to sample points on the real surface of the scene and extract a mesh from the Gaussians using Poisson reconstruction, which is fast, scalable, and preserves details, in contrast to the Marching Cubes algorithm usually applied to extract meshes from Neural SDFs. Finally, we introduce an optional refinement strategy that binds gaussians to the surface of the mesh, and jointly optimizes these Gaussians and the mesh through Gaussian splatting rendering. This enables easy editing, sculpting, rigging, animating, compositing and relighting of the Gaussians using traditional softwares by manipulating the mesh instead of the gaussians themselves. Retrieving such an editable mesh for realistic rendering is done within minutes with our method, compared to hours with the state-of-the-art methods on neural SDFs, while providing a better rendering quality.

📄 Paper | 🌐 Project Page | 💻 Code | 🎥 Short Presentation

3. NeuSG: Neural Implicit Surface Reconstruction with 3D Gaussian Splatting Guidance

Authors: Hanlin Chen, Chen Li, Gim Hee Lee

Abstract

Existing neural implicit surface reconstruction methods have achieved impressive performance in multi-view 3D reconstruction by leveraging explicit geometry priors such as depth maps or point clouds as regularization. However, the reconstruction results still lack fine details because of the over-smoothed depth map or sparse point cloud. In this work, we propose a neural implicit surface reconstruction pipeline with guidance from 3D Gaussian Splatting to recover highly detailed surfaces. The advantage of 3D Gaussian Splatting is that it can generate dense point clouds with detailed structure. Nonetheless, a naive adoption of 3D Gaussian Splatting can fail since the generated points are the centers of 3D Gaussians that do not necessarily lie on the surface. We thus introduce a scale regularizer to pull the centers close to the surface by enforcing the 3D Gaussians to be extremely thin. Moreover, we propose to refine the point cloud from 3D Gaussians Splatting with the normal priors from the surface predicted by neural implicit models instead of using a fixed set of points as guidance. Consequently, the quality of surface reconstruction improves from the guidance of the more accurate 3D Gaussian splatting. By jointly optimizing the 3D Gaussian Splatting and the neural implicit model, our approach benefits from both representations and generates complete surfaces with intricate details. Experiments on Tanks and Temples verify the effectiveness of our proposed method.

📄 Paper

Misc:

Modeling dynamic, large-scale urban scenes is challenging due to their highly intricate geometric structures and unconstrained dynamics in both space and time. Prior methods often employ high-level architectural priors, separating static and dynamic elements, resulting in suboptimal capture of their synergistic interactions. To address this challenge, we present a unified representation model, called Periodic Vibration Gaussian (PVG). PVG builds upon the efficient 3D Gaussian splatting technique, originally designed for static scene representation, by introducing periodic vibration-based temporal dynamics. This innovation enables PVG to elegantly and uniformly represent the characteristics of various objects and elements in dynamic urban scenes. To enhance temporally coherent representation learning with sparse training data, we introduce a novel flow-based temporal smoothing mechanism and a position-aware adaptive control strategy. Extensive experiments on Waymo Open Dataset and KITTI benchmarks demonstrate that PVG surpasses state-of-the-art alternatives in both reconstruction and novel view synthesis for both dynamic and static scenes. Notably, PVG achieves this without relying on manually labeled object bounding boxes or expensive optical flow estimation. Moreover, PVG exhibits 50/6000-fold acceleration in training/rendering over the best alternative.

📄 Paper | 🌐 Project Page | 💻 Code (not yet)

Regularization and Optimization:

We present a method named iComMa to address the 6D pose estimation problem in computer vision. The conventional pose estimation methods typically rely on the target's CAD model or necessitate specific network training tailored to particular object classes. Some existing methods address mesh-free 6D pose estimation by employing the inversion of a Neural Radiance Field (NeRF), aiming to overcome the aforementioned constraints. However, it still suffers from adverse initializations. By contrast, we model the pose estimation as the problem of inverting the 3D Gaussian Splatting (3DGS) with both the comparing and matching loss. In detail, a render-and-compare strategy is adopted for the precise estimation of poses. Additionally, a matching module is designed to enhance the model's robustness against adverse initializations by minimizing the distances between 2D keypoints. This framework systematically incorporates the distinctive characteristics and inherent rationale of render-and-compare and matching-based approaches. This comprehensive consideration equips the framework to effectively address a broader range of intricate and challenging scenarios, including instances with substantial angular deviations, all while maintaining a high level of prediction accuracy. Experimental results demonstrate the superior precision and robustness of our proposed jointly optimized framework when evaluated on synthetic and complex real-world data in challenging scenarios.

📄 Paper | 💻 Code

Rendering:

Neural Radiance Fields (NeRFs) have demonstrated the remarkable potential of neural networks to capture the intricacies of 3D objects. By encoding the shape and color information within neural network weights, NeRFs excel at producing strikingly sharp novel views of 3D objects. Recently, numerous generalizations of NeRFs utilizing generative models have emerged, expanding its versatility. In contrast, Gaussian Splatting (GS) offers a similar renders quality with faster training and inference as it does not need neural networks to work. We encode information about the 3D objects in the set of Gaussian distributions that can be rendered in 3D similarly to classical meshes. Unfortunately, GS are difficult to condition since they usually require circa hundred thousand Gaussian components. To mitigate the caveats of both models, we propose a hybrid model that uses GS representation of the 3D object's shape and NeRF-based encoding of color and opacity. Our model uses Gaussian distributions with trainable positions (i.e. means of Gaussian), shape (i.e. covariance of Gaussian), color and opacity, and neural network, which takes parameters of Gaussian and viewing direction to produce changes in color and opacity. Consequently, our model better describes shadows, light reflections, and transparency of 3D objects.

📄 Paper | 💻 Code

Reviews:

📄 Paper

SLAM:

The integration of neural rendering and the SLAM system recently showed promising results in joint localization and photorealistic view reconstruction. However, existing methods, fully relying on implicit representations, are so resource-hungry that they cannot run on portable devices, which deviates from the original intention of SLAM. In this paper, we present Photo-SLAM, a novel SLAM framework with a hyper primitives map. Specifically, we simultaneously exploit explicit geometric features for localization and learn implicit photometric features to represent the texture information of the observed environment. In addition to actively densifying hyper primitives based on geometric features, we further introduce a Gaussian-Pyramid-based training method to progressively learn multi-level features, enhancing photorealistic mapping performance. The extensive experiments with monocular, stereo, and RGB-D datasets prove that our proposed system Photo-SLAM significantly outperforms current state-of-the-art SLAM systems for online photorealistic mapping, e.g., PSNR is 30% higher and rendering speed is hundreds of times faster in the Replica dataset. Moreover, the Photo-SLAM can run at real-time speed using an embedded platform such as Jetson AGX Orin, showing the potential of robotics applications.

📄 Paper | 🌐 Project Page | 💻 Code

Sparse:

We introduce the Splatter Image, an ultra-fast approach for monocular 3D object reconstruction which operates at 38 FPS. Splatter Image is based on Gaussian Splatting, which has recently brought real-time rendering, fast training, and excellent scaling to multi-view reconstruction. For the first time, we apply Gaussian Splatting in a monocular reconstruction setting. Our approach is learning-based, a

Name		Name	Last commit message	Last commit date
Latest commit History 667 Commits
paper_manager		paper_manager
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

MrNeRF/awesome-3D-gaussian-splatting

Folders and files

Latest commit

History

Repository files navigation

Awesome 3D Gaussian Splatting Resources

Table of contents

Seminal Paper introducing 3D Gaussian Splatting:

3D Gaussian Splatting for Real-Time Radiance Field Rendering

3D Object Detection:

2024:

1. 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection

Autonomous Driving:

2024:

1. Street Gaussians for Modeling Dynamic Urban Scenes

2. TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Surrounding Autonomous Driving Scenes

3. OmniRe: Omni Urban Scene Reconstruction

4. SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving

2023:

1. [CVPR '24] DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes

2. [CVPR '24] HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting

Avatars:

2024:

1. GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting

2. PSAvatar: A Point-based Morphable Shape Model for Real-Time Head Avatar Creation with 3D Gaussian Splatting

3. Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos

4. HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

5. ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting

6. GaussianHair: Hair Modeling and Rendering with Light-aware Gaussians

7. GVA: Reconstructing Vivid 3D Gaussian Avatars from Monocular Videos

8. [CVPR '24] SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting

9. SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable Surface

10. HAHA: Highly Articulated Gaussian Human Avatars with Textured Mesh Prior

11. [CVPRW '24] Gaussian Splatting Decoder for 3D‑aware Generative Adversarial Networks

12. GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh

13. OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering

14. [CVPR '24] Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

15. [NeurIPS '24] Generalizable and Animatable Gaussian Head Avatar

16. [SIGGRAPH Asia '24] DualGS: Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos

17. [SIGGRAPH Asia '24] V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians

2023:

1. Drivable 3D Gaussian Avatars

2. SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos

3. [CVPR '24] Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling

4. [CVPR '24] GART: Gaussian Articulated Template Models

5. [CVPR '24] Human Gaussian Splatting: Real-time Rendering of Animatable Avatars

6. [CVPR '24] HUGS: Human Gaussian Splats

7. [CVPR '24] Gaussian Shell Maps for Efficient 3D Human Generation

8. GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation

9. [CVPR '24] GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians

10. [CVPR '24] GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis

11. GauHuman: Articulated Gaussian Splatting from Monocular Human Videos

12. HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting

13. [CVPR '24] HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting

14. [CVPR '24] GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians

15. [CVPR '24] FlashAvatar: High-fidelity Head Avatar with Efficient Gaussian Embedding

16. [CVPR '24] Relightable Gaussian Codec Avatars

17. MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar

18. [CVPR '24] ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering

19. [CVPR '24] 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting

20. [CVPR '24] GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

21. Deformable 3D Gaussian Splatting for Animatable Human Avatars

22. Human101: Training 100+FPS Human Gaussians in 100s from 1 View

23. [CVPR '24] Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

24. HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors

Classic work:

1. A Generalization of Algebraic Surface Drawing

2. Approximate Differentiable Rendering with Algebraic Surfaces

3. Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling

4. Generating and Real-Time Rendering of Clouds

Compression:

2024:

1. [I3D '24] Reducing the Memory Footprint of 3D Gaussian Splatting

2. [CVPR '24] Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis

3. HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

4. [ECCV '24] End-to-End Rate-Distortion Optimized 3D Gaussian Representation

5. 3DGS.zip: A survey on 3D Gaussian Splatting Compression Methods

6. LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming

7. Implicit Gaussian Splatting with Efficient Multi-Level Tri-Plane Representation