[CVPR 2025] HRAvatar: High-Quality and Relightable Gaussian Head Avatar

Dongbin Zhang^1,2, Yunfei Liu², Lijian Lin², Ye Zhu², Kangjie Chen¹, Minghan Qin¹, Yu Li^2†, Haoqian Wang^1†
( ¹Tsinghua University, ²International Digital Economy Academy )

🌐 Webpage | 📄Full Paper | 🎥 Video

📌 Introduction

This repository contains the official author's implementation associated with the paper "HRAvatar: High-Quality and Relightable Gaussian Head Avatar".

HRAvatar, a 3DGS-based method that reconstructs high-fidelity, relightable 3D head avatars, achieves real-time rendering and realistic visual effects under varying lighting conditions.


Pipeline of HRAvatar

🖥️ Cloning the Repository

# SSH
git clone [email protected]:Pixel-Talk/HRAvatar.git

or

# HTTPS
git clone https://github.com/Pixel-Talk/HRAvatar.git

The components have been tested on Ubuntu Linux 20.04. Instructions for setting up and running each of them are in the below sections.

📂 Datasets preparation

Download the insta dataset (already with extracted mask) from INSTA. The dataset can be accessed here.

The HDTF videos we used can be downloaded from here.

🛠️ Setup

Optimizer

The optimizer uses PyTorch and CUDA extensions in a Python environment to produce trained models.

Hardware Requirements

CUDA-ready GPU with Compute Capability 7.0+
24 GB VRAM (to train to paper evaluation quality)

Software Requirements

Conda (recommended for easy setup)
C++ Compiler for PyTorch extensions (we used VS Code)
CUDA SDK 11 for PyTorch extensions (we used 11.7)
C++ Compiler and CUDA SDK must be compatible

Environment Setup

Our default, provided install method is based on Conda package and environment management:

conda env create --file environment.yml
conda activate HRAvatar
cd submodules
git clone https://github.com/NVlabs/nvdiffrast.git
pip install nvdiffrast
pip install diff-gaussian-rasterization_c10
pip install simple-knn

🔧 Data Preprocessing

Data preprocessing for each video includes several steps: frame extraction, foreground extraction, keypoint estimation, and face tracking.

For the INSTA dataset, we directly use the provided masks.

# example script
bash preprocess/preprocess_shell/insta/bala_preprocess.sh

For the HDTF dataset or custom videos, you can run the following script:

# example script
bash preprocess/preprocess_shell/HDTF/marcia_preprocess.sh

Use Intrinsic Anything to extract albedo as pseudo-GT.

# example script
bash preprocess/preprocess_shell/extract_albedo.sh

For more details on data preprocessing, refer to Data_Preprocessing

Environment map filtering is described in Filter_Envmap

🎯 Traning

For INSTA DATASET

# example script
CUDA_VISIBLE_DEVICES=0  python train.py --source_path /path/to/bala \
  --model_path outputs/insta/bala  --eval  --test_set_num 350  --epochs 15

For HDTF

# example script
CUDA_VISIBLE_DEVICES=0  python train.py --source_path /path/to/marcia \
  --model_path outputs/HDTF/marcia  --eval  --test_set_num 500  --epochs 15 \
  --max_reflectance 0.8 --min_reflectance 0.04 --with_envmap_consist

For Custom DATASET

# example script
# Note: Lower learning rates can lead to better geometry 
#       but may degrade quantitative metrics (e.g., PSNR, SSIM)
CUDA_VISIBLE_DEVICES=0  python train.py --source_path /path/to/subject \
  --model_path outputs/custom/subject  --eval  --test_set_num 500  --epochs 15 \
  --max_reflectance 0.8 --min_reflectance 0.04 --with_envmap_consist \
  --expression_dirs_lr 1e-7 --pose_dirs_lr 1e-7 --shape_dirs_lr 1e-8 \
  --position_lr_init 5e-5 --position_lr_final 5e-7

🎨 Rendering

Render the training and testing results
(This is automatically done after training by default)

# example script
CUDA_VISIBLE_DEVICES=0 python render.py  --model_path outputs/insta/bala

Render others Add arguments in render.py

--skip_test # Skip rendering self-reenactment test set results
--skip_train # Skip rendering self-reenactment training set results
--render_albedo # Render albedo component
--render_normal # Render normal component
--render_irradiance # Render irradiance component
--render_specular # Render specular component
--render_roughness  # Render roughness component
--render_reflectance # Render reflectance component
--render_depth  # Render depth map
--render_envmap # Visualize optimized environment map
--render_relighting # Perform relighting render
--with_relight_background # Use input environment map as background during relighting
--envmap_path assets/envmaps/cobblestone_street  # Filtered environment map for relighting
--render_material_editing # Render material editing results (gradually increase reflectance)
--corss_source_path  # Render cross-reenactment results (specify the processed data path of another subject)
--test_static_material_edting_idxs 100 # Apply material editing to a specific image
--test_static_relight_idxs 100  # Apply relighting to a specific image

Evaluation

(This is automatically done after training by default)

# example script
python metrics.py --model_path outputs/insta/bala

📖 BibTeX

If you find our work helpful, please cite:

@InProceedings{HRAvatar,
    author    = {Zhang, Dongbin and Liu, Yunfei and Lin, Lijian and Zhu, Ye and Chen, Kangjie and Qin, Minghan and Li, Yu and Wang, Haoqian},
    title     = {HRAvatar: High-Quality and Relightable Gaussian Head Avatar},
    booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
    month     = {June},
    year      = {2025},
    pages     = {26285-26296}
}

🙏 Acknowledgement

We sincerely appreciate the contributions of the research community that made this work possible.

Our implementation is built upon the PyTorch framework for 3D Gaussian Splatting from GraphDeco-Inria. We thank the authors for their open-source efforts and inspiring research.

We also acknowledge the developers and maintainers of the following projects, which played a crucial role in our implementation:

nvdiffrast: NVIDIA's differentiable rasterizer, used for efficient shading.
diff-gaussian-rasterization: The differentiable Gaussian rasterization library, essential for rendering.
FLAME: A 3D head model that provided a strong foundation for our work.
SMIRK: For providing a valuable framework for FLAME parameter estimation.
INSTA and HDTF datasets, which we used for training and evaluation.

Finally, we thank our collaborators, colleagues, and the open-source community for their valuable discussions and continuous support.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
arguments		arguments
assets		assets
gaussian_renderer		gaussian_renderer
net_modules		net_modules
preprocess		preprocess
run_shells		run_shells
scene		scene
submodules		submodules
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
metrics.py		metrics.py
render.py		render.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[CVPR 2025] HRAvatar: High-Quality and Relightable Gaussian Head Avatar

📌 Introduction

🖥️ Cloning the Repository

📂 Datasets preparation

🛠️ Setup

Optimizer

Hardware Requirements

Software Requirements

Environment Setup

🔧 Data Preprocessing

🎯 Traning

🎨 Rendering

Evaluation

📖 BibTeX

🙏 Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

Pixel-Talk/HRAvatar

Folders and files

Latest commit

History

Repository files navigation

[CVPR 2025] HRAvatar: High-Quality and Relightable Gaussian Head Avatar

📌 Introduction

🖥️ Cloning the Repository

📂 Datasets preparation

🛠️ Setup

Optimizer

Hardware Requirements

Software Requirements

Environment Setup

🔧 Data Preprocessing

🎯 Traning

🎨 Rendering

Evaluation

📖 BibTeX

🙏 Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages