HAA4D

"HAA4D" is a challenging human action recognition 3D+T dataset that is built on top of the HAA500 dataset. The dataset is clean, diverse, class balanced, and the choice of atomic actions makes annotation even easier as each video clip lasts for only a few seconds.

Paper can be downloaded in here.

Project Website: https://cse.hkust.edu.hk/haa4d

Structures of the dataset

Currently, HAA4D consists of more than 3,300 4D skeletons in 300 human atomic action classes. The dataset include 4 different modalities of data:

RGB videos
2D skeletal data
3D skeletal data
Globally aligned skeletons

For RGB videos, it can be downloaded from https://www.cse.ust.hk/haa/

2D skeletal data, 3D skeletal data, and globally aligned skeletons can be downloaded in here.

Here is the hierarchical structure of the dataset:

/dataset 
├── video
├── images 
├── skeletons_2d 
├── skeletons_3d 
├── processed_data
│ ├── globally_aligned_skeletons
│ │ ├── haa4d
│ │ ├── nturgb+d 
│ ├── normalized_skeletons_3d 
├── info.json

2D skeletal data can be generated from AlphaPose[1] or by human labeling. For human labeling, we provided an annotation tool with interpolation techniques that can faster the annotating process. The shape of the 2D skeletal data is (num_joints, 2), with dimension one being the (x, y) of a joint. A more detailed topology of the skeletal is shown in Figure 1.
For 3D skeletal data, we use a 3D lifting tool to lift the 2D joints to 3D, which is implemented based on the open-source EvoSkeleton[2]. The shape of the 3D skeletal data is (num_joints, 3), with dimension one being the (x, y, z) of a joint.

Figure 1. HAA 3D+T Skeleton Topology

Get Started

run get_HAA500.py to get raw images from the video

python get_HAA500.py -p action_name

To view the skeleton example, run demo.py

python demo.py

Annotation Tool

Labelling UI

See the documentation here.

Global Aignment

See the documentation here.

References

[1] Fang HS, Xie S, Tai YW, Lu C. Rmpe: Regional multi-person pose estimation. In Proceedings of the IEEE International Conference on Computer Vision, 2017 (pp. 2334-2343).

[2] Shichao Li, Lei Ke, Kevin Pratama, Yu-Wing Tai, Chi-KeungTang, and Kwang-Ting Cheng. Cascaded deep monocular 3dhuman pose estimation with evolutionary training data.

Citation

To cite our datasets, please use the following bibtex records:

@misc{tseng2022haa4d,
	title={HAA4D: Few-Shot Human Atomic Action Recognition via 3D Spatio-Temporal Skeletal Alignment}, 
	author={Mu-Ruei Tseng and Abhishek Gupta and Chi-Keung Tang and Yu-Wing Tai},
	year={2022},
	eprint={2202.07308},
	archivePrefix={arXiv},
	primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
annotation_tool		annotation_tool
dataset		dataset
libs		libs
project-website		project-website
.gitignore		.gitignore
README.md		README.md
demo.py		demo.py
get_HAA500.py		get_HAA500.py
get_normalized_skeletons_3d.py		get_normalized_skeletons_3d.py
info.py		info.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HAA4D

Structures of the dataset

Get Started

Annotation Tool

Labelling UI

Global Aignment

References

Citation

About

Releases

Packages

Languages

Morris88826/HAA4D

Folders and files

Latest commit

History

Repository files navigation

HAA4D

Structures of the dataset

Get Started

Annotation Tool

Labelling UI

Global Aignment

References

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages