Style Aggregated Network for Facial Landmark Detection

We provide the training and testing codes for SAN, implemented in PyTorch. This project is inspired by supervision-by-registration.

Preparation

Dependencies

Datasets Download

Download 300W-Style and AFLW-Style from Google Drive, and extract the downloaded files into ~/datasets/.
In 300W-Style and AFLW-Style directories, the Original sub-directory contains the original images from 300-W and AFLW
The sketch, light, and gray style images are used to analyze the image style variance in facial landmark detection.
For simplification, we change some file names, such as removing the space or unifying the file extension.

Figure 1. Our 300W-Style and AFLW-Style datasets. There are four styles, original, sketch, light, and gray.

If you can't download the datasets via the above link, please try:

300W-Style : https://drive.google.com/open?id=1wy9ZUSWE4V2WdMbXS3Jkq7kQEYHSpLk1
AFLW-Style : https://drive.google.com/open?id=1y5JrOd86NGHTPZLYLgeqRJzT2T4ACWJe

300W-Style Directory

300W-Style.tgz should be extracted into ~/datasets/300W-Style by typing tar xzvf 300W-Style.tgz; mv 300W-Convert 300W-Style. It has the following structure:

--300W-Gray
 --300W ;  afw  ; helen ; ibug ; lfpw
--300W-Light
 --300W ;  afw  ; helen ; ibug ; lfpw
--300W-Sketch
 --300W ;  afw  ; helen ; ibug ; lfpw
--300W-Original
 --300W ;  afw  ; helen ; ibug ; lfpw
--Bounding_Boxes
 --*.mat

AFLW-Style Directory

AFLW-Style.tgz should be extracted into ~/datasets/AFLW-Style by typing tar xzvf AFLW-Style.tgz; mv AFLW-Convert AFLW-Style. It has the following structure (annotation is generated by python aflw_from_mat.py):

--aflw-Gray
  --0 2 3
--aflw-Light
  --0 2 3
--aflw-Sketch
  --0 2 3
--aflw-Original
  --0 2 3
--annotation
  --0 2 3

Generate lists for training and evaluation

cd cache_data
python aflw_from_mat.py
python generate_300W.py

The generated list file will be saved into ./cache_data/lists/300W and ./cache_data/lists/AFLW.

Prepare images for training the style-aggregated face generation module

python crop_pic.py

The above commands will pre-crop the face images, and save them into ./cache_data/cache/300W and ./cache_data/cache/AFLW.

Training and Evaluation

300-W

Step-1 : cluster images into different groups, for example bash scripts/300W/300W_Cluster.sh 0,1 GTB 3.
Step-2 : use bash scripts/300W/300W_CYCLE_128.sh 0,1 GTB or bash scripts/300W/300W_CYCLE_128.sh 0,1 DET to train SAN on 300-W.
GTB means using the ground truth face bounding box, and DET means using the face detection results from a pre-trained detector (these results are provided from the official 300-W website).

AFLW

Step-1 : cluster images into different groups, for example bash scripts/AFLW/AFLW_Cluster.sh 0,1 GTB 3.
Step-2 : use bash scripts/AFLW/AFLW_CYCLE_128.FULL.sh or bash scripts/AFLW/AFLW_CYCLE_128.FRONT.sh to train SAN on AFLW.

Evaluation on the Single Image

You can donwload a pre-trained model from the snapshots directory of here, which is trained on 300-W. Put it in snapshots and use the following command to evaluate a single image. This command will print the location of each landmark and also save the visualization result in temp_1.png.

CUDA_VISIBLE_DEVICES=1 python san_eval.py --image ./cache_data/cache/test_1.jpg --model ./snapshots/SAN_300W_GTB_itn_cpm_3_50_sigma4_128x128x8/checkpoint_49.pth.tar --face 819.27 432.15 971.70 575.87 --save_path temp_1.png

The parameter image is the image path to be evaluated, model is the trained SAN model, and face denotes the coordinates of the face bounding box. The ground truth landmark annotation for ./cache_data/cache/test_1.jpg is ./cache_data/cache/test_1.pts.

Evaluate on 300-W or AFLW

Download the pre-trained model from the snapshots directory of here, which is trained on 300-W. Put it in snapshots and use the following command to evaluate on 300-W.

bash scripts/300W/300W-EVAL.sh 0,1
bash scripts/AFLW/AFLW_CYCLE_128.FULL-EVAL.sh 0,1

Normalization

Figure 2. We use the distance between the outer corners of the eyes, i.e., the 37-th and the 46-th points, for normalization.

Correction

In the Table 1 of the paper, SDM / ESR / LBF / CFSS use the inter-pupil distance for normalization, but we use the inter-ocular distance. If they also use the inter-ocular distance, they can obtain a smaller NME.

Citation

If this project helps your research, please cite the following papers:

@inproceedings{dong2018san,
   title={Style Aggregated Network for Facial Landmark Detection},
   author={Dong, Xuanyi and Yan, Yan and Ouyang, Wanli and Yang, Yi},
   booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
   pages={379--388},
   year={2018}
}
@inproceedings{dong2018sbr,
  title={{Supervision-by-Registration}: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors},
  author={Dong, Xuanyi and Yu, Shoou-I and Weng, Xinshuo and Wei, Shih-En and Yang, Yi and Sheikh, Yaser},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  pages={360--368},
  year={2018}
}

Contact

To ask questions or report issues, please open an issue on the issues tracker.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
cache_data		cache_data
lib		lib
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
base_cluster.py		base_cluster.py
cluster.py		cluster.py
crop_pic.py		crop_pic.py
gen_mean_face.py		gen_mean_face.py
init_path.py		init_path.py
san_eval.py		san_eval.py
san_main.py		san_main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Style Aggregated Network for Facial Landmark Detection

Preparation

Dependencies

Datasets Download

300W-Style Directory

AFLW-Style Directory

Generate lists for training and evaluation

Prepare images for training the style-aggregated face generation module

Training and Evaluation

300-W

AFLW

Evaluation on the Single Image

Evaluate on 300-W or AFLW

Normalization

Correction

Citation

Contact

About

Releases

Packages

Languages

License

zzx-zhangxian/SAN

Folders and files

Latest commit

History

Repository files navigation

Style Aggregated Network for Facial Landmark Detection

Preparation

Dependencies

Datasets Download

300W-Style Directory

AFLW-Style Directory

Generate lists for training and evaluation

Prepare images for training the style-aggregated face generation module

Training and Evaluation

300-W

AFLW

Evaluation on the Single Image

Evaluate on 300-W or AFLW

Normalization

Correction

Citation

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages