Consistency and Accuracy of CelebA Attribute Values

Figure 1: Examples of three annotations options, unusable images, ambiguous images, and edge images of Mouth Slightly Open (MSO) attribute.

TL;DR

This repository provides an MSO attribute cleaned version of CelebA.

Paper details

Haiyu Wu, Grace Bezold, Manuel Günther, Terrance Boult, Michael C. King, Kevin W. Bowyer, "Consistency and Accuracy of CelebA Attribute Values", CVPRW, 2023, arXiv:2210.07356

Abstract

We report the first systematic analysis of the experimental foundations of facial attribute classification. Two annotators independently assigning attribute values shows that only 12 of 40 common attributes are assigned values with ≥ 95% consistency, and three (high cheekbones, pointed nose, oval face) have essentially random consistency. Of 5,068 duplicate face appearances in CelebA, attributes have contradicting values on from 10 to 860 of the 5,068 duplicates. Manual audit of a subset of CelebA estimates error rates as high as 40% for (no beard=false), even though the labeling consistency experiment indicates that no beard could be assigned with ≥ 95% consistency. Selecting the mouth slightly open (MSO) for deeper analysis, we estimate the error rate for (MSO=true) at about 20% and (MSO=false) at about 2%. A corrected version of the MSO attribute values enables learning a model that achieves higher accuracy than previously reported for MSO.

Citation

If you use any part of our code or data, please cite our paper.

@inproceedings{wu2023consistency,
  title={Consistency and accuracy of celeba attribute values},
  author={Wu, Haiyu and Bezold, Grace and G{\"u}nther, Manuel and Boult, Terrance and King, Michael C and Bowyer, Kevin W},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={3257--3265},
  year={2023}
}

Dataset Cleaning

The statistic information is shown in this table. Note that, for label value, only MSO is cleaned in this paper.

	Train	Val	Test	Info_not_vis	Unusable	NMSO	MSO
Original	162,770	19,867	19,962	-	-	104,657(51.7%)	97,942(48.3%)
Cleaned	161,982	19,741	19,913	797	166	73,701(36.6%)	127,932(63.4%)

You can download cleaned labels here.

Accuracy

Train/Val/Test	AFFACT	MOON	ResNet50	DenseNet121
Original/Original/Original	94.16	94.09	93.95	94.10
Original/Original/Cleaned	85.17	85.94	85.24	85.98
Original/Cleaned/Cleaned	86.17	86.49	86.54	85.69
Cleaned/Cleaned/Original	86.13	85.40	85.90	85.27
Cleaned/Cleaned/Cleaned	95.18	95.49	95.33	95.15

Models are available in Model zoo

Test

Data preparation

Download the cropped and aligned CelebA and run the script to get the dataset.

python drop_noise_ims.py -s /path/to/celeba/ims -f ./cleaned_mso_labels/partition.csv -d ./dataset

Download the weights from model zoo and run the script.

python test.py -m /path/to/weights -i ./dataset/test/ -l ./cleaned_mso_labels/label_cleaned_mso.csv

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
cleaned_mso_labels		cleaned_mso_labels
figures		figures
models		models
LICENSE		LICENSE
README.md		README.md
dataloader.py		dataloader.py
drop_noise_ims.py		drop_noise_ims.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Consistency and Accuracy of CelebA Attribute Values

TL;DR

Paper details

Abstract

Citation

Dataset Cleaning

Accuracy

Test

Data preparation

About

Releases

Packages

Languages

License

HaiyuWu/CelebAMSO

Folders and files

Latest commit

History

Repository files navigation

Consistency and Accuracy of CelebA Attribute Values

TL;DR

Paper details

Abstract

Citation

Dataset Cleaning

Accuracy

Test

Data preparation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages