BTM

Reviving Undersampling for Long-Tailed Learning

Authors: Hao Yu, Yingxiao Du, Jianxin Wu

[arXiv] [bibtex]

Introduction: This repository provides an implementation for the paper: "Reviving Undersampling for Long-Tailed Learning" based on MiSLAS. We aim to enhance the accuracy of the worst-performing categories and utilize the harmonic mean and geometric mean to assess the model's performance. We revive the balanced undersampling produces a more equitable distribution of accuracy across categories, and devise a straightforward model ensemble strategy, which does not result in any additional overhead and achieves improved harmonic and geometric mean while keeping the average accuracy. BTM is a simple, and efficient framework for long-tailed recognition.

Installation

Requirements

Python 3.8
torchvision 0.13.0
Pytorch 1.12.0

Dataset Preparation

Change the data_path in config/*/*.yaml accordingly.

Training

Stage-1:

To get a model of Stage-1, you can directly download from MiSLAS, or run:

python train_stage1.py --cfg ./config/DATASETNAME/DATASETNAME_ARCH_stage1_mixup.yaml

DATASETNAME can be selected from imagenet, ina2018, and places.

ARCH can be resnet50/101/152 for imagenet, resnet50 for ina2018, and resnet152 for places, respectively.

BTM:

To training a model with undersamping, run:

python train_stage1_bl_10_classifier.py --cfg ./config/DATASETNAME/DATASETNAME_ARCH_stage1_mixup_bl_10_calssifier.yaml

Modify Line221 train_loader = dataset.bl_train_10_0_instance to bl_train_10_1_instance, bl_train_10_2_instance etc. for getting different balance-training models.

Then run

python merge.py

for getting the fusion model. Modify Line19-28 to the real model checkpoint path.

Stage-2:

To train a model for Stage-2, run:

python train_stage2.py --cfg ./config/DATASETNAME/DATASETNAME_ARCH_stage2_mislas.yaml resume /path/to/checkpoint/BTM

The saved folder (including logs and checkpoints) is organized as follows.

MiSLAS
├── saved
│   ├── modelname_date
│   │   ├── ckps
│   │   │   ├── current.pth.tar
│   │   │   └── model_best.pth.tar
│   │   └── logs
│   │       └── modelname.txt
│   ...

Evaluation

To evaluate a trained model, run:

python eval.py --cfg ./config/DATASETNAME/DATASETNAME_ARCH_stage1_mixup.yaml  resume /path/to/checkpoint/stage1
python eval.py --cfg ./config/DATASETNAME/DATASETNAME_ARCH_stage2_mislas.yaml resume /path/to/checkpoint/stage2

Citation

@article{yu2024reviving,
  title={Reviving Undersampling for Long-Tailed Learning},
  author={Yu, Hao and Du, Yingxiao and Wu, Jianxin},
  journal={arXiv preprint arXiv:2401.16811},
  year={2024}
}

Contact

If you have any questions about our work, feel free to contact us through email (Hao Yu: [email protected]) or Github issues.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
config		config
datasets		datasets
models		models
LICENSE		LICENSE
README.md		README.md
count_sampler.py		count_sampler.py
eval.py		eval.py
merge.py		merge.py
methods.py		methods.py
reliability_diagrams.py		reliability_diagrams.py
train_stage1.py		train_stage1.py
train_stage1_bl_10_classifier.py		train_stage1_bl_10_classifier.py
train_stage2.py		train_stage2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BTM

Installation

Training

Evaluation

Citation

Contact

About

Uh oh!

Releases

Packages

Languages

License

yuhao318/BTM

Folders and files

Latest commit

History

Repository files navigation

BTM

Installation

Training

Evaluation

Citation

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages