EAN-efficient-attention-network

By Zhongzhan Huang, Senwei Liang, Mingfu Liang, Wei He and Haizhao Yang.

The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug'' [paper].

Introduction

Efficient Attention Network (EAN) is a framework to improve the efficiency for the existing attention modules in computer vision. In EAN, we leverage the sharing mechanism (Huang et al. 2020) to share the attention module within the backbone and search where to connect the shared attention module via reinforcement learning.

Requirement

Python 3.6 and PyTorch 1.0

Implementation

Our implementation is divided in three parts. First, we pre-train a supernet. Second, we use a policy-gradient-based method to search for an optimal connection scheme from the supernet. Last, we train from scratch a network searched by the second step.

Pretrain a Supernet

First, we pretrain a supernet and the checkpoint is saved in NAS_ckpts. For example, we train a SGE-supernet,

CUDA_VISIBLE_DEVICES=0,1,2,3 python train_imagenet/train_imagenet_ensemble_subset.py -a forward_config_share_sge_resnet50 -data /home/jovyan/ILSVRC2012_Data --checkpoint NAS_ckpts/ensemble_sge_train_on_subset

or train a DIA-supernet,

CUDA_VISIBLE_DEVICES=0,1,2,3 python train_imagenet/train_imagenet_ensemble_subset.py -a forward_dia_fbresnet50 -data /home/jovyan/ILSVRC2012_Data --checkpoint NAS_ckpts/ensemble_dia_train_on_subset

Search an Optimal Connection Scheme

Then, we search an optimal connection scheme from supernet.

For SGE,

python search_imagenet/run_code_search_sge.py

For DIA,

python search_imagenet/run_code_search_dia.py

Train a Network From Scratch

Last, we train from scracth the attention network with the connection scheme searched in the second step. Note that to train the attention network with the different scheme, we need to edit train_imagenet/run_codes_train_from_scratch.py

python train_imagenet/run_codes_train_from_scratch.py

The checkpoints will be save in NAS_ckpts.

Citation

If you find this paper helps in your research, please kindly cite

Acknowledgement

We would like to thank Taehoon Kim for his pytorch version of ENAS fromework and Xiang Li for his attention network framework.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
image		image
search_imagent		search_imagent
train_imagenet		train_imagenet
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EAN-efficient-attention-network

Introduction

Requirement

Implementation

Pretrain a Supernet

Search an Optimal Connection Scheme

Train a Network From Scratch

Citation

Acknowledgement

About

Releases

Packages

Contributors 3

Languages

License

gbup-group/EAN-efficient-attention-network

Folders and files

Latest commit

History

Repository files navigation

EAN-efficient-attention-network

Introduction

Requirement

Implementation

Pretrain a Supernet

Search an Optimal Connection Scheme

Train a Network From Scratch

Citation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages