AutoSlim: Towards One-Shot Architecture Search for Channel Numbers with TF2

Implementation of Autoslim using Tensorflow2.
Slimmable layers are implemented by masking, so they can be run in static mode, which is much faster than eager execution.
WResNet, ResNet, and MobileNet-v2 are available, and you can find their trained parameter in each link.

Updates

Gradients aggregated SGD is implemented for memory efficiency.

Note that

This repositories hyper-parameter setting is different from the authors' one due to the limitation of the hardware.

Paper's abstract

We study how to set channel numbers in a neural network to achieve better accuracy under constrained resources (e.g., FLOPs, latency, memory footprint or model size). A simple and one-shot solution, named AutoSlim, is presented. Instead of training many network samples and searching with reinforcement learning, we train a single slimmable network to approximate the network accuracy of different channel configurations. We then iteratively evaluate the trained slimmable model and greedily slim the layer with minimal accuracy drop. By this single pass, we can obtain the optimized channel configurations under different resource constraints. We present experiments with MobileNet v1, MobileNet v2, ResNet-50 and RL-searched MNasNet on ImageNet classification. We show significant improvements over their default channel configurations. We also achieve better accuracy than recent channel pruning methods and neural architecture search methods. Notably, by setting optimized channel numbers, our AutoSlim-MobileNet-v2 at 305M FLOPs achieves 74.2% top-1 accuracy, 2.4% better than default MobileNet-v2 (301M FLOPs), and even 0.2% better than RL-searched MNasNet (317M FLOPs). Our AutoSlim-ResNet-50 at 570M FLOPs, without depthwise convolutions, achieves 1.3% better accuracy than MobileNet-v1 (569M FLOPs).

Requirements

Tensorflow > 2.0
Scipy
more than 5 GB GPU memory

Run

python train_w_slimming.py --arch "archtecture name" --slimmable True --arguments

Experimental results

I only use CIFAR10 dataset due to my low hardware performance.
Network configuration is different from the authors'. Therefore, baseline FLOPS and Params are different.
All the training configuration is probably not optimal.
All the numerical values and plots are the average of three results.
The target FLOPS rate is set to 0.5.

MobileNet-v2

python train_w_slimming.py --arch Mobilev2 --slimmable True --weight_decay 4e-5

	Accuracy	FLOPS (M)	Params (M)	Model
Baseline	92.84	82.56	2.27	download
Autoslim(50)	92.83	40.83	0.93	download
Autoslim(30)	92.47	24.34	0.52	download

An example of slimmed network via Autoslim.

WResNet40-4

python train_w_slimming.py --arch WResnet-40-4 --slimmable True --weight_decay 5e-4

	Accuracy	FLOPS (M)	Params (M)	Model
Baseline	95.72	1306.97	8.97	download
Autoslim(50)	95.58	646.74	5.68	download
Autoslim(30)	95.49	474.69	3.81	download

An example of slimmed network via Autoslim.

ResNet56

python train_w_slimming.py --arch Resnet-56 --slimmable True --weight_decay 5e-4

	Accuracy	FLOPS (M)	Params (M)	Model
Baseline	93.91	127.93	0.8600	download
Autoslim(50)	93.59	62.64	0.4964	download
Autoslim(30)	92.67	37.52	0.3208	download

An example of slimmed network via Autoslim.

Reference

@article{yu2019autoslim,
  title={AutoSlim: Towards One-Shot Architecture Search for Channel Numbers},
  author={Yu, Jiahui and Huang, Thomas},
  journal={arXiv preprint arXiv:1903.11728},
  volume={8},
  year={2019}
}

Original project page

https://github.com/JiahuiYu/slimmable_networks

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
figs		figs
nets		nets
README.md		README.md
dataloader.py		dataloader.py
op_util.py		op_util.py
slim_util.py		slim_util.py
test.py		test.py
train_w_slimming.py		train_w_slimming.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoSlim: Towards One-Shot Architecture Search for Channel Numbers with TF2

Updates

Note that

Paper's abstract

Requirements

Run

Experimental results

MobileNet-v2

WResNet40-4

ResNet56

Reference

Original project page

About

Releases

Packages

Languages

sseung0703/Autoslim_TF2

Folders and files

Latest commit

History

Repository files navigation

AutoSlim: Towards One-Shot Architecture Search for Channel Numbers with TF2

Updates

Note that

Paper's abstract

Requirements

Run

Experimental results

MobileNet-v2

WResNet40-4

ResNet56

Reference

Original project page

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages