jpeg-dcgan-pytorch

JPEG圧縮を応用した画像生成プロジェクトです。

参考論文

Faster Neural Networks Straight from JPEG

Gueguen, L., Sergeev, A., Kadlec, B., Liu, R., & Yosinski, J. (2018). Faster neural networks straight from jpeg. In Advances in Neural Information Processing Systems (pp. 3933-3944).

JPEGのDCT係数を入力として画像分類をする。
分類精度を維持しながら高速化を達成した。

公式実装: uber-research / jpeg2dct

Toward Joint Image Generation and Compression using Generative Adversarial Networks

Kang, B., Tripathi, S., & Nguyen, T. Q. (2019). Toward Joint Image Generation and Compression using Generative Adversarial Networks. arXiv preprint arXiv:1901.07838.

高解像度画像は圧縮されているはずなのでGeneratorにJPEGデコード機構を追加する事で画像にJPEG的な特徴を付与する。
通常のモデルに追加する形なので高速化ではなくFIDスコアの向上が目的である。

Locally Connected Layer
入出力特徴平面上の小領域間でのみ結合があるような畳み込みを行う。

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

Goyal, P., Dollár, P., Girshick, R., Noordhuis, P., Wesolowski, L., Kyrola, A., Tulloch, A., Jia, Y., & He, K. (2017). Accurate, large minibatch sgd: Training imagenet in 1 hour. arXiv preprint arXiv:1706.02677.

Linear Scaling
学習率をバッチサイズに比例させる。

cGANs with Projection Discriminator

Miyato, T., & Koyama, M. (2018). cGANs with projection discriminator. arXiv preprint arXiv:1802.05637.

Projection Discriminator
Discriminatorの最終層の特徴とクラスを埋め込んだベクトルの内積を取る。

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., & Hochreiter, S. (2017). Gans trained by a two time-scale update rule converge to a local nash equilibrium. In Advances in neural information processing systems (pp. 6626-6637).

Two Time-Scale Update Rule (TTUR)
GeneratorとDiscriminatorでそれぞれ異なる学習率を適用する。

Self-Attention Generative Adversarial Networks

Zhang, H., Goodfellow, I., Metaxas, D., & Odena, A. (2019, May). Self-attention generative adversarial networks. In International Conference on Machine Learning (pp. 7354-7363). PMLR.

GとDの両方にSelf-Attentionを導入して大域的な画素の関係を考慮できるようにする。

Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis

Anonymous authors (Paper under double-blind review). ICLR2021.

Skip-Layer channel-wise Excitation Module (SLE Module)
ResidualConnectionやSEモジュールのような勾配補強のレイヤー間接続を作成する。
Self-Supervised Discriminator
Dの中間特徴から元画像を復元できるようにDを訓練し正則化する。

非公式実装: lucidrains / lightweight-gan

参考記事など

参考ライブラリ

kornia (Documentation)
GPU上で動作するData Augmentationライブラリ

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
configs/params		configs/params
fonts		fonts
models		models
tests		tests
tools		tools
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
evaluate.py		evaluate.py
generate.py		generate.py
requirements.txt		requirements.txt
run.sh		run.sh
test.py		test.py
tool.py		tool.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

jpeg-dcgan-pytorch

参考論文

Faster Neural Networks Straight from JPEG

Toward Joint Image Generation and Compression using Generative Adversarial Networks

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

cGANs with Projection Discriminator

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

Self-Attention Generative Adversarial Networks

Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis

参考記事など

参考ライブラリ

About

Releases

Packages

Languages

License

kthksgy/jpeg-dcgan-pytorch

Folders and files

Latest commit

History

Repository files navigation

jpeg-dcgan-pytorch

参考論文

参考記事など

参考ライブラリ

About

Topics

Resources

License

Stars

Watchers

Forks

Languages