Speech-Bubble-Aware-Automatic-Comic-Colorization

Speech bubble aware automatic comic colorization using ConvNet. Please see Report for more details.

Result on validation data

(a) Input gray-scale image, (b) Low-resolution colorized image, (c) Super-resolution color image, (d) Text detection, (e) Speech bubble segmentation, (f) Speech bubble aware colorization, (g) ground truth. Notice the difference in (c) and (f), without speech bubble awareness the colorization is likely to fail in speech bubble regions.

Usage

Required package

tensorflow (2.0.0a0), GPU version can also be used
opencv (4.1.0.25)
imutils (0.5.2)
beautifulsoup4 (4.7.1), can be skipped if you do not want to crawl the dataset

Installation

pip install -r requirements.txt

Reproduce examples

Run run_example.py to produce all results.

Model weights

Pre-trained model weights can be download here

Dataset

The dataset and preprocess methods we used is the same as Sungmin Kang's paper. The dataset we used is the comic Yumi's Cell (English version), from the first episode to episode 238. It can be viewed here. Since all the images are under the copyright of the artist Donggeon Lee, and we are unable to contact the artist. Please inform us if there is any copyright issues.

System pipeline

Our method consists of three parts, low resolution colorizer, polishing network, and speech bubble segmentation.

Low resolution colorizer

Polishing network

Speech bubble segmentation pipeline

(a) Scene text detection using EAST, (b) Bounding box clustering to eliminate lonely boxes, (c) Cluster the bounding boxes into groups, (d) Flooding filling, (e) Hole filling.

Failure cases

Failure cases of speech bubble segmentation

(a) Comic objects containing texts inside, (b) Text cluster too close, (c) Mis-detection result from EAST.

Generalization to unseen comics and characters

References

Satoshi Iizuka, Edgar Simo-Serra & Hiroshi Ishikawa. (2016) "Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification".
Chie Furusawa, Kazutuki Hiroshiba, Keisuke Ogaki & Yuri Odagiri. (2017) "Comicolorization: Semi-Automatic Manga Colorization".
Sungmin Kang, Jaegul Choo & Jaehyuk Chang. (2017) "Consistent Comic Colorization with Pixel-wise Background Classification".
Sergio Guadarrama, Ryan Dahl, David Bieber, Mohammad Norouzi, Jonathon Shlens, Kevin Murphy. (2017) "PixColor: Pixel Recursive Colorization".
Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He & Jiajun Liang. (2017) "EAST: An Efficient and Accurate Scene Text Detector".

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Speech-Bubble-Aware-Automatic-Comic-Colorization

Result on validation data

Usage

Required package

Installation

Reproduce examples

Model weights

Dataset

System pipeline

Low resolution colorizer

Polishing network

Speech bubble segmentation pipeline

Failure cases

Failure cases of speech bubble segmentation

Generalization to unseen comics and characters

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

Speech-Bubble-Aware-Automatic-Comic-Colorization

Result on validation data

Usage

Required package

Installation

Reproduce examples

Model weights

Dataset

System pipeline

Low resolution colorizer

Polishing network

Speech bubble segmentation pipeline

Failure cases

Failure cases of speech bubble segmentation

Generalization to unseen comics and characters

References