Speech bubble aware automatic comic colorization using ConvNet. Please see Report for more details.
(a) Input gray-scale image, (b) Low-resolution colorized image, (c) Super-resolution color image, (d) Text detection, (e) Speech bubble segmentation, (f) Speech bubble aware colorization, (g) ground truth. Notice the difference in (c) and (f), without speech bubble awareness the colorization is likely to fail in speech bubble regions.
- tensorflow (2.0.0a0), GPU version can also be used
- opencv (4.1.0.25)
- imutils (0.5.2)
- beautifulsoup4 (4.7.1), can be skipped if you do not want to crawl the dataset
pip install -r requirements.txt
Run run_example.py
to produce all results.
Pre-trained model weights can be download here
The dataset and preprocess methods we used is the same as Sungmin Kang's paper. The dataset we used is the comic Yumi's Cell (English version), from the first episode to episode 238. It can be viewed here. Since all the images are under the copyright of the artist Donggeon Lee, and we are unable to contact the artist. Please inform us if there is any copyright issues.
Our method consists of three parts, low resolution colorizer, polishing network, and speech bubble segmentation.
(a) Scene text detection using EAST, (b) Bounding box clustering to eliminate lonely boxes, (c) Cluster the bounding boxes into groups, (d) Flooding filling, (e) Hole filling.
(a) Comic objects containing texts inside, (b) Text cluster too close, (c) Mis-detection result from EAST.
-
Satoshi Iizuka, Edgar Simo-Serra & Hiroshi Ishikawa. (2016) "Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification".
-
Chie Furusawa, Kazutuki Hiroshiba, Keisuke Ogaki & Yuri Odagiri. (2017) "Comicolorization: Semi-Automatic Manga Colorization".
-
Sungmin Kang, Jaegul Choo & Jaehyuk Chang. (2017) "Consistent Comic Colorization with Pixel-wise Background Classification".
-
Sergio Guadarrama, Ryan Dahl, David Bieber, Mohammad Norouzi, Jonathon Shlens, Kevin Murphy. (2017) "PixColor: Pixel Recursive Colorization".
-
Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He & Jiajun Liang. (2017) "EAST: An Efficient and Accurate Scene Text Detector".