Update testing scripts and readme.

gmalivenko · gmalivenko · commit 98192bb09ae4 · 2022-06-16T23:01:06.000+03:00
diff --git a/README.md b/README.md
@@ -1,24 +1,19 @@
-## Replacing Mobile Camera ISP with a Single Deep Learning Model
-
-<br/>
+## PyNet-V2 Mobile: Efficient On-Device Photo Processing With Neural Networks
 
 <img src="http://people.ee.ethz.ch/~ihnatova/assets/img/pynet/pynet_teaser.jpg"/>
 
-<br/>
 
-#### 1. Overview [[Paper]](https://arxiv.org/pdf/2002.05509.pdf) [[PyTorch Implementation]](https://github.com/aiff22/PyNET-PyTorch) [[Project Webpage]](http://people.ee.ethz.ch/~ihnatova/pynet.html)
+#### 1. Overview [[Paper (in progress)]]() [[Project Webpage (in progress)]]()
 
-This repository provides the implementation of the RAW-to-RGB mapping approach and PyNET CNN presented in [this paper](https://arxiv.org/). The model is trained to convert **RAW Bayer data** obtained directly from mobile camera sensor into photos captured with a professional Canon 5D DSLR camera, thus replacing the entire hand-crafted ISP camera pipeline. The provided pre-trained PyNET model can be used to generate full-resolution **12MP photos** from RAW (DNG) image files captured using the Sony Exmor IMX380 camera sensor. More visual results of this approach for the Huawei P20 and BlackBerry KeyOne smartphones can be found [here](http://people.ee.ethz.ch/~ihnatova/pynet.html#demo).
+This repository provides the implementation of further improvement of the PyNet model originally presented in [this paper](https://arxiv.org/abs/2002.05509). 
 
-<br/>
 
 #### 2. Prerequisites
 
 - Python: scipy, numpy, imageio and pillow packages
 - [TensorFlow 1.X](https://www.tensorflow.org/install/) + [CUDA cuDNN](https://developer.nvidia.com/cudnn)
 - Nvidia GPU
 
-<br/>
 
 #### 3. First steps
 
@@ -29,91 +24,36 @@ This repository provides the implementation of the RAW-to-RGB mapping approach a
   
   <sub>*Please note that Google Drive has a quota limiting the number of downloads per day. To avoid it, you can login to your Google account and press "Add to My Drive" button instead of a direct download. Please check [this issue](https://github.com/aiff22/PyNET/issues/4) for more information.* </sub>
 
-<br/>
-
-
-#### 4. PyNET CNN
-
-<br/>
 
-<img src="http://people.ee.ethz.ch/~ihnatova/assets/img/pynet/pynet_architecture_github.png" alt="drawing" width="1000"/>
 
-<br/>
+#### 4. PyNet-V2 Mobile CNN
 
-PyNET architecture has an inverted pyramidal shape and is processing the images at **five different scales** (levels). The model is trained sequentially, starting from the lowest 5th layer, which allows to achieve good reconstruction results at smaller image resolutions. After the bottom layer is pre-trained, the same procedure is applied to the next level till the training is done on the original resolution. Since each higher level is getting **upscaled high-quality features** from the lower part of the model, it mainly learns to reconstruct the missing low-level details and refines the results. In this work, we are additionally using one transposed convolutional layer (Level 0) on top of the model that upsamples the image to its target size.
+[WIP]
 
-<br/>
 
 #### 5. Training the model
 
-The model is trained level by level, starting from the lowest (5th) one:
-
-```bash
-python train_model.py level=<level>
-```
-
-Obligatory parameters:
-
->```level```: **```5, 4, 3, 2, 1, 0```**
-
-Optional parameters and their default values:
-
->```batch_size```: **```50```** &nbsp; - &nbsp; batch size [small values can lead to unstable training] <br/>
->```train_size```: **```30000```** &nbsp; - &nbsp; the number of training patches randomly loaded each 1000 iterations <br/>
->```eval_step```: **```1000```** &nbsp; - &nbsp; each ```eval_step``` iterations the accuracy is computed and the model is saved <br/>
->```learning_rate```: **```5e-5```** &nbsp; - &nbsp; learning rate <br/>
->```restore_iter```: **```None```** &nbsp; - &nbsp; iteration to restore (when not specified, the last saved model for PyNET's ```level+1``` is loaded)<br/>
->```num_train_iters```: **```5K, 5K, 20K, 20K, 35K, 100K (for levels 5 - 0)```** &nbsp; - &nbsp; the number of training iterations <br/>
->```vgg_dir```: **```vgg_pretrained/imagenet-vgg-verydeep-19.mat```** &nbsp; - &nbsp; path to the pre-trained VGG-19 network <br/>
->```dataset_dir```: **```raw_images/```** &nbsp; - &nbsp; path to the folder with **Zurich RAW to RGB dataset** <br/>
-
-</br>
-
-Below we provide the commands used for training the model on the Nvidia Tesla V100 GPU with 16GB of RAM. When using GPUs with smaller amount of memory, the batch size and the number of training iterations should be adjusted accordingly:
+The model is trained level by level, starting from the lowest. The script below incorporates all training steps:
 
 ```bash
-python train_model.py level=5 batch_size=50 num_train_iters=5000
-python train_model.py level=4 batch_size=50 num_train_iters=5000
-python train_model.py level=3 batch_size=48 num_train_iters=20000
-python train_model.py level=2 batch_size=18 num_train_iters=20000
-python train_model.py level=1 batch_size=12 num_train_iters=35000
-python train_model.py level=0 batch_size=10 num_train_iters=100000
+./train.sh
 ```
 
-<br/>
 
 #### 6. Test the provided pre-trained models on full-resolution RAW image files
 
 ```bash
-python test_model.py level=0 orig=true
+python test_model_keras.py
 ```
 
 Optional parameters:
 
->```use_gpu```: **```true```**,**```false```** &nbsp; - &nbsp; run the model on GPU or CPU <br/>
->```dataset_dir```: **```raw_images/```** &nbsp; - &nbsp; path to the folder with **Zurich RAW to RGB dataset** <br/>
+>```--model```: - &nbsp; path to the Keras model checkpoint <br/>
+>```--inp_path```: **```raw_images/test/```** &nbsp; - &nbsp; path to the folder with **Zurich RAW to RGB dataset** <br/>
+>```--out_path```: **```.```** &nbsp; - &nbsp; path to the output images <br/>
 
-<br/>
 
-#### 7. Test the obtained model on full-resolution RAW image files
-
-```bash
-python test_model.py level=<level>
-```
-
-Obligatory parameters:
-
->```level```: **```5, 4, 3, 2, 1, 0```**
-
-Optional parameters:
-
->```restore_iter```: **```None```** &nbsp; - &nbsp; iteration to restore (when not specified, the last saved model for level=```<level>``` is loaded)<br/>
->```use_gpu```: **```true```**,**```false```** &nbsp; - &nbsp; run the model on GPU or CPU <br/>
->```dataset_dir```: **```raw_images/```** &nbsp; - &nbsp; path to the folder with **Zurich RAW to RGB dataset** <br/>
-
-<br/>
-
-#### 8. Folder structure
+#### 7. Folder structure
 
 >```models/```            &nbsp; - &nbsp; logs and models that are saved during the training process <br/>
 >```models/original/```   &nbsp; - &nbsp; the folder with the provided pre-trained PyNET model <br/>
@@ -123,44 +63,34 @@ Optional parameters:
 >```vgg-pretrained/```    &nbsp; - &nbsp; the folder with the pre-trained VGG-19 network <br/>
 
 >```load_dataset.py```    &nbsp; - &nbsp; python script that loads training data <br/>
->```model.py```           &nbsp; - &nbsp; PyNET implementation (TensorFlow) <br/>
->```train_model.py```     &nbsp; - &nbsp; implementation of the training procedure <br/>
->```test_model.py```      &nbsp; - &nbsp; applying the pre-trained model to full-resolution test images <br/>
+>```model.py```           &nbsp; - &nbsp; PyNET implementation (Keras) <br/>
+>```train_model_keras.py```     &nbsp; - &nbsp; implementation of the training procedure <br/>
+>```test_model_keras.py```      &nbsp; - &nbsp; applying the pre-trained model to full-resolution test images <br/>
 >```utils.py```           &nbsp; - &nbsp; auxiliary functions <br/>
 >```vgg.py```             &nbsp; - &nbsp; loading the pre-trained vgg-19 network <br/>
 
-<br/>
 
 #### 9. Bonus files
 
 These files can be useful for further experiments with the model / dataset:
 
 >```dng_to_png.py```            &nbsp; - &nbsp; convert raw DNG camera files to PyNET's input format <br/>
->```evaluate_accuracy.py```     &nbsp; - &nbsp; compute PSNR and MS-SSIM scores on Zurich RAW-to-RGB dataset for your own model <br/>
+>```ckpt2pb_keras.py```     &nbsp; - &nbsp; converts Keras checkpoint to TFLite format <br/>
+>```evaluate_accuracy_tflite.py```     &nbsp; - &nbsp; compute PSNR and MS-SSIM scores on Zurich RAW-to-RGB dataset for TFLite model <br/>
 
-<br/>
 
 #### 10. License
 
-Copyright (C) 2020 Andrey Ignatov. All rights reserved.
+Copyright (C) 2022 Andrey Ignatov. All rights reserved.
 
 Licensed under the [CC BY-NC-SA 4.0 (Attribution-NonCommercial-ShareAlike 4.0 International)](https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode).
 
 The code is released for academic research use only.
 
-<br/>
 
 #### 11. Citation
+[WIP] 
 
-```
-@article{ignatov2020replacing,
-  title={Replacing Mobile Camera ISP with a Single Deep Learning Model},
-  author={Ignatov, Andrey and Van Gool, Luc and Timofte, Radu},
-  journal={arXiv preprint arXiv:2002.05509},
-  year={2020}
-}
-```
-<br/>
 
 #### 12. Any further questions?
 
diff --git a/model.py b/model.py
@@ -59,13 +59,6 @@ def PyNET(input, instance_norm=False, instance_norm_level_1=False):
         conv_l1_out = _conv_layer(conv_l1_out, 3, 3, 1, relu=False, instance_norm=False)
         output_l1 = tf.nn.tanh(conv_l1_out) * 0.58 + 0.5
 
-    with tf.name_scope("generator_0_"):
-        conv_l0 = _upsample_layer(conv_l1_d14, 32, 3, 2)
-        conv_l0_out = _conv_layer(conv_l0, 3 * k * k, 3, 1, relu=False, instance_norm=False)
-
-        # -> Output: Level 0
-        output_l0 = tf.nn.tanh(conv_l0_out) * 0.58 + 0.5
-
     return None, output_l1, output_l2, output_l3
 
 
diff --git a/test_model.py b/test_model.py
diff --git a/test_model_keras.py b/test_model_keras.py
@@ -0,0 +1,87 @@
+# Copyright 2022 by Andrey Ignatov. All Rights Reserved.
+
+import numpy as np
+import tensorflow.compat.v1 as tf
+import tensorflow_addons as tfa
+import imageio
+import sys
+import os
+import importlib
+import rawpy
+import cv2
+from tensorflow.keras.models import load_model
+import argparse
+
+from load_dataset import extract_bayer_channels
+
+IMAGE_HEIGHT, IMAGE_WIDTH = 1472, 1984
+DSLR_SCALE = 2
+
+
+dataset_dir = 'raw_images/'
+dslr_dir = 'fujifilm/'
+phone_dir = 'mediatek_raw/'
+
+
+def main():
+    """Test model"""
+    parser = argparse.ArgumentParser(
+        description='Test model',
+        formatter_class=argparse.ArgumentDefaultsHelpFormatter
+    )
+    parser.add_argument(
+        '--model', help='Path to model checkpoint.', type=str, default='model.h5', required=True)
+    parser.add_argument(
+        '--inp_path', help='Path to the input data.', type=str, default='raw_images/test', required=True)
+    parser.add_argument(
+        '--out_path', help='Path to the output images.', type=str, default='.', required=True)
+    args = parser.parse_args()
+
+
+    spec = importlib.util.spec_from_file_location('pynet.model', 'model.py')
+    module = importlib.util.module_from_spec(spec)
+    spec.loader.exec_module(module)
+    PyNET = module.PyNET
+
+    phone_ = tf.keras.Input(shape=(IMAGE_HEIGHT, IMAGE_WIDTH, 4))
+    # Loading pre-trained model
+    _, enhanced, _, _ = \
+        PyNET(phone_, instance_norm=True, instance_norm_level_1=False)
+
+
+    print("Initializing variables")
+
+    model = tf.keras.Model(inputs=phone_, outputs=enhanced)
+    prev_model = load_model(args.model, compile=False)
+    for i, layer in enumerate(prev_model.layers):
+        for k in model.layers:
+            if k.name == layer.name:
+                k.set_weights(layer.get_weights())
+    
+
+    # Processing full-resolution RAW images
+    test_dir = args.inp_path
+    test_photos = [f for f in os.listdir(test_dir) if os.path.isfile(test_dir + f)]
+
+    for photo in test_photos:
+        with rawpy.imread(test_dir + photo) as raw:
+            I = extract_bayer_channels(raw.raw_image)
+            print("Processing image " + photo)
+
+            I = I[0:IMAGE_HEIGHT, 0:IMAGE_WIDTH, :]
+            I = np.reshape(I, [1, I.shape[0], I.shape[1], 4])
+
+            # Run inference
+
+            enhanced_tensor = model.predict([I])
+            enhanced_image = np.reshape(enhanced_tensor, [int(I.shape[1] * DSLR_SCALE), int(I.shape[2] * DSLR_SCALE), 3])
+
+            # Save the results as .png images
+            photo_name = photo.rsplit(".", 1)[0]
+            enhanced_image = cv2.cvtColor(enhanced_image, cv2.COLOR_RGB2BGR)
+            enhanced_image = np.uint8(np.clip(enhanced_image, 0.0, 1.0) * 255.0)
+            cv2.imwrite(os.path.join(args.out_path, photo_name + ".png"), enhanced_image)
+
+
+if __name__ == '__main__':
+    main()