##Installation
Works for python 3
- Clone the repo
pip install -r requirements.txt
- Install Pytorch from pyorch.org
The steps to follow for our approach are:
The explanation will be divided in these three parts, starting with generation.
Generating a dataset for image segmentation given background images and foreground images. Applied to satellite image segmentation. For this part (included in /generation) there are 2 important files in addition to the data folder, which contains the folder examples, that includes different backgrounds and tents. The files are the following.
Here, the main code for generation exists. This is the file used to generate. To get familiar with it you can run the help options (-help
). For a brief explanation:
-l
: number of outputs (def.: 10)-h
: number of tents (def.: random)-d
: battery shape dimensions (area where the tents will be) (write NNxMM) (if 0x0 adjusts to background; def.: random)-r
: rotation of all the batteries (range 0-180º) (def.: random)-s
: scale tent factor (how much should the tent image resize before putting it in the background) (def.: 0.8)-m
: output mask for individual tent(s) or mask for each input tent file (def.: for each tent)-hf
: input tent file (pass 1 filename or 1 folder) (def. data/examples/tent1.png)-bf
: input background file (pass 1 filename) (def. data/examples/background1.jpg)-date
: enter the date.-margin
: margin (in px) from one block to another (def. 5px)-help
: help
Recall that the date is a mandatory parameter.
Let's try an example:
python generate.py -hf data/examples/tent1.png -bf data/examples/background8.png -date 19970305 -l 5 -s 1.5 -d 0x0 -h 60 -margin 0 -m
This would generate 5 images with around 60 tents occupying all the background used. These would output in data/[date]/output, in addition with their masks in data/[date]/mask and a metadata file containing the output path, the mask path and the real number of tents for every image generated. If the -m
parameter is called or if there is no mask available for a tent, the number of tents will not be truthful.
Comments about the use:
- The images are expected to be .jpg or .png.
- The tent images are expected to have the same name as their masks without including "-mask" (i.e. "tent1.png and tent1-mask.png").
This file also relies on the auxiliar.py file, which contains additional functionalities. Another thing we can do is to input a folder of tents not only one type of tent. To use an example, we have included the folder data/examples/tents-example containing a group of tents with their masks respectively.
Here there is a collection of calls to the last file so that a lot of variation of parameters (like type of tents or backgrounds or rotation, etcetera.) can be made to actually generate a big dataset.
In order to train the model the data must be organized in the following way:
-
A folder with an arbitrary name. For example "TrainingFolder"
-
Inside TrainingFolder, there must be a file called metadata.txt which has, at least, two columns that represent input image and target image for each line. The columns must be separated in spaces. For example, the file deep-learning/training_data/201812091527/metadata.txt contains 3 columns, but only the first two will be rellevant.
-
Then, you must run
python convolutional_masking.py
Notice that inside this file, you must hardcode some parameters such as training and validation data folder. Validation data folder must have the same structure than the training folder.
In this file, you also have the definition of the model. There, you can costumise the model, training algorithm, batch size, etc... It has a DataLoader class which is used to load the data and form the batches.
You can train a model or use the one provided in the repo.
Once you have trained a model, you can use it to analyse images. There are to ways to do it: one by one or many at the same time:
-
detect_one.py
: It requires the path tp the model to use (hardcoded) and the image path (first argument). For example:python detect_one.py image.png
-
detect.py
: It can be used to to a analyse multiple images at once (by now, GPU-only). Following the same style of the training algorithm, it requires a metadata.txt file, with a column consisting of the paths of the images that must be analysed. The name of the model and the input folder must be hardcoded. For example:python detect.py
It also has a DataLoader class, used to load data and form batches.