Image Loading Benchmark: From JPG to RGB Numpy Arrays

This benchmark evaluates the efficiency of different libraries in loading JPG images and converting them into RGB numpy arrays, essential for neural network training data preparation. Inspired by the Albumentations library.

Important Note on Image Conversion

In the benchmark, it's crucial to standardize image formats for a fair comparison, despite different default formats used by OpenCV (BGR), torchvision, and TensorFlow (tensors). A conversion step to RGB numpy arrays is included for consistency. Note that in typical use cases, torchvision and TensorFlow do not require this conversion. Preliminary analysis shows that this extra step does not significantly impact the comparative performance of the libraries, ensuring that the benchmark accurately reflects realistic end-to-end image loading and preprocessing times.

Installation and Setup

Before running the benchmark, ensure your system is equipped with the necessary dependencies. Start by installing libturbojpeg:

sudo apt-get install libturbojpeg

Next, install all required Python libraries listed in requirements.txt:

sudo apt install requirements.txt

Note: If you want to update package versions in requirements.txt

pip install pip-tools

pip-compile requirements.in

this will create new requirements.txt file

pip install -r requirements.txt

to install latest versions

Running the Benchmark

To understand the benchmark's configuration options and run it according to your setup, use the following commands:

python imread_benchmark/benchmark.py -h

usage: benchmark.py [-h] [-d DIR] [-n N] [-r N] [--show-std] [-m] [-p] [-s] [-o OUTPUT_PATH]

Image reading libraries performance benchmark

options:
  -h, --help            show this help message and exit
  -d DIR, --data-dir DIR
                        path to a directory with images
  -n N, --num_images N  number of images for benchmarking (default: 2000)
  -r N, --num_runs N    number of runs for each benchmark (default: 5)
  --show-std            show standard deviation for benchmark runs
  -m, --markdown        print benchmarking results as a markdown table
  -p, --print-package-versions
                        print versions of packages
  -s, --shuffle         Shuffle the list of images.
  -o OUTPUT_PATH, --output_path OUTPUT_PATH
                        Path to save resulting dataframe.

python imread_benchmark/benchmark.py \
    --data-dir <path to image folder> \
    --num_images <num_images> \
    --num_runs <number of runs> \
    --show-std \
    --print-package-versions \
    --print-package-versions

Extra options: --print-package-versions - to print benchmarked libraries versions --print-package-versions - to shuffle images on every run --show-std - to show standard deviation for measurements

Hardware and Software Specifications

CPU: AMD Ryzen Threadripper 3970X 32-Core Processor

Results

	Library	Version	Performance (images/sec)
0	scikit-image	0.23.2	538.48 ± 6.86
1	imageio	2.34.1	538.58 ± 6.84
2	opencv-python-headless	4.10.0.82	631.46 ± 0.43
3	pillow	10.3.0	589.56 ± 8.79
4	jpeg4py	0.1.4	700.60 ± 0.88
5	torchvision	0.18.1	658.68 ± 0.78
6	tensorflow	2.16.1	704.43 ± 1.10
7	kornia-rs	0.1.1	682.95 ± 1.21

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.github		.github
images		images
imread_benchmark		imread_benchmark
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.in		requirements.in
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Loading Benchmark: From JPG to RGB Numpy Arrays

Important Note on Image Conversion

Installation and Setup

Running the Benchmark

Hardware and Software Specifications

Results

About

Releases

Sponsor this project

Packages

Contributors 2

Languages

License

ternaus/imread_benchmark

Folders and files

Latest commit

History

Repository files navigation

Image Loading Benchmark: From JPG to RGB Numpy Arrays

Important Note on Image Conversion

Installation and Setup

Running the Benchmark

Hardware and Software Specifications

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Contributors 2

Languages

Packages