Skip to content

Commit 7091782

Browse files
author
Björn Barz
committed
Added README.md about duplicate lists
1 parent 1422b50 commit 7091782

File tree

1 file changed

+21
-0
lines changed

1 file changed

+21
-0
lines changed

meta/README.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
CIFAR Duplicates
2+
================
3+
4+
The files in this directory contain lists of duplicate image pairs found in the [CIFAR-10 and CIFAR-100 datasets][1].
5+
6+
[`duplicates_cifar10.csv`](duplicates_cifar10.csv) and [`duplicates_cifar100.csv`](duplicates_cifar100.csv) list images from the test sets that have near-duplicates in the training set.
7+
The columns of these CSV files have the following meaning:
8+
9+
- `TestID`: Index of the test image in the original CIFAR dataset (counting from 0).
10+
- `TrainID`: Index of the training image in the original CIFAR dataset (counting from 0).
11+
- `Distance`: The Euclidean distance between these two images in the L2-normalized CNN feature space.
12+
- `Judgment`: Indicates the type of duplicate (assigned by manual annotation):
13+
- `0` = **exact duplicate**: Almost all pixels in the two images are approximately identical.
14+
- `1` = **near-duplicate**: The content of the images is exactly the same, i.e., both originated from the same camera shot. However, different post-processing might have been applied to this original scene, e.g., color shifts, translations, scaling etc.
15+
- `2` = **very similar**: The contents of the two images are different, but highly similar, so that the difference can only be spotted at the second glance.
16+
17+
On the other hand, [`duplicates_cifar10_test.csv`](duplicates_cifar10_test.csv) and [`duplicates_cifar100_test.csv`](duplicates_cifar100_test.csv) list duplicate image pairs within the test set.
18+
The structure is identical to that of the other two files, but the column `TrainID` now also refers to images in the test set.
19+
20+
21+
[1]: https://www.cs.toronto.edu/~kriz/cifar.html

0 commit comments

Comments
 (0)