The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features

Kristen Grauman and Trevor Darrell

Massachusetts Institute of Technology

Computer Science and Artificial Intelligence Laboratory

Cambridge, MA, USA

Introduction

Related Work

Pyramid Match Kernel

Satisfying Mercer’s Condition

Results

Conclusions

It is often useful to represent a single example by the collection of local features or parts that comprise it.

The image can be described by local features extracted from patches around salient interest points, or a shape may be described by local descriptors defined at edge points.

Set of Features in two images

To perform learning tasks like categorization or recognition with such representations is challenging in such cases.

Support Vector Machine (SVM) is a widely used approach to discriminative classification that finds the optimal separating hyperplane between two classes.

Kernel functions, which measure similarity between inputs, introduce non-linearities to the decision functions; the kernel non-linearly maps two examples from the input space to the inner product in some feature space.

Conventional kernel-based algorithms are designed to operate on fixed-length vector inputs and hence commonly used general-purpose kernels defined on n inputs (e.g., Gaussian RBF, polynomial) are not applicable in the space of vector sets.

The pyramid match kernel – a new kernel function over unordered feature sets that allows them to be used effectively and efficiently in kernel-based learning methods. Each feature set is mapped to a multiresolution histogram that preserves the individual features’ distinctness at the finest level.

To perform learning tasks like categorization or recognition with such representations is challenging in such cases.

Pyramid Match Kernel

Pyramid matching: an efficient method that maps unordered feature sets to multi-resolution histograms .

Computes a weighted histogram intersection to find implicit correspondences based on finest resolution histogram cell where a matched pair first appears .

Approximates similarity measured by optimal correspondences between feature sets of unequal cardinality .

1GraumanandDarrell, The PyramidMatchKernel:DiscriminativeClassificationwithSetsofImageFeatures, IEEEICCV2005,Vol2, pp.1458–1465

histogram pyramids

number of newly matched pairs at level i

measure of difficulty of a match at level i

Pyramid Match Kernel

Weights inversely proportional to bin size

Normalize kernel values to avoid favoring large sets

Histogram Intersection

matches at this level

matches at previous level

Difference in histogram intersections across levels counts number of new pairs matched

Histogram Intersection