Publications

Stats

View publication

Title Automatic Weight Selection for Multi-metric Distances
Authors Juan Manuel Barrios, Benjamin Bustos
Publication date 2011
Abstract Content-Based Multimedia Information Retrieval retrieves multimedia
documents based on their content (colors, edges, textures, etc.). The
content of a whole multimedia document is represented by a global
descriptor. The similarity of two multimedia documents can be defined as
the distance between their descriptors. A multi-metric function that
combines distances from many descriptors usually outperforms the
effectiveness of any single descriptor. In this case, a different weight
is assigned to each descriptor representing its relative importance in the
combination. Usually, these sets of weights are fixed manually or by
performing many effectiveness evaluations. In this work, we present three
novel techniques for weighting multi-metrics: alpha-normalization, which
is a generalization of the normalization by maximum distance that uses the
histogram of distances, MID weighting which selects weights that maximize
intrinsic dimensionality, and MID-alpha-weighting that combines the two
previous techniques. These techniques enable the selection of a set of
weights with satisfactory effectiveness without performing any
effectiveness evaluation. Thus, they are suitable when a ground truth does
not exist or when it is expensive to perform an evaluation. We tested
their effectiveness on a content-based copy detection corpus, and we
analyzed the behavior of effectiveness and efficiency in a multi-metric
space. We conclude that MID-alpha-weighting outperforms the widely used
maximum distance normalization, and that it can be used as an automatic
weight selection for further manual adjustment.
Downloaded 12 times
Pages 61-68
Conference name International Workshop on Similarity Search and Applications
Publisher IEEE Computer Society Press (Los Alamitos, CA, USA)
PDF View PDF
Reference URL View reference page