Reducing the dimensionality of the SIFT descriptor and increasing its effectiveness and efficiency in image retrieval via bag-of-features

  • Glauco Vitor Pedrosa USP
  • Solange Oliveira Oliveira Rezende USP
  • Agma Juci Machado Traina USP

Resumo


The Bag-of-Features is a popular approach to describe multimedia information by using visual words. The SIFT (Scale Invariant Feature Transform) is one of the most utilized descriptor to model multimedia information in Bag of-Features. The data is described as a set of keypoints and a feature vector is assigned for each of the keypoints. This feature vector is composed of 128 values, which represent the region around each keypoint. In general, some of the detected keypoints are not relevant and can be discarded without losing the local discriminative power. In this paper, we propose a technique to reduce the detected keypoints by SIFT, as well as a technique to reduce the feature vector dimensionality. Experiments were made in order to analyze the performance of the proposed reduction techniques using two different image databases. The results demonstrated that the proposed techniques improve the performance of the image retrieval by reducing up to 50% the feature vector dimensionality of SIFT and at the same time providing a gain of computational time of modeling an image employing Bag-of-Features.
Publicado
15/10/2012
PEDROSA, Glauco Vitor; OLIVEIRA REZENDE, Solange Oliveira; TRAINA, Agma Juci Machado. Reducing the dimensionality of the SIFT descriptor and increasing its effectiveness and efficiency in image retrieval via bag-of-features. In: BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA), 18. , 2012, São Paulo. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2012 . p. 139-142.