Query Expansion based on Local Distributional Thesauri

  • Fabiano Tavares da Silva UECE
  • José Everardo Bessa Maia UECE


This work proposes and evaluates an approach to query expansion in Information Retrieval based on Local Context Analysis using a Distributional Semantic Representation. In general, the approach performed better compared to that of query expansion using non-distributional, local or global techniques, running over datasets of different application domains.


Azad, H. K. and Deepak, A. (2017). Query expansion techniques for information retrieval: a survey. arXiv preprint arXiv:1708.00247.

Baeza-Yates, R., Ribeiro-Neto, B., et al. (1999). Modern information retrieval, volume 463. ACM press New York.

Bai, J., Song, D., Bruza, P., Nie, J.-Y., and Cao, G. (2005). Query expansion using term relationships in language models for information retrieval. In Proceedings of the 14th ACM international conference on Information and knowledge management, pages 688–695. ACM.

Bhagdev, R., Chapman, S., Ciravegna, F., Lanfranchi, V., and Petrelli, D. (2008). Hybrid search: Effectively combining keywords and semantic searches. In European Semantic Web Conference, pages 554–568. Springer.

Bhogal, J., MacFarlane, A., and Smith, P. (2007). A review of ontology based query expansion. Information processing & management, 43(4):866–886.

Buckley, C. and Voorhees, E. M. (2004). Retrieval evaluation with incomplete information. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pages 25–32. ACM.

Carpineto, C. and Romano, G. (2012). A survey of automatic query expansion in information retrieval. ACM Computing Surveys (CSUR), 44(1):1.

Claveau, V. and Kijak, E. (2016). Distributional thesauri for information retrieval and vice versa. In Language and Resource Conference, LREC.

Croft, W. B. (2002). Combining approaches to information retrieval. In Advances in information retrieval, pages 1–36. Springer.

Dahab, M. Y., Alnofaie, S., and Kamel, M. (2018). A tutorial on information retrieval using query expansion. In Intelligent Natural Language Processing: Trends and Applications, pages 761–776. Springer.

Diaz, F., Mitra, B., and Craswell, N. (2016). Query expansion with locally-trained word embeddings. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), volume 1, pages 367–377.

Fox, E. (1990). Virginia disc one. Blacksburg, VA.

Harris, Z. S. (1954). Distributional structure. Word, 10(2-3):146–162.

Jiang, J. J. and Conrath, D. W. (1997). Semantic similarity based on corpus statistics and lexical taxonomy. In Proc of 10th International Conference on Research in Computational Linguistics, ROCLING’97. Citeseer.

Manning, C. D., Raghavan, P., Schütze, H., et al. (2008). Introduction to information retrieval. Cambridge university press Cambridge.

Mikolov, T., Corrado, G., Chen, K., and Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. Proceedings of the International Conference on Learning Representations (ICLR 2013).

Miller, G. A. (1995a). Wordnet: a lexical database for english. Communications of the ACM, 38(11):39–41.

Miller, G. A. (1995b). Wordnet: A lexical database for english. Commun. ACM, 38(11):39–41.

Ooi, J., Ma, X., Qin, H., and Liew, S. C. (2015). A survey of query expansion, query suggestion and query refinement techniques. 2015 4th International Conference on Software Engineering and Computer Systems, ICSECS 2015: Virtuous Software Solutions for Big Data, pages 112–117.

Pal, D., Mitra, M., and Datta, K. (2014). Improving query expansion using wordnet. Journal of the Association for Information Science and Technology, 65(12):2469–2478.

SanJuan, E., Ibekwe-SanJuan, F., Torres-Moreno, J.-M., and Velázquez-Morales, P. (2007). Combining vector space model and multi word term extraction for semantic query expansion. In International Conference on Application of Natural Language to Information Systems, pages 252–263. Springer.

Schütze, H. (1998). Automatic word sense discrimination. Computational linguistics, 24(1):97–123.

Voorhees, E. M. (1994). Query expansion using lexical-semantic relations. In Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, pages 61–69. Springer-Verlag New York, Inc.

Xu, J. and Croft, W. B. (1996). Query expansion using local and global document analysis. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’96, pages 4–11, New York, NY, USA. ACM.

Xu, J. and Croft, W. B. (2000). Improving the effectiveness of information retrieval with local context analysis. ACM Transactions on Information Systems (TOIS), 18(1):79– 112.

Zhang, J., Deng, B., and Li, X. (2009). Concept based query expansion using wordnet. In Proceedings of the 2009 international e-conference on advanced science and technology, pages 52–55. IEEE Computer Society.
DA SILVA, Fabiano Tavares; MAIA, José Everardo Bessa. Query Expansion based on Local Distributional Thesauri. In: ENCONTRO NACIONAL DE INTELIGÊNCIA ARTIFICIAL E COMPUTACIONAL (ENIAC), 15. , 2018, São Paulo. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2018 . p. 924-932. ISSN 2763-9061. DOI: https://doi.org/10.5753/eniac.2018.4479.