Exploiting Popularity to Improve Blog Search

  • Luiz Guilherme P. Santos UFMG
  • Marcos André Gonçalves UFMG
  • Alberto H. F. Laender UFMG

Resumo


The blogosphere is a highly dynamic and interconnected subset of the Web that has triggered a lot of interest due to its social and personal nature. We present a study of an important social aspect of blogs, namely popularity. This study, based on the most popular blogs from four important blog domains in Brazil, shows that, popularity has been underexploited by at least the most popular search engines in the context of blog search. In our experiments, queries specifically formulated for retrieving these popular blogs were not capable of ranking them at the top positions (top 100) by the most popular search engines. We also provide evidence that explicitly incorporating popularity into the search engine algorithm has the potential to significantly improve the final rankings.

Referências

Ali-Hasan, N. and Adamic, L. A. (2007). Expressing social relationships on the blog through links and comments. In ICWSM’07.

Baehni, S., Guerraoui, R., Koldehofe, B., and Monod, M. (2007). Towards fair event dissemination. In Proc. ICDCSW’07, page 63.

Bao, S., Xue, G., Wu, X., Yu, Y., Fei, B., and Su, Z. (2007). Optimizing web search using social annotations. In Proc. WWW’07, pages 501–510.

Duarte, F., Mattos, B., Bestavros, A., Almeida, V., and Almeida, J. (2007). Traffic characteristics and communication patterns in blogosphere. In ICWSM’07.

Goncalves, M. A., Almeida, J. M., dos Santos, L. G., Laender, A. H., and Almeida, V. (2010). On popularity in the blogosphere. IEEE Internet Computing, 14:42–49.

Järvelin, K. and Kekäläinen, J. (2000). IR evaluation methods for retrieving highly relevant documents. In Proc. SIGIR’00, pages 41–48.

Macdonald, C. and Ounis, I. (2006). The trec blogs06 collection : Creating and analysing a blog test collection. DCS Technical Report Series.

Mislove, A., Gummadi, K. P., and Druschel, P. (2006). Exploiting social networks for internet search. In Proc. 5th HotNets-II, California, USA.
Publicado
20/07/2010
SANTOS, Luiz Guilherme P.; GONÇALVES, Marcos André; LAENDER, Alberto H. F.. Exploiting Popularity to Improve Blog Search. In: CONCURSO DE TESES E DISSERTAÇÕES (CTD), 23. , 2010, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2010 . p. 49-56. ISSN 2763-8820.