AirBSet: A Dataset with Brazilian Properties from Airbnb and their Reviews

  • Jonatas Freire Federal Institute of Minas Gerais (IFMG)
  • Luis Henrique Ferreira Costa Federal University of Santa Catarina (UFSC)
  • Carina F. Dorneles Federal Institute of Minas Gerais (IFMG)
  • Michele A. Brandão Federal University of Santa Catarina (UFSC)

Abstract


Airbnb is an online platform with more than 6.6 million listings and 1.4 billion guests from different locations. As this indicates the presence of so many users, this platform generates a large volume of data that various applications can use. Therefore, this work presents AirBSet, a data set with Brazilian properties and respective valuations. This data set is described and characterized to facilitate its use in other studies.
Keywords: Data collect, Airbnb listings, Comment analysis

References

Arefieva, V., Egger, R., Schrefl, M., and Schedl, M. (2023). Travel bird: A personalized destination recommender with tourbert and airbnb experiences. In Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, pages 1164–1167.

Ding, K., Niu, Y., and Choo, W. C. (2023). The evolution of airbnb research: A systematic literature review using structural topic modeling. Heliyon, page e17090.

Jain, S., Proserpio, D., Quattrone, G., and Quercia, D. (2021). Nowcasting gentrification using airbnb data. Proceedings of the ACM on Human-Computer Interaction, 5(CSCW1):1–21.

Jordan, E. J., Vieira, J. C., Santos, C. M., and Huang, T.-Y. (2023). Do residents differentiate between the impacts of tourism, cruise tourism, and airbnb tourism? Journal of Sustainable Tourism, 31(2):265–283.

Machado, A. C. et al. (2022). Análise e correlação de dados: um estudo de caso usando o airbnb e o tripadvisor em florianópolis.

Mody, M., Suess, C., and Dogru, T. (2021). Does airbnb impact non-hosting residents’ quality of life? comparing media discourse with empirical evidence. Tourism Management Perspectives, 39:100853.

Silva, M. O., Scofield, C., and Moro, M. M. (2021). Pportal: Public domain portuguese-language literature dataset. In Anais do III Dataset Showcase Workshop - SBBD, pages 77–88. SBC.
Published
2023-09-25
FREIRE, Jonatas; COSTA, Luis Henrique Ferreira; DORNELES, Carina F.; BRANDÃO, Michele A.. AirBSet: A Dataset with Brazilian Properties from Airbnb and their Reviews. In: DATASET SHOWCASE WORKSHOP (DSW), 5. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 79-86. DOI: https://doi.org/10.5753/dsw.2023.233296.