Automated Content Moderation in a Brazilian Marketplace

Ana Claudia Zandavalle; Victor do Nascimento; Carolina Gadelha; Tatiana Gama; Fernando Zagatti; Lucas Nildaimon; João Gabriel Melo Barbirato; Livy Real

doi:10.5753/webmedia_estendido.2022.WiP01

Ana Claudia Zandavalle Americanas S.A.
Victor do Nascimento Americanas S.A.
Carolina Gadelha Americanas S.A.
Tatiana Gama Americanas S.A.
Fernando Zagatti Americanas S.A.
Lucas Nildaimon Americanas S.A.
João Gabriel Melo Barbirato Americanas S.A.
Livy Real Americanas S.A.

DOI: https://doi.org/10.5753/webmedia_estendido.2022.WiP01

Resumo

Clarifying doubts can become decisive when shopping on e-commerces platforms. Considering the relevance of user generated content, this work aimed to develop an internal hybrid system, composed of machine learning models along-side a rule-based module, to moderate customers’ questions and sellers’ answers in one of the biggest marketplaces in Brazil.

Palavras-chave: content moderation, Portuguese, questions and answers, user generated content, e-commerce, marketplace

Referências

Shrabastee Banerjee, Chrysanthos Dellarocas, and Georgios Zervas. 2021. Interacting user-generated content technologies: How questions and answers affect consumer reviews. Journal of Marketing Research 58, 4 (2021), 742–761.

Yahui Chen, Dongsheng Liu, Yanni Liu, Yiming Zheng, Bing Wang, and Yi Zhou. 2022. Research on user generated content in Q&A system and online comments based on text mining. Alexandria Engineering Journal 61, 10 (2022), 7659–7668.

Erik Choi and Chirag Shah. 2016. User motivations for asking questions in online Q&A services. Journal of the Association for Information Science and Technology 67, 5 (2016), 1182–1197.

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171–4186. https://doi.org/10.18653/v1/N19-1423

Marcos Menon José, Marcelo Archanjo José, Denis Deratani Mauá, and Fábio Gagliardi Cozman. 2022. Integrating Question Answering and Text-to-SQL in Portuguese. In Computational Processing of the Portuguese Language, Vládia Pinheiro, Pablo Gamallo, Raquel Amaro, Carolina Scarton, Fernando Batista, Diego Silva, Catarina Magro, and Hugo Pinto (Eds.). Springer International Publishing, Cham, 278–287.

Warut Khern-am nuai, Hossein Ghasemkhani, and Karthik Kannan. 2017. How questions and answers shape online marketplaces: The Case of Amazon answer. 50th Hawaii International Conference on System Sciences 50, 1 (2017), 853–862.

Ashish Kulkarni, Kartik Mehta, Shweta Garg, Vidit Bansal, Nikhil Rasiwasia, and Srinivasan Sengamedu. 2019. ProductQnA: Answering User Questions on E-Commerce Product Pages. In Companion Proceedings of The 2019 World Wide Web Conference (San Francisco, USA) (WWW ’19). Association for Computing Machinery, New York, NY, USA, 354–360. https://doi.org/10.1145/3308560.3316597

Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2020. Focal Loss for Dense Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 42, 2 (2020), 318–327. https://doi.org/10.1109/TPAMI.2018.2858826

Xiao-ping Liu and Wen-xiang Deng. 2018. The Researches on the Impact of Community Q&A Information Quality on Consumers’ Purchase Intention. Journal of Mathematics and Informatics 14 (08 2018), 45–52. https://doi.org/10.22457/jmi.v14a6

Alex Serban, Koen van der Blom, Holger Hoos, and Joost Visser. 2020. Adoption and Effects of Software Engineering Best Practices in Machine Learning. In Proceedings of the 14th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM) (Bari, Italy) (ESEM ’20). Association for Computing Machinery, New York, NY, USA, Article 3, 12 pages. https://doi.org/10.1145/3382494.3410681

Fábio Souza, Rodrigo Nogueira, and Roberto Lotufo. 2020. BERTimbau: Pretrained BERT Models for Brazilian Portuguese. In Intelligent Systems, Ricardo Cerri and Ronaldo C. Prati (Eds.). Springer International Publishing, Cham, 403–417.