Enhancing Aspect-Based Sentiment Analysis for Portuguese Using Instruction Tuning

Gabriel Pereira; Luciano Barbosa; Johny Moreira; Tiago Melo; Altigran Silva

doi:10.5753/eniac.2024.245109

Gabriel Pereira UFPE
Luciano Barbosa UFPE
Johny Moreira UFAM
Tiago Melo UEA
Altigran Silva UFAM

DOI: https://doi.org/10.5753/eniac.2024.245109

Resumo

This study explores the application of instruction tuning in opensource small language models for Portuguese End-to-End Aspect-Based Sentiment Analysis (E2E-ABSA), focusing on restaurant reviews. Utilizing a diverse dataset from sources such as Google Reviews, TripAdvisor, Instagram, and iFood, the research evaluates the performance of PTT5 Base, a T5 model pretrained on Portuguese data, in comparison to multilingual models, namely FLAN-T5 Base and mT0 Small. The results show that the PTT5 Base has superior capabilities in E2E-ABSA, achieving an F1 Score of 0.60, Precision of 0.61, and Recall of 0.59. These findings emphasize the significance of language-specific pretraining in analyzing customer opinions for the ABSA task.

Palavras-chave: Generative AI, Small Language Models, Natural Language Processing, Transformers

Referências

Carmo, D., Piau, M., Campiotti, I., Nogueira, R., and Lotufo, R. (2020). Ptt5: Pre-training and validating the t5 model on brazilian portuguese data. arXiv preprint arXiv:2008.09144.

Chung, H. W., Hou, L., Longpre, S., Zoph, B., Tay, Y., Fedus, W., Li, Y., Wang, X., Dehghani, M., Brahma, S., Webson, A., Gu, S. S., Dai, Z., Suzgun, M., Chen, X., Chowdhery, A., Castro-Ros, A., Pellat, M., Robinson, K., Valter, D., Narang, S., Mishra, G., Yu, A., Zhao, V., Huang, Y., Dai, A., Yu, H., Petrov, S., Chi, E. H., Dean, J., Devlin, J., Roberts, A., Zhou, D., Le, Q. V., and Wei, J. (2024). Scaling instruction-finetuned language models. Journal of Machine Learning Research, 25(70):1–53.

Gomes, J. R. S., Garcia, E. A. S., Junior, A. F. B., Rodrigues, R. C., Silva, D. F. C., Maia, D. F., da Silva, N. F. F., Soares, A. d. S., et al. (2023). Deep learning brasil at absapt 2022: Portuguese transformer ensemble approaches. arXiv preprint arXiv:2311.05051.

Hsieh, C.-Y., Li, C.-L., Yeh, C.-k., Nakhost, H., Fujii, Y., Ratner, A., Krishna, R., Lee, C.-Y., and Pfister, T. (2023). Distilling step-by-step! outperforming larger language models with less training data and smaller model sizes. In Rogers, A., Boyd-Graber, J., and Okazaki, N., editors, Findings of the Association for Computational Linguistics: ACL 2023, pages 8003–8017, Toronto, Canada. Association for Computational Linguistics.

Kung, P.-N. and Peng, N. (2023). Do models really learn to follow instructions? an empirical study of instruction tuning. In Rogers, A., Boyd-Graber, J., and Okazaki, N., editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 1317–1328, Toronto, Canada. Association for Computational Linguistics.

Liu, P., Yuan, W., Fu, J., Jiang, Z., Hayashi, H., and Neubig, G. (2023). Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Comput. Surv., 55(9).

Liu, T. and Low, B. K. H. (2023). Goat: Fine-tuned llama outperforms gpt-4 on arithmetic tasks. arXiv preprint arXiv:2305.14201.

Mishra, S., Khashabi, D., Baral, C., and Hajishirzi, H. (2022). Cross-task generalization via natural language crowdsourcing instructions. In Muresan, S., Nakov, P., and Villavicencio, A., editors, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3470–3487, Dublin, Ireland. Association for Computational Linguistics.

Muennighoff, N., Wang, T., Sutawika, L., Roberts, A., Biderman, S., Le Scao, T., Bari, M. S., Shen, S., Yong, Z. X., Schoelkopf, H., Tang, X., Radev, D., Aji, A. F., Almubarak, K., Albanie, S., Alyafeai, Z., Webson, A., Raff, E., and Raffel, C. (2023). Crosslingual generalization through multitask finetuning. In Rogers, A., Boyd-Graber, J., and Okazaki, N., editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15991–16111, Toronto, Canada. Association for Computational Linguistics.

Pires, R., Abonizio, H., Almeida, T. S., and Nogueira, R. (2023). Sabiá : Portuguese large language models. In Intelligent Systems, pages 226–240. Springer Nature Switzerland.

Scaria, K., Gupta, H., Goyal, S., Sawant, S., Mishra, S., and Baral, C. (2024). InstructABSA: Instruction learning for aspect based sentiment analysis. In Duh, K., Gomez, H., and Bethard, S., editors, Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 720–736, Mexico City, Mexico. Association for Computational Linguistics.

Tan, Z., Beigi, A., Wang, S., Guo, R., Bhattacharjee, A., Jiang, B., Karami, M., Li, J., Cheng, L., and Liu, H. (2024). Large language models for data annotation: A survey. arXiv preprint arXiv:2402.13446.

Varia, S., Wang, S., Halder, K., Vacareanu, R., Ballesteros, M., Benajiba, Y., Anna John, N., Anubhai, R., Muresan, S., and Roth, D. (2023). Instruction tuning for few-shot aspect-based sentiment analysis. In Barnes, J., De Clercq, O., and Klinger, R., editors, Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, pages 19–27, Toronto, Canada. Association for Computational Linguistics.

Wagner Filho, J. A., Wilkens, R., Idiart, M., and Villavicencio, A. (2018). The brWaC corpus: A new open resource for Brazilian Portuguese. In Calzolari, N., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Hasida, K., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., Piperidis, S., and Tokunaga, T., editors, Proceed- ings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA).

Wang, S., Liu, Y., Xu, Y., Zhu, C., and Zeng, M. (2021). Want to reduce labeling cost? GPT-3 can help. In Moens, M.-F., Huang, X., Specia, L., and Yih, S. W.-t., editors, Findings of the Association for Computational Linguistics: EMNLP 2021, pages 4195–4205, Punta Cana, Dominican Republic. Association for Computational Linguistics.

Yin, W., Li, J., and Xiong, C. (2022). ConTinTin: Continual learning from task instructions. In Muresan, S., Nakov, P., and Villavicencio, A., editors, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3062–3072, Dublin, Ireland. Association for Computational Linguistics.

Zhang, S., Dong, L., Li, X., Zhang, S., Sun, X., Wang, S., Li, J., Hu, R., Zhang, T., Wu, F., et al. (2023). Instruction tuning for large language models: A survey. arXiv preprint arXiv:2308.10792.

Zhang, W., Deng, Y., Li, X., Yuan, Y., Bing, L., and Lam, W. (2021a). Aspect sentiment quad prediction as paraphrase generation. In Moens, M.-F., Huang, X., Specia, L., and Yih, S. W.-t., editors, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 9209–9219, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.

Zhang, W., Li, X., Deng, Y., Bing, L., and Lam, W. (2021b). Towards generative aspect-based sentiment analysis. In Zong, C., Xia, F., Li, W., and Navigli, R., editors, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 504–510, Online. Association for Computational Linguistics.

Zhang, W., Li, X., Deng, Y., Bing, L., and Lam, W. (2022). A survey on aspect-based sentiment analysis: Tasks, methods, and challenges. IEEE Trans. on Knowl. and Data Eng., 35(11):11019–11038.

Zhou, C., Liu, P., Xu, P., Iyer, S., Sun, J., Mao, Y., Ma, X., Efrat, A., Yu, P., YU, L., Zhang, S., Ghosh, G., Lewis, M., Zettlemoyer, L., and Levy, O. (2023). Lima: Less is more for alignment. In Oh, A., Naumann, T., Globerson, A., Saenko, K., Hardt, M., and Levine, S., editors, Advances in Neural Information Processing Systems, volume 36, pages 55006–55021. Curran Associates, Inc.

Enhancing Aspect-Based Sentiment Analysis for Portuguese Using Instruction Tuning

Resumo

Referências

Artigos mais lidos do(s) mesmo(s) autor(es)