Machine Learning Integration in Usability Evaluation: Advances, Challenges, and Future Applications
Abstract
Introduction: Usability evaluation is a well-established step without interface development, but it still relies heavily on manual processes that require time and expertise. With the advancement of Machine Learning (ML) techniques, interest in automated approaches that allow usability problems to be identified through the analysis of large volumes of data is growing. However, the landscape of these applications is fragmented, making it difficult to understand the current state of research on usability and ML. Objective: Given this context, this work investigates automated methods for ML-based usability evaluation. Methodology: To achieve this objective, a Systematic Literature Mapping (SLM) was performed. Results: The search of relevant databases yielded 238 studies, of which 47 were selected for qualitative analysis. The results show that the most common methods combine interaction metrics with supervised models such as SVM, Random Forest, CNNs, and KNN, applied to the analysis of Think-Aloud-type verbalizations, usage patterns, images, videos, and interface elements. The literature highlights technical advances, but also limitations regarding the generalization of models and integration with experts in HCI and UX. Future studies can explore hybrid approaches and expand the application contexts, especially in emerging technologies such asaugmented reality, conversational interfaces, and adaptive systems.
References
Assila, A., Marçal de Oliveira, K., e Ezzedine, H. (2016). An environment for integrating subjective and objective usability findings based on measures. In 2016 IEEE Tenth International Conference on Research Challenges in Information Science (RCIS), pages 1–12. IEEE.
Asthana, S. e Singh, P. (2015). Maareech: Usability testing tool for voice response system using xmlbased user models. In Design, User Experience, and Usability: Design Discourse 4th International Conference, DUXU 2015, Held as Part of HCI International 2015, Los Angeles, CA, USA, August 2-7, 2015, Proceedings, Part I, volume 9186, pages 101–112. Springer.
Bakaev, M., Khvorostov, V., e Laricheva, T. (2017a). Assessing subjective quality of web interaction with neural network as context of use model. In Communications in Computer and Information Science, volume 745, pages 185–195. Springer Verlag.
Bakaev, M., Mamysheva, T., e Gaedke, M. (2017b). Current trends in automating usability evaluation of websites: Can you manage what you can’t measure? In Proceedings - 2016 11th International Forum on Strategic Technology, IFOST 2016, pages 510–514. Institute of Electrical and Electronics Engineers Inc.
Barbosa, S. e Silva, B. (2010). Interação humano-computador. Elsevier Brasil.
Basili, V. e Rombach, H. (1988). The tame project: towards improvement-oriented software environments. IEEE Transactions on Software Engineering, 14(6):758–773.
Cassino, R., Tucci, M., Vitiello, G., e Francese, R. (2015). Empirical validation of an automatic usability evaluation method. Journal of Visual Languages and Computing, 28:1–22.
Castro, J. W., Garnica, I., e Rojas, L. A. (2022). Automated tools for usability evaluation: a systematic mapping study. In International Conference on Human-Computer Interaction, pages 28–46. Springer.
del Gobbo, E., Guarino, A., Cafarelli, B., Grilli, L., e Limone, P. (2023). Automatic evaluation of open-ended questions for online learning. a systematic mapping. Studies in Educational Evaluation, 77:101258.
Devyat, V. V. e Tipsin, E. A. (2019). Automating usability evaluation of visual user interfaces in the tile logic. In Multi Conference on Computer Science and Information Systems, MCCSIS 2019 - Proceedings of the International Conferences on Interfaces and Human Computer Interaction 2019, Game and Entertainment Technologies 2019 and Computer Graphics, Visualization, Computer Vision and Image Processing 2019, pages 254–262. IADIS Press.
Dhengre, S., Mathur, J., Oghazian, F., Tan, X., e McComb, C. (2020). Towards enhanced creativity in interface design through automated usability evaluation. In ICCC, pages 366–369.
Elfaki, A. O. e Bassfar, Z. (2019). Auto-measuring usability method based on runtime user’s behavior: Case study for governmental web-based information systems. Journal of Theoretical and Applied Information Technology, 97:3559–3573.
Fan, M., Li, Y., e Truong, K. N. (2020). Automatic detection of usability problem encounters in think-aloud sessions. ACM Transactions on Interactive Intelligent Systems, 10.
Feijo Filho, J., Prata, W., e Oliveira, J. (2016). Where-how-what am i feeling: User context logging in automated usability tests for mobile software. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), volume 9748, pages 14–23. Springer.
Ferre, X., Villalba, E., Julio, H., e Zhu, H. (2017). Extending mobile app analytics for usability test logging. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), volume 10515 LNCS, pages 114–131. Springer Verlag.
Filho, J. F., Prata, W., e Valle, T. (2015). Emotions logging in automated usability tests for mobile devices. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), volume 9186, pages 428–435. Springer Verlag.
Geng, R. e Tian, J. (2015). Improving web navigation usability by comparing actual and anticipated usage. IEEE Transactions on Human-Machine Systems, 45:84–94.
Goncalves, C. e Dias (2018). Improving usability evaluation by automating a standardized usability questionnaire. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), volume 10918 LNCS, pages 379–395. Springer Verlag.
Goncalves, L. F., Vasconcelos, L. G., Munson, E. V., e Baldochi, L. A. (2016). Supporting adaptation of web applications to the mobile environment with automated usability evaluation. In Proceedings of the ACM Symposium on Applied Computing, volume 04-08-April-2016, pages 787–794. Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States.
Grigera, J., Garrido, A., e Rossi, G. (2017). Kobold: Web usability as a service. In IEEE International Conference on Automated Software Engineering (ASE), pages 990–995. IEEE.
Guimarães, A. A., de Sales, A. B., Santos, B. A., e Palmeira, E. G. (2022). Avaliação de características de usabilidade em jogos sérios em interação humano-computador. In Anais do XXXIII Simpósio Brasileiro de Informática na Educação, pages 505–516. SBC.
Gupta, S., Epiphaniou, G., e Maple, C. (2023). Ai-augmented usability evaluation framework for software requirements specification in cyber physical human systems. Internet of Things (Netherlands), 23.
Harms, P. (2019). Automated usability evaluation of virtual reality applications. ACM Transactions on Computer-Human Interaction, 26.
Hertzum, M. (2022). Usability testing: A practitioner’s guide to evaluating the user experience. Springer Nature.
Hwang, H. e Lee, Y. (2021). Usability problem identification based on explainable neural network in asynchronous testing environment. Interacting with Computers, 33:155–166.
Issa, T. e Isaias, P. (2022). Usability and human–computer interaction (hci). In Sustainable design: HCI, usability and environmental concerns, pages 23–40. Springer.
Janiesch, C., Zschech, P., e Heinrich, K. (2021). Machine learning and deep learning. Electronic Markets, 31(3):685–695.
Jarraya, M. e Moussa, F. (2018). Proxy oriented approach for evaluating usability of a resilient life-critical interactive systems. In International Conference on Advanced Information Networking and Applications (AINA), pages 464–471. IEEE.
Kaminska, D., Zwolinski, G., e Laska-Lesniewicz, A. (2022). Usability testing of virtual reality applications-the pilot study. SENSORS, 22.
Khasnis, S. S., Raghuram P., S., Aditi, A., Samrakshini, R., e Namratha, M. (2019). Analysis of automation in the field of usability evaluation. In 2019 1st International Conference on Advanced Technologies in Intelligent Control, Environment, Computing & Communication Engineering (ICATIECE), pages 85–91. IEEE.
Kitchenham, B. (2004). Procedures for performing systematic reviews. Keele, UK, Keele University, 33(2004):1–26.
Kitchenham, B. e Charters, S. (2007). Guidelines for performing systematic literature reviews in software engineering.
Koch, J. e Oulasvi, A. (2016). Computational layout perception using gestalt laws. In Conference on Human Factors in Computing Systems - Proceedings, volume 07-12-May-2016, pages 1423–1429. Association for Computing Machinery.
Kuang, E. (2023). Crafting human-ai collaborative analysis for user experience evaluation. In Conference on Human Factors in Computing Systems - Proceedings. Association for Computing Machinery.
Kuang, E., Jahangirzadeh Soure, E., Fan, M., Zhao, J., e Shinohara, K. (2023). Collaboration with conversational ai assistants for ux evaluation: Questions and how to ask them (voice vs. text). In Conference on Human Factors in Computing Systems - Proceedings. Association for Computing Machinery.
Kuang, E., Li, M., Fan, M., e Shinohara, K. (2024). Enhancing ux evaluation through collaboration with conversational ai assistants: Effects of proactive dialogue and timing. In Conference on Human Factors in Computing Systems - Proceedings, pages 1–16. Association for Computing Machinery.
Lecaros, A., Moquillaza, A., Falconi, F., Aguirre, J., Ramos, C., e Paz, F. (2024). Automation of granollers heuristic evaluation method using a developed support system: A case study. In Communications in Computer and Information Science, volume 1877 CCIS, pages 93–108. Springer Science and Business Media Deutschland GmbH.
Li, Y., Feng, Y., Hao, R., e Chen, Z. (2023). Human-machine collaborative testing for android applications. In IEEE International Conference on Software Quality, Reliability and Security, QRS, pages 440–451. Institute of Electrical and Electronics Engineers Inc.
Lu, J., Schmidt, M., Lee, M., e Huang, R. (2022). Usability research in educational technology: A state-of-the-art systematic review. Educational technology research and development, 70(6):1951–1992.
Malan, K. M., Eloff, J. H., e de Bruin, J. A. (2018). Semi-automated usability analysis through eye tracking. South African Computer Journal, 30:66–84.
Maqbool, B. e Herold, S. (2024). Potential effectiveness and efficiency issues in usability evaluation within digital health: A systematic literature review. Journal of Systems and Software, 208:111881.
Marenkov, J., Robal, T., e Kalja, A. (2017). A tool for design-time usability evaluation of web user interfaces. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), volume 10509 LNCS, pages 394–407. Springer Verlag.
Marenkov, J., Robal, T., e Kalja, A. (2018). Guideliner: a tool to improve web ui development for better usability. In WIMS ’18: Proceedings of the 8th International Conference on Web Intelligence, Mining and Semantics, page 9. Association for Computing Machinery.
Matos Claro, S., Ruiz de la Peña, J., Lamoth Borrero, L., e Snoeck, M. (2024). Technology for automatic usability evaluation using model driven engineering. In International Conference on Business Process Modeling, Development and Support, pages 191–200. Springer.
Moore, S., Costello, E., Nguyen, H. A., e Stamper, J. (2024). An automatic question usability evaluation toolkit. arXiv preprint arXiv:2405.20529.
Namoun, A., Alrehaili, A., e Tufail, A. (2021). A review of automated website usability evaluation tools: Research issues and challenges. In International Conference on Human-Computer Interaction, pages 292–311. Springer.
Nielsen, J. (1994). Usability engineering. Morgan Kaufmann.
Novák, J. Š., Masner, J., Benda, P., Šimek, P., e Merunka, V. (2024). Eye tracking, usability, and user experience: A systematic review. International Journal of Human–Computer Interaction, 40(17):4484–4500.
Ntoa, S., Margetis, G., Antona, M., e Stephanidis, C. (2018). Uxami observer: An automated user experience evaluation tool for ambient intelligence environments. In Advances in Intelligent Systems and Computing, volume 868, pages 1350–1370. Springer Verlag.
Padovani, S. e Schlemmer, A. (2021). Ensaio de interação ou teste de usabilidade... afnal, do que estamos falando? In CONGRESSO INTERNACIONAL DE DESIGN DA INFORMAÇÃO, volume 10, pages 1154–1171.
Petticrew, M. e Roberts, H. (2008). Systematic Reviews in the Social Sciences: A Practical Guide. Wiley.
Qasrawi, A., Vicunapolo, S., e Qasrawi, R. (2021). User experience and performance evaluation of palestinian universities websites. In Proceedings - 2021 International Conference on Promising Electronic Technologies, ICPET 2021, pages 73–77. Institute of Electrical and Electronics Engineers Inc.
Santos, F. d. S., Treviso, M. V., Gama, S. P., e de Mattos Fortes, R. P. (2022). A framework to semi-automated usability evaluations processing considering users’ emotional aspects. In Human-Computer Interaction. Theoretical Approaches and Design Methods: Thematic Area, HCI 2022, Held as Part of the 24th HCI International Conference, HCII 2022, Virtual Event, June 26–July 1, 2022, Proceedings, Part I, volume 13302, pages 419–438. Springer.
Schaffer, S., Schleicher, R., e Möller, S. (2015). Modeling input modality choice in mobile graphical and speech interfaces. International Journal of Human Computer Studies, 75:21–34.
Sergieieva, K., Bitchou, T. M. N., e Meixner, G. (2020). Task identifcation framework to automatically detect anomalies in users’ interactions with mobile application to support usability evaluation. In Advances in Intelligent Systems and Computing, volume 1018, pages 421–427. Springer Verlag.
Sodhar, I. N., Mirani, A. A., e Sodhar, A. N. (2019). Automated usability evaluation of government and private sector educational websites of pakistan. Information Sciences Letters, 8:51–55.
Torres-Molina, R. e Seyam, M. (2023). The intersection of usability evaluation and machine learning in software systems. In 2023 IEEE 5th International Conference on Cognitive Machine Intelligence (CogMI), pages 122–127. IEEE Computer Society.
Zytek, A., Liu, D., Vaithianathan, R., e Veeramachaneni, K. (2021). Sibyl: Understanding and addressing the usability challenges of machine learning in high-stakes decision making. IEEE Transactions on Visualization and Computer Graphics, 28(1):1161–1171.
