Geração de expressões faciais por emoção e fala

Arthur William Dórea Melo; Caio Vasconcelos Araújo Figueiredo; Pedro Moreira Guerra de Almeida; Renan Silva Ferreira; Guilherme Vieira Moraes; Victor Flávio de Andrade Araujo

doi:10.5753/sbgames_estendido.2025.14708

Arthur William Dórea Melo UNIT
Caio Vasconcelos Araújo Figueiredo UNIT
Pedro Moreira Guerra de Almeida UNIT
Renan Silva Ferreira UNIT
Guilherme Vieira Moraes UNIT
Victor Flávio de Andrade Araujo UNIT / INCT-SANI

DOI: https://doi.org/10.5753/sbgames_estendido.2025.14708

Resumo

Introdução: Este artigo apresenta um método de converter arquivos de áudio em expressões faciais. Objetivo: Transformação de áudios emotivos em expressões faciais correspondentes. Metodologia ou Etapas: Este trabalho utiliza interpolação, C#, Unity, Google Colab e Adobe Firefly para realizar as conversões de áudio para expressões faciais. Resultados: Conversões bem sucedidas de arquivos de áudio em expressões faciais equivalentes.

Palavras-chave: IA, áudio-para-imagem, Unity, Face, Emoção

Referências

Bassili, J. N. (1979). Emotion recognition: the role of facial movement and the relative importance of upper and lower areas of the face. Journal of personality and social psychology, 37(11):2049.

Buck, R. W., Savin, V. J., Miller, R. E., e Caul, W. F. (1972). Communication of affect through facial expressions in humans. Journal of personality and social psychology, 23(3):362.

Dellaert, F., Polzin, T., e Waibel, A. (1996). Recognizing emotion in speech. In Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP’96, volume 3, pages 1970–1973. IEEE.

Kwon, O.-W., Chan, K., Hao, J., e Lee, T.-W. (2003). Emotion recognition by speech signals. In Interspeech, pages 125–128. Citeseer.

Milton, A., Roy, S. S., e Selvi, S. T. (2013). Svm scheme for speech emotion recognition using mfcc feature. International Journal of Computer Applications, 69(9).

Seehapoch, T. e Wongthanavasu, S. (2013). Speech emotion recognition using support vector machines. In 2013 5th international conference on Knowledge and smart technology (KST), pages 86–91. IEEE.

Wu, W., Li, Z., He, Y., Shou, M. Z., Shen, C., Cheng, L., Li, Y., Gao, T., e Zhang, D. (2025). Paragraph-to-image generation with information-enriched diffusion model. International Journal of Computer Vision, pages 1–22.

Zhang, J. (1999). C-bézier curves and surfaces. Graphical Models and Image Processing, 61(1):2–15.