An Approach for Automatic Description of Characters for Blind People
ResumoAudio Description (AD) or Video Description is a vital accessibility concept in blind and visually impaired people's life. Automating this task is not easy and involves many problems, such as describing the scenario, actions, emotions, and characters. This paper presents an approach to automatically describe characters — in a video or image — combining Deep Learning (DL), Face detection, Facial Expression detection techniques, and audio synthesizers. Our proposal uses the detection tools, applies some DL models to the analyzed data, and generates an audio description. To evaluate the feasibility of our proposal, we have developed a proof of concept of the solution and performed some computational experiments to evaluate it.
Palavras-chave: acessibility, deep learning, blind people, audio description
ROCHA FILHO, Itamar; HONORATO, Felipe; LUCENA, J. Wallace; TEIXEIRA, J. Pedro; MARITAN, Tiago. An Approach for Automatic Description of Characters for Blind People. In: SIMPÓSIO BRASILEIRO DE SISTEMAS MULTIMÍDIA E WEB (WEBMEDIA), 1. , 2021, Minas Gerais. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2021 . p. 53-56.