An Approach for Automatic Description of Characters for Blind People
Resumo
Audio Description (AD) or Video Description is a vital accessibility concept in blind and visually impaired people's life. Automating this task is not easy and involves many problems, such as describing the scenario, actions, emotions, and characters. This paper presents an approach to automatically describe characters — in a video or image — combining Deep Learning (DL), Face detection, Facial Expression detection techniques, and audio synthesizers. Our proposal uses the detection tools, applies some DL models to the analyzed data, and generates an audio description. To evaluate the feasibility of our proposal, we have developed a proof of concept of the solution and performed some computational experiments to evaluate it.
Palavras-chave:
acessibility, deep learning, blind people, audio description
Publicado
05/11/2021
Como Citar
ROCHA FILHO, Itamar; HONORATO, Felipe; LUCENA, J. Wallace; TEIXEIRA, J. Pedro; MARITAN, Tiago.
An Approach for Automatic Description of Characters for Blind People. In: BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA), 1. , 2021, Minas Gerais.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2021
.
p. 53-56.