An Approach for Automatic Description of Characters for Blind People

  • Itamar Rocha Filho UFPB
  • Felipe Honorato UFPB
  • J. Wallace Lucena UFPB
  • J. Pedro Teixeira UFPB
  • Tiago Maritan UFPB

Resumo


Audio Description (AD) or Video Description is a vital accessibility concept in blind and visually impaired people's life. Automating this task is not easy and involves many problems, such as describing the scenario, actions, emotions, and characters. This paper presents an approach to automatically describe characters — in a video or image — combining Deep Learning (DL), Face detection, Facial Expression detection techniques, and audio synthesizers. Our proposal uses the detection tools, applies some DL models to the analyzed data, and generates an audio description. To evaluate the feasibility of our proposal, we have developed a proof of concept of the solution and performed some computational experiments to evaluate it.
Palavras-chave: acessibility, deep learning, blind people, audio description
Publicado
05/11/2021
ROCHA FILHO, Itamar; HONORATO, Felipe; LUCENA, J. Wallace; TEIXEIRA, J. Pedro; MARITAN, Tiago. An Approach for Automatic Description of Characters for Blind People. In: BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA), 1. , 2021, Minas Gerais. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2021 . p. 53-56.