Automatic Text Recognition in Web Images

  • Rodolfo Valiente USP
  • José C. Gutiérrez USP
  • Marcelo T. Sadaike USP
  • Graça Bressan USP

Resumo


Web images play an important role in delivering multimedia content on the Web. The text embedded in web images carry semantic information related to layout and content of the pages. Statistics show that there is a significant need to detect and recognize text from web images. This paper presents an architecture that efficiently integrates localization, extraction and recognition algorithms applied to text recognition in web images. In the recognition step is proposed a procedure based on super-resolution and an iterative method for improving the performance. The approach is implemented and evaluated using Matlab and cloud computing, making the system flexible, scalable and robust in detecting texts from complex web images with different orientations, dimensions and colors. Competitive results are presented, both in precision and recognition rate, when compared with other systems in the existing literature.
Publicado
17/10/2017
VALIENTE, Rodolfo; GUTIÉRREZ, José C.; SADAIKE, Marcelo T.; BRESSAN, Graça. Automatic Text Recognition in Web Images. In: SIMPÓSIO BRASILEIRO DE SISTEMAS MULTIMÍDIA E WEB (WEBMEDIA), 23. , 2017, Gramado. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2017 . p. 241-244.

Artigos mais lidos do(s) mesmo(s) autor(es)