skip to main content
10.1145/3472301.3484338acmotherconferencesArticle/Chapter ViewAbstractPublication PagesihcConference Proceedingsconference-collections
research-article

Methodology based on Computer Vision and Machine Learning to guide the Design of Augmentative and Alternative Communication Systems using Personalized Gestural Interaction

Published:18 October 2021Publication History

ABSTRACT

People who have motor disabilities associated with some difficulty in speech demand alternative means to interact with other people and the environment in which they are inserted. Augmentative and Alternative Communication (AAC) refers to all communication forms that can complement or replace speech. The practice of AAC mediated by computational applications represents a very attractive alternative. For people with motor and speech difficulties, gestural interaction can be a way to make interaction with AAC systems feasible. In this thesis, a methodology called MyPGI (Methodology to yield Personalized Gestural Interaction) was developed and validated to guide the design of AAC systems for people with motor and speech difficulties in order to promote greater autonomy for them in handling computer systems and for monitoring by their caregivers. The methodology uses Computer Vision and Machine Learning techniques to enable non-invasive and personalized gestural interaction using low-cost devices. MyPGI was applied and evaluated in real case studies with people with disabilities, informing the design of an interactive system named PGCA (Personal Gesture Communication Assistant) to allow the creation and use of personalized gestural languages for people with severe motor and speech difficulties. This thesis, defended at the Federal University of Parana's Graduate Program in Informatics, shows conceptual, methodological, and technical contributions, with publications in high-level vehicles, registered software, and social technology available for free and open use.

References

  1. Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, et al. 2016. Tensor-flow: a system for large-scale machine learning.. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), Vol. 16. 265--283.Google ScholarGoogle Scholar
  2. Ethem Alpaydin. 2020. Introduction to machine learning. MIT Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Rúbia E O Schultz Ascari, Roberto Pereira, and Luciano Silva. 2018. Mobile Interaction for Augmentative and Alternative Communication: a Systematic Mapping. SBC Journal on 3D Interactive Systems 9, 2 (2018), 105--118.Google ScholarGoogle Scholar
  4. Rúbia E O Schultz Ascari, Roberto Pereira, and Luciano Silva. 2018. Towards a Methodology to Support Augmentative and Alternative Communication by means of Personalized Gestural Interaction. In Proceedings of the 17th Brazilian Symposium on Human Factors in Computing Systems (IHC 2018). ACM, 1--12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Rúbia E O Schultz Ascari, Roberto Pereira, and Luciano Silva. 2020. Computer vision-based methodology to improve interaction for people with motor and speech impairment. ACM Transactions on Accessible Computing (TACCESS) 1, 1 (2020), 1--33.Google ScholarGoogle Scholar
  6. Rúbia E O Schultz Ascari, Roberto Pereira, and Luciano Silva. 2020. Personalized Gestural Interaction Applied in a Gesture Interactive Game-based Approach for Disabled People. In Proceedings of the 25th International Conference on Intelligent User Interfaces (IUI 2020). ACM, 1--11.Google ScholarGoogle Scholar
  7. Rúbia E O Schultz Ascari, Roberto Pereira, and Luciano Silva. 2021. Computer vision applied to improve interaction and communication of people with motor disabilities: A systematic mapping. Technology and Disability 33, 1 (2021), 1--18.Google ScholarGoogle Scholar
  8. Rúbia E O Schultz Ascari, Luciano Silva, and Roberto Pereira. 2019. Personalized Interactive Gesture Recognition Assistive Technology. In Proceedings of the 18th Brazilian Symposium on Human Factors in Computing Systems (IHC 2019). ACM, 1--12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Cecilia Calani Baranauskas, Clarisse S. de Souza, and Roberto Pereira. 2015. GranDIHC-BR-Grand Research Challenges for Human-Computer Interaction in Brazil. Human-Computer Interaction Special Committee (CEIHC) of the Brazilian Computer Society (SBC) (2015).Google ScholarGoogle Scholar
  10. Samit Bhattacharya, Debasis Samanta, and Anupam Basu. 2008. Performance models for automatic evaluation of virtual scanning keyboards. IEEE Transactions on Neural Systems and Rehabilitation Engineering 16, 5 (2008), 510--519.Google ScholarGoogle ScholarCross RefCross Ref
  11. Chi-hau Chen. 2015. Handbook of Pattern Recognition and Computer Vision. World Scientific.Google ScholarGoogle Scholar
  12. Louis Cohen, Lawrence Manion, and Keith Morrison. 2000. Research methods in education [5 th edn] London: Routledge Falmer. Teaching in higher education 41 (2000), 21.Google ScholarGoogle Scholar
  13. Wikimedia Commons. 2017. CommonsWikimedia. https://commons.wikimedia.org/wiki/File:Turtle_clip_art.svg. Acessado em 12/05/2020.Google ScholarGoogle Scholar
  14. Sibanjan Das. 2016. Classification methods. In Data Science Using Oracle Data Miner and Oracle R Enterprise. Springer, 189--237.Google ScholarGoogle Scholar
  15. Sky Box-Sunny Day. 2015. OpenGameArt.org. https://opengameart.org. Acessado em 12/05/2020.Google ScholarGoogle Scholar
  16. Bradley Efron. 1992. Bootstrap methods: another look at the jackknife. In Breakthroughs in statistics. Springer, 569--593.Google ScholarGoogle Scholar
  17. S Yu Eroshkin, NA Kameneva, DV Kovkov, and AI Sukhorukov. 2017. Conceptual system in the modern information management. Procedia Computer Science 103 (2017), 609--612.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Ivo Everts, Nicu Sebe, Graeme A Jones, et al. 2007. Cooperative Object Tracking with Multiple PTZ Cameras. In International Conference on Image Analysis and Processing (ICIAP 2007), Vol. 7. 323--330.Google ScholarGoogle Scholar
  19. Seymour Geisser. 1975. The predictive sample reuse method with applications. J. Amer. Statist. Assoc. 70, 350 (1975), 320--328.Google ScholarGoogle ScholarCross RefCross Ref
  20. Cindy Gevarter et al. 2013. Comparisons of intervention components within augmentative and alternative communication systems for individuals with developmental disabilities: A review of the literature. Research in developmental disabilities 34, 12 (2013), 4404--4414.Google ScholarGoogle Scholar
  21. Google. 2019. Google Image. http://images.google.com. Acessado em 12/11/2019.Google ScholarGoogle Scholar
  22. Shaun K Kane, Amy Hurst, Erin Buehler, Patrick A Carrington, and Michele A Williams. 2014. Collaboratively designing assistive technology. Interactions 21, 2 (2014), 78--81.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Ron Kohavi. 1995. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Proceedings of Fourteenth International Joint Conference on Artificial Intelligence (IJCAI), Vol. 14. Montreal, Canada, 1137--1145.Google ScholarGoogle Scholar
  24. Denis Lalanne, Laurence Nigay, Peter Robinson, et al. 2009. Fusion engines for multimodal input: a survey. In Proceedings of the 2009 International Conference on Multimodal Interfaces. ACM, 153--160.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Larry Laudan. 1978. Progress and its problems: Towards a theory of scientific growth. Vol. 282. Univ of California Press.Google ScholarGoogle Scholar
  26. Bruce D Lucas and Takeo Kanade. 1981. An iterative image registration technique with an application to stereo vision. In Proceedings of the 7th International Joint Conference on Artificial Intelligence. Vancouver, BC, Canada.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Sharynne McLeod. 2004. Speech pathologists' application of the ICF to children with speech impairment. Advances in Speech Language Pathology 6, 1 (2004), 75--81.Google ScholarGoogle ScholarCross RefCross Ref
  28. Laura Montanini, Enea Cippitelli, Ennio Gambi, and Susanna Spinsante. 2015. Low complexity head tracking on portable android devices for real time message composition. Journal on Multimodal User Interfaces 9, 2 (2015), 141--151.Google ScholarGoogle ScholarCross RefCross Ref
  29. Johann Mouton. 1996. Understanding social research. Van Schaik Publishers.Google ScholarGoogle Scholar
  30. Hoang D Nguyen and Danny Chiang Choon Poo. 2017. Unified Structured Framework for mHealth Analytics: Building an Open and Collaborative Community. In International Conference on Social Computing and Social Media. Springer, 440--450.Google ScholarGoogle ScholarCross RefCross Ref
  31. Antti Oulasvirta and Kasper Hornbæk. 2016. HCI research as problem-solving. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 4956--4967.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Roger Pressman and Bruce Maxim. 2016. Engenharia de Software - 8a Edição. McGraw Hill Brasil.Google ScholarGoogle Scholar
  33. Olga Russakovsky, Jia Deng, Hao Su, et al. 2015. Imagenet large scale visual recognition challenge. International Journal of Computer Vision 115, 3 (2015), 211--252.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Sancho Salcedo-Sanz, José Luis Rojo-Álvarez, Manel Martínez-Ramón, and Gustavo Camps-Valls. 2014. Support vector machines in engineering: an overview. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 4, 3 (2014), 234--267.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2818--2826.Google ScholarGoogle ScholarCross RefCross Ref
  36. Martin A Tanner and Wing Hung Wong. 1987. The calculation of posterior distributions by data augmentation. Journal of the American statistical Association 82, 398 (1987), 528--540.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Methodology based on Computer Vision and Machine Learning to guide the Design of Augmentative and Alternative Communication Systems using Personalized Gestural Interaction

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          IHC '21: Proceedings of the XX Brazilian Symposium on Human Factors in Computing Systems
          October 2021
          523 pages
          ISBN:9781450386173
          DOI:10.1145/3472301

          Copyright © 2021 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 18 October 2021

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          IHC '21 Paper Acceptance Rate29of77submissions,38%Overall Acceptance Rate331of973submissions,34%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader