Deep Deterministic Policy Gradient and Computer Vision for Autonomous Wheelchair Navigation in Human-Shared Environments

  • Paulo De Almeida Afonso UFPel
  • Paulo Roberto Ferreira UFPel

Resumo


We present an innovative approach to autonomous navigation for motorized wheelchairs in shared indoor spaces with human presence, combining Deep Reinforcement Learning and Computer Vision. The primary aim of this study is to enhance the well-being of individuals with disabilities who rely on such assistance for their mobility needs. Our results demonstrate that the Deep Deterministic Policy Gradient (DDPG) algorithm combined with computer vision techniques outperforms the Deep Q-Network (DQN) in both efficiency and stability. Throughout all analyzed stages, DDPG achieved higher average success rates: 98% (Stage 01), 89% (Stage 02), 86% (Stage 03), and 86% (Stage 04), showcasing generalization capabilities and consistently superior performance in diverse settings. In contrast, DQN struggled to avoid collisions, resulting in significantly lower average success rates: 3% (Stage 02), 14% (Stage 03), and 29% (Stage 04). These findings highlight the promising potential of our proposed solution and contribute to the advancement of research in this field.
Palavras-chave: Training, Computer vision, Navigation, Wheelchairs, Deep reinforcement learning, Stability analysis, Indoor environment, Mobile robots, Autonomous robots, Testing, Deep Reinforcement Learning, Robotic wheelchair, Autonomous navigation, Computer Vision
Publicado
13/11/2024
AFONSO, Paulo De Almeida; FERREIRA, Paulo Roberto. Deep Deterministic Policy Gradient and Computer Vision for Autonomous Wheelchair Navigation in Human-Shared Environments. In: SIMPÓSIO BRASILEIRO DE ROBÓTICA E SIMPÓSIO LATINO AMERICANO DE ROBÓTICA (SBR/LARS), 16. , 2024, Goiânia/GO. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 103-108.