Robotic Action Planning Using Large Language Models

César Bastos Da Silva; Juan Luis Barraza Ramírez; Niederauer Mastelari; Roberto Lotufo; Jayr Pereira; Eric Rohmer

César Bastos Da Silva UNICAMP
Juan Luis Barraza Ramírez UNICAMP
Niederauer Mastelari UNICAMP
Roberto Lotufo NeuralMind.ai
Jayr Pereira UFCA
Eric Rohmer UNICAMP

Resumo

This study explores the integration of artificial intelligence (AI) and large language models (LLMs) in robotics, focusing on task planning and execution. We implemented a Reasoning and Acting (ReAct) system within a simulated environment, utilizing a humanoid robot equipped with various tools for searching, locomotion, vision, manipulation, and communication. The robot operates based on natural language prompts and utilizes the LangChain framework to facilitate interaction with the LLM. We conducted experiments to evaluate the robot’s performance on tasks requiring short-term, medium-term, and long-term memory. Short-term memory tasks involved single-step actions, medium-term memory tasks needed the completion of two-step sequences, and long-term memory tasks involved three or more steps. The results demonstrated a high success rate for short-term tasks, while performance for medium and long-term tasks varied depending on the number of steps involved. Our findings highlight both the challenges and potential of using AI and LLMs in robotic task planning. The results demonstrate the promise of enhancing robotic capabilities to perform complex tasks through natural language instructions. The work is available on: https://github.com/cesarbds/LLM_Planner

Palavras-chave: Industries, Service robots, Large language models, Natural languages, Memory management, Humanoid robots, Cognition, Planning, Prompt engineering, Robots, ReAct, Task Planning, LLM, AI, Robotics Actions