TY - JOUR
T1 - Advanced Human-Robot Interaction with a Mobile Robot
T2 - International Neural Network Society Workshop on Deep Learning Innovations and Applications, IJCNN 2025
AU - Triana, Elio D.R.
AU - Loureiro, Matheus
AU - Machado, Fabiana
AU - Mello, Ricardo
AU - Jimenez H, Mario F.
AU - Frizera-Neto, Anselmo
N1 - Publisher Copyright:
© 2025 The Authors. Published by Elsevier B.V.
PY - 2025
Y1 - 2025
N2 - Advancements in assistive robotic systems demand sophisticated Human-Robot Interaction (HRI) strategies, particularly in multilingual environments. This study introduces an HRI framework that integrates Natural Language Processing (NLP) via two Large Language Models (LLMs) accessed through backend applications interfacing with OpenAI and Deepgram APIs. The system employs a responsive chat-style interface that is usable across multiple devices, supporting Brazilian Portuguese, a notable improvement over similar projects. An experimental protocol was conducted with 20 native Portuguese-speaking volunteers to evaluate the system's interpretive performance and user perception. Participants guided a mobile robot using voice commands to complete a predefined trajectory in an obstacle-free environment. Usability evaluations using the System Usability Scale yielded consistently high scores throughout the experiment, ranging from "good"to "excellent". However, users noted slightly reduced accuracy perception in fully autonomous LLM mode compared to a pre-programmed mode, alongside increased frustration. These findings validate the feasibility of integrating LLMs into multilingual robotic systems, emphasizing both the promise and limitations of NLP in HRI. Future work could focus on improving the handling of ambiguous user requests and enhance feedback mechanisms to improve user experience.
AB - Advancements in assistive robotic systems demand sophisticated Human-Robot Interaction (HRI) strategies, particularly in multilingual environments. This study introduces an HRI framework that integrates Natural Language Processing (NLP) via two Large Language Models (LLMs) accessed through backend applications interfacing with OpenAI and Deepgram APIs. The system employs a responsive chat-style interface that is usable across multiple devices, supporting Brazilian Portuguese, a notable improvement over similar projects. An experimental protocol was conducted with 20 native Portuguese-speaking volunteers to evaluate the system's interpretive performance and user perception. Participants guided a mobile robot using voice commands to complete a predefined trajectory in an obstacle-free environment. Usability evaluations using the System Usability Scale yielded consistently high scores throughout the experiment, ranging from "good"to "excellent". However, users noted slightly reduced accuracy perception in fully autonomous LLM mode compared to a pre-programmed mode, alongside increased frustration. These findings validate the feasibility of integrating LLMs into multilingual robotic systems, emphasizing both the promise and limitations of NLP in HRI. Future work could focus on improving the handling of ambiguous user requests and enhance feedback mechanisms to improve user experience.
UR - https://www.scopus.com/pages/publications/105013955699
UR - https://www.scopus.com/inward/citedby.url?scp=105013955699&partnerID=8YFLogxK
U2 - 10.1016/j.procs.2025.07.139
DO - 10.1016/j.procs.2025.07.139
M3 - Conference article
AN - SCOPUS:105013955699
SN - 1877-0509
VL - 264
SP - 280
EP - 289
JO - Procedia Computer Science
JF - Procedia Computer Science
Y2 - 30 June 2025 through 5 July 2025
ER -