AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments

التفاصيل البيبلوغرافية
العنوان: AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments
المؤلفون: Duricic, Tomislav, Müllner, Peter, Weidinger, Nicole, ElSayed, Neven, Kowald, Dominik, Veas, Eduardo
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Human-Computer Interaction, Computer Science - Information Retrieval
الوصف: Many industrial sectors rely on well-trained employees that are able to operate complex machinery. In this work, we demonstrate an AI-powered immersive assistance system that supports users in performing complex tasks in industrial environments. Specifically, our system leverages a VR environment that resembles a juice mixer setup. This digital twin of a physical setup simulates complex industrial machinery used to mix preparations or liquids (e.g., similar to the pharmaceutical industry) and includes various containers, sensors, pumps, and flow controllers. This setup demonstrates our system's capabilities in a controlled environment while acting as a proof-of-concept for broader industrial applications. The core components of our multimodal AI assistant are a large language model and a speech-to-text model that process a video and audio recording of an expert performing the task in a VR environment. The video and speech input extracted from the expert's video enables it to provide step-by-step guidance to support users in executing complex tasks. This demonstration showcases the potential of our AI-powered assistant to reduce cognitive load, increase productivity, and enhance safety in industrial environments.
Comment: 3 pages, 2 figures, Demo Paper accepted at the 50th European Conference on Artificial Intelligence
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2407.09147
رقم الأكسشن: edsarx.2407.09147
قاعدة البيانات: arXiv