A Machine Learning Framework for Real-Time Gesture and Skeleton-Based Action Recognition in Unit: Exploring Human-Compute-Interaction in Game Design and Interaction
2024 (Engelska)Självständigt arbete på avancerad nivå (masterexamen), 20 poäng / 30 hp
Studentuppsats (Examensarbete)
Abstract [en]
This master thesis presents a machine learning framework for real-time gesture and skeleton-based action recognition, integrated with the Unity game engine. The system aims to enhance human-computer interaction (HCI) in gaming and 3D related applications through natural movement recognition, by training a model on skeleton tracking data. The framework is trained to accurately categorize and identify gestures such as kicks and punches, enabling a more immersive gaming experience not existing in traditional controllers.
After studying the evolution of HCI and how machine learning has transformed and reshaped the interaction paradigm, the prototype system is built through data collection, augmenting, and preprocessing, followed by training and evaluating a Long Short-Term Memory (LSTM) neural network model for gesture classification. The model is integrated into Unity via Unity Sentis using Open Neural Network Exchange (ONNX) format, enabling efficient real-time action recognition in 3D space. Each component of the pipeline is available and adaptable for future custom- ization and needs, skeleton tracking and Unity integration is built using the ZED 2i camera and ZED SDK.
Experimental results demonstrate that the system presented can achieve over 90% accuracy in identifying predefined gestures. As a bridging solution tailored for Unity, this framework offers a practical solution to action recognition that could be found useful in future applications. This work contributes to advancing human-computer interaction and offers a foundation for further development in gesture-based Unity game design.
Ort, förlag, år, upplaga, sidor
2024. , s. 33
Serie
UMNAD ; 1489
Nyckelord [en]
Machine Learning Framework, Real-time Gesture Recognition, Skeleton-based Action Recognition, Unity Game Engine Integration, Human-Computer Interaction (HCI), Natural Movement Recognition, Skeleton Tracking Data
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:umu:diva-227161OAI: oai:DiVA.org:umu-227161DiVA, id: diva2:1877220
Externt samarbete
CLAYSTUDIO AB
Presentation
2024-05-31, MIT, UNIVERSITETSTORGET 4, 901 87 Umeå, 14:21 (Engelska)
Handledare
Examinatorer
2024-06-262024-06-252024-06-26Bibliografiskt granskad