A Machine Learning Framework for Real-Time Gesture and Skeleton-Based Action Recognition in Unit: Exploring Human-Compute-Interaction in Game Design and Interaction
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE credits
Student thesis
Abstract [en]
This master thesis presents a machine learning framework for real-time gesture and skeleton-based action recognition, integrated with the Unity game engine. The system aims to enhance human-computer interaction (HCI) in gaming and 3D related applications through natural movement recognition, by training a model on skeleton tracking data. The framework is trained to accurately categorize and identify gestures such as kicks and punches, enabling a more immersive gaming experience not existing in traditional controllers.
After studying the evolution of HCI and how machine learning has transformed and reshaped the interaction paradigm, the prototype system is built through data collection, augmenting, and preprocessing, followed by training and evaluating a Long Short-Term Memory (LSTM) neural network model for gesture classification. The model is integrated into Unity via Unity Sentis using Open Neural Network Exchange (ONNX) format, enabling efficient real-time action recognition in 3D space. Each component of the pipeline is available and adaptable for future custom- ization and needs, skeleton tracking and Unity integration is built using the ZED 2i camera and ZED SDK.
Experimental results demonstrate that the system presented can achieve over 90% accuracy in identifying predefined gestures. As a bridging solution tailored for Unity, this framework offers a practical solution to action recognition that could be found useful in future applications. This work contributes to advancing human-computer interaction and offers a foundation for further development in gesture-based Unity game design.
Place, publisher, year, edition, pages
2024. , p. 33
Series
UMNAD ; 1489
Keywords [en]
Machine Learning Framework, Real-time Gesture Recognition, Skeleton-based Action Recognition, Unity Game Engine Integration, Human-Computer Interaction (HCI), Natural Movement Recognition, Skeleton Tracking Data
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-227161OAI: oai:DiVA.org:umu-227161DiVA, id: diva2:1877220
External cooperation
CLAYSTUDIO AB
Presentation
2024-05-31, MIT, UNIVERSITETSTORGET 4, 901 87 Umeå, 14:21 (English)
Supervisors
Examiners
2024-06-262024-06-252024-06-26Bibliographically approved