Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Machine Learning Framework for Real-Time Gesture and Skeleton-Based Action Recognition in Unit: Exploring Human-Compute-Interaction in Game Design and Interaction
Umeå University, Faculty of Science and Technology, Department of Computing Science.
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

This master thesis presents a machine learning framework for real-time gesture and skeleton-based action recognition, integrated with the Unity game engine. The system aims to enhance human-computer interaction (HCI) in gaming and 3D related applications through natural movement recognition, by training a model on skeleton tracking data. The framework is trained to accurately categorize and identify gestures such as kicks and punches, enabling a more immersive gaming experience not existing in traditional controllers.

After studying the evolution of HCI and how machine learning has transformed and reshaped the interaction paradigm, the prototype system is built through data collection, augmenting, and preprocessing, followed by training and evaluating a Long Short-Term Memory (LSTM) neural network model for gesture classification. The model is integrated into Unity via Unity Sentis using Open Neural Network Exchange (ONNX) format, enabling efficient real-time action recognition in 3D space. Each component of the pipeline is available and adaptable for future custom- ization and needs, skeleton tracking and Unity integration is built using the ZED 2i camera and ZED SDK.

Experimental results demonstrate that the system presented can achieve over 90% accuracy in identifying predefined gestures. As a bridging solution tailored for Unity, this framework offers a practical solution to action recognition that could be found useful in future applications. This work contributes to advancing human-computer interaction and offers a foundation for further development in gesture-based Unity game design.

Place, publisher, year, edition, pages
2024. , p. 33
Series
UMNAD ; 1489
Keywords [en]
Machine Learning Framework, Real-time Gesture Recognition, Skeleton-based Action Recognition, Unity Game Engine Integration, Human-Computer Interaction (HCI), Natural Movement Recognition, Skeleton Tracking Data
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-227161OAI: oai:DiVA.org:umu-227161DiVA, id: diva2:1877220
External cooperation
CLAYSTUDIO AB
Presentation
2024-05-31, MIT, UNIVERSITETSTORGET 4, 901 87 Umeå, 14:21 (English)
Supervisors
Examiners
Available from: 2024-06-26 Created: 2024-06-25 Last updated: 2024-06-26Bibliographically approved

Open Access in DiVA

fulltext(3717 kB)459 downloads
File information
File name FULLTEXT01.pdfFile size 3717 kBChecksum SHA-512
acffeb540b99ddcbc17b80fcd80d0f538c2d8b434ab51ad4bf8a18a31b4955a7c78923e35da18264a4359ea1e9828e79b5071c0f1d7fd050d7d757fc7e8aa714
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Moeini, Arian
By organisation
Department of Computing Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 459 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 547 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf