Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Continuous Control of an Underground Loader Using Deep Reinforcement Learning
Algoryx Simulation AB, Umeå, Sweden.
Algoryx Simulation AB, Umeå, Sweden.
Umeå University, Faculty of Science and Technology, Department of Physics. Algoryx Simulation AB, Umeå, Sweden. (Digital Physics)ORCID iD: 0000-0002-4748-0086
Umeå University, Faculty of Science and Technology, Department of Physics. Algoryx Simulation AB, Umeå, Sweden. (Digital Physics)ORCID iD: 0000-0002-0787-4988
Show others and affiliations
2021 (English)In: Machines, E-ISSN 2075-1702, Vol. 9, no 10, article id 216Article in journal (Refereed) Published
Abstract [en]

The reinforcement learning control of an underground loader was investigated in a simulated environment by using a multi-agent deep neural network approach. At the start of each loading cycle, one agent selects the dig position from a depth camera image of a pile of fragmented rock. A second agent is responsible for continuous control of the vehicle, with the goal of filling the bucket at the selected loading point while avoiding collisions, getting stuck, or losing ground traction. This relies on motion and force sensors, as well as on a camera and lidar. Using a soft actor–critic algorithm, the agents learn policies for efficient bucket filling over many subsequent loading cycles, with a clear ability to adapt to the changing environment. The best results—on average, 75% of the max capacity—were obtained when including a penalty for energy usage in the reward.

Place, publisher, year, edition, pages
MDPI, 2021. Vol. 9, no 10, article id 216
Keywords [en]
autonomous excavation, bucket filling, deep reinforcement learning, mining robotics, simulation, wheel loader
National Category
Robotics Computer Sciences Applied Mechanics
Research subject
Physics; Computer Science
Identifiers
URN: urn:nbn:se:umu:diva-187947DOI: 10.3390/machines9100216ISI: 000717124100001Scopus ID: 2-s2.0-85116361998OAI: oai:DiVA.org:umu-187947DiVA, id: diva2:1597655
Funder
Vinnova, 2019-04832Available from: 2021-09-27 Created: 2021-09-27 Last updated: 2023-09-05Bibliographically approved

Open Access in DiVA

fulltext(3119 kB)333 downloads
File information
File name FULLTEXT01.pdfFile size 3119 kBChecksum SHA-512
b1f4c74e56c33549e52263a321a6a4bde5b7af8b81490efcfb3f5c492456500ea16066876789809b013faf8df771bd9d970458e5adcccab0711e76c38ed8e6cf
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Bodin, KennethServin, Martin

Search in DiVA

By author/editor
Bodin, KennethServin, Martin
By organisation
Department of Physics
RoboticsComputer SciencesApplied Mechanics

Search outside of DiVA

GoogleGoogle Scholar
Total: 339 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 390 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf