Umeå University's logo

umu.sePublications
Operational message
There are currently operational disruptions. Troubleshooting is in progress.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Sim-to-real transfer of active suspension control using deep reinforcement learning
Umeå University, Faculty of Science and Technology, Department of Physics. Algoryx Simulation AB, Umeå, Sweden. (Digital Physics)ORCID iD: 0000-0001-6565-3123
Umeå University, Faculty of Science and Technology, Department of Physics. (Digital Physics)ORCID iD: 0000-0001-6266-4740
Umeå University, Faculty of Science and Technology, Department of Physics.ORCID iD: 0009-0000-9267-1140
Skogforsk (the Forestry Research Institute of Sweden), Uppsala, Sweden.
Show others and affiliations
2024 (English)In: Robotics and Autonomous Systems, ISSN 0921-8890, E-ISSN 1872-793X, Vol. 179, article id 104731Article in journal (Refereed) Published
Abstract [en]

We explore sim-to-real transfer of deep reinforcement learning controllers for a heavy vehicle with active suspensions designed for traversing rough terrain. While related research primarily focuses on lightweight robots with electric motors and fast actuation, this study uses a forestry vehicle with a complex hydraulic driveline and slow actuation. We simulate the vehicle using multibody dynamics and apply system identification to find an appropriate set of simulation parameters. We then train policies in simulation using various techniques to mitigate the sim-to-real gap, including domain randomization, action delays, and a reward penalty to encourage smooth control. In reality, the policies trained with action delays and a penalty for erratic actions perform nearly at the same level as in simulation. In experiments on level ground, the motion trajectories closely overlap when turning to either side, as well as in a route tracking scenario. When faced with a ramp that requires active use of the suspensions, the simulated and real motions are in close alignment. This shows that the actuator model together with system identification yields a sufficiently accurate model of the actuators. We observe that policies trained without the additional action penalty exhibit fast switching or bang–bang control. These present smooth motions and high performance in simulation but transfer poorly to reality. We find that policies make marginal use of the local height map for perception, showing no indications of predictive planning. However, the strong transfer capabilities entail that further development concerning perception and performance can be largely confined to simulation.

Place, publisher, year, edition, pages
Elsevier, 2024. Vol. 179, article id 104731
National Category
Electrical Engineering, Electronic Engineering, Information Engineering Other Physics Topics
Research subject
Physics; computer and systems sciences
Identifiers
URN: urn:nbn:se:umu:diva-226893DOI: 10.1016/j.robot.2024.104731ISI: 001260733600001Scopus ID: 2-s2.0-85196769514OAI: oai:DiVA.org:umu-226893DiVA, id: diva2:1875672
Projects
Mistra Digital Forest
Funder
Mistra - The Swedish Foundation for Strategic Environmental Research, Grant DIA 2017/14 #6Wallenberg AI, Autonomous Systems and Software Program (WASP)Available from: 2024-06-23 Created: 2024-06-23 Last updated: 2025-04-24Bibliographically approved

Open Access in DiVA

fulltext(5087 kB)214 downloads
File information
File name FULLTEXT01.pdfFile size 5087 kBChecksum SHA-512
a811ea9aba1a690598bbea5dcc87131c03129527b96b1a0a9327a9ae88721b3bb7cdea42471c6aff393a4eac678104cfcd65f14e552ac421f0e6d950b0211118
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Wiberg, ViktorWallin, ErikFälldin, ArvidWadbro, EddieServin, Martin

Search in DiVA

By author/editor
Wiberg, ViktorWallin, ErikFälldin, ArvidRossander, MorganWadbro, EddieServin, Martin
By organisation
Department of PhysicsDepartment of Computing Science
In the same journal
Robotics and Autonomous Systems
Electrical Engineering, Electronic Engineering, Information EngineeringOther Physics Topics

Search outside of DiVA

GoogleGoogle Scholar
Total: 214 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 454 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf