Umeå University's logo

umu.sePublications
Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Predicting Gripability Heatmaps using Conditional GANs
Umeå University, Faculty of Science and Technology, Department of Physics. (Digital Physics)ORCID iD: 0000-0002-6811-2776
Umeå University, Faculty of Science and Technology, Department of Physics. (Digital Physics)ORCID iD: 0000-0002-0787-4988
2021 (English)Report (Other academic)
Abstract [en]

The feasibility of using conditional GANs (Generative Adversarial Networks) to predict gripability in log piles is investigated. This is done by posing gripability heatmap prediction from RGB-D data as an image-to-image translation problem. Conditional GANs have previously achieved impressive results on several image-to-image translation tasks predicting physical properties and adding details not present in the input images. Here, piles of logs modelled as sticks or rods are generated in simulation, and groundtruth gripability maps are created using a simple algorithm streamlining the datacollection process. A modified SSIM (Structural Similarity Index) is used to evaluate the quality of the gripability heatmap predictions. The results indicate promising model performance on several different datasets and heatmap designs, including using base plane textures from a real forest production site to add realistic noise in the RGB data. Including a depth channel in the input data is shown to increase performance compared to using pure RGB data. The implementation is based on the general Pix2Pix network developed by Isola et al. in 2017. However, there is potential to increase performance and model generalization, and the adoption of more advanced loss functions and network architectures are suggested. Next steps include using terrains reconstructed from highdensity laser scans in physics-based simulation for data generation. A more in-depth discussion regarding the level of sophistication required in the gripability heatmaps should also be carried out, along with discussions regarding other specifications that will be required for future deployment. This will enable derivation of a tailored gripability metric for ground-truth heatmap generation, and method evaluation on less ideal data.

Place, publisher, year, edition, pages
Umeå: Umeå universitet , 2021. , p. 29
Series
Technical Report
National Category
Computer Vision and Robotics (Autonomous Systems)
Research subject
Computerized Image Analysis
Identifiers
URN: urn:nbn:se:umu:diva-190758OAI: oai:DiVA.org:umu-190758DiVA, id: diva2:1622970
Available from: 2021-12-27 Created: 2021-12-27 Last updated: 2022-02-18Bibliographically approved

Open Access in DiVA

fulltext(4143 kB)509 downloads
File information
File name FULLTEXT01.pdfFile size 4143 kBChecksum SHA-512
0aae3fad79f896530356b9412b983ec970f0ce393f08b773bedc5dfceacb84011c4ed02fca47e84152f6b98dfa84ace733d46e892a1fe91a4be08dc0b1491b52
Type fulltextMimetype application/pdf

Authority records

Andersson, JenniferServin, Martin

Search in DiVA

By author/editor
Andersson, JenniferServin, Martin
By organisation
Department of Physics
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 509 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 390 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf