Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Rigidity preserving image transformations and equivariance in perspective
Chalmers University of Technology, Department of Electrical Engineering, Gothenburg, Sweden.
Chalmers University of Technology, Department of Electrical Engineering, Gothenburg, Sweden.
Umeå University, Faculty of Science and Technology, Department of Mathematics and Mathematical Statistics.
Chalmers University of Technology, Department of Electrical Engineering, Gothenburg, Sweden.
2023 (English)In: Image analysis: 23rd Scandinavian conference, SCIA 2023, Sirkka, Finland, April 18–21, 2023, proceedings, part II / [ed] Rikke Gade; Michael Felsberg; Joni-Kristian Kämäräinen, Cham: Springer Nature, 2023, Vol. 2, p. 59-76Conference paper, Published paper (Refereed)
Abstract [en]

We characterize the class of image plane transformations which realize rigid camera motions and call these transformations ‘rigidity preserving’. It turns out that the only rigidity preserving image transformations are homographies corresponding to rotating the camera. In particular, 2D translations of pinhole images are not rigidity preserving. Hence, when using CNNs for 3D inference tasks, it can be beneficial to modify the inductive bias from equivariance w.r.t. translations to equivariance w.r.t. rotational homographies. We investigate how equivariance with respect to rotational homographies can be approximated in CNNs, and test our ideas on 6D object pose estimation. Experimentally, we improve on a competitive baseline.

Place, publisher, year, edition, pages
Cham: Springer Nature, 2023. Vol. 2, p. 59-76
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 13886
Keywords [en]
Pinhole camera model, deep learning, equivariance
National Category
Computer Vision and Robotics (Autonomous Systems)
Identifiers
URN: urn:nbn:se:umu:diva-207837DOI: 10.1007/978-3-031-31438-4_5Scopus ID: 2-s2.0-85161446684ISBN: 978-3-031-31438-4 (electronic)ISBN: 978-3-031-31437-7 (print)OAI: oai:DiVA.org:umu-207837DiVA, id: diva2:1754444
Conference
23rd Scandinavian Conference on Image Analysis, SCIA 2023, Sirkka, Finland, April 18–21, 2023
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

Conference series: SCIA: Scandinavian Conference on Image Analysis

Available from: 2023-05-03 Created: 2023-05-03 Last updated: 2023-06-28Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Flinth, Axel

Search in DiVA

By author/editor
Flinth, Axel
By organisation
Department of Mathematics and Mathematical Statistics
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 158 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf