Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Optimization dynamics of equivariant and augmented neural networks
Umeå University, Faculty of Science and Technology, Department of Mathematics and Mathematical Statistics.
Umeå University, Faculty of Science and Technology, Umeå Mathematics Education Research Centre (UMERC). Umeå University, Faculty of Science and Technology, Department of Mathematics and Mathematical Statistics.ORCID iD: 0000-0002-3165-6999
Umeå University, Faculty of Science and Technology, Department of Mathematics and Mathematical Statistics.
2025 (English)In: Transactions on Machine Learning Research, E-ISSN 2835-8856Article in journal (Refereed) Published
Abstract [en]

We investigate the optimization of neural networks on symmetric data, and compare the strategy of constraining the architecture to be equivariant to that of using data augmentation. Our analysis reveals that the relative geometry of the admissible and the equivariant layers, respectively, plays a key role. Under natural assumptions on the data, network, loss, and group of symmetries, we show that compatibility of the spaces of admissible layers and equivariant layers, in the sense that the corresponding orthogonal projections commute, implies that the sets of equivariant stationary points are identical for the two strategies. If the linear layers of the network also are given a unitary parametrization, the set of equivariant layers is even invariant under the gradient flow for augmented models. Our analysis however also reveals that even in the latter situation, stationary points may be unstable for augmented training although they are stable for the manifestly equivariant models.

Place, publisher, year, edition, pages
2025.
Keywords [en]
Equivariance, data augmentation, neural networks, dynamical systems
National Category
Computer Sciences Computational Mathematics
Research subject
Mathematics
Identifiers
URN: urn:nbn:se:umu:diva-234734Scopus ID: 2-s2.0-85219534158OAI: oai:DiVA.org:umu-234734DiVA, id: diva2:1932370
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

Submission number: 3153

Published: 2025-01-16

Available from: 2025-01-29 Created: 2025-01-29 Last updated: 2025-03-19Bibliographically approved

Open Access in DiVA

fulltext(2891 kB)78 downloads
File information
File name FULLTEXT01.pdfFile size 2891 kBChecksum SHA-512
e55ce8615a0aa2619fcdbd0fb53dd3ec5522689d354da59772567de558cb394e0f0fa5f9cee2fe7828d54a1bad4e886de1ddcd077d29bf698728ac362e1a963b
Type fulltextMimetype application/pdf

Other links

ScopusPublisher's full text

Authority records

Nordenfors, OskarOhlsson, FredrikFlinth, Axel

Search in DiVA

By author/editor
Nordenfors, OskarOhlsson, FredrikFlinth, Axel
By organisation
Department of Mathematics and Mathematical StatisticsUmeå Mathematics Education Research Centre (UMERC)
In the same journal
Transactions on Machine Learning Research
Computer SciencesComputational Mathematics

Search outside of DiVA

GoogleGoogle Scholar
Total: 78 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 565 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf