Umeå University's logo

umu.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Convex formulations for training two-layer ReLU neural networks
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för matematik och matematisk statistik.
Department of Computing, Imperial College London, United Kingdom.
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för matematik och matematisk statistik.ORCID-id: 0000-0001-7320-1506
2025 (engelsk)Inngår i: 13th International Conference on Learning Representations, ICLR 2025, Curran Associates, Inc., 2025, s. 30682-30704Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Solving non-convex, NP-hard optimization problems is crucial for training machine learning models, including neural networks. However, non-convexity often leads to black-box machine learning models with unclear inner workings. While convex formulations have been used for verifying neural network robustness, their application to training neural networks remains less explored. In response to this challenge, we reformulate the problem of training infinite-width two-layer ReLU networks as a convex completely positive program in a finite-dimensional (lifted) space. Despite the convexity, solving this problem remains NP-hard due to the complete positivity constraint. To overcome this challenge, we introduce a semidefinite relaxation that can be solved in polynomial time. We then experimentally evaluate the tightness of this relaxation, demonstrating its competitive performance in test accuracy across a range of classification tasks.

sted, utgiver, år, opplag, sider
Curran Associates, Inc., 2025. s. 30682-30704
Emneord [en]
copositive programming, semidefinite programming, neural networks
HSV kategori
Identifikatorer
URN: urn:nbn:se:umu:diva-236599Scopus ID: 2-s2.0-105010230817ISBN: 979-8-3313-2085-0 (digital)OAI: oai:DiVA.org:umu-236599DiVA, id: diva2:1945107
Konferanse
International Conference on Learning Representations (ICLR), Singapore, April 24-28, 2025
Forskningsfinansiär
Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg FoundationSwedish Research CouncilTilgjengelig fra: 2025-03-17 Laget: 2025-03-17 Sist oppdatert: 2025-07-18bibliografisk kontrollert

Open Access i DiVA

Accepted paper(4944 kB)216 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 4944 kBChecksum SHA-512
e917859d9c98cf6d56cd2d6efeed9c2541ca3fdf4a689f46fabe75be9758c45207447f73b0dae74b50611a8176dc7e86750e54ba5d58cefcf9f2f2d204b9fe46
Type fulltextMimetype application/pdf

Andre lenker

ScopusAccepted paper

Person

Prakhya, KarthikYurtsever, Alp

Søk i DiVA

Av forfatter/redaktør
Prakhya, KarthikYurtsever, Alp
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 216 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 160 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf