Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
A unified model for high-resolution ODEs: new insights on accelerated methods
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för matematik och matematisk statistik.ORCID-id: 0000-0001-8251-2605
School of Mathematics, University of Edinburgh, Scotland.
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för matematik och matematisk statistik.
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för matematik och matematisk statistik.ORCID-id: 0000-0001-7320-1506
2025 (Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Abstract [en]

Recent work on high-resolution ordinary differential equations (HR-ODEs) captures fine nuances among different momentum-based optimization methods, leading to accurate theoretical insights. However, these HR-ODEs often appear disconnected, each targeting a specific algorithm and derived with different assumptions and techniques. We present a unifying framework by showing that these diverse HR-ODEs emerge as special cases of a general HR-ODE derived using the Forced Euler-Lagrange equation. Discretizing this model recovers a wide range of optimization algorithms through different parameter choices. Using integral quadratic constraints, we also introduce a general Lyapunov function to analyze the convergence of the proposed HR-ODE and its discretizations, achieving significant improvements across various cases, including new guarantees for the triple momentum method s HR-ODE and the quasi-hyperbolic momentum method, as well as faster gradient norm minimization rates for Nesterov s accelerated gradient algorithm, among other advances.

Ort, förlag, år, upplaga, sidor
2025.
Nationell ämneskategori
Beräkningsmatematik
Forskningsämne
matematik
Identifikatorer
URN: urn:nbn:se:umu:diva-248743DOI: 10.48550/arXiv.2503.15136OAI: oai:DiVA.org:umu-248743DiVA, id: diva2:2031147
Tillgänglig från: 2026-01-22 Skapad: 2026-01-22 Senast uppdaterad: 2026-01-22Bibliografiskt granskad
Ingår i avhandling
1. From accelerated first-order methods to structured nonconvex optimization: analysis and perspectives
Öppna denna publikation i ny flik eller fönster >>From accelerated first-order methods to structured nonconvex optimization: analysis and perspectives
2026 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Alternativ titel[sv]
Från accelererade första ordningens metoder till strukturerad icke-konvex optimering : analys och perspektiv
Abstract [en]

Modern technologies like machine learning and data-driven decision systems, depend on solving large and often highly complex optimization problems. These problems rarely come with simple shapes or smooth surfaces; instead, they can twist and bend in ways that make finding good solutions surprisingly difficult. This thesis explores two ideas that help us navigate such complexity more effectively. 

The first idea focuses on acceleration and investigates how optimization algorithms can reach good solutions faster while using only basic information such as function values and gradients. By studying these algorithms through a continuous-time analysis, we show that many of the fastest methods behave like carefully designed dynamical systems. This perspective not only clarifies why acceleration happens, but also allows us to design fast algorithms and understand how they behave in the presence of noisy information. 

The second idea focuses on a mathematical structural assumption called difference-of- convex (DC) programming, which captures a remarkably wide range of nonconvex problems. By leveraging this structure, we develop practical algorithms that avoid costly operations like projections or high dimensional gradient evaluations, making them efficient for large-scale applications. Through the connection between DC programming and the classical Expectation–Maximization (EM) algorithm, we develop more scalable EM variants and provide the first general performance guarantees for them. 

Ort, förlag, år, upplaga, sidor
Umeå: Umeå University, 2026. s. 43
Serie
Research report in mathematical statistics, ISSN 1653-0829 ; 80/26
Nyckelord
accelerated methods, convex optimization, nonconvex optimization, DC pro- gramming, EM algorithm, randomized DC algorithm
Nationell ämneskategori
Beräkningsmatematik Sannolikhetsteori och statistik
Identifikatorer
urn:nbn:se:umu:diva-248866 (URN)978-91-8070-910-1 (ISBN)978-91-8070-911-8 (ISBN)
Disputation
2026-02-24, BIO.E.203 - Aula Biologica + Zoom, 08:30 (Engelska)
Opponent
Handledare
Forskningsfinansiär
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Anmärkning

Link to participate via Zoom: https://umu.zoom.us/j/63834231447

Tillgänglig från: 2026-02-03 Skapad: 2026-01-22 Senast uppdaterad: 2026-01-22Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltext

Person

Maskan, HoomaanEftekhari, ArminYurtsever, Alp

Sök vidare i DiVA

Av författaren/redaktören
Maskan, HoomaanEftekhari, ArminYurtsever, Alp
Av organisationen
Institutionen för matematik och matematisk statistik
Beräkningsmatematik

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 23 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf