Umeå University's logo

umu.sePublications
Change search
Link to record
Permanent link

Direct link
Publications (7 of 7) Show all publications
Maskan, H. (2026). From accelerated first-order methods to structured nonconvex optimization: analysis and perspectives. (Doctoral dissertation). Umeå: Umeå University
Open this publication in new window or tab >>From accelerated first-order methods to structured nonconvex optimization: analysis and perspectives
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Från accelererade första ordningens metoder till strukturerad icke-konvex optimering : analys och perspektiv
Abstract [en]

Modern technologies like machine learning and data-driven decision systems, depend on solving large and often highly complex optimization problems. These problems rarely come with simple shapes or smooth surfaces; instead, they can twist and bend in ways that make finding good solutions surprisingly difficult. This thesis explores two ideas that help us navigate such complexity more effectively. 

The first idea focuses on acceleration and investigates how optimization algorithms can reach good solutions faster while using only basic information such as function values and gradients. By studying these algorithms through a continuous-time analysis, we show that many of the fastest methods behave like carefully designed dynamical systems. This perspective not only clarifies why acceleration happens, but also allows us to design fast algorithms and understand how they behave in the presence of noisy information. 

The second idea focuses on a mathematical structural assumption called difference-of- convex (DC) programming, which captures a remarkably wide range of nonconvex problems. By leveraging this structure, we develop practical algorithms that avoid costly operations like projections or high dimensional gradient evaluations, making them efficient for large-scale applications. Through the connection between DC programming and the classical Expectation–Maximization (EM) algorithm, we develop more scalable EM variants and provide the first general performance guarantees for them. 

Place, publisher, year, edition, pages
Umeå: Umeå University, 2026. p. 43
Series
Research report in mathematical statistics, ISSN 1653-0829 ; 80/26
Keywords
accelerated methods, convex optimization, nonconvex optimization, DC pro- gramming, EM algorithm, randomized DC algorithm
National Category
Computational Mathematics Probability Theory and Statistics
Identifiers
urn:nbn:se:umu:diva-248866 (URN)978-91-8070-910-1 (ISBN)978-91-8070-911-8 (ISBN)
Public defence
2026-02-24, BIO.E.203 - Aula Biologica + Zoom, 08:30 (English)
Opponent
Supervisors
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

Link to participate via Zoom: https://umu.zoom.us/j/63834231447

Available from: 2026-02-03 Created: 2026-01-22 Last updated: 2026-01-22Bibliographically approved
Maskan, H., Zygalakis, K. C., Eftekhari, A. & Yurtsever, A. (2025). A unified model for high-resolution ODEs: new insights on accelerated methods.
Open this publication in new window or tab >>A unified model for high-resolution ODEs: new insights on accelerated methods
2025 (English)Manuscript (preprint) (Other academic)
Abstract [en]

Recent work on high-resolution ordinary differential equations (HR-ODEs) captures fine nuances among different momentum-based optimization methods, leading to accurate theoretical insights. However, these HR-ODEs often appear disconnected, each targeting a specific algorithm and derived with different assumptions and techniques. We present a unifying framework by showing that these diverse HR-ODEs emerge as special cases of a general HR-ODE derived using the Forced Euler-Lagrange equation. Discretizing this model recovers a wide range of optimization algorithms through different parameter choices. Using integral quadratic constraints, we also introduce a general Lyapunov function to analyze the convergence of the proposed HR-ODE and its discretizations, achieving significant improvements across various cases, including new guarantees for the triple momentum method s HR-ODE and the quasi-hyperbolic momentum method, as well as faster gradient norm minimization rates for Nesterov s accelerated gradient algorithm, among other advances.

National Category
Computational Mathematics
Research subject
Mathematics
Identifiers
urn:nbn:se:umu:diva-248743 (URN)10.48550/arXiv.2503.15136 (DOI)
Available from: 2026-01-22 Created: 2026-01-22 Last updated: 2026-01-22Bibliographically approved
Maskan, H., Halvachi, P., Sra, S. & Yurtsever, A. (2025). Randomized block coordinate DC algorithm. EURO Journal on Computational Optimization, 13, Article ID 100123.
Open this publication in new window or tab >>Randomized block coordinate DC algorithm
2025 (English)In: EURO Journal on Computational Optimization, ISSN 2192-4406, Vol. 13, article id 100123Article in journal (Refereed) Published
Abstract [en]

We introduce an extension of the Difference of Convex Algorithm (DCA) in the form of a randomized block coordinate approach for problems with separable structure. For n coordinate-blocks and k iterations, our main result proves a non-asymptotic convergence rate of O(n/k) in expectation, with respect to a stationarity measure based on a Forward-Backward envelope. Furthermore, leveraging the connection between DCA and Expectation Maximization (EM), we propose a randomized block coordinate EM algorithm.

Place, publisher, year, edition, pages
Elsevier, 2025
Keywords
Block coordinate methods, Difference of convex, Nonconvex optimization
National Category
Computational Mathematics
Identifiers
urn:nbn:se:umu:diva-247450 (URN)10.1016/j.ejco.2025.100123 (DOI)2-s2.0-105023553755 (Scopus ID)
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Available from: 2025-12-12 Created: 2025-12-12 Last updated: 2026-01-22Bibliographically approved
Maskan, H., Hou, Y., Sra, S. & Yurtsever, A. (2025). Revisiting Frank-Wolfe for structured nonconvex optimization. In: NeuriPS 2025: Downloads. Paper presented at NeuriPS 2025, The Thirty-Ninth Annual Conference on Neural Information Processing Systems, San Diego, USA, December 1, 2025.
Open this publication in new window or tab >>Revisiting Frank-Wolfe for structured nonconvex optimization
2025 (English)In: NeuriPS 2025: Downloads, 2025Conference paper, Poster (with or without abstract) (Refereed)
Abstract [en]

We introduce a new projection-free (Frank-Wolfe) method for optimizing structured nonconvex functions that are expressed as a difference of two convex functions. This problem class subsumes smooth nonconvex minimization, positioning our method as a promising alternative to the classical Frank-Wolfe algorithm. DC decompositions are not unique; by carefully selecting a decomposition, we can better exploit the problem structure, improve computational efficiency, and adapt to the underlying problem geometry to find better local solutions. We prove that the proposed method achieves a first-order stationary point in  iterations, matching the complexity of the standard Frank-Wolfe algorithm for smooth non-convex minimization in general. Specific decompositions can, for instance, yield a gradient-efficient variant that requires only  calls to the gradient oracle by reusing computed gradients over multiple iterations. Finally, we present numerical experiments demonstrating the effectiveness of the proposed method compared to other projection-free algorithms.

National Category
Computational Mathematics
Research subject
Mathematics
Identifiers
urn:nbn:se:umu:diva-248742 (URN)
Conference
NeuriPS 2025, The Thirty-Ninth Annual Conference on Neural Information Processing Systems, San Diego, USA, December 1, 2025
Available from: 2026-01-20 Created: 2026-01-20 Last updated: 2026-01-22Bibliographically approved
Maskan, H., Zygalakis, K. C. & Yurtsever, A. (2023). A variational perspective on high-resolution ODEs. In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023): . Paper presented at 37th Conference on Neural Information Processing Systems, NeurIPS 2023, New Orleans, USA, December 10-16, 2023. Neural information processing systems foundation
Open this publication in new window or tab >>A variational perspective on high-resolution ODEs
2023 (English)In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023), Neural information processing systems foundation , 2023Conference paper, Published paper (Refereed)
Abstract [en]

We consider unconstrained minimization of smooth convex functions. We propose a novel variational perspective using forced Euler-Lagrange equation that allows for studying high-resolution ODEs. Through this, we obtain a faster convergence rate for gradient norm minimization using Nesterov's accelerated gradient method. Additionally, we show that Nesterov's method can be interpreted as a rate-matching discretization of an appropriately chosen high-resolution ODE. Finally, using the results from the new variational perspective, we propose a stochastic method for noisy gradients. Several numerical experiments compare and illustrate our stochastic algorithm with state of the art methods.

Place, publisher, year, edition, pages
Neural information processing systems foundation, 2023
Series
Advances in neural information processing systems, ISSN 1049-5258
National Category
Computational Mathematics
Identifiers
urn:nbn:se:umu:diva-223951 (URN)2-s2.0-85191188553 (Scopus ID)
Conference
37th Conference on Neural Information Processing Systems, NeurIPS 2023, New Orleans, USA, December 10-16, 2023
Available from: 2024-05-03 Created: 2024-05-03 Last updated: 2026-01-22Bibliographically approved
Maskan, H., Daei, S. & Kahaei, M. H. (2023). Demixing sines and spikes using multiple measurement vectors. Signal Processing, 203, Article ID 108786.
Open this publication in new window or tab >>Demixing sines and spikes using multiple measurement vectors
2023 (English)In: Signal Processing, ISSN 0165-1684, E-ISSN 1872-7557, Vol. 203, article id 108786Article in journal (Refereed) Published
Abstract [en]

We address the line spectral estimation problem with multiple measurement corrupted vectors. Such scenarios appear in many practical applications such as radar, optics, and seismic imaging in which the measurements can be modeled as the sum of a spectrally sparse and a block-sparse signal known as outlier. Our aim is to demix the two components and for this purpose, we design a convex problem whose objective function promotes both of the structures. Using the Positive Trigonometric Polynomials (PTP) theory, we reformulate the dual problem as a Semidefinite Program (SDP). Our theoretical results state that for a fixed number of measurements N and constant number of outliers, up to O(N) spectral lines can be recovered using our SDP problem as long as a minimum frequency separation condition is satisfied. Our simulation results also show that increasing the number of samples per measurement vectors reduces the minimum required frequency separation for successful recovery.

Place, publisher, year, edition, pages
Elsevier, 2023
Keywords
Atomic norm, Convex optimization, Demixing, Multiple measurement vector, Spectral super resolution
National Category
Signal Processing Mathematics
Identifiers
urn:nbn:se:umu:diva-200382 (URN)10.1016/j.sigpro.2022.108786 (DOI)000872558500013 ()2-s2.0-85139348869 (Scopus ID)
Available from: 2022-11-08 Created: 2022-11-08 Last updated: 2024-07-02Bibliographically approved
Jirhandeh, M. J., Maskan, H. & Kahaei, M. H. (2022). Super-resolution DOA estimation for wideband signals using non-uniform linear arrays with no focusing matrix. IEEE Wireless Communications Letters, 11(3), 641-644
Open this publication in new window or tab >>Super-resolution DOA estimation for wideband signals using non-uniform linear arrays with no focusing matrix
2022 (English)In: IEEE Wireless Communications Letters, ISSN 2162-2337, E-ISSN 2162-2345, Vol. 11, no 3, p. 641-644Article in journal (Other academic) Published
Abstract [en]

We focus on developing an effective Direction Of Arrival (DOA) estimation method for wideband sources based on the super-resolution concept. In this letter, we represent the data model of a wideband DOA estimation problem in a new form. Then, using this model, we propose an atomic norm minimization problem that leads to an accurate wideband DOA estimation method with no need for any focusing matrices. Moreover, this method requires no initial estimate and can be implemented by non-uniform linear arrays. Numerical simulations show that in comparison to some well-known techniques, the proposed method generates outstanding accuracy and higher robustness to noise. The effectiveness of the method is also verified in presence of close adjacent sources.

Place, publisher, year, edition, pages
IEEE, 2022
National Category
Telecommunications Signal Processing
Identifiers
urn:nbn:se:umu:diva-226116 (URN)10.1109/lwc.2021.3139568 (DOI)000766612300045 ()2-s2.0-85122565622 (Scopus ID)
Available from: 2024-06-13 Created: 2024-06-13 Last updated: 2024-07-02Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-8251-2605

Search in DiVA

Show all publications

Profile pages

Personal Webpage