Altmetric
A multilevel method for self-concordant minimization
File | Description | Size | Format | |
---|---|---|---|---|
s10957-024-02509-z.pdf | Published online version | 1.76 MB | Adobe PDF | View/Open |
Title: | A multilevel method for self-concordant minimization |
Authors: | Tsipinakis, N Parpas, P |
Item Type: | Journal Article |
Abstract: | The analysis of second-order optimization methods based either on sub-sampling, randomization or sketching has two serious shortcomings compared to the conventional Newton method. The first shortcoming is that the analysis of the iterates has only been shown to be scale-invariant only under specific assumptions on the problem structure. The second shortfall is that the fast convergence rates of second-order methods have only been established by making assumptions regarding the input data. In this paper, we propose a randomized Newton method for self-concordant functions to address both shortfalls. We propose a Self-concordant Iterative-minimization-Galerkin-based Multilevel Algorithm (SIGMA) and establish its super-linear convergence rate using the theory of self-concordant functions. Our analysis is based on the connections between multigrid optimization methods, and the role of coarse-grained or reduced-order models in the computation of search directions. We take advantage of the insights from the analysis to significantly improve the performance of second-order methods in machine learning applications. We report encouraging initial experiments that suggest SIGMA outperforms other state-of-the-art sub-sampled/sketched Newton methods for both medium and large-scale problems. |
Issue Date: | 14-Sep-2024 |
Date of Acceptance: | 2-Aug-2024 |
URI: | http://hdl.handle.net/10044/1/115618 |
DOI: | 10.1007/s10957-024-02509-z |
ISSN: | 0022-3239 |
Publisher: | Springer |
Journal / Book Title: | Journal of Optimization Theory and Applications |
Copyright Statement: | © The Author(s) 2024 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
Publication Status: | Published online |
Online Publication Date: | 2024-09-14 |
Appears in Collections: | Computing Faculty of Engineering |
This item is licensed under a Creative Commons License