Adaptive regularization algorithms with inexact evaluations for nonconvex optimization

Stefania Bellavia; Gianmarco Gurioli; Benedetta Morini; Philippe Toint

doi:10.1137/18M1226282

Adaptive regularization algorithms with inexact evaluations for nonconvex optimization

Stefania Bellavia, Gianmarco Gurioli, Benedetta Morini, Philippe Toint

Namur Institute for Complex Systems

Research output: Contribution to journal › Article › peer-review

37 Downloads (Pure)

Abstract

A regularization algorithm using inexact function values and inexact derivatives is proposed and its evaluation complexity analyzed. This algorithm is applicable to unconstrained problems and to problems with inexpensive constraints (that is, constraints whose evaluation and enforcement has negligible cost) under the assumption that the derivative of highest degree is β-Hölder continuous. It features a very flexible adaptive mechanism for determining the inexactness which is allowed, at each iteration, when computing objective function values and derivatives. The complexity analysis covers arbitrary optimality order and arbitrary degree of available approximate derivatives. It extends results of Cartis, Gould, and Toint [SIAM J. Optim., to appear] on the evaluation complexity to the inexact case: if a qth-order minimizer is sought using approximations to the first p derivatives, it is proved that a suitable approximate minimizer within ε is computed by the propposed algorithm in at most O[Formula presented] iterations and at most O[Formula presented] approximate evaluations. An algorithmic variant, although more rigid in practice, can be proved to find such an approximate minimizer in O[Formula presented] evaluations. While the proposed framework remains so far conceptual for high degrees and orders, it is shown to yield simple and computationally realistic inexact methods when specialized to the unconstrained and bound-constrained first- and second-order cases. The deterministic complexity results are finally extended to the stochastic context, yielding adaptive sample-size rules for subsampling methods typical of machine learning.

Original language	English
Pages (from-to)	2881-2915
Number of pages	35
Journal	SIAM Journal on Optimization
Volume	29
Issue number	4
DOIs	https://doi.org/10.1137/18M1226282
Publication status	Published - 2 Jan 2020

Keywords

evaluation complexity
regularization methods
inexact functions and derivatives
subsampling methods
machine learning
Evaluation complexity
Subsampling methods
Inexact functions and derivatives
Regularization methods

Access to Document

10.1137/18M1226282

bgmtR2Accepted author manuscript, 347 KB

Cite this

@article{9b35c02c72fc426d85d352c68e65d452,

title = "Adaptive regularization algorithms with inexact evaluations for nonconvex optimization",

abstract = "A regularization algorithm using inexact function values and inexact derivatives is proposed and its evaluation complexity analyzed. This algorithm is applicable to unconstrained problems and to problems with inexpensive constraints (that is, constraints whose evaluation and enforcement has negligible cost) under the assumption that the derivative of highest degree is β-H{\"o}lder continuous. It features a very flexible adaptive mechanism for determining the inexactness which is allowed, at each iteration, when computing objective function values and derivatives. The complexity analysis covers arbitrary optimality order and arbitrary degree of available approximate derivatives. It extends results of Cartis, Gould, and Toint [SIAM J. Optim., to appear] on the evaluation complexity to the inexact case: if a qth-order minimizer is sought using approximations to the first p derivatives, it is proved that a suitable approximate minimizer within ε is computed by the propposed algorithm in at most O[Formula presented] iterations and at most O[Formula presented] approximate evaluations. An algorithmic variant, although more rigid in practice, can be proved to find such an approximate minimizer in O[Formula presented] evaluations. While the proposed framework remains so far conceptual for high degrees and orders, it is shown to yield simple and computationally realistic inexact methods when specialized to the unconstrained and bound-constrained first- and second-order cases. The deterministic complexity results are finally extended to the stochastic context, yielding adaptive sample-size rules for subsampling methods typical of machine learning.",

keywords = "evaluation complexity, regularization methods, inexact functions and derivatives, subsampling methods, machine learning, Evaluation complexity, Subsampling methods, Inexact functions and derivatives, Regularization methods",

author = "Stefania Bellavia and Gianmarco Gurioli and Benedetta Morini and Philippe Toint",

note = "Funding Information: Acknowledgments. The fourth author gratefully acknowledges the support and friendly environment provided by the Department of Industrial Engineering at the Universit{\`a} degli Studi, Florence, Italy, during his visit in fall 2018. The authors are also indebted to two careful referees, whose comments and perceptive questions have resulted in a significant improvement of the manuscript. Funding Information: The first, second, and third authors are members of the INdAM Research Group GNCS. INdAM-GNCS partially supported the first and third authors under Progetti di Ricerca 2018. The second author was partially supported by INdAM through a GNCS grant. The fourth author gratefully acknowledges the support and friendly environment provided by the Department of Industrial Engineering at the Universit? degli Studi, Florence, Italy, during his visit in fall 2018. The authors are also indebted to two careful referees, whose comments and perceptive questions have resulted in a significant improvement of the manuscript. Funding Information: ∗Received by the editors November 12, 2018; accepted for publication (in revised form) July 19, 2019; published electronically November 14, 2019. https://doi.org/10.1137/18M1226282 Funding: The first, second, and third authors are members of the INdAM Research Group GNCS. INdAM-GNCS partially supported the first and third authors under Progetti di Ricerca 2018. The second author was partially supported by INdAM through a GNCS grant. †Dipartimento di Ingegneria Industriale, Universit{\`a} degli Studi, Firenze, 50134, Italy (stefa-nia.bellavia@unifi.it, benedetta.morini@unifi.it). ‡Dipartimento di Matematica e Informatica “Ulisse Dini”, Universit{\`a} degli Studi, Firenze, 50134, Italy (gianmarco.gurioli@unifi.it). §Namur Center for Complex Systems (naXys), University of Namur, 61, rue de Bruxelles, B-5000 Namur, Belgium (philippe.toint@unamur.be). Publisher Copyright: {\textcopyright} 2019 Society for Industrial and Applied Mathematics.",

year = "2020",

month = jan,

day = "2",

doi = "10.1137/18M1226282",

language = "English",

volume = "29",

pages = "2881--2915",

journal = "SIAM Journal on Optimization",

issn = "1052-6234",

publisher = "Society for Industrial and Applied Mathematics",

number = "4",

}

TY - JOUR

T1 - Adaptive regularization algorithms with inexact evaluations for nonconvex optimization

AU - Bellavia, Stefania

AU - Gurioli, Gianmarco

AU - Morini, Benedetta

AU - Toint, Philippe

N1 - Funding Information: Acknowledgments. The fourth author gratefully acknowledges the support and friendly environment provided by the Department of Industrial Engineering at the Università degli Studi, Florence, Italy, during his visit in fall 2018. The authors are also indebted to two careful referees, whose comments and perceptive questions have resulted in a significant improvement of the manuscript. Funding Information: The first, second, and third authors are members of the INdAM Research Group GNCS. INdAM-GNCS partially supported the first and third authors under Progetti di Ricerca 2018. The second author was partially supported by INdAM through a GNCS grant. The fourth author gratefully acknowledges the support and friendly environment provided by the Department of Industrial Engineering at the Universit? degli Studi, Florence, Italy, during his visit in fall 2018. The authors are also indebted to two careful referees, whose comments and perceptive questions have resulted in a significant improvement of the manuscript. Funding Information: ∗Received by the editors November 12, 2018; accepted for publication (in revised form) July 19, 2019; published electronically November 14, 2019. https://doi.org/10.1137/18M1226282 Funding: The first, second, and third authors are members of the INdAM Research Group GNCS. INdAM-GNCS partially supported the first and third authors under Progetti di Ricerca 2018. The second author was partially supported by INdAM through a GNCS grant. †Dipartimento di Ingegneria Industriale, Università degli Studi, Firenze, 50134, Italy (stefa-nia.bellavia@unifi.it, benedetta.morini@unifi.it). ‡Dipartimento di Matematica e Informatica “Ulisse Dini”, Università degli Studi, Firenze, 50134, Italy (gianmarco.gurioli@unifi.it). §Namur Center for Complex Systems (naXys), University of Namur, 61, rue de Bruxelles, B-5000 Namur, Belgium (philippe.toint@unamur.be). Publisher Copyright: © 2019 Society for Industrial and Applied Mathematics.

PY - 2020/1/2

Y1 - 2020/1/2

N2 - A regularization algorithm using inexact function values and inexact derivatives is proposed and its evaluation complexity analyzed. This algorithm is applicable to unconstrained problems and to problems with inexpensive constraints (that is, constraints whose evaluation and enforcement has negligible cost) under the assumption that the derivative of highest degree is β-Hölder continuous. It features a very flexible adaptive mechanism for determining the inexactness which is allowed, at each iteration, when computing objective function values and derivatives. The complexity analysis covers arbitrary optimality order and arbitrary degree of available approximate derivatives. It extends results of Cartis, Gould, and Toint [SIAM J. Optim., to appear] on the evaluation complexity to the inexact case: if a qth-order minimizer is sought using approximations to the first p derivatives, it is proved that a suitable approximate minimizer within ε is computed by the propposed algorithm in at most O[Formula presented] iterations and at most O[Formula presented] approximate evaluations. An algorithmic variant, although more rigid in practice, can be proved to find such an approximate minimizer in O[Formula presented] evaluations. While the proposed framework remains so far conceptual for high degrees and orders, it is shown to yield simple and computationally realistic inexact methods when specialized to the unconstrained and bound-constrained first- and second-order cases. The deterministic complexity results are finally extended to the stochastic context, yielding adaptive sample-size rules for subsampling methods typical of machine learning.

AB - A regularization algorithm using inexact function values and inexact derivatives is proposed and its evaluation complexity analyzed. This algorithm is applicable to unconstrained problems and to problems with inexpensive constraints (that is, constraints whose evaluation and enforcement has negligible cost) under the assumption that the derivative of highest degree is β-Hölder continuous. It features a very flexible adaptive mechanism for determining the inexactness which is allowed, at each iteration, when computing objective function values and derivatives. The complexity analysis covers arbitrary optimality order and arbitrary degree of available approximate derivatives. It extends results of Cartis, Gould, and Toint [SIAM J. Optim., to appear] on the evaluation complexity to the inexact case: if a qth-order minimizer is sought using approximations to the first p derivatives, it is proved that a suitable approximate minimizer within ε is computed by the propposed algorithm in at most O[Formula presented] iterations and at most O[Formula presented] approximate evaluations. An algorithmic variant, although more rigid in practice, can be proved to find such an approximate minimizer in O[Formula presented] evaluations. While the proposed framework remains so far conceptual for high degrees and orders, it is shown to yield simple and computationally realistic inexact methods when specialized to the unconstrained and bound-constrained first- and second-order cases. The deterministic complexity results are finally extended to the stochastic context, yielding adaptive sample-size rules for subsampling methods typical of machine learning.

KW - evaluation complexity

KW - regularization methods

KW - inexact functions and derivatives

KW - subsampling methods

KW - machine learning

KW - Evaluation complexity

KW - Subsampling methods

KW - Inexact functions and derivatives

KW - Regularization methods

UR - http://www.scopus.com/inward/record.url?scp=85076364479&partnerID=8YFLogxK

U2 - 10.1137/18M1226282

DO - 10.1137/18M1226282

M3 - Article

SN - 1052-6234

VL - 29

SP - 2881

EP - 2915

JO - SIAM Journal on Optimization

JF - SIAM Journal on Optimization

IS - 4

ER -

Adaptive regularization algorithms with inexact evaluations for nonconvex optimization

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Evaluation complexity of algorithms for nonconvex optimization

Adaptive Regularization Minimization Algorithms with Non-Smooth Norms and Euclidean Curvature

An algorithm for the minimization of nonsmooth nonconvex functions using inexact evaluations and its worst-case complexity

Complexity in nonlinear optimization

ADALGOPT: ADALGOPT - Advanced algorithms in nonlinear optimization

Recent results in worst-case evaluation complexity for smooth and non-smooth, exact and inexact, nonconvex optimization

Recent results in worst-case evaluation complexity for smooth and non-smooth, exact and inexact, nonconvex optimization

5th Conference on Numerical Analysis and Optimization

Cite this

Adaptive regularization algorithms with inexact evaluations for nonconvex optimization

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Research output

Projects

Activities

Cite this