OFFO minimization algorithms for second-order optimality and their complexity

Serge Gratton, Philippe TOINT

Research output: Contribution to journalArticlepeer-review

19 Downloads (Pure)

Abstract

An Adagrad-inspired class of algorithms for smooth unconstrained optimization
is presented in which the objective function is never evaluated and yet the
gradient norms decrease at least as fast as lO(1/\sqrt{k+1}) while
second-order optimality measures converge to zero at least as fast as
O(1/(k+1)^{1/3}). This latter rate of convergence is shown to be
essentially sharp and is identical to that known for more standard algorithms
(like trust-region or adaptive-regularization methods) using both function and
derivatives' evaluations. A related ``divergent stepsize'' method is also
described, whose essentially sharp rate of convergence is slighly inferior. It
is finally discussed how to obtain weaker second-order optimality guarantees
at a (much) reduced computional cost.
Original languageEnglish
Pages (from-to)573-607
Number of pages35
JournalComputational Optimization and Applications
Volume84
Issue number2
DOIs
Publication statusPublished - 15 Feb 2023

Keywords

  • Adagrad
  • Evaluation complexity
  • Global rate of convergence
  • Objective-function-free optimization (OFFO)
  • Second-order optimality

Fingerprint

Dive into the research topics of 'OFFO minimization algorithms for second-order optimality and their complexity'. Together they form a unique fingerprint.

Cite this