69937 | ИПУ РАН

Автор(ы):

Данилова М. Ю. (ИПУ РАН, Лаборатория 07) НЕАКТУАЛЬНАЯ ЗАПИСЬ

Малиновский Г. С. (King Abdullah University of Science and Technology)

Автор(ов):

Параметры публикации

Тип публикации:

Статья в журнале/сборнике

Название:

Averaged heavy-ball method

ISBN/ISSN:

2076-7633

DOI:

10.20537/2076-7633-2022-14-2-277-308

Наименование источника:

Computer Research and Modeling

Обозначение и номер тома:

Volume 14, Issue 2

Город:

Ижевск

Издательство:

Институт компьютерных исследований, Университет Иннополис

Год издания:

2022

Страницы:

277-308

Аннотация

First-order optimization methods are work horses in a wide range of modern applications in economics, physics, biology, machine learning, control, and other fields. Among other first-order methods accelerated and momentum ones obtain special attention because of their practical efficiency. The heavy-ball method (HB) is one of the first momentum methods. The method was proposed in 1964 and the first analysis was conducted for quadratic strongly convex functions. Since then a number of variations of HB have been proposed and analyzed. In particular, HB is known for its simplicity in implementation and its performance on nonconvex problems. However, as other momentum methods, it has nonmonotone behavior, and for optimal parameters, the method suffers from the so-called peak effect. To address this issue, in this paper, we consider an averaged version of the heavy-ball method (AHB). We show that for quadratic problems AHB has a smaller maximal deviation from the solution than HB. Moreover, for general convex and strongly convex functions, we prove non-accelerated rates of global convergence of AHB, its weighted version WAHB, and for AHB with restarts R-AHB. To the best of our knowledge, such guarantees for HB with averaging were not explicitly proven for strongly convex problems in the existing works. Finally, we conduct several numerical experiments on minimizing quadratic and nonquadratic functions to demonstrate the advantages of using averaging for HB. Moreover, we also tested one more modification of AHB called the tail-averaged heavy-ball method (TAHB). In the experiments, we observed that HB with a properly adjusted averaging scheme converges faster than HB without averaging and has smaller oscillations.

Библиографическая ссылка:

Данилова М.Ю., Малиновский Г.С. Averaged heavy-ball method // Computer Research and Modeling. 2022. Volume 14, Issue 2. С. 277-308.