Nesterovs Accelerated Gradient and Momentum as approximations to Regularised Update Descent

published by David Barber in 2016 in Mathematical Statistics and research's language is English Download

Abstract in English

We present a unifying framework for adapting the update direction in gradient-based iterative optimization methods. As natural special cases we re-derive classical momentum and Nesterovs accelerated gradient method, lending a new intuitive interpretation to the latter algorithm. We show that a new algorithm, which we term Regularised Gradient Descent, can converge more quickly than either Nesterovs algorithm or the classical momentum algorithm.

Download