Loading...

On the Provable Suboptimality of Momentum SGD in Nonstationary Stochastic Optimization - Sharan Sahu, Cameron J. Hogan, Martin T. Wells | Arena