Accelerated stochastic gradient ..first-order optimization - Zeyuan Allen-Zhu