What 2-layer neural nets can we optimize?