The Landscape of Deep Learning Algorithms l-layer NN convergence $$O(r^{2l}\sqrt{d\log(l)}/\sqrt n)$$ with sample n Introduction from population risk to empirical risk use the idea of sub-Gaussian