Some Statistical Results on Deep Learning: Interpolation, Optimality and Sparsity

This talk discusses three aspects of deep learning from a statistical perspective: interpolation, optimality and sparsity. The first one attempts to interpret the double descent phenomenon by precisely characterizing a U-shaped curve within the “over-fitting regime,” while the second one focuses on the statistical optimality of neural network classification in a student-teacher framework. This talk is concluded by proposing sparsity induced training of neural network with statistical guarantee.

Date

November 13, 2019

Speakers

Guang Cheng, Institute for Advanced Study

Affiliation

Purdue University; Member, School of Mathematics

School of Mathematics

Theoretical Machine Learning