Understanding Deep Learning Requ

作者: catHeart | 来源:发表于2017-10-25 20:45 被阅读20次

Understanding Deep Learning Requ
了解图像分割的深度学习技术
半监督去雾：Semi-Supervised Image Deha
【学术讲座】2020-09-11
《UNDERSTANDING DEEP LEARNING REQ
Deep Learning from the perspecti
AI学习笔记
2020台大李宏毅机器学习 DL预备——深度学习简介
深入参数训练初始化的影响paper
深度学习学习资料

Understanding Deep Learning Requires Re-thinking Generalization, https://arxiv.org/abs/1611.03530

This paper discussed the generalization ability of deep neural networks.

Conventional wisdom attributes small generalization error either to properties of the model family, or to the regularization techniques used during training.

The authors designed several experiments to examine capacity of neural networks. They made modifications of the labels and input images including true labels, partially corrupted labels, random labels, shuffled pixels, random pixels and Gaussian. The experiment results were

a) we do not need to change the learning rate schedule; b) once the fitting starts, it converges quickly; c) it converges to (over)fit the training set perfectly.

For corrupted data sets, deep neural networks takes longer time to converge.

The authors enable and disable regularization measures in deep neural networks to measure the role of regularization. Without regularization measures, the generalization error of deep neural networks is larger than that with regularization. However, deep neural networks without regularization still have low generalization error.

Both explicit and implicit regularizers are consistently suggesting that regularizers, when properly tuned, could hep to improve the generalization performance. However, it is unlikely that the regularizers are the fundamental reason for generalization, as the networks continue to perform well after all the regularizers removed.

Finite-sample Expressivity

The authors checked the expressive power of neural network on a finite sample of size $n$.

There exists a two-layer neural network with ReLU activiations and $2n + d$ weights that can represent any function on a sample of size $n$ in $d$ dimensions.

Regularization of Linear Models

The authors appealed to linear models to argue that it is not even easy to understand the source of generalization for linear models either.

New Terms

Non-parametric randomization test
Early stopping
Weight decay
Rademacher complexity
Uniform stability

网友评论

本文标题：Understanding Deep Learning Requ

本文链接：https://www.haomeiwen.com/subject/ivaxpxtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

Understanding Deep Learning Requ

Finite-sample Expressivity

Regularization of Linear Models

New Terms

相关文章