Implicit regularization of dropout

Zhang, Zhongwang; Xu, Zhi-Qin John

Computer Science > Machine Learning

arXiv:2207.05952 (cs)

[Submitted on 13 Jul 2022 (v1), last revised 10 Apr 2023 (this version, v2)]

Title:Implicit regularization of dropout

Authors:Zhongwang Zhang, Zhi-Qin John Xu

View PDF

Abstract:It is important to understand how dropout, a popular regularization method, aids in achieving a good generalization solution during neural network training. In this work, we present a theoretical derivation of an implicit regularization of dropout, which is validated by a series of experiments. Additionally, we numerically study two implications of the implicit regularization, which intuitively rationalizes why dropout helps generalization. Firstly, we find that input weights of hidden neurons tend to condense on isolated orientations trained with dropout. Condensation is a feature in the non-linear learning process, which makes the network less complex. Secondly, we experimentally find that the training with dropout leads to the neural network with a flatter minimum compared with standard gradient descent training, and the implicit regularization is the key to finding flat solutions. Although our theory mainly focuses on dropout used in the last hidden layer, our experiments apply to general dropout in training neural networks. This work points out a distinct characteristic of dropout compared with stochastic gradient descent and serves as an important basis for fully understanding dropout.

Comments:	arXiv admin note: text overlap with arXiv:2111.01022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2207.05952 [cs.LG]
	(or arXiv:2207.05952v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2207.05952

Submission history

From: Zhongwang Zhang [view email]
[v1] Wed, 13 Jul 2022 04:09:14 UTC (1,674 KB)
[v2] Mon, 10 Apr 2023 08:26:42 UTC (2,865 KB)

Computer Science > Machine Learning

Title:Implicit regularization of dropout

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Implicit regularization of dropout

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators