• Open Access

Reexamining the principle of mean-variance preservation for neural network initialization

Kyle Luther and H. Sebastian Seung
Phys. Rev. Research 2, 033135 – Published 24 July 2020
PDFHTMLExport Citation

Abstract

Before backpropagation training, it is common to randomly initialize a neural network so that mean and variance of activity are uniform across neurons. Classically these statistics were defined over an ensemble of random networks. Alternatively, they can be defined over a random sample of inputs to the network. We show analytically and numerically that these two formulations of the principle of mean-variance preservation are very different in deep networks using rectification nonlinearity (ReLU). We numerically investigate training speed after data-dependent initialization of networks to preserve sample mean and variance.

  • Figure
  • Figure
  • Figure
  • Received 30 January 2020
  • Accepted 23 June 2020

DOI:https://doi.org/10.1103/PhysRevResearch.2.033135

Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation, and DOI.

Published by the American Physical Society

Physics Subject Headings (PhySH)

  1. Research Areas
Interdisciplinary PhysicsStatistical Physics & ThermodynamicsNetworks

Authors & Affiliations

Kyle Luther1,2,* and H. Sebastian Seung2,3

  • 1Department of Physics, Princeton University, Princeton, New Jersey 08544, USA
  • 2Neuroscience Institute, Princeton University, Princeton, New Jersey 08544, USA
  • 3Department of Computer Science, Princeton University, Princeton, New Jersey 08544, USA

  • *kluther@princeton.edu

Article Text

Click to Expand

Supplemental Material

Click to Expand

References

Click to Expand
Issue

Vol. 2, Iss. 3 — July - September 2020

Subject Areas
Reuse & Permissions
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review Research

Reuse & Permissions

It is not necessary to obtain permission to reuse this article or its components as it is available under the terms of the Creative Commons Attribution 4.0 International license. This license permits unrestricted use, distribution, and reproduction in any medium, provided attribution to the author(s) and the published article's title, journal citation, and DOI are maintained. Please note that some figures may have been included with permission from other third parties. It is your responsibility to obtain the proper permission from the rights holder directly for these figures.

×

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×