Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalisation

Ciceri, Simone; Cassani, Lorenzo; Osella, Matteo; Rotondo, Pietro; Valle, Filippo; Gherardi, Marco

doi:10.1038/s42256-023-00772-9

Computer Science > Machine Learning

arXiv:2303.05161 (cs)

[Submitted on 9 Mar 2023 (v1), last revised 23 Feb 2024 (this version, v2)]

Title:Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalisation

Authors:Simone Ciceri, Lorenzo Cassani, Matteo Osella, Pietro Rotondo, Filippo Valle, Marco Gherardi

View PDF HTML (experimental)

Abstract:To achieve near-zero training error in a classification problem, the layers of a feed-forward network have to disentangle the manifolds of data points with different labels, to facilitate the discrimination. However, excessive class separation can bring to overfitting since good generalisation requires learning invariant features, which involve some level of entanglement. We report on numerical experiments showing how the optimisation dynamics finds representations that balance these opposing tendencies with a non-monotonic trend. After a fast segregation phase, a slower rearrangement (conserved across data sets and architectures) increases the class entanglement.The training error at the inversion is stable under subsampling, and across network initialisations and optimisers, which characterises it as a property solely of the data structure and (very weakly) of the architecture. The inversion is the manifestation of tradeoffs elicited by well-defined and maximally stable elements of the training set, coined ``stragglers'', particularly influential for generalisation.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2303.05161 [cs.LG]
	(or arXiv:2303.05161v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2303.05161
Journal reference:	Nature Machine Intelligence, vol 6, 40-47 (2024)
Related DOI:	https://doi.org/10.1038/s42256-023-00772-9

Submission history

From: Marco Gherardi [view email]
[v1] Thu, 9 Mar 2023 10:35:40 UTC (1,984 KB)
[v2] Fri, 23 Feb 2024 17:21:40 UTC (3,429 KB)

Computer Science > Machine Learning

Title:Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalisation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Inversion dynamics of class manifolds in deep learning reveals tradeoffs underlying generalisation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators