ExcelFormer: A Neural Network Surpassing GBDTs on Tabular Data

Chen, Jintai; Yan, Jiahuan; Chen, Danny Ziyi; Wu, Jian

Computer Science > Machine Learning

arXiv:2301.02819 (cs)

[Submitted on 7 Jan 2023 (v1), last revised 24 Jan 2023 (this version, v3)]

Title:ExcelFormer: A Neural Network Surpassing GBDTs on Tabular Data

Authors:Jintai Chen, Jiahuan Yan, Danny Ziyi Chen, Jian Wu

View PDF

Abstract:Though deep neural networks have gained enormous successes in various fields (e.g., computer vision) with supervised learning, they have so far been still trailing after the performances of GBDTs on tabular data. Delving into this task, we determine that a judicious handling of feature interactions and feature representation is crucial to the effectiveness of neural networks on tabular data. We develop a novel neural network called ExcelFormer, which alternates in turn between two attention modules that shrewdly manipulate feature interactions and feature representation updates, respectively. A bespoke training methodology is jointly introduced to facilitate model performances. Specifically, by initializing parameters with minuscule values, these attention modules are attenuated when the training begins, and the effects of feature interactions and representation updates grow progressively up to optimum levels under the guidance of our proposed specific regularization schemes Feat-Mix and Hidden-Mix as the training proceeds. Experiments on 28 public tabular datasets show that our ExcelFormer approach is superior to extensively-tuned GBDTs, which is an unprecedented progress of deep neural networks on supervised tabular learning.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2301.02819 [cs.LG]
	(or arXiv:2301.02819v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2301.02819

Submission history

From: Jintai Chen [view email]
[v1] Sat, 7 Jan 2023 09:42:03 UTC (970 KB)
[v2] Fri, 13 Jan 2023 11:47:15 UTC (970 KB)
[v3] Tue, 24 Jan 2023 13:48:15 UTC (1,809 KB)

Computer Science > Machine Learning

Title:ExcelFormer: A Neural Network Surpassing GBDTs on Tabular Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ExcelFormer: A Neural Network Surpassing GBDTs on Tabular Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators