Infusing theory into deep learning for interpretable reactivity prediction

Wang, Shih-Han; Pillai, Hemanth Somarajan; Wang, Siwen; Achenie, Luke E. K.; Xin, Hongliang

doi:10.1038/s41467-021-25639-8

Download PDF

Article
Open access
Published: 06 September 2021

Infusing theory into deep learning for interpretable reactivity prediction

Shih-Han Wang¹^na1,
Hemanth Somarajan Pillai¹^na1,
Siwen Wang¹,
Luke E. K. Achenie¹ &
…
Hongliang Xin ORCID: orcid.org/0000-0001-9344-1697¹

Nature Communications volume 12, Article number: 5288 (2021) Cite this article

10k Accesses
36 Citations
80 Altmetric
Metrics details

Subjects

Abstract

Despite recent advances of data acquisition and algorithms development, machine learning (ML) faces tremendous challenges to being adopted in practical catalyst design, largely due to its limited generalizability and poor explainability. Herein, we develop a theory-infused neural network (TinNet) approach that integrates deep learning algorithms with the well-established d-band theory of chemisorption for reactivity prediction of transition-metal surfaces. With simple adsorbates (e.g., *OH, *O, and *N) at active site ensembles as representative descriptor species, we demonstrate that the TinNet is on par with purely data-driven ML methods in prediction performance while being inherently interpretable. Incorporation of scientific knowledge of physical interactions into learning from data sheds further light on the nature of chemical bonding and opens up new avenues for ML discovery of novel motifs with desired catalytic properties.

Organic reaction mechanism classification using machine learning

Article 25 January 2023

Jordi Burés & Igor Larrosa

Organic reactivity from mechanism to machine learning

Article 16 March 2021

Kjell Jorner, Anna Tomberg, … Per-Ola Norrby

Unifying machine learning and quantum chemistry with a deep neural network for molecular wavefunctions

Article Open access 15 November 2019

K. T. Schütt, M. Gastegger, … R. J. Maurer

Introduction

Adsorption energies of simple molecules or their fragments at solid surfaces often serve as reactivity descriptors in heterogeneous catalysis¹. Rapid discovery of structural motifs with kinetics-favorable descriptor values, for example using quantum-chemical calculations, is appealing while remaining a daunting task due to the formidable computational cost in accurately solving the many-electron Schrödinger equation. In this aspect, the d-band theory of chemisorption pioneered by Hammer and Nørskov^2,3,4,5,6 has been widely used for understanding reactivity trends of d-block metals^7,8 and, to some extent, their compounds⁹. However, its quantitative prediction accuracy using individual d-band characteristics, e.g., the number of d-electrons¹⁰, d-band center², and d-band upper edge^6,11, is limited due to the perturbative nature of the theoretical framework¹² and a large variation of site properties in high-throughput catalyst screening.

In recent years, machine learning (ML) has emerged as an alternative approach to predicting the chemical reactivity of catalytic sites with either hand-crafted^{13,14,15,16,17,18,19,20} or algorithm-derived features^{21,22,23,24,25}. By learning correlated interactions of atoms, ions, or molecules with a substrate from a sufficient amount of ab initio data, it is possible to compute adsorption properties orders of magnitude faster than traditional practices and narrow down candidate materials prior to experimental tests^{13,14,16,17,18,22,25,26,27,28}. A major limitation of black-box ML models, particularly with the resurgent deep learning algorithms²⁹, is that it is easy to learn some correlates that look deceptively good on both training and test samples, but do not generalize well outside the labeled data. To alleviate the issue, active learning workflows guided by key performance indicators^17,30 and/or model uncertainties¹⁶ have been used to accelerate the exploration of the enormous, essentially infinite, size of the accessible design space. Nevertheless, the necessity of a very large amount of data samples for model development and difficulties in interpreting model prediction impose tremendous challenges toward its adoption for automated search of high-performance catalytic materials.

Herein, we present a theory-infused neural network (TinNet) approach to predicting chemical reactivity of transition-metal surfaces and, more importantly, to extracting physical insights into the nature of chemical bonding that can be translated into catalyst design strategies. Incorporation of scientific knowledge of physical interactions into data-driven ML methods is an emerging area of research in catalysis science^{13,18,19,23,24,31,32}. To the best of our knowledge, no such hybrid surrogate models of chemisorption were developed within a fully integrated ML framework that is reasonably accurate (~0.1−0.2 eV error) and transferable across diverse samples. By learning from ab initio adsorption properties with deep learning algorithms, e.g., convolutional neural networks, while respecting the well-established d-band theory of chemisorption in architecture design, the TinNet can be applied for a broad range of d-block metal sites and naturally encodes physical aspects of bonding interactions, inheriting the merits of both worlds. We demonstrate the approach using adsorbed hydroxyl (*OH) at {111}-terminated intermetallics and near-surface alloys as a representative descriptor species, such as in finding efficient electrocatalysts for metal-catalyzed O₂ reduction³³, CO₂ reduction³⁴, and H₂ oxidation in alkaline electrolytes³⁵. This framework can be straightforwardly applied to other adsorbates (e.g., *O) or active site ensembles of multiple bonding atoms as shown for *N adsorption at {100}-terminated metal surfaces. The TinNet not only achieves prediction performance on par with purely regression-based ML methods, especially for out-of-sample systems with unseen structural and electronic features but also enables physical interpretation, paving the path toward ML discovery of novel motifs with desired catalytic properties.

Results

Deep network architecture

As illustrated in Fig. 1, the TinNet framework contains two sequential components: a regression module and a theory module. The input into the regression module built with convolutional neural networks is the feature representation of the adsorbate–substrate system that encodes the atomic information and bonding interactions of each atom with its neighboring environment. The output units from the regression module then serve as unknown parameters in the theory module that is built upon the d-band theory of chemisorption for predicting adsorption properties of a d-metal site. To ensure model transferability, easily accessible graph features were used (see Fig. 1). In the graph-level scheme, each atom or node is represented by a binary vector, comprising nine properties of the atom, e.g., electron affinity, atomic volume, and electronegativity^26,36. Similarly, each connection or edge encodes the pair interaction between neighboring atoms, including the solid angles swept out by the shared face of Voronoi polyhedra²² and the kernelized distances³⁶. A surface at the optimized bulk geometry with the adsorbate attached to the site of interest is used³⁷, thus avoiding the time-consuming structural optimization in the exploration of new systems²². Neural nets with m convolution-pooling layers are connected to the feature representation sub-module. Within the convolutional layers, multi-dimensional feature arrays are iteratively updated by convolution (i.e., feature mapping) to extract high-level patterns and by pooling for feature subsampling. The 2D array is flattened into a vector, which can be fed into a fully connected network with k hidden layers and a certain number of hidden neurons at each layer to capture the complex mapping between the extracted features and output targets. Finally, the output vector from the regression is incorporated into the theory module as local parameters along with user-defined global parameters, if any, that are independent of input features.

**Fig. 1: Schematic illustration of the theory-infused neural network (TinNet).**

The physical meaning of each output unit from the regression module is pre-assigned in the TinNet framework. Historically, many factors have been used to correlate with the chemical reactivity of d-block metals, e.g., atomic or bulk properties^10,38, coordination numbers^39,40, and d-band characteristics^2,6. Mapping physically relevant factors onto adsorption energies with ML algorithms has been previously explored with some success^{13,14,15,17,18,19,21,25,31,32,41}. Besides the ambiguity of physical interpretation inherent to highly non-linear regression techniques, another major criticism is that some of the hand-crafted features require fully optimized geometric and/or electronic structures of the clean adsorption site, adding computational overhead costs to reactivity prediction of new materials. Instead of purely mathematical regression, we resort to the d-band theory of chemisorption with Newns–Anderson-type Hamiltonians^31,42,43 for computing the adsorption properties of metal sites. The central idea of the approach is to employ the activation output from the regression module as unknown, albeit trainable, parameters in the theory module (see Fig. 1). According to the d-band theory of chemisorption, chemical bonding at transition-metal surfaces can be conceptually separated into two consecutive steps². First, the gas-phase adsorbate species, characterized by an orbital \(\left|a\right\rangle\) at \({\epsilon }_{{\mathrm {a}}}^{0}\), is embedded into the delocalized sp-states of the substrate, leading to a resonance state at ϵ_a with a Lorentzian line shape. Second, the adsorbate resonance interacts with a distribution of localized d-states ρ_d, shifting up in energies due to the orbital orthogonalization penalty for satisfying the Pauli exclusion principle (termed Pauli repulsion) and then hybridizing into bonding and antibonding states. The first step interaction with the sp-band contributes the largest part of chemical bonding, albeit as a constant ΔE₀ for a given adsorbate and site type. The adsorption energy difference from one metal to the next is governed by the 2^nd step ΔE_d, which consists of Pauli repulsion and orbital hybridization⁴⁴, as illustrated in Fig. 1. The orthogonalization cost of interacting orbitals \({{\Delta }}{E}_{d}^{{\mathrm {orth}}}\) can be quantified simply as proportional to the coupling integral V and overlap integral S, which are related through S ≈ α∣V∣ (α as the overlap coefficient)⁴⁴. V² can be conveniently written as \(\beta {V}_{ad}^{2}\), in which β denotes the coupling coefficient. \({V}_{ad}^{2}\) represents the interatomic coupling integral squared when the atoms are aligned along the z-axis and its standard value for a d-metal relative to Cu has been estimated from the linear muffin-tin orbitals (LMTO) theory and is readily available on the solid-state table⁴⁵. To a first approximation, the d-band hybridization contribution \({{\Delta }}{E}_{d}^{{\mathrm {hyb}}}\) can be obtained from one-electron eigenenergies using Green’s function approach⁴³ with the parameterized Hamiltonian and the density of d-states ρ_d as the input. The total adsorption energy ΔE is the sum of the energy contributions from the metal sp-states and d-states, ΔE₀ and ΔE_d, respectively. Another important piece of information from the d-band theory with the Newns–Anderson model is the density of states projected onto the adsorbate orbital ρ_a. Inclusion of multiple frontier orbitals 1 ⋯ i of an adsorbate while considering their degeneracies can be realized by stacking fully connected network sub-modules (see Fig. 1). A full account of the theoretical framework was recently presented to bridge the complexity of electronic descriptors in understanding reactivity trends of pristine transition-metal surfaces and their alloys³¹.

A TinNet model using the architecture in Fig. 1 can be considered as a complex function mapping the graph feature representation of an adsorbate-substrate system to adsorption properties, i.e., the adsorption energy ΔE, projected density of states onto the adsorbate frontier orbital(s) \({\rho }_{a}^{1}\,\) ⋯ \({\rho }_{a}^{i}\), and d-band moments μ₁ ⋯ μ_j of the adsorption site. Such mapping is parameterized by learnable weights of convolutional filters and neural connections in the regression module that is subsequently regularized by the theory module. The training of TinNet models can be performed by minimizing the sum-of-squares error loss function J between model-predicted properties and DFT-calculated ground truths in the output layer (see Fig. 1). In the current TinNet implementation, two low-order moments (μ₁, μ₂) are embedded in the network for constructing the semi-ellipse ρ_d, which is centered at ϵ_d (μ₁, the first moment of the distribution relative to the Fermi level) with a full-width W_d (\(4\sqrt{{\mu }_{2}}\), μ₂ is the second moment of the distribution relative to the center). This simplified distribution is sufficient in computing orbital hybridization energies compared with self-consistent, DFT-calculated density of d-states for transition-metal surfaces¹¹. Higher-order moments of a distribution can be included using moment methods if necessary^6,46. Using the backpropagation and stochastic gradient descent (SGD) algorithms, the constrained optimization can be performed. The PyTorch framework is used for implementing the hierarchical neural networks^26,36 in Fig. 1. In the optimization of ML models, the output activations from the fully connected layers in the regression module are directly passed into the theory module as a vector. Those vector elements are partitioned into different parts and assigned to the d-band moments of the site atoms and interaction parameters of individual adsorbate frontier orbitals with the metal sp- and d-states. The binding energy of the adsorbate and the projected density of states onto adsorbate orbitals can then be computed from the theory module. For comparison purposes, the fully connected neural network (FCNN) and crystal graph convolutional neural network (CGCNN)^26,36 models were developed using the Adaptive Moment Estimation algorithm with weight decay (AdamW), see the details of input features and model optimization in the “Methods” section. The complete code, named TinNet, is available at a Github repository https://github.com/hlxin/tinnet for public access.

Model benchmark

A comparison of the TinNet with the purely regression-based FCNN and CGCNN on predicting the chemical reactivity of d-block metal surfaces is shown in Fig. 2a. The dataset corresponds to *OH at 748 {111}-terminated transition-metal surfaces with a wide variety of site compositions. Specifically, it includes intermetallics (A₃B) and near-surface alloys (A\({}^{\prime}\)@A_ML, A–B@A_ML, A₃B@A_ML, A@A₂B₂, and A@AB₃), where A (or A\({}^{\prime}\)) represents 10 fcc/hcp metals and B covers 26 d-metals across the periodic table, see the “Methods” section for computational details. OH is adsorbed at the atop site while the O−H bond is tilted toward the bridge site. The straight-up *OH adsorption configuration is less favorable than the tilted ones on transition metals because of the directional 1π-orbital interactions with metal d-states. In this study, we did not include other local minima of tilted *OH adsorption configurations. In the feature representation, bonding angles are also not included in the CGCNN/TinNet framework. Note that other developments that are built upon the CGCNN, e.g., iCGCNN⁴⁷, and ALIGNN⁴⁸, have implemented angle features, which will be useful if multiple local minima exist in the dataset. Compared with previous studies that include different surface terminations and adsorption sites^17,26, we are focusing on a relatively small but representative dataset^14,49,50. For *OH, we explicitly included the 3σ, 1π, and 4σ^* frontier molecular orbitals in the network design. To rigorously evaluate the prediction performance of ML models with a balanced bias/variance trade-off, we adopted k-fold cross-validation (k = 10) to optimize hyperparameters, including learning rate, # of atomic features, # of convolution-pooling layers, # of hidden layers, and # of hidden neurons of each layer⁵¹. A validation set (10%) is randomly split off the training set for early stopping of the optimization process as a form of regularization to avoid overfitting. In Fig. 2a, we present the learning curves of the FCNN, CGCNN, and TinNet models, in which the mean absolute error (MAE) of prediction and its standard deviation are estimated by the nested 10-fold cross-validation approach⁵² (see Supplementary Table 1 for the hyper-parameters of each model scheme). We include a diagram of the TinNet model architecture and hyperparameters in Supplementary Fig. 2 for *OH to further clarify the flow/mapping of graph features to target properties. In the data-scarce region, the FCNN showed a relatively accurate and stable prediction of *OH adsorption energies compared with CGCNN and TinNet models because of employing physics-based features (e.g., orbitalwise coordination numbers¹³) rather than low-level graph features. As the number of training samples increases, the TinNet can attain a 0.118 eV MAE of prediction with a 0.022 eV deviation, outperforming the FCNN (0.152 ± 0.015 eV) and on par with the CGCNN (0.114 ± 0.025 eV). Figure 2b shows a 2D histogram representing the TinNet-predicted *OH adsorption energies of all 10-fold test sets against DFT-calculated values. In graph representation, the strain and ligand effects on site reactivity can be captured by atomic features and neighboring information. For the TinNet framework, graph representation of the local coordination environment is naturally reflected by the output activations from the regression module, including (1) the d-band center (1^st moment) and width (2^nd moment) of the site atoms and (2) interaction parameters of individual adsorbate frontier orbitals with the metal sp- and d-states, such as the orbital overlap and coupling coefficients which are dependent on d-orbital radii, interatomic distances, and local electron densities based on the tight-binding theory⁴⁵. To make a clear benchmark comparison of the TinNet/CGCNN/FCNN models in this work and previously published ML models of *OH chemisorption on alloy surfaces, we have tabulated their feature type, learning algorithm, # of tuning parameters, # of samples, data range, and prediction errors (MAE and RMSE) in Table 1. In a comparison of those methods, FCNN and CGCNN models rely on data to learn the underlying correlations between a site structure and the adsorption energy of *OH in a pure regression fashion, while the TinNet embeds the well-established physics, i.e., the Newns–Anderson model within the d-band theory of chemisorption, into the network architecture. Compared to the Bayschem model³¹ trained with pristine transition-metal data (Supplementary Fig. 7), the significant improvement of the prediction accuracy (MAEs, Bayeschem: 0.27 eV, TinNet: 0.118 eV) can be attributed to the design of the TinNet architecture, allowing the algorithms to learn local interaction parameters of individual adsorbate frontier orbitals with the metal sp- and d-states from data samples of diverse site coordination environments. In contrast to ML models with hand-crafted features^{13,14,21,25,31,41}, the electronic structure of test samples is not needed for prediction using the TinNet. This elaborate design of the network architecture, as seen in Fig. 1, further improves the transferability of the TinNet framework and signifies its potential as a robust ML approach for guiding catalyst design beyond labeled material structures.

Table 1 Benchmark comparison of ML models of *OH chemisorption on alloy surfaces.

Full size table

Model validation with single-atom alloys

To test the prediction performance of those final models for unseen data, we chose single-atom alloys (SAAs)⁵³ as an out-of-sample material system that was not used in model training and cross-validation. This emerging type of material has received substantial interest in recent years because of its simplicity in the structure allowing us to control catalytic properties at the atomic level. Here, we calculated *OH adsorption at the atop the site of SAAs with Cu, Ag, or Au as the host and 26 d-metals as the single-atom active site. Because of the limited overlap between the d-states wavefunction of an active d-metal and that of the inert host, most of those SAAs exhibit previously unseen free-atom-like d-states^54,55, resembling the localized electronic structure in homogeneous molecular catalysts. With the Cu₁/Ag(111) single-atom alloy as a specific case, recent spectroscopic measurements validated the formation of such peaky d-states and its effect on surface reactivity of Cu₁ sites⁵⁵. Using the TinNet-predicted interaction parameters (\({{{\Delta }}}_{0}^{i}\), \({\epsilon }_{a}^{i}\), αⁱ, and βⁱ, where i represents an adsorbate frontier orbital) of Cu₁/Ag(111) from the regression module, Fig. 3a shows the model-constructed projected density of states onto the OH 3σ, 1π, and 4σ^* orbitals against with DFT-calculated distributions. The d-states distribution ρ_d of a Cu₁ site and its Hilbert transform along with the adsorbate line \(y=(\epsilon -{\epsilon }_{a})/\pi \beta {V}_{ad}^{2}\) for each orbital are plotted for the graphical solution of the Newns–Anderson model⁴³. The intersections in Fig. 3a represent either the adsorbate-substrate bonding and antibonding states (2 localized roots) for 1π or the resonance state (1 localized root) for 3σ and 4σ^*. Given the simplicity of the model, the clearly captured strong-coupling and weak-coupling signatures for 1π and 3σ/4σ^* orbitals, respectively, justified the TinNet in qualitatively predicting the electronic structure of an adsorbate–substrate system. In another aspect, the comparison of model performance for predicting *OH adsorption energies between FCNN, CGCNN, and TinNet is shown in Fig. 3b and Supplementary Fig. 4. Using the 10-fold cross-validated final models, the TinNet (MAE: 0.161 ± 0.008 eV) improves its prediction error over the FCNN (MAE: 0.193 ± 0.026 eV) and CGCNN (MAE: 0.179 ± 0.029 eV), particularly for the region involving highly active early transition metals. Supplementary Fig. 5 shows the DFT-calculated vs. model-predicted d-band center ϵ_d and full-width W_d (MAE: 0.13 and 0.37 eV, respectively) that were used to construct the semi-ellipse representing the projected d-states distribution ρ_d onto a metal site. As an additional metric of model performance, the MAEs of the TinNet-predicted \({\rho }_{a}^{i}\) are 0.0205, 0.0166, and 0.0187 eV⁻¹ for the OH 3σ, 1π, and 4σ^* orbitals, respectively. To better understand the origin of the improved generalization performance, we have re-trained the FCNN and CGCNN models using multi-task learning (MTL), i.e., including both the adsorption energy and the d-band moments of the adsorption site in the loss function. We found that the generalization error of the adsorption energy prediction of SAAs remains similar or slightly worsens for the FCNN (MAE: 0.198 ± 0.039 eV) and CGCNN (MAE: 0.185 ± 0.029 eV). The improved generalization performance can be attributed to the solid physical basis of the TinNet framework for property prediction of out-of-sample systems with unseen structural and electronic features, rather than accessing more electronic structure information. It is important to note that optimizing hyperparameters in deep learning architectures and training deployable models with a rigorous validation procedure is quite expensive even with current GPU architectures (10²−10³ GPU hours). Future development of the TinNet framework should enable transfer learning of trained model parameters to other adsorbate systems. For adsorbates with an identical set of frontier orbitals, e.g., atomic p_x, p_y, and p_z orbitals of C, N, and O adatoms, it is natural to start from past fittings since the output vectors from the regression module have the same length and physical meaning of individual adsorbate frontier orbital interacting with the metal sp- and d-states. For adsorbates with a distinct set of frontier orbitals, e.g., O, OH, and OOH, it is generally accepted that the underlying physics or factors governing the interaction strength of those adsorbates with alloy surfaces are universal. In that scenario, convolution filter parameters that extract high-level feature representations of adsorption sites can be preloaded to speed up optimization processes.

**Fig. 3: Out-of-sample validation of the TinNet model.**

Discussion

A significant advantage of the TinNet framework is the model interpretability empowered by the theory module. To provide physical insights into the reactivity trend of *OH at transition-metal surfaces, we deconvolute the d-contributed adsorption energy ΔE_d into Pauli repulsion and orbital hybridization (see Fig. 4a). Not surprisingly, orbital hybridization dominates the overall trend of *OH adsorption energies, in agreement with the Bayesian chemisorption model developed for pure metals³¹. In the strong-binding region, the Pauli repulsion due to orbital orthogonalization involving less than half-filled d-shells is expected to be negligible, very well captured by the TinNet. However, it becomes prominently important for late transition metals with completely or nearly filled d-states^3,33. Although this phenomenon was recognized, leveraging this physical aspect of chemical bonding for catalyst design in addition to strain⁵ and ligand⁴ effects has not been realized. For the diverse sites considered here, neither the d-band center nor the upper edge is linearly correlated with the *OH adsorption energy (R²: 0.64 and 0.49, respectively) (see Supplementary Fig. 6). We argue that a linear descriptor of this kind might not exist for such a diverse dataset. Interestingly, the TinNet-predicted coupling integral squared V², i.e., \(\beta {V}_{ad}^{2}\), correlates very well with the orbital hybridization energies for 3σ (R² ~ 0.93), 1π (R² ~ 0.87), and 4σ^* (R² ~ 0.89) orbitals (see Fig. 4b). This result showcases the ability of the TinNet framework to provide a detailed physical interpretation of the reactivity trend of metal sites that is inaccessible with purely regression-based models.

**Fig. 4: Physical insights into chemical bonding.**

To demonstrate the approach for other adsorbates and facets, we developed the TinNet models for *O at the atop the site of the {111}-terminated bimetallic alloy surfaces and *N at the hollow site of {100}-terminated ternary alloy surfaces, as shown in Fig. 5. The 10-fold cross-validated MAEs are 0.147 and 0.116 eV for *O and *N, respectively. We use the same set of alloy surfaces for *O as the *OH models (748 total). For *N adsorbed at the four-fold hollow site, we used 329 {100}-terminated Pt-based ternary alloy surfaces (Pt₃M and Pt₂M₂ intermetallics with M\({}^{\prime}\) dopants at different positions of the top two layers, see the “Methods” section for details). *N adsorption at metal sites represents an important reactivity descriptor for ammonia electro-oxidation as the anode reaction in direct ammonia fuel cells^56,57,58. We note that the surface has a coadsorbed *OH spectator species for all the samples. Our previous study has shown that *OH play a crucial role in stabilizing *NH_x species under relevant operating conditions⁵⁹. The dataset showcases the inclusion of adsorbate–adsorbate interactions in developing ML models. In the current TinNet implementation, for an n-atom site ensemble, the regression module automatically allocates 2n output neurons for the 1^st and 2^nd moments of the d-states distribution of site atoms. The d-states distribution of the adsorption site will be represented by a superposition of individual d-dos constructs, e.g., semi-elliptic functions. Other output neurons representing interaction parameters of the adsorbate frontier orbitals with the metal sp- and d-states have the same dimension and physical meanings for adsorption sites of different atom ensembles.

**Fig. 5: TinNet models for other adsorbates/facets.**

This study highlights the importance of the frontier molecular orbital theory, electronic structure methods, and deep learning algorithms in developing interpretable ML models of chemical bonding. Infusing theory into ML fueled with ab initio adsorption properties will eventually lead us to better understand the fundamentals of linear energy relationships^60,61 and devise strategies to overcome such constraints in catalysis⁶². For example, electrolyte molecules or ions can exert an additional coupling term with the adsorbate energy level ϵ_a, often via hydrogen bonding^63,64, which could be leveraged to break the adsorption-energy scaling relations for hydrogen-containing species. Indeed, there is evidence that adding a co-solvent or ionic species into the bulk electrolyte does have a positive effect on stabilizing charge-transfer intermediates in metal-air batteries⁶⁵, ammonia synthesis⁶⁶, CO₂ reduction⁶⁷, and oxygen evolution⁶⁸. This physical aspect of chemical bonding can be built into the TinNet for screening improved catalytic systems with consideration of electrolyte choices. As a related note, all the structures used in this study are DFT-optimized local minima. Informing the learning algorithms of this physical information (forces are less than a threshold) in the spirit of incorporating physics, if the forces are accessible in the TinNet framework, can further constrain deep learning models and improve their transferability. Beyond a better estimation of adsorption energetics that is extensively explored in the field of catalysis, activation barriers, adsorbate–adsorbate interactions, and surface segregation energies are also important for predicting reaction kinetics and site stability prior to catalyst screening. The framework proposed here is a step toward that direction.

To conclude, the herein proposed theory-infused neural network (TinNet) represents a generalized ML approach to predicting the chemical reactivity of solid surfaces with atomically tailored active sites. Importantly, physical insights by learning from data come naturally with the TinNet, which cannot be obtained otherwise using purely regression-based methods, irrespective of feature representations. We demonstrate the approach using simple adsorbates (e.g., *OH, *O, and *N) at active site ensembles as specific cases, and it can also be transferred directly to other descriptor species and nanostructures of different site geometries or electronic complexities, e.g., metal compounds with strongly correlated d electrons, paving the path toward interpretable ML discovery of novel motifs with desired catalytic properties. This study encapsulates all of the important ingredients of the ML approach and can be straightforwardly extended to generic models or principles where the neuron representing parameters should be treated on a case-by-case basis.

Methods

DFT calculations

Spin-polarized DFT calculations of *OH and *O adsorption systems were performed through Quantum ESPRESSO⁶⁹ with ultrasoft pseudopotentials. The exchange-correlation was approximated within the generalized gradient approximation (GGA) with Perdew–Burke–Ernzerhof (PBE)⁷⁰. {111}-terminated metal surfaces were simulated using (2 × 2) supercells with 4 layers and a vacuum of 15 Å between two images. The bottom two layers were fixed while the top two layers and adsorbates were allowed to relax until a force criteria of 0.1 eV/Å. A plane-wave energy cutoff of 500 eV was used. The *N adsorption systems consist of {100}-terminated Pt-based bimetallic surfaces doped with a third element. It includes Pt₃M and PtM bimetallics where M can be any of the transition metals, while the dopants cover 15 elements: Fe, Zn, Cu, Co, Ni, Rh, Pd, Ag, Ir, Pt, Au, Ru, Mo, Cr, and W. Spin-polarized DFT calculations were performed through Vienna ab initio simulation package (VASP) with projector-augmented wave pseudopotentials. The exchange-correlation was approximated within the GGA with the revised Perdew–Burke–Ernzerhof (RPBE)⁷¹. A plane-wave energy cutoff of 450 eV was used. The {100}-terminated alloy surfaces were modeled using (2 × 2) supercells with 4 layers and a vacuum of 15 Å between two images. The bottom two layers were fixed while the top two layers and adsorbates were allowed to relax until force criteria of 0.05 eV/Å. In order to consider the effect of aqueous solvation on adsorption energies, an implicit solvation model was employed through the VASPsol package⁷². All of the Pt-based alloy surfaces have coadsorbed *OH (θ_OH = 1/4 ML) as a spectator species. Doping is simulated by replacing one of the top two-layer metal atoms with dopant metals. For both {111} and {100} terminations, a Monkhorst–Pack mesh of 6 × 6 × 1 was used to sample the Brillouin zone, while for molecules and radicals only the Gamma point was used. Methfessel–Paxton smearing scheme was used with a smearing parameter of 0.1 eV for adsorbate systems and 0.001 eV for molecules. Electronic energies are extrapolated to k_BT = 0 eV. The projected atomic and molecular density of states were obtained by projecting the eigenvectors of the full system at a denser k-point sampling (12 × 12 × 1) with an energy spacing of 0.01 eV onto the ones of the part, as determined by gas-phase calculations.

FCNN models

A FCNN is the simplest artificial neural network, and there is no cycle between node connections. The input features of FCNN include atomic features, surface features, and bulk features, which represent characteristics of the adsorption site, the environment of the adsorption site, and properties of the entire crystal. The "BulkFingerprintGenerator.bulk_average" module of the CatLearn package³⁷ is used to extract properties of the adsorption site, the first two surface layers, and the bulk as atomic, surface, and bulk features, respectively. All missing properties in the module are set to zero. In addition to previous properties, atomic features also contain Pauling electronegativity (χ₀), \({V}_{ad}^{2}\), and atomic radius (r₀) while surface features include local Pauling electronegativity (χ) and orbitalwise coordination numbers (CN^s and CN^d)⁴⁰.

Hyperparameter optimization

In this study, five hyperparameters, namely learning rate (lr), number of hidden layers (n_h), number of neurons of each hidden layer (h_fea_len), number of convolutional layers (n_conv), and the length of atomic features into the convolution (atom_fea_len), were tuned by using the random search algorithm through the Ray package⁵¹. lr is randomly sampled from 0.0001 to 1 with log uniform distribution. atom_fea_len, n_conv, h_fea_len, and n_h are random integers between 16–112, 1–10, 32–224, and 1–10, respectively. For each model, 150 randomly selected combinations are used as the hyper-parameter set for the training. For each hyperparameter set, regular 10-fold cross-validation (CV) is applied. The data set is divided into 10 folds first. A fold is used as the test set for each calculation. The rest of 90% of data set will be divided into 10 folds again and a randomly chosen one fold is used as the validation set for early stopping the training procedure. Supplementary Fig. 1 illustrates the hyperparameter optimization procedure. AdamW optimization algorithm, MSE loss function, and Softplus, Sigmoid, and ReLU activation functions are implemented in the training. Batch size and weight decay are 64 and 0.0001, respectively. If no better validation loss within 1000 epochs, the model with minimal validation loss will be selected as the final model of that fold. For FCNN and CGCNN, the loss function only contains MSE(ΔE), but, for TinNet, the loss function is constructed with MSE(ΔE) + MSE(μ₁) + MSE(2\(\sqrt{{\mu }_{2}}\)) + λ[MSE(ρ_3σ) + MSE(ρ_1π) + MSE(\({\rho }_{4{\sigma }^{* }}\))]. The energy contribution from the sp-electrons (ΔE₀) and the weight of density of states (λ) are set at −2.69 eV and 0.01, respectively, as derived from Bayesian learning models³¹. The final loss (average 10 test loss) of that hyperparameter set will be obtained. An optimized hyperparameter set with a minimal loss for each algorithm is shown in Supplementary Table 1. These hyperparameter sets will be used for all later ML optimization. Details of the CGCNN model set can be found in refs. ^22,36.

Learning curve

The nested 10-fold cross-validation with different proportions of the dataset (from 5% to 100% with 5% as the interval) was used to evaluate the model performance. For each proportion, the dataset is divided into 10 folds. One of the folds is used as the test set, the other fold is used as the validation set, and all other eight folds are used as the training set. Supplementary Fig. 3 illustrates the procedure for generating the learning curve with the nested 10-fold cross-validation approach. 90 models, whose test set is not equal to the validation set, are used to evaluate model performance. Those 10 models whose test set is also the validation set will be used as final models for predicting unknown systems. For different methods, the average wall-time consumed to train a model for a given data split is shown in Supplementary Table 2.

Data availability

The training and test data of transition-metal alloy surfaces data used in this study are available in the Github repository (https://github.com/hlxin/tinnet).

Code availability

TinNet: https://github.com/hlxin/tinnet

CGCNN: https://github.com/txie-93/cgcnn

CGCNN: https://github.com/ulissigroup/cgcnn

Ray Tune: https://docs.ray.io/en/latest/tune/index.html

References

Nørskov, J. K., Abild-Pedersen, F., Studt, F. & Bligaard, T. Density functional theory in surface chemistry and catalysis. Proc. Natl Acad. Sci. USA 108, 937–943 (2011).
Article ADS PubMed PubMed Central Google Scholar
Hammer, B. & Nørskov, J. K. Electronic factors determining the reactivity of metal surfaces. Surf. Sci. 343, 211–220 (1995).
Article ADS CAS Google Scholar
Xin, H. & Linic, S. Communications: exceptions to the d-band model of chemisorption on metal surfaces: the dominant role of repulsion between adsorbate states and metal d-states. J. Chem. Phys. 132, 221101–221101–4 (2010).
Article Google Scholar
Kitchin, J. R., Norskov, J. K., Barteau, M. A. & Chen, J. G. Role of strain and ligand effects in the modification of the electronic and chemical properties of bimetallic surfaces. Phys. Rev. Lett. 93, 156801 (2004).
Article ADS CAS PubMed Google Scholar
Mavrikakis, M., Hammer, B. & Nørskov, J. K. Effect of strain on the reactivity of metal surfaces. Phys. Rev. Lett. 81, 2819–2822 (1998).
Article ADS Google Scholar
Xin, H., Vojvodic, A., Voss, J., Nørskov, J. K. & Abild-Pedersen, F. Effects of d-band shape on the surface reactivity of transition-metal alloys. Phys. Rev. B Condens. Matter 89, 115114 (2014).
Article ADS CAS Google Scholar
Nørskov, J. K., Bligaard, T., Rossmeisl, J. & Christensen, C. H. Towards the computational design of solid catalysts. Nat. Chem. 1, 37–46 (2009).
Article PubMed CAS Google Scholar
Zhao, Z. -J. et al. Theory-guided design of catalytic materials using scaling relationships and reactivity descriptors. Nat. Rev. Mater. 4, 792–804 (2019).
Article ADS Google Scholar
Vojvodic, A., Hellman, A., Ruberto, C. & Lundqvist, B. I. From electronic structure to catalytic activity: a single descriptor for adsorption and reactivity on transition-metal carbides. Phys. Rev. Lett. 103, 146103 (2009).
Article ADS CAS PubMed Google Scholar
Calle-Vallejo, F. et al. Number of outer electrons as descriptor for adsorption processes on transition metals and their oxides. Chem. Sci. 4, 1245–1249 (2013).
Article CAS Google Scholar
Vojvodic, A., Nørskov, J. K. & Abild-Pedersen, F. Electronic structure effects in transition metal surface chemistry. Top. Catal. 57, 25–32 (2014).
Article CAS Google Scholar
Nørskov, J. K. COVALENT EFFECTS IN THE EFFECTIVE-MEDIUM THEORY OF CHEMICAL-BINDING - HYDROGEN HEATS OF SOLUTION IN THE 3D-METALS. Phys. Rev. B 26, 2875–2885 (1982).
Article ADS Google Scholar
Ma, X., Li, Z., Achenie, L. E. K. & Xin, H. Machine-learning-augmented chemisorption model for CO₂ electroreduction catalyst screening. J. Phys. Chem. Lett. 6, 3528–3533 (2015).
Article CAS PubMed Google Scholar
Li, Z., Wang, S., Chin, W. S., Achenie, L. E. & Xin, H. High-throughput screening of bimetallic catalysts enabled by machine learning. J. Mater. Chem. A 5, 24131–24138 (2017).
Article CAS Google Scholar
Chowdhury, A. J. et al. Prediction of adsorption energies for chemical species on metal catalyst surfaces using machine learning. J. Phys. Chem. C 122, 28142–28150 (2018).
Article CAS Google Scholar
Li, Z., Achenie, L. E. K. & Xin, H. An adaptive machine learning strategy for accelerating discovery of perovskite electrocatalysts. ACS Catal. 10, 4377–4384 (2020).
Article CAS Google Scholar
Tran, K. & Ulissi, Z. W. Active learning across intermetallics to guide discovery of electrocatalysts for CO₂ reduction and H2 evolution. Nat. Catal. 1, 696–703 (2018).
Article CAS Google Scholar
Montemore, M. M., Nwaokorie, C. F. & Kayode, G. O. General screening of surface alloys for catalysis. Catal. Sci. Technol. 10, 4467–4476 (2020).
Article CAS Google Scholar
Esterhuizen, J. A., Goldsmith, B. R. & Linic, S. Theory-guided machine learning finds geometric structure–property relationships for chemisorption on subsurface alloys. Chem 6, 3100–3117 (2020).
Article CAS Google Scholar
Mamun, O., Winther, K. T., Boes, J. R. & Bligaard, T. A Bayesian framework for adsorption energy prediction on bimetallic alloy catalysts. npj Comput. Mater. 6, 1–11 (2020).
Article CAS Google Scholar
Andersen, M., Levchenko, S. V., Scheffler, M. & Reuter, K. Beyond scaling relations for the description of catalytic materials. ACS Catal. 9, 2752–2759 (2019).
Article CAS Google Scholar
Back, S., Tran, K. & Ulissi, Z. W. Toward a design of active oxygen evolution catalysts: Insights from automated density functional theory calculations and machine learning. ACS Catal. 9, 7651–7659 (2019).
Article CAS Google Scholar
García-Muelas, R. & López, N. Statistical learning goes beyond the d-band model providing the thermochemistry of adsorbates on transition metals. Nat. Commun. 10, 4687 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Weng, B. et al. Simple descriptor derived from symbolic regression accelerating the discovery of new perovskite catalysts. Nat. Commun. 11, 3513 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Fung, V., Hu, G., Ganesh, P. & Sumpter, B. G. Machine learned features from density of states for accurate adsorption energy prediction. Nat. Commun. 12, 88 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Back, S. et al. Convolutional neural network of atomic surface structures to predict binding energies for high-throughput screening of catalysts. J. Phys. Chem. Lett. 10, 4401–4408 (2019).
Article CAS PubMed Google Scholar
Peterson, A. A. Acceleration of saddle-point searches with machine learning. J. Chem. Phys. 145, 074106 (2016).
Article ADS PubMed CAS Google Scholar
Garrido Torres, J. A., Jennings, P. C., Hansen, M. H., Boes, J. R. & Bligaard, T. Low-scaling algorithm for nudged elastic band calculations using a surrogate machine learning model. Phys. Rev. Lett. 122, 156001 (2019).
Article ADS CAS PubMed Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS PubMed Google Scholar
Zhong, M. et al. Accelerated discovery of CO₂ electrocatalysts using active machine learning. Nature 581, 178–183 (2020).
Article ADS CAS PubMed Google Scholar
Wang, S., Pillai, H. S. & Xin, H. Bayesian learning of chemisorption for bridging the complexity of electronic descriptors. Nat. Commun. 11, 6132 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lamoureux, P. S. et al. Artificial intelligence real-time prediction and physical interpretation of atomic binding energies in nano-scale metal clusters. Preprint at bioRxiv https://arxiv.org/abs/2005.02572 (2020).
Xin, H., Holewinski, A. & Linic, S. Predictive structure–reactivity models for rapid screening of Pt-based multimetallic electrocatalysts for the oxygen reduction reaction. ACS Catal. 2, 12–16 (2012).
Article CAS Google Scholar
Tang, M. T., Peng, H., Lamoureux, P. S., Bajdich, M. & Abild-Pedersen, F. From electricity to fuels: descriptors for C₁ selectivity in electrochemical CO₂ reduction. Appl. Catal. B 279, 119384 (2020).
Article CAS Google Scholar
Strmcnik, D. et al. Improving the hydrogen oxidation reaction rate by promotion of hydroxyl adsorption. Nat. Chem. 5, 300–306 (2013).
Article CAS PubMed Google Scholar
Xie, T. & Grossman, J. C. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett. 120, 145301 (2018).
Article ADS CAS PubMed Google Scholar
Hansen, M. H. et al. An atomistic machine learning package for surface science and catalysis. Preprint at bioRxiv http://arxiv.org/abs/1904.00904 (2019).
Trasatti, S. Work function, electronegativity, and electrochemical behaviour of metals. J. Electroanal. Chem. Interfacial Electrochem. 39, 163–184 (1972).
Article CAS Google Scholar
Calle-Vallejo, F., Martínez, J. I., García-Lastra, J. M., Sautet, P. & Loffreda, D. Fast prediction of adsorption properties for platinum nanocatalysts with generalized coordination numbers. Angew. Chem. Int. Ed. 53, 8316–8319 (2014).
Article CAS Google Scholar
Ma, X. & Xin, H. Orbitalwise coordination number for predicting adsorption properties of metal nanocatalysts. Phys. Rev. Lett. 118, 036101 (2017).
Article ADS PubMed Google Scholar
Li, Z., Ma, X. & Xin, H. Feature engineering of machine-learning chemisorption models for catalyst design. Catal. Today 280 (Part 2), 232–238 (2017).
Anderson, P. W. Localized magnetic states in metals. Phys. Rev. 124, 41 (1961).
Article ADS MathSciNet CAS Google Scholar
Edwards, D. M. & Newns, D. M. Electron interaction in the band theory of chemisorption. Phys. Lett. A 24, 236–237 (1967).
Article ADS CAS Google Scholar
Hammer, B., Morikawa, Y. & Nørskov, J. K. CO chemisorption at metal surfaces and overlayers. Phys. Rev. Lett. 76, 2141 (1996).
Article ADS CAS PubMed Google Scholar
Harrison, W. A. Electronic Structure and the Properties of Solids: The Physics of the Chemical Bond (Dover Publications, 1989).
Rajan, A., Kuang, Y. C., Ooi, M. P. L., Demidenko, S. N. & Carstens, H. Moment-constrained maximum entropy method for expanded uncertainty evaluation. IEEE Access 6, 4072–4082 (2018).
Article Google Scholar
Park, C. W. & Wolverton, C. Developing an improved crystal graph convolutional neural network framework for accelerated materials discovery. Phys. Rev. Mater. 4, 063801 (2020).
Article CAS Google Scholar
DeCost, B. & Choudhary, K. Atomistic line graph neural network for improved materials property predictions. Preprint at bioRxiv http://arxiv.org/abs/2106.01829 (2021).
Schiros, T. et al. Structure and bonding of the water–hydroxyl mixed phase on Pt(111). J. Phys. Chem. C 111, 15003–15012 (2007).
Article CAS Google Scholar
Held, G., Clay, C., Barrett, S. D., Haq, S. & Hodgson, A. The structure of the mixed OH + H₂O overlayer on Pt[111]. J. Chem. Phys. 123, 64711 (2005).
Article CAS PubMed Google Scholar
Liaw, R. et al. Tune: a research platform for distributed model selection and training. Preprint at arXiv:1807.05118 [cs.LG] (2018).
Varma, S. & Simon, R. Bias in error estimation when using cross-validation for model selection. BMC Bioinforma. 7, 91 (2006).
Article CAS Google Scholar
Hannagan, R. T., Giannakakis, G., Flytzani-Stephanopoulos, M. & Sykes, E. C. H. Single-atom alloy catalysis. Chem. Rev. 120, 12044–12088 (2020).
Article CAS PubMed Google Scholar
Thirumalai, H. & Kitchin, J. R. Investigating the reactivity of single atom alloys using density functional theory. Top. Catal. 61, 462–474 (2018).
Article CAS Google Scholar
Greiner, M. T. et al. Free-atom-like d states in single-atom alloy catalysts. Nat. Chem. 10, 1008–1015 (2018).
Article CAS PubMed Google Scholar
Katsounaros, I. et al. On the mechanism of the electrochemical conversion of ammonia to dinitrogen on Pt (1 0 0) in alkaline environment. J. Catal. 359, 82–91 (2018).
Article CAS Google Scholar
Li, Y. et al. Ternary PtIrNi catalysts for efficient electrochemical ammonia oxidation. ACS Catal. 10, 3945–3957 (2020).
Article CAS Google Scholar
Li, Y. et al. High-performance ammonia oxidation catalysts for anion-exchange membrane direct ammonia fuel cells. Energy Environ. Sci. 14, 1449–1460 (2021).
Article CAS Google Scholar
Pillai, H. S. & Xin, H. New insights into electrochemical ammonia oxidation on Pt(100) from First Principles. Ind. Eng. Chem. Res. 58, 10819–10828 (2019).
Article CAS Google Scholar
Abild-Pedersen, F. et al. Scaling properties of adsorption energies for hydrogen-containing molecules on transition-metal surfaces. Phys. Rev. Lett. 99, 016105 (2007).
Article ADS CAS PubMed Google Scholar
Wang, S. et al. Universal Brønsted–Evans–Polanyi relations for C–C, C–O, C–N, N–O, N–N, and O–O dissociation reactions. Catal. Lett. 141, 370–373 (2011).
Article CAS Google Scholar
Vojvodic, A. & Nørskov, J. K. New design paradigm for heterogeneous catalysts. Natl Sci. Rev. 2, 140–149 (2015).
Article CAS Google Scholar
Santos, E., Quaino, P. & Schmickler, W. Theory of electrocatalysis: hydrogen evolution and more. Phys. Chem. Chem. Phys. 14, 11224–11233 (2012).
Article CAS PubMed Google Scholar
Fortunelli, A. et al. Dramatic increase in the oxygen reduction reaction for platinum cathodes from tuning the solvent dielectric constant. Angew. Chem. Int. Ed. 53, 6669–6672 (2014).
Article CAS Google Scholar
Amin, H. M. A., Molls, C., Bawol, P. P. & Baltruschat, H. The impact of solvent properties on the performance of oxygen reduction and evolution in mixed tetraglyme-dimethyl sulfoxide electrolytes for Li–O2 batteries: mechanism and stability. Electrochim. Acta 245, 967–980 (2017).
Article CAS Google Scholar
Kim, K. et al. Communication—electrochemical reduction of nitrogen to ammonia in 2-propanol under ambient temperature and pressure. J. Electrochem. Soc. 163, F610 (2016).
Article CAS Google Scholar
Rosen, B. A. et al. Ionic liquid-mediated selective conversion of CO₂ to CO at low overpotentials. Science 334, 643–644 (2011).
Article ADS CAS PubMed Google Scholar
Li, G.-F., Divinagracia, M., Labata, M. F., Ocon, J. D. & Abel Chuang, P.-Y. Electrolyte-dependent oxygen evolution reactions in alkaline media: electrical double layer and interfacial interactions. ACS Appl. Mater. Interfaces 11, 33748–33758 (2019).
Article CAS PubMed Google Scholar
Giannozzi, P. et al. QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials. J. Phys. Condens. Matter 21, 395502 (2009).
Article PubMed Google Scholar
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996).
Article ADS CAS PubMed Google Scholar
Hammer, B., Hansen, L. B. & Nørskov, J. K. Improved adsorption energetics within density-functional theory using revised Perdew–Burke–Ernzerhof functionals. Phys. Rev. B Condens. Matter 59, 7413–7421 (1999).
Article ADS Google Scholar
Mathew, K., Sundararaman, R., Letchworth-Weaver, K., Arias, T. A. & Hennig, R. G. Implicit solvation model for density-functional study of nanocrystal surfaces and reaction pathways. J. Chem. Phys. 140, 084106 (2014).
Article ADS PubMed CAS Google Scholar

Download references

Acknowledgements

S.H.W., H.S.P., S.W., L.E.K.A. and H.X. acknowledge the partial financial support from the NSF CAREER program (CBET-1845531). The computational resource used in this work is provided by the advanced research computing at Virginia Polytechnic Institute and State University.

Author information

These authors contributed equally: Shih-Han Wang, Hemanth Somarajan Pillai.

Authors and Affiliations

Department of Chemical Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
Shih-Han Wang, Hemanth Somarajan Pillai, Siwen Wang, Luke E. K. Achenie & Hongliang Xin

Authors

Shih-Han Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hemanth Somarajan Pillai
View author publications
You can also search for this author in PubMed Google Scholar
Siwen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Luke E. K. Achenie
View author publications
You can also search for this author in PubMed Google Scholar
Hongliang Xin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.E.K.A. and H.X. supervised the research. S.-H.W., H.S.P., S.W., and H.X. conceived the idea and designed the general approach. S.-H.W., H.S.P., and S.W. conducted DFT calculations and coding. S.-H.W. and H.S.P. performed a detailed analysis. All authors revised the manuscript.

Corresponding author

Correspondence to Hongliang Xin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Bin Wang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, SH., Pillai, H.S., Wang, S. et al. Infusing theory into deep learning for interpretable reactivity prediction. Nat Commun 12, 5288 (2021). https://doi.org/10.1038/s41467-021-25639-8

Download citation

Received: 09 June 2021
Accepted: 20 August 2021
Published: 06 September 2021
DOI: https://doi.org/10.1038/s41467-021-25639-8

This article is cited by

Bridging the complexity gap in computational heterogeneous catalysis with machine learning
- Tianyou Mou
- Hemanth Somarajan Pillai
- Hongliang Xin
Nature Catalysis (2023)
Interpretable design of Ir-free trimetallic electrocatalysts for ammonia oxidation with graph neural networks
- Hemanth Somarajan Pillai
- Yi Li
- Hongliang Xin
Nature Communications (2023)
Catalyst design with machine learning
- Hongliang Xin
Nature Energy (2022)
Human- and machine-centred designs of molecules and materials for sustainability and decarbonization
- Jiayu Peng
- Daniel Schwalbe-Koda
- Yang Shao-Horn
Nature Reviews Materials (2022)
Interpretable machine learning for knowledge generation in heterogeneous catalysis
- Jacques A. Esterhuizen
- Bryan R. Goldsmith
- Suljo Linic
Nature Catalysis (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.