Issue 44, 2021

A machine learning approach using frequency descriptor for molecular property predictions

Abstract

Machine learning algorithms have been found to be effective in predicting the properties of molecules and materials. Recently, a new strategy, Δ-machine learning, which uses low-level calculations as a baseline to predict properties of high-level methods, has been proposed to further reduce computational costs. It has been successfully applied to predictions of potential energy surfaces, bandgaps and chemical shieldings. Here we introduce a new descriptor, in which we used harmonic vibrational frequencies as the descriptor in predictions of molecular properties, namely the frequency descriptor (FD). In detail, we used harmonic vibrational frequencies of several semi-empirical methods (the PM6, PM7 and GFN2-xTB methods) as the descriptor in Δ-machine learning. The energies, enthalpies and HOMO–LUMO gaps of 6095 C7H10O2 isomers at high-level calculations were used as target properties to test the descriptor. We found that the FD generated by the GFN2-xTB method has excellent performance among several semiempirical methods. The chemical accuracy can be achieved with a small training set size according to the combination of single-point calculations at density functional theory levels. In addition, we further included infrared intensities to the FD, namely the FD-II by which the chemical accuracy of energies can be achieved with a small training set size (3%) that represents the smallest sample size in the current dataset (C7H10O2 isomers). We expect that the FD and FD-II can also be used to accelerate other property predictions.

Graphical abstract: A machine learning approach using frequency descriptor for molecular property predictions

Supplementary files

Article information

Article type
Paper
Submitted
04 Oct 2021
Accepted
20 Oct 2021
First published
20 Oct 2021

New J. Chem., 2021,45, 20672-20680

A machine learning approach using frequency descriptor for molecular property predictions

J. Chen, W. Xu and R. Zhang, New J. Chem., 2021, 45, 20672 DOI: 10.1039/D1NJ04739F

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements