Issue 33, 2023

Deep learning of electrochemical CO2 conversion literature reveals research trends and directions

Abstract

Large-scale and openly available material science databases are mainly composed of computer simulation results rather than experimental data. Some examples include the Materials Project, Open Quantum Materials Database, and Open Catalyst 2022. Unfortunately, building large-scale experimental databases remains challenging due to the difficulties in consolidating locally distributed datasets. In this work, focusing on the catalysis literature of CO2 reduction reactions (CO2RRs), we present a machine learning (ML)-based protocol for selecting highly relevant papers and extracting important experimental data. First, we report a document embedding method (Doc2Vec) for collecting papers of greatest relevance to the specific target domain, which yielded 3154 CO2RR-related papers from six publishers. Next, we developed named entity recognition (NER) models to extract twelve entities related to material names (catalyst, electrolyte, etc.) and catalytic performance (Faradaic efficiency, current density, etc.). Among several tested models, the MatBERT-based approach achieved the highest accuracy, with an average F1-score of 90.4% and an F1-score of 95.2% in a boundary relaxation evaluation scheme. The accurate and accelerated NER-based data extraction from a large volume of catalysis literature enables temporal trend analyses of the CO2RR catalysts, products, and performances, revealing the potentially effective material space in CO2RRs. While this work demonstrates the effectiveness of our ML-based text mining methods for specifically CO2RR literature, the methods and approach are applicable to and may be used to accelerate the development of other catalytic chemical reactions.

Graphical abstract: Deep learning of electrochemical CO2 conversion literature reveals research trends and directions

  • This article is part of the themed collection: #MyFirstJMCA

Supplementary files

Article information

Article type
Paper
Submitted
11 May 2023
Accepted
18 Jul 2023
First published
19 Jul 2023

J. Mater. Chem. A, 2023,11, 17628-17643

Deep learning of electrochemical CO2 conversion literature reveals research trends and directions

J. Choi, K. Bang, S. Jang, J. Choi, J. Ordonez, D. Buttler, A. Hiszpanski, T. Yong-Jin Han, S. S. Sohn, B. Lee, K. Lee, S. S. Han and D. Kim, J. Mater. Chem. A, 2023, 11, 17628 DOI: 10.1039/D3TA02780E

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements