Artificial intelligence system for identification of false-negative interpretations in chest radiographs

Hwang, Eui Jin; Park, Jongsoo; Hong, Wonju; Lee, Hyun-Ju; Choi, Hyewon; Kim, Hyungjin; Nam, Ju Gang; Goo, Jin Mo; Yoon, Soon Ho; Lee, Chang Hyun; Park, Chang Min

doi:10.1007/s00330-022-08593-x

Artificial intelligence system for identification of false-negative interpretations in chest radiographs

Chest
Published: 23 February 2022

Volume 32, pages 4468–4478, (2022)
Cite this article

European Radiology Aims and scope Submit manuscript

Eui Jin Hwang^1,2,
Jongsoo Park¹,
Wonju Hong¹,
Hyun-Ju Lee^1,2,
Hyewon Choi^1,3,
Hyungjin Kim^1,2,
Ju Gang Nam^1,2,
Jin Mo Goo^1,2,
Soon Ho Yoon^1,2,
Chang Hyun Lee^1,2 &
…
Chang Min Park ORCID: orcid.org/0000-0003-1884-3738^1,2,4

808 Accesses
7 Citations
Explore all metrics

Abstract

Objectives

To investigate the efficacy of an artificial intelligence (AI) system for the identification of false negatives in chest radiographs that were interpreted as normal by radiologists.

Methods

We consecutively collected chest radiographs that were read as normal during 1 month (March 2020) in a single institution. A commercialized AI system was retrospectively applied to these radiographs. Radiographs with abnormal AI results were then re-interpreted by the radiologist who initially read the radiograph (“AI as the advisor” scenario). The reference standards for the true presence of relevant abnormalities in radiographs were defined by majority voting of three thoracic radiologists. The efficacy of the AI system was evaluated by detection yield (proportion of true-positive identification among the entire examination) and false-referral rate (FRR, proportion of false-positive identification among all examinations). Decision curve analyses were performed to evaluate the net benefits of applying the AI system.

Results

A total of 4208 radiographs from 3778 patients (M:F = 1542:2236; median age, 56 years) were included. The AI system identified initially overlooked relevant abnormalities with a detection yield and an FRR of 2.4% and 14.0%, respectively. In the “AI as the advisor” scenario, radiologists detected initially overlooked relevant abnormalities with a detection yield and FRR of 1.2% and 0.97%, respectively. In a decision curve analysis, AI as an advisor scenario exhibited a positive net benefit when the cost-to-benefit ratio was below 1:0.8.

Conclusion

An AI system could identify relevant abnormalities overlooked by radiologists and could enable radiologists to correct their false-negative interpretations by providing feedback to radiologists.

Key Points

• In consecutive chest radiographs with normal interpretations, an artificial intelligence system could identify relevant abnormalities that were initially overlooked by radiologists.

• The artificial intelligence system could enable radiologists to correct their initial false-negative interpretations by providing feedback to radiologists when overlooked abnormalities were present.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Diagnostic effect of artificial intelligence solution for referable thoracic abnormalities on chest radiography: a multicenter respiratory outpatient diagnostic cohort study

Article Open access 01 January 2022

Multicentre external validation of a commercial artificial intelligence software to analyse chest radiographs in health screening environments with low disease prevalence

Article 10 January 2023

Diagnostic performance of artificial intelligence approved for adults for the interpretation of pediatric chest radiographs

Article Open access 17 June 2022

Abbreviations

AI:: Artificial intelligence
FRR:: False-referral rate
PPV:: Positive predictive value

References

Expert Panel on Thoracic Imaging, Jokerst C, Chung JH et al (2018) ACR Appropriateness criteria((R)) acute respiratory illness in immunocompetent patients. J Am Coll Radiol 15:S240–S251
Article Google Scholar
Expert Panel on Thoracic Imaging, Lee C, Colletti PM et al (2019) ACR appropriateness criteria(R) acute respiratory illness in immunocompromised patients. J Am Coll Radiol 16:S331–S339
Article Google Scholar
Expert Panel on Thoracic Imaging, McComb BL, Ravenel JG et al (2018) ACR Appropriateness criteria((R)) chronic dyspnea-noncardiovascular origin. J Am Coll Radiol 15:S291–S301
Article Google Scholar
Expert Panel on Thoracic Imaging, Olsen KM, Manouchehr-Pour S et al (2020) ACR appropriateness criteria(R) hemoptysis. J Am Coll Radiol 17:S148–S159
Article Google Scholar
Expert Panel on Thoracic Imaging, Ravenel JG, Chung JH et al (2017) ACR Appropriateness criteria((R)) imaging of possible tuberculosis. J Am Coll Radiol 14:S160–S165
Article Google Scholar
Berlin L (2014) Radiologic errors, past, present and future. Diagnosis (Berl) 1:79–84
Article Google Scholar
Donald JJ, Barnard SA (2012) Common patterns in 558 diagnostic radiology errors. J Med Imaging Radiat Oncol 56:173–178
Article Google Scholar
Miyashita N, Kawai Y, Tanaka T et al (2015) Detection failure rate of chest radiography for the identification of nursing and healthcare-associated pneumonia. J Infect Chemother 21:492–496
Article Google Scholar
Hwang EJ, Park CM (2020) Clinical implementation of deep learning in thoracic radiology: potential applications and challenges. Korean J Radiol 21:511–525
Article Google Scholar
Nagendran M, Chen Y, Lovejoy CA et al (2020) Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies. BMJ 368:m689
Article Google Scholar
Nam JG, Park S, Hwang EJ et al (2019) Development and validation of deep learning-based automatic detection algorithm for malignant pulmonary nodules on chest radiographs. Radiology 290:218–228
Article Google Scholar
Hwang EJ, Park S, Jin KN et al (2019) Development and validation of a deep learning-based automated detection algorithm for major thoracic diseases on chest radiographs. JAMA Netw Open 2:e191095
Article Google Scholar
Hwang EJ, Park S, Jin KN et al (2019) Development and validation of a deep learning-based automatic detection algorithm for active pulmonary tuberculosis on chest radiographs. Clin Infect Dis 69:739–747
Article Google Scholar
Murphy K, Smits H, Knoops AJG et al (2020) COVID-19 on chest radiographs: a multireader evaluation of an artificial intelligence system. Radiology 296:E166–E172
Article Google Scholar
Sim Y, Chung MJ, Kotter E et al (2020) Deep convolutional neural network-based software improves radiologist detection of malignant lung nodules on chest radiographs. Radiology 294:199–209
Article Google Scholar
Sung J, Park S, Lee SM et al (2021) Added value of deep learning-based detection system for multiple major findings on chest radiographs: a randomized crossover study. Radiology 299:450–459
Article Google Scholar
Itri JN, Tappouni RR, McEachern RO, Pesch AJ, Patel SH (2018) Fundamentals of diagnostic error in imaging. Radiographics 38:1845–1865
Article Google Scholar
Hwang EJ, Hong JH, Lee KH et al (2020) Deep learning algorithm for surveillance of pneumothorax after lung biopsy: a multicenter diagnostic cohort study. Eur Radiol 30:3660–3671
Article Google Scholar
Hwang EJ, Nam JG, Lim WH et al (2019) Deep learning for chest radiograph diagnosis in the emergency department. Radiology 293:573–580
Article Google Scholar
Nam JG, Kim M, Park J et al (2021) Development and validation of a deep learning algorithm detecting 10 common abnormalities on chest radiographs. Eur Respir J 57:2003061
Article Google Scholar
Jang S, Song H, Shin YJ et al (2020) Deep Learning-based automatic detection algorithm for reducing overlooked lung cancers on chest radiographs. Radiology 296:652–661
Article Google Scholar
Nam JG, Hwang EJ, Kim DS et al (2020) Undetected lung cancer at posteroanterior chest radiography: potential role of a deep learning-based detection algorithm. Radiol Cardiothorac Imaging 2:e190222
Article Google Scholar
Hwang EJ, Lee JS, Lee JH et al (2021) Deep learning for detection of pulmonary metastasis on chest radiographs. Radiology. https://doi.org/10.1148/radiol.2021210578:210578
Degnan AJ, Ghobadi EH, Hardy P et al (2019) Perceptual and interpretive error in diagnostic radiology-causes and potential solutions. Acad Radiol 26:833–845
Article Google Scholar
Waite S, Scott J, Gale B, Fuchs T, Kolla S, Reede D (2017) Interpretive error in radiology. AJR Am J Roentgenol 208:739–749
Article Google Scholar
Bruno MA, Walker EA, Abujudeh HH (2015) Understanding and confronting our mistakes: the epidemiology of error in radiology and strategies for error reduction. Radiographics 35:1668–1676
Article Google Scholar
Van Calster B, Wynants L, Verbeek JFM et al (2018) Reporting and interpreting decision curve analysis: a guide for investigators. Eur Urol 74:796–804
Article Google Scholar
Fitzgerald M, Saville BR, Lewis RJ (2015) Decision curve analysis. JAMA 313:409–410
Article CAS Google Scholar
Hwang EJ, Kim H, Lee JH, Goo JM, Park CM (2020) Automated identification of chest radiographs with referable abnormality with deep learning: need for recalibration. Eur Radiol 30:6902–6912
Article Google Scholar

Download references

Acknowledgements

The present study was supported by the Seoul National University Hospital Research Fund (grant number: 03-2021-0270).

Funding

This study has received funding by from the Seoul National University Hospital Research Fund (grant number: 03-2021-0270).

Author information

Authors and Affiliations

Department of Radiology, Seoul National University Hospital, 101 Daehak-ro, Jongno-gu, Seoul, 03080, Korea
Eui Jin Hwang, Jongsoo Park, Wonju Hong, Hyun-Ju Lee, Hyewon Choi, Hyungjin Kim, Ju Gang Nam, Jin Mo Goo, Soon Ho Yoon, Chang Hyun Lee & Chang Min Park
Department of Radiology, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Korea
Eui Jin Hwang, Hyun-Ju Lee, Hyungjin Kim, Ju Gang Nam, Jin Mo Goo, Soon Ho Yoon, Chang Hyun Lee & Chang Min Park
Department of Radiology, Chung-Ang University Hospital, 102 Heukseok-ro, Dongjak-gu, Seoul, 06973, Korea
Hyewon Choi
Institute of Radiation Medicine, Seoul National University College of Medicine, 101 Daehak-ro, Jongno-gu, Seoul, 03080, Korea
Chang Min Park

Authors

Eui Jin Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Jongsoo Park
View author publications
You can also search for this author in PubMed Google Scholar
Wonju Hong
View author publications
You can also search for this author in PubMed Google Scholar
Hyun-Ju Lee
View author publications
You can also search for this author in PubMed Google Scholar
Hyewon Choi
View author publications
You can also search for this author in PubMed Google Scholar
Hyungjin Kim
View author publications
You can also search for this author in PubMed Google Scholar
Ju Gang Nam
View author publications
You can also search for this author in PubMed Google Scholar
Jin Mo Goo
View author publications
You can also search for this author in PubMed Google Scholar
Soon Ho Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Chang Hyun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Chang Min Park
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chang Min Park.

Ethics declarations

Guarantor

The scientific guarantor of this publication is Chang Min Park.

Conflict of interest

Eui Jin Hwang received a research grant from Lunit Inc. outside the present study. Hyungjin Kim received a research grant from Lunit Inc. outside the present study and holds stock of MedicalIP. Soon Ho Yoon works in MedicalIP as an unpaid chief medical officer and holds stock options of Medical IP. Ju Gang Nam received a research grant from Vuno, outside the present study. Chang Min Park received a research grant from Lunit Inc. outside the present study and holds stock of Promedius and stock options of Lunit Inc. and Coreline Soft.

Statistics and biometry

No complex statistical methods were necessary for this paper.

Informed consent

Written informed consent was waived by the Institutional Review Board.

Ethical approval

Institutional Review Board approval was obtained.

Methodology

• retrospective

• diagnostic or prognostic study

• performed at one institution

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hwang, E.J., Park, J., Hong, W. et al. Artificial intelligence system for identification of false-negative interpretations in chest radiographs. Eur Radiol 32, 4468–4478 (2022). https://doi.org/10.1007/s00330-022-08593-x

Download citation

Received: 24 November 2021
Revised: 04 January 2022
Accepted: 25 January 2022
Published: 23 February 2022
Issue Date: July 2022
DOI: https://doi.org/10.1007/s00330-022-08593-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial intelligence system for identification of false-negative interpretations in chest radiographs