Reliability of molecular imaging diagnostics

Lalumera, Elisabetta; Fanti, Stefano; Boniolo, Giovanni

doi:10.1007/s11229-019-02419-y

Reliability of molecular imaging diagnostics

S.I.: Reliability
Published: 09 October 2019

Volume 198, pages 5701–5717, (2021)
Cite this article

Synthese Aims and scope Submit manuscript

354 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

Advanced medical imaging, such as CT, fMRI and PET, has undergone enormous progress in recent years, both in accuracy and utilization. Such techniques often bring with them an illusion of immediacy, the idea that the body and its diseases can be directly inspected. In this paper we target this illusion and address the issue of the reliability of advanced imaging tests as knowledge procedures, taking positron emission tomography (PET) in oncology as paradigmatic case study. After individuating a suitable notion of reliability, we argue that (1) PET is a highly theory-laden and non-immediate knowledge procedure, in spite of the photographic-like quality of the images it delivers; (2) the diagnostic conclusions based on the interpretation of PET images are population-dependent; (3) PET images require interpretation, which is inherently observer-dependent and therefore variable. We conclude with a three-step methodological proposal for enhancing the reliability of advanced medical imaging.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Quality and consistency of clinical practice guideline recommendations for PET/CT and PET: a systematic appraisal

Article 14 June 2023

FDG PET/CT in cancer: comparison of actual use with literature-based recommendations

Article Open access 30 October 2015

Standardization of Imaging Biomarkers: The FDG PET/CT Example

Notes

“Precision medicine” and “personalised medicine” are often used as synonyms; though with “precision medicine”, the stress is on targeting a specific disease or malfunction with treatments and tests, rather than a larger category of similar diseases (i.e. triple negative breast cancer versus breast cancer, see Wu et al. 2018), whereas “personalised medicine” refers to the consideration of patient-specific factors in diagnosis and treatment (Desmond-Hellmann et al. 2011; National Institute of Health 2018). We acknowledge this distinction though it is not essential to the argument of this paper, (see https://ghr.nlm.nih.gov/ Precision Medicine; see also https://www.nih.gov/research-training/allofus-research-program).
A notable exception is Megan Delehanty’s PhD’s dissertation on the epistemic credentials, and especially on the reliability, of PET images in clinical oncology. Delehanty focuses on the question of whether and how PET as a data-generating process produces reliable knowledge, while we enlarge the picture and consider the reliability of the technology together with the way it is usually employed within the medical community. In broadening our scope with such socio-epistemological question, we think our work completes Delehanty’s excellent analysis.
As we will see in a while, FDG stands for fludeoxyglucose, that is, the usual radiotracer used for PET neuroimaging and cancer patient management.
The non-immediacy of medical imaging in the philosophy of neuroscience has been studied extensively, see Bogen (2001).
For an introduction to the differences between PET and CT, see for example RSNA 2019. For a philosophical illustration of CT and “seeing styles”, see Friedrich (2010). We thank one of the reviewers for pressing us on this point.
See the position and the terminology suggested by the National Institute of Standards and Technology (https://www.nist.gov/pml/nist-technical-note-1297).
We believe that the distinction between reliability of process (first sense, epistemology), reliability of data (second sense, philosophy of science) and reliability as repeatability (third sense, scientific methodology) can be useful in the paper because different readers can be more familiar with one or the other of the three senses. We thank one of the reviewers for pressing us on this point.
Here we use personalised medicine since we are in the situation indicated as such by National Research Council in footnote 2.
Alongside campaigns promoted by scientific societies and institutional and private healthcare providers, there is a philosophical debate on overutilisation and medical futility, addressing both the definition of the phenomenon and ethical consequences. See Hofmann (2010).
The psychological allure of images is of course not unique to molecular imaging tests, of course, but is common to all medical imaging diagnostic tests (such as CT for example, see footnote 4 above). What is specific to molecular imaging, we believe, is that immediacy of images is additionally difficult to defend.
Nonetheless, medical images as numbers can be more easily read by software, and this is the basis of Radiomics. See Gillies et al. (2015).
As known, under this tradition in the philosophy of science, there was the hidden figure of Kant and of a form of neo- or post-Kantism, see Boniolo (2007).
The competence and expertise of an expert reader of medical images are a research field in itself, and it is especially debated now that artificial readers become increasingly available. See Krupinski (2010), Samei and Krupinski (2010) and Shiraishi et al. (2011).
To put it very simply, the interquartile range of a data set is where is a measure of where the bulk of the values lie.
The use of consensus conferences and Delphi procedures is widespread in the social and life sciences, whenever evidence underdetermines the answer to a given scientific or policy question, and experts disagree. For example, they are often employed in psychiatry, in order to decide whether a certain condition is to be considered a disease or not, and they were used in astronomy to assess the status of Pluto as a planet. In philosophy, their epistemic pedigree has been analysed by philosophers like Miriam Solomon and Jakob Stegenga. In advanced diagnostic imaging, they are often utilized in order to write and publish guidelines for the appropriate use of tests (see e.g. In philosophy of science, Miriam Solomon has studied the role of consensus conferences and Delphi studies in the making of medical knowledge (Solomon 2007, 2015).

References

Alavi, A., & Reivich, M. (2002). Guest editorial: The conception of FDG-PET imaging. Seminars in Nuclear Medicine, 32(1), 2–5.
Article Google Scholar
Ashcroft, R. (2004). Current epistemological problems in evidence-based medicine. Journal of Medical Ethics, 30, 131–135.
Article Google Scholar
Biggi, A., Gallamini, A., Chauvie, S., Hutchings, M., Kostakoglu, L., Gregianin, M., Meignan, M., Malkowski, B., Hofman, M. S., & Barrington, S. F. (2013). International validation study for interim PET in ABVD-treated, advanced-stage Hodgkin lymphoma: Interpretation criteria and concordance rate among reviewers. Journal of Nuclear Medicine, 54(5), 683–690.
Article Google Scholar
Boellaard, R., et al. (2015). FDG PET/CT: EANM procedure guidelines for tumour imaging: Version 2.0. European Journal of Nuclear Medicine and Molecular Imaging, 42(2), 328–354.
Article Google Scholar
Bogen, J. (2001). Functional imaging evidence: Some epistemic hot spots. In P. K. Machamer, R. Grush, & P. McLaughlin (Eds.), Theory and method in the neurosciences (pp. 173–199). Pittsburgh: University of Pittsburgh Press.
Google Scholar
Bogen, J. (2008). Experiment and observation. In P. Machamer & M. Silberstein (Eds.), The Blackwell guide to the philosophy of science (Vol. 19, pp. 128–148). Oxford: Blackwell.
Chapter Google Scholar
Boniolo, G. (2007). On scientific representation: From Kant to a new philosophy of science. Houndmills: Palgrave Macmillan.
Book Google Scholar
Boniolo, G., & Sanchini, V. (Eds.). (2016). Ethical counselling and medical decision-making in the era of personalized medicine. Heidelberg: Springer.
Google Scholar
Brown, J. (1979). Perception, theory and commitment: New philosophy of science. Chicago: University of Chicago Press.
Google Scholar
Burggraaff, C. N., Cornelisse, A. C., Hoekstra, O. S., Lugtenburg, P. J., De Keizer, B., Arens, A. I., et al. (2018). Interobserver agreement of interim and end-of-treatment 18F-FDG PET/CT in diffuse large B-cell lymphoma (DLBCL): Impact on clinical practice and trials. Journal of Nuclear Medicine, 59(12), 1831–1836.
Article Google Scholar
Daston, L., & Galison, P. (1992). The image of objectivity. Representations, 40, 81–128.
Article Google Scholar
Delehanty, M. (2010). Why images? Medicine Studies, 2(3), 161–173.
Article Google Scholar
Delehanty, M. C. (2005). Empiricism and the epistemic status of imaging technologies. Doctoral dissertation, University of Pittsburgh.
Desmond-Hellmann, S., Sawyers, C. L., Cox, D. R., Fraser-Liggett, C., Galli, S. J., Goldstein, D. B., et al. (2011). Toward precision medicine: Building a knowledge network for biomedical research and a new taxonomy of disease. Washington, DC: National Academy of Sciences.
Google Scholar
Eurostat. (2017). https://ec.europa.eu/eurostat/statistics-explained/index.php?title=File:Use_of_imaging_equipment_—_number_of_PET_scans,_2010_and_2015_(per_100_000_inhabitants)_HLTH17.png. Accessed October 18, 2018.
Ferretti, G., Linkeviciute, A., & Boniolo, G. (2017). Comprehending and communicating statistics in breast cancer screening. Ethical implications and potential solutions. In M. Gadebusch-Bondio, F. Spöring, & J.-S. Gordon (Eds.), Medical ethics, prediction and prognosis: Interdipliplinary perspectives (pp. 30–41). New York: Routledge.
Chapter Google Scholar
Fischer, B., Lassen, U., Mortensen, J., Larsen, S., Loft, A., Bertelsen, A., et al. (2009). Preoperative staging of lung cancer with combined PET–CT. New England Journal of Medicine, 361(1), 32–39.
Article Google Scholar
Friedrich, K. (2010). ‘Sehkollektiv’: Sight Styles in Diagnostic Computed Tomography. Medicine Studies, 2(3), 185–195.
Article Google Scholar
Gandhi, S., Mosleh, W., Shen, J., & Chow, C. M. (2018). Automation, machine learning, and artificial intelligence in echocardiography: A brave new world. Echocardiography, 35(9), 1402–1418.
Article Google Scholar
Gillies, R. J., Kinahan, P. E., & Hricak, H. (2015). Radiomics: Images are more than pictures, they are data. Radiology, 278(2), 563–577.
Article Google Scholar
Goldman, A. (1979). What is justified belief? In G. S. Pappas (Ed.), Justification and knowledge (pp. 1–25). Dordrecht: Reidel. Reprinted in A. I. Goldman (Ed.), Reliabilism and contemporary epistemology (pp. 29–49). New York: Oxford University Press, 2012.
Goldman, A., & Beddor, B. (2016). Reliabilist epistemology. In E. N. Zalta (Ed.), The Stanford encyclopedia of philosophy (Winter 2016 Edition). https://plato.stanford.edu/archives/win2016/entries/reliabilism/. Accessed October 20, 2018.
Gonzalez, S., Guedj, E., Fanti, S., Lalumera, E., Le Coz, P., & Taïeb, D. (2018). Delivering PET imaging results to cancer patients: Steps for handling ethical issues. European Journal of Nuclear Medicine and Molecular Imaging, 45(12), 2240–2241.
Article Google Scholar
Gould, M. K., Kuschner, W. G., Rydzak, C. E., Maclean, C. C., Demas, A. N., Shigemitsu, H., et al. (2003). Test performance of positron emission tomography and computed tomography for mediastinal staging in patients with non-small-cell lung cancer: A meta-analysis. Annals of Internal Medicine, 139(11), 879–892.
Article Google Scholar
Han, P. K., Klabunde, C. N., Noone, A. M., Earle, C. C., Ayanian, J. Z., Ganz, P. A., et al. (2013). Physicians’ beliefs about breast cancer surveillance testing are consistent with test overuse. Medical Care, 51(4), 315.
Article Google Scholar
Hanson, N. R. (1958). Observation. In N. R. Hanson (Ed.), Patterns of discovery: An inquiry into the conceptual foundations of science (pp. 4–30). Cambridge: Cambridge University Press.
Google Scholar
Hanson, N. R. (2001). Seeing and seeing as. In Y. Balashov & A. Rosenberg (Eds.), Philosophy of science: Contemporary readings (pp. 321–339). London: Routledge. Originally published in N. R. Hanson (Ed.), Perception and discovery: An introduction to scientific inquiry (pp. 91–110). San Francisco: Freeman, 1969.
Hendee, W. R., Becker, G. J., Borgstede, J. P., Bosma, J., Casarella, W. J., Erickson, B. A., et al. (2010). Addressing overutilization in medical imaging. Radiology, 257(1), 240–245.
Article Google Scholar
Hicks, R. J., Kalff, V., MacManus, M. P., Ware, R. E., Hogg, A., McKenzie, A. F., et al. (2001). 18F-FDG PET provides high-impact and powerful prognostic stratification in staging newly diagnosed non-small cell lung cancer. Journal of Nuclear Medicine, 42(11), 1596–1604.
Google Scholar
Hofman, M. S., & Hicks, R. J. (2016). How we read oncologic FDG PET/CT. Cancer Imaging, 16(1), 35.
Article Google Scholar
Hofmann, B. (2010). Too much of a good thing is wonderful? A conceptual analysis of excessive examinations and diagnostic futility in diagnostic radiology. Medicine, Health Care and Philosophy, 13(2), 139–148.
Article Google Scholar
Hosny, A., Parmar, C., Coroller, T. P., Grossmann, P., Zeleznik, R., Kumar, A., et al. (2018). Deep learning for lung cancer prognostication: A retrospective multi-cohort radiomics study. PLoS Medicine, 15(11), e1002711.
Article Google Scholar
Hunink, M. M., & Krestin, G. P. (2002). Study design for concurrent development, assessment, and implementation of new diagnostic imaging technology. Radiology, 222(3), 604–614.
Article Google Scholar
Jarvik, J. G. (2002). Study design for the new millennium: Changing how we perform research and practice medicine. Radiology, 222(3), 593–594.
Article Google Scholar
Joyce, K. A. (2008). Magnetic appeal: MRI and the myth of transparency. Ithaca: Cornell University Press.
Google Scholar
Kilani, R. K., Paxton, B. E., Stinnett, S. S., Barnhart, H. X., Bindal, V., & Lungren, M. P. (2011). Self-referral in medical imaging: A meta-analysis of the literature. Journal of the American College of Radiology, 8(7), 469–476.
Article Google Scholar
Kingma, E. (2007). What is it to be healthy? Analysis, 67, 128–133.
Article Google Scholar
Krupinski, E. A. (2010). Current perspectives in medical image perception. Attention, Perception, & Psychophysics, 72(5), 1205–1217.
Article Google Scholar
Kuhn, T. S. (1990). The road since structure. In PSA: Proceedings of the Biennial meeting of the philosophy of science association (Vol. 1990, pp. 3–13). Chicago: Philosophy of Science Association.
Lalumera, E., & Fanti, S. (2017). Randomized controlled trials for diagnostic imaging: Conceptual and practical problems. Topoi, 38(2), 395–400.
Article Google Scholar
Leplin, J. (2007). In defense of reliabilism. Philosophical Studies, 134(1), 31–42.
Article Google Scholar
Losee, A. (2001). Historical introduction to the philosophy of science. Oxford: Oxford University Press.
Google Scholar
Lysdahl, K. B., & Hofmann, B. M. (2009). What causes increasing and unnecessary use of radiological investigations? A survey of radiologists’ perceptions. BMC Health Services Research, 9(1), 155.
Article Google Scholar
Mitchell, J. M. (2008). Utilization trends for advanced imaging procedures: Evidence from individuals with private insurance coverage in California. Medical Care, 46(5), 460–466.
Article Google Scholar
National Institute of Health. (2018). What is precision medicine? https://ghr.nlm.nih.gov/primer/precisionmedicine/definition. Accessed September 30, 2018.
National Research Council (US) and Institute of Medicine (US) Committee on State of the Science of Nuclear Medicine. (2007). Advancing nuclear medicine through innovation. Washington, DC: National Academic Press.
Google Scholar
Oldroyd, D. (1986). The aArch of knowledge: An introductory study of the history of the philosophy and methodology of science. London: Routledge Kegan & Paul.
Google Scholar
Peters, M. D. J., Godfrey, C. M., McInerney, P., et al. (2015). Methodology for JBI scoping reviews. The Joanna Briggs Institute reviewers’ manual 2015. Adelaide: The Joanna Briggs Institute.
Google Scholar
Samei, E., & Krupinski, E. (Eds.). (2010). The handbook of medical image perception and techniques. Cambridge: Cambridge University Press.
Google Scholar
Shiraishi, J., Li, Q., Appelbaum, D., & Doi, K. (2011). Computer-aided diagnosis and artificial intelligence in clinical imaging. Seminars in Nuclear Medicine, 41(6), 449–462.
Article Google Scholar
Solomon, M. (2007). The social epistemology of NIH consensus conferences. In M. Solomon (Ed.), Establishing medical reality: Essays in the metaphysics and epistemology of biomedical science (pp. 167–177). Dordrecht: Springer.
Chapter Google Scholar
Solomon, M. (2015). Making medical knowledge. Oxford: Oxford University Press.
Book Google Scholar
Stegenga, J. (2018). Medical nihilism. Oxford: Oxford University Press.
Book Google Scholar
Stegenga, J., et al. (2016). New directions in philosophy of medicine. The Bloomsbury Companion to Contemporary Philosophy of Medicine, 343, 23.
Google Scholar
Taylor, P. M. (2007). A review of research into the development of radiologic expertise: Implications for computer-based training. Academic Radiology, 14(10), 1252–1263.
Article Google Scholar
Van Dijck, J. (2011). The transparent body: A cultural analysis of medical imaging. Seattle: University of Washington Press.
Google Scholar
van Westreenen, H. L., Heeren, P. A., Jager, P. L., van Dullemen, H. M., Groen, H., & Plukker, J. T. M. (2003). Pitfalls of positive findings in staging esophageal cancer with F-18-fluorodeoxyglucose positron emission tomography. Annals of Surgical Oncology, 10(9), 1100–1105.
Article Google Scholar
Warburg, O. (1956). On the origin of cancer cells. Science, 123(3191), 309–314.
Article Google Scholar
Waterstram-Rich, K. M., & Gilmore, D. (2016). Nuclear medicine and PET/CT E-Book: Technology and techniques. St. Louis, MO: Elsevier Health Sciences.
Google Scholar
Woodward, J. (2000). Data, phenomena, and reliability. Philosophy of Science, 67(3), S163–S179.
Article Google Scholar
Wu, N., Zhang, J., Zhao, J., Mu, K., Zhang, J., Jin, Z., et al. (2018). Precision medicine based on tumorigenic signaling pathways for triple–negative breast cancer. Oncology Letters, 16(4), 4984–4996.
Google Scholar
Yasunaga, H. (2008). Willingness to pay for mass screening for prostate cancer: A contingent valuation survey. International Journal of Urology, 15(1), 102–105.
Article Google Scholar
Yasunaga, H., Ide, H., Imamura, T., & Ohe, K. (2006). The measurement of willingness to pay for mass cancer screening with whole-body PET (positron emission tomography). Annals of Nuclear Medicine, 20(7), 457–462.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Psicologia, Milano-Bicocca University, Milan, Italy
Elisabetta Lalumera
Policlinico S. Orsola, University of Bologna, Bologna, Italy
Stefano Fanti
Dipartimento di Scienze Biomediche e Chirurgico Specialistiche, University of Ferrara, Ferrara, Italy
Giovanni Boniolo

Authors

Elisabetta Lalumera
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Fanti
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Boniolo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elisabetta Lalumera.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lalumera, E., Fanti, S. & Boniolo, G. Reliability of molecular imaging diagnostics. Synthese 198 (Suppl 23), 5701–5717 (2021). https://doi.org/10.1007/s11229-019-02419-y

Download citation

Received: 26 November 2018
Accepted: 30 September 2019
Published: 09 October 2019
Issue Date: October 2021
DOI: https://doi.org/10.1007/s11229-019-02419-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reliability of molecular imaging diagnostics

Abstract

Access this article

Similar content being viewed by others

Quality and consistency of clinical practice guideline recommendations for PET/CT and PET: a systematic appraisal

FDG PET/CT in cancer: comparison of actual use with literature-based recommendations

Standardization of Imaging Biomarkers: The FDG PET/CT Example

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Reliability of molecular imaging diagnostics

Abstract

Access this article

Similar content being viewed by others

Quality and consistency of clinical practice guideline recommendations for PET/CT and PET: a systematic appraisal

FDG PET/CT in cancer: comparison of actual use with literature-based recommendations

Standardization of Imaging Biomarkers: The FDG PET/CT Example

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation