Abstract
Advanced medical imaging, such as CT, fMRI and PET, has undergone enormous progress in recent years, both in accuracy and utilization. Such techniques often bring with them an illusion of immediacy, the idea that the body and its diseases can be directly inspected. In this paper we target this illusion and address the issue of the reliability of advanced imaging tests as knowledge procedures, taking positron emission tomography (PET) in oncology as paradigmatic case study. After individuating a suitable notion of reliability, we argue that (1) PET is a highly theory-laden and non-immediate knowledge procedure, in spite of the photographic-like quality of the images it delivers; (2) the diagnostic conclusions based on the interpretation of PET images are population-dependent; (3) PET images require interpretation, which is inherently observer-dependent and therefore variable. We conclude with a three-step methodological proposal for enhancing the reliability of advanced medical imaging.
Similar content being viewed by others
Notes
“Precision medicine” and “personalised medicine” are often used as synonyms; though with “precision medicine”, the stress is on targeting a specific disease or malfunction with treatments and tests, rather than a larger category of similar diseases (i.e. triple negative breast cancer versus breast cancer, see Wu et al. 2018), whereas “personalised medicine” refers to the consideration of patient-specific factors in diagnosis and treatment (Desmond-Hellmann et al. 2011; National Institute of Health 2018). We acknowledge this distinction though it is not essential to the argument of this paper, (see https://ghr.nlm.nih.gov/ Precision Medicine; see also https://www.nih.gov/research-training/allofus-research-program).
A notable exception is Megan Delehanty’s PhD’s dissertation on the epistemic credentials, and especially on the reliability, of PET images in clinical oncology. Delehanty focuses on the question of whether and how PET as a data-generating process produces reliable knowledge, while we enlarge the picture and consider the reliability of the technology together with the way it is usually employed within the medical community. In broadening our scope with such socio-epistemological question, we think our work completes Delehanty’s excellent analysis.
As we will see in a while, FDG stands for fludeoxyglucose, that is, the usual radiotracer used for PET neuroimaging and cancer patient management.
The non-immediacy of medical imaging in the philosophy of neuroscience has been studied extensively, see Bogen (2001).
For an introduction to the differences between PET and CT, see for example RSNA 2019. For a philosophical illustration of CT and “seeing styles”, see Friedrich (2010). We thank one of the reviewers for pressing us on this point.
See the position and the terminology suggested by the National Institute of Standards and Technology (https://www.nist.gov/pml/nist-technical-note-1297).
We believe that the distinction between reliability of process (first sense, epistemology), reliability of data (second sense, philosophy of science) and reliability as repeatability (third sense, scientific methodology) can be useful in the paper because different readers can be more familiar with one or the other of the three senses. We thank one of the reviewers for pressing us on this point.
Here we use personalised medicine since we are in the situation indicated as such by National Research Council in footnote 2.
Alongside campaigns promoted by scientific societies and institutional and private healthcare providers, there is a philosophical debate on overutilisation and medical futility, addressing both the definition of the phenomenon and ethical consequences. See Hofmann (2010).
The psychological allure of images is of course not unique to molecular imaging tests, of course, but is common to all medical imaging diagnostic tests (such as CT for example, see footnote 4 above). What is specific to molecular imaging, we believe, is that immediacy of images is additionally difficult to defend.
Nonetheless, medical images as numbers can be more easily read by software, and this is the basis of Radiomics. See Gillies et al. (2015).
As known, under this tradition in the philosophy of science, there was the hidden figure of Kant and of a form of neo- or post-Kantism, see Boniolo (2007).
To put it very simply, the interquartile range of a data set is where is a measure of where the bulk of the values lie.
The use of consensus conferences and Delphi procedures is widespread in the social and life sciences, whenever evidence underdetermines the answer to a given scientific or policy question, and experts disagree. For example, they are often employed in psychiatry, in order to decide whether a certain condition is to be considered a disease or not, and they were used in astronomy to assess the status of Pluto as a planet. In philosophy, their epistemic pedigree has been analysed by philosophers like Miriam Solomon and Jakob Stegenga. In advanced diagnostic imaging, they are often utilized in order to write and publish guidelines for the appropriate use of tests (see e.g. In philosophy of science, Miriam Solomon has studied the role of consensus conferences and Delphi studies in the making of medical knowledge (Solomon 2007, 2015).
References
Alavi, A., & Reivich, M. (2002). Guest editorial: The conception of FDG-PET imaging. Seminars in Nuclear Medicine, 32(1), 2–5.
Ashcroft, R. (2004). Current epistemological problems in evidence-based medicine. Journal of Medical Ethics, 30, 131–135.
Biggi, A., Gallamini, A., Chauvie, S., Hutchings, M., Kostakoglu, L., Gregianin, M., Meignan, M., Malkowski, B., Hofman, M. S., & Barrington, S. F. (2013). International validation study for interim PET in ABVD-treated, advanced-stage Hodgkin lymphoma: Interpretation criteria and concordance rate among reviewers. Journal of Nuclear Medicine, 54(5), 683–690.
Boellaard, R., et al. (2015). FDG PET/CT: EANM procedure guidelines for tumour imaging: Version 2.0. European Journal of Nuclear Medicine and Molecular Imaging, 42(2), 328–354.
Bogen, J. (2001). Functional imaging evidence: Some epistemic hot spots. In P. K. Machamer, R. Grush, & P. McLaughlin (Eds.), Theory and method in the neurosciences (pp. 173–199). Pittsburgh: University of Pittsburgh Press.
Bogen, J. (2008). Experiment and observation. In P. Machamer & M. Silberstein (Eds.), The Blackwell guide to the philosophy of science (Vol. 19, pp. 128–148). Oxford: Blackwell.
Boniolo, G. (2007). On scientific representation: From Kant to a new philosophy of science. Houndmills: Palgrave Macmillan.
Boniolo, G., & Sanchini, V. (Eds.). (2016). Ethical counselling and medical decision-making in the era of personalized medicine. Heidelberg: Springer.
Brown, J. (1979). Perception, theory and commitment: New philosophy of science. Chicago: University of Chicago Press.
Burggraaff, C. N., Cornelisse, A. C., Hoekstra, O. S., Lugtenburg, P. J., De Keizer, B., Arens, A. I., et al. (2018). Interobserver agreement of interim and end-of-treatment 18F-FDG PET/CT in diffuse large B-cell lymphoma (DLBCL): Impact on clinical practice and trials. Journal of Nuclear Medicine, 59(12), 1831–1836.
Daston, L., & Galison, P. (1992). The image of objectivity. Representations, 40, 81–128.
Delehanty, M. (2010). Why images? Medicine Studies, 2(3), 161–173.
Delehanty, M. C. (2005). Empiricism and the epistemic status of imaging technologies. Doctoral dissertation, University of Pittsburgh.
Desmond-Hellmann, S., Sawyers, C. L., Cox, D. R., Fraser-Liggett, C., Galli, S. J., Goldstein, D. B., et al. (2011). Toward precision medicine: Building a knowledge network for biomedical research and a new taxonomy of disease. Washington, DC: National Academy of Sciences.
Eurostat. (2017). https://ec.europa.eu/eurostat/statistics-explained/index.php?title=File:Use_of_imaging_equipment_—_number_of_PET_scans,_2010_and_2015_(per_100_000_inhabitants)_HLTH17.png. Accessed October 18, 2018.
Ferretti, G., Linkeviciute, A., & Boniolo, G. (2017). Comprehending and communicating statistics in breast cancer screening. Ethical implications and potential solutions. In M. Gadebusch-Bondio, F. Spöring, & J.-S. Gordon (Eds.), Medical ethics, prediction and prognosis: Interdipliplinary perspectives (pp. 30–41). New York: Routledge.
Fischer, B., Lassen, U., Mortensen, J., Larsen, S., Loft, A., Bertelsen, A., et al. (2009). Preoperative staging of lung cancer with combined PET–CT. New England Journal of Medicine, 361(1), 32–39.
Friedrich, K. (2010). ‘Sehkollektiv’: Sight Styles in Diagnostic Computed Tomography. Medicine Studies, 2(3), 185–195.
Gandhi, S., Mosleh, W., Shen, J., & Chow, C. M. (2018). Automation, machine learning, and artificial intelligence in echocardiography: A brave new world. Echocardiography, 35(9), 1402–1418.
Gillies, R. J., Kinahan, P. E., & Hricak, H. (2015). Radiomics: Images are more than pictures, they are data. Radiology, 278(2), 563–577.
Goldman, A. (1979). What is justified belief? In G. S. Pappas (Ed.), Justification and knowledge (pp. 1–25). Dordrecht: Reidel. Reprinted in A. I. Goldman (Ed.), Reliabilism and contemporary epistemology (pp. 29–49). New York: Oxford University Press, 2012.
Goldman, A., & Beddor, B. (2016). Reliabilist epistemology. In E. N. Zalta (Ed.), The Stanford encyclopedia of philosophy (Winter 2016 Edition). https://plato.stanford.edu/archives/win2016/entries/reliabilism/. Accessed October 20, 2018.
Gonzalez, S., Guedj, E., Fanti, S., Lalumera, E., Le Coz, P., & Taïeb, D. (2018). Delivering PET imaging results to cancer patients: Steps for handling ethical issues. European Journal of Nuclear Medicine and Molecular Imaging, 45(12), 2240–2241.
Gould, M. K., Kuschner, W. G., Rydzak, C. E., Maclean, C. C., Demas, A. N., Shigemitsu, H., et al. (2003). Test performance of positron emission tomography and computed tomography for mediastinal staging in patients with non-small-cell lung cancer: A meta-analysis. Annals of Internal Medicine, 139(11), 879–892.
Han, P. K., Klabunde, C. N., Noone, A. M., Earle, C. C., Ayanian, J. Z., Ganz, P. A., et al. (2013). Physicians’ beliefs about breast cancer surveillance testing are consistent with test overuse. Medical Care, 51(4), 315.
Hanson, N. R. (1958). Observation. In N. R. Hanson (Ed.), Patterns of discovery: An inquiry into the conceptual foundations of science (pp. 4–30). Cambridge: Cambridge University Press.
Hanson, N. R. (2001). Seeing and seeing as. In Y. Balashov & A. Rosenberg (Eds.), Philosophy of science: Contemporary readings (pp. 321–339). London: Routledge. Originally published in N. R. Hanson (Ed.), Perception and discovery: An introduction to scientific inquiry (pp. 91–110). San Francisco: Freeman, 1969.
Hendee, W. R., Becker, G. J., Borgstede, J. P., Bosma, J., Casarella, W. J., Erickson, B. A., et al. (2010). Addressing overutilization in medical imaging. Radiology, 257(1), 240–245.
Hicks, R. J., Kalff, V., MacManus, M. P., Ware, R. E., Hogg, A., McKenzie, A. F., et al. (2001). 18F-FDG PET provides high-impact and powerful prognostic stratification in staging newly diagnosed non-small cell lung cancer. Journal of Nuclear Medicine, 42(11), 1596–1604.
Hofman, M. S., & Hicks, R. J. (2016). How we read oncologic FDG PET/CT. Cancer Imaging, 16(1), 35.
Hofmann, B. (2010). Too much of a good thing is wonderful? A conceptual analysis of excessive examinations and diagnostic futility in diagnostic radiology. Medicine, Health Care and Philosophy, 13(2), 139–148.
Hosny, A., Parmar, C., Coroller, T. P., Grossmann, P., Zeleznik, R., Kumar, A., et al. (2018). Deep learning for lung cancer prognostication: A retrospective multi-cohort radiomics study. PLoS Medicine, 15(11), e1002711.
Hunink, M. M., & Krestin, G. P. (2002). Study design for concurrent development, assessment, and implementation of new diagnostic imaging technology. Radiology, 222(3), 604–614.
Jarvik, J. G. (2002). Study design for the new millennium: Changing how we perform research and practice medicine. Radiology, 222(3), 593–594.
Joyce, K. A. (2008). Magnetic appeal: MRI and the myth of transparency. Ithaca: Cornell University Press.
Kilani, R. K., Paxton, B. E., Stinnett, S. S., Barnhart, H. X., Bindal, V., & Lungren, M. P. (2011). Self-referral in medical imaging: A meta-analysis of the literature. Journal of the American College of Radiology, 8(7), 469–476.
Kingma, E. (2007). What is it to be healthy? Analysis, 67, 128–133.
Krupinski, E. A. (2010). Current perspectives in medical image perception. Attention, Perception, & Psychophysics, 72(5), 1205–1217.
Kuhn, T. S. (1990). The road since structure. In PSA: Proceedings of the Biennial meeting of the philosophy of science association (Vol. 1990, pp. 3–13). Chicago: Philosophy of Science Association.
Lalumera, E., & Fanti, S. (2017). Randomized controlled trials for diagnostic imaging: Conceptual and practical problems. Topoi, 38(2), 395–400.
Leplin, J. (2007). In defense of reliabilism. Philosophical Studies, 134(1), 31–42.
Losee, A. (2001). Historical introduction to the philosophy of science. Oxford: Oxford University Press.
Lysdahl, K. B., & Hofmann, B. M. (2009). What causes increasing and unnecessary use of radiological investigations? A survey of radiologists’ perceptions. BMC Health Services Research, 9(1), 155.
Mitchell, J. M. (2008). Utilization trends for advanced imaging procedures: Evidence from individuals with private insurance coverage in California. Medical Care, 46(5), 460–466.
National Institute of Health. (2018). What is precision medicine? https://ghr.nlm.nih.gov/primer/precisionmedicine/definition. Accessed September 30, 2018.
National Research Council (US) and Institute of Medicine (US) Committee on State of the Science of Nuclear Medicine. (2007). Advancing nuclear medicine through innovation. Washington, DC: National Academic Press.
Oldroyd, D. (1986). The aArch of knowledge: An introductory study of the history of the philosophy and methodology of science. London: Routledge Kegan & Paul.
Peters, M. D. J., Godfrey, C. M., McInerney, P., et al. (2015). Methodology for JBI scoping reviews. The Joanna Briggs Institute reviewers’ manual 2015. Adelaide: The Joanna Briggs Institute.
Samei, E., & Krupinski, E. (Eds.). (2010). The handbook of medical image perception and techniques. Cambridge: Cambridge University Press.
Shiraishi, J., Li, Q., Appelbaum, D., & Doi, K. (2011). Computer-aided diagnosis and artificial intelligence in clinical imaging. Seminars in Nuclear Medicine, 41(6), 449–462.
Solomon, M. (2007). The social epistemology of NIH consensus conferences. In M. Solomon (Ed.), Establishing medical reality: Essays in the metaphysics and epistemology of biomedical science (pp. 167–177). Dordrecht: Springer.
Solomon, M. (2015). Making medical knowledge. Oxford: Oxford University Press.
Stegenga, J. (2018). Medical nihilism. Oxford: Oxford University Press.
Stegenga, J., et al. (2016). New directions in philosophy of medicine. The Bloomsbury Companion to Contemporary Philosophy of Medicine, 343, 23.
Taylor, P. M. (2007). A review of research into the development of radiologic expertise: Implications for computer-based training. Academic Radiology, 14(10), 1252–1263.
Van Dijck, J. (2011). The transparent body: A cultural analysis of medical imaging. Seattle: University of Washington Press.
van Westreenen, H. L., Heeren, P. A., Jager, P. L., van Dullemen, H. M., Groen, H., & Plukker, J. T. M. (2003). Pitfalls of positive findings in staging esophageal cancer with F-18-fluorodeoxyglucose positron emission tomography. Annals of Surgical Oncology, 10(9), 1100–1105.
Warburg, O. (1956). On the origin of cancer cells. Science, 123(3191), 309–314.
Waterstram-Rich, K. M., & Gilmore, D. (2016). Nuclear medicine and PET/CT E-Book: Technology and techniques. St. Louis, MO: Elsevier Health Sciences.
Woodward, J. (2000). Data, phenomena, and reliability. Philosophy of Science, 67(3), S163–S179.
Wu, N., Zhang, J., Zhao, J., Mu, K., Zhang, J., Jin, Z., et al. (2018). Precision medicine based on tumorigenic signaling pathways for triple–negative breast cancer. Oncology Letters, 16(4), 4984–4996.
Yasunaga, H. (2008). Willingness to pay for mass screening for prostate cancer: A contingent valuation survey. International Journal of Urology, 15(1), 102–105.
Yasunaga, H., Ide, H., Imamura, T., & Ohe, K. (2006). The measurement of willingness to pay for mass cancer screening with whole-body PET (positron emission tomography). Annals of Nuclear Medicine, 20(7), 457–462.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Lalumera, E., Fanti, S. & Boniolo, G. Reliability of molecular imaging diagnostics. Synthese 198 (Suppl 23), 5701–5717 (2021). https://doi.org/10.1007/s11229-019-02419-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11229-019-02419-y