skip to main content
research-article

Iterative Temporal-spatial Transformer-based Cardiac T1 Mapping MRI Reconstruction

Published:08 March 2024Publication History
Skip Abstract Section

Abstract

The precise reconstruction of accelerated magnetic resonance imaging (MRI) brings about notable advantages, such as enhanced diagnostic precision and decreased examination costs. In contrast, traditional cardiac MRI necessitates repetitive acquisitions across multiple heartbeats, resulting in prolonged acquisition times. Significant strides have been made in accelerating MRI through deep learning-based reconstruction methods. However, these existing methods encounter certain limitations: (1) The intricate nature of heart reconstruction involving multiple complex time-series data poses a challenge in exploring nonlinear dependencies between temporal contexts. (2) Existing research often overlooks weight sharing in iterative frameworks, impeding the effective capturing of non-local information and, consequently, limiting improvements in model performance. In order to improve cardiac MRI reconstruction, we propose a novel temporal-spatial transformer with a strategy in this study. Based on the multi-level encoder and decoder transformer architecture, we conduct multi-level spatiotemporal information feature aggregation over several adjacent views, that create nonlinear dependencies among features and efficiently learn important information among adjacent cardiac temporal frames. Additionally, in order to improve contextual awareness between neighboring views, we add cross-view attention for temporal information fusion. Furthermore, we introduce an iterative strategy for training weights during the reconstruction process, which improves feature fusion in critical locations and reduces the number of computations required to calculate global feature dependencies. Extensive experiments have demonstrated the substantial superiority of this procedure over the most advanced techniques, suggesting that it has broad potential for clinical use.

REFERENCES

  1. [1] Aggarwal Hemant K., Mani Merry P., and Jacob Mathews. 2018. MoDL: Model-based deep learning architecture for inverse problems. IEEE Transactions on Medical Imaging 38, 2 (2018), 394405.Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Ahmed Abdul Haseeb, Zhou Ruixi, Yang Yang, Nagpal Prashant, Salerno Michael, and Jacob Mathews. 2020. Free-breathing and ungated dynamic mri using navigator-less spiral storm. IEEE Transactions on Medical Imaging 39, 12 (2020), 39333943.Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Alamri Atif, Cha Jongeun, and Saddik Abdulmotaleb El. 2010. AR-REHAB: An augmented reality framework for poststroke-patient rehabilitation. IEEE Transactions on Instrumentation and Measurement 59, 10 (2010), 25542563.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Arnab Anurag, Dehghani Mostafa, Heigold Georg, Sun Chen, Lučić Mario, and Schmid Cordelia. 2021. Vivit: A video vision transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 68366846.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Carreira Joao and Zisserman Andrew. 2017. Quo vadis, action recognition? A new model and the kinetics dataset. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 62996308.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Cong Yuren, Liao Wentong, Ackermann Hanno, Rosenhahn Bodo, and Yang Michael Ying. 2021. Spatial-temporal transformer for dynamic scene graph generation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 1637216382.Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Du Jinglong, He Zhongshi, Wang Lulu, Gholipour Ali, Zhou Zexun, Chen Dingding, and Jia Yuanyuan. 2020. Super-resolution reconstruction of single anisotropic 3D MR images using residual convolutional neural network. Neurocomputing 392 (2020), 209220.Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Saddik Abdulmotaleb El. 2007. The potential of haptics technologies. IEEE Instrumentation and Measurement Magazine 10, 1 (2007), 1017.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Saddik Abdulmotaleb El. 2018. Digital twins: The convergence of multimedia technologies. IEEE Multimedia 25, 2 (2018), 8792.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Feng Chun-Mei, Yan Yunlu, Fu Huazhu, Chen Li, and Xu Yong. 2021. Task transformer network for joint MRI reconstruction and super-resolution. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part VI 24. Springer, 307317.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Rui Guo, Hossam El-Rewaidy, Salah Assana, Xiaoying Cai, Amine Amyar, Kelvin Chow, Xiaoming Bi, Tuyen Yankama, Julia Cirillo, Patrick Pierce, Beth Goddu, Long Ngo, and Reza Nezafat. 2022. Accelerated cardiac T1 mapping in four heartbeats with inline MyoMapNet: a deep learning-based T1 estimation approach. Journal of Cardiovascular Magnetic Resonance 24, 1 (2022), 1–15.Google ScholarGoogle Scholar
  12. [12] Guo Xudong, Guo Xun, and Lu Yan. 2021. Ssan: Separable self-attention network for video representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1261812627.Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Hara Kensho, Kataoka Hirokatsu, and Satoh Yutaka. 2017. Learning spatio-temporal features with 3d residual networks for action recognition. In Proceedings of the IEEE International Conference on Computer Vision Workshops. 31543160.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Ho Jonathan, Kalchbrenner Nal, Weissenborn Dirk, and Salimans Tim. 2019. Axial attention in multidimensional transformers. arXiv:1912.12180. Retrieved from https://arxiv.org/abs/1912.12180Google ScholarGoogle Scholar
  15. [15] Hossain M Shamim, Muhammad Ghulam, and Alamri Atif. 2019. Smart healthcare monitoring: A voice pathology detection paradigm for smart cities. Multimedia Systems 25 (2019), 565575.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Huang Qiaoying, Yang Dong, Wu Pengxiang, Qu Hui, Yi Jingru, and Metaxas Dimitris. 2019. MRI reconstruction via cascaded channel-wise attention network. In Proceedings of the 2019 IEEE 16th International Symposium on Biomedical Imaging. IEEE, 16221626.Google ScholarGoogle ScholarCross RefCross Ref
  17. [17] Jung Hong, Ye Jong Chul, and Kim Eung Yeop. 2007. Improved k–t BLAST and k–t SENSE using FOCUSS. Physics in Medicine and Biology 52, 11 (2007), 3201.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Li Guangyuan, Lv Jun, Tian Yapeng, Dou Qi, Wang Chengyan, Xu Chenliang, and Qin Jing. 2022. Transformer-empowered multi-scale contextual matching and aggregation for multi-contrast MRI super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2063620645.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Liang Jingyun, Cao Jiezhang, Fan Yuchen, Zhang Kai, Ranjan Rakesh, Li Yawei, Timofte Radu, and Gool Luc Van. 2022. Vrt: A video restoration transformer. arXiv:2201.12288. Retrieved from https://arxiv.org/abs/2201.12288Google ScholarGoogle Scholar
  20. [20] Liang Jingyun, Cao Jiezhang, Sun Guolei, Zhang Kai, Gool Luc Van, and Timofte Radu. 2021. Swinir: Image restoration using swin transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 18331844.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, and Luc Van Gool. 2022. Flow-guided sparse transformer for video deblurring. In International Conference on Machine Learning. PMLR, 13334–13343.Google ScholarGoogle Scholar
  22. [22] Lingala Sajan Goud, Hu Yue, DiBella Edward, and Jacob Mathews. 2011. Accelerated dynamic MRI exploiting sparsity and low-rank structure: kt SLR. IEEE Transactions on Medical Imaging 30, 5 (2011), 10421054.Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Guangming Wang, Jun Lyu, Fanwen Wang, Chengyan Wang, and Jing Qin. 2024. Multi-level temporal information sharing transformer-based feature reuse network for cardiac MRI reconstruction. In Statistical Atlases and Computational Models of the Heart. Regular and CMRxRecon Challenge Papers (STACOM’23), Oscar Camara, et al. (Eds.)., Lecture Notes in Computer Science, vol 14507. Springer, Cham. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Liu Ze, Lin Yutong, Cao Yue, Hu Han, Wei Yixuan, Zhang Zheng, Lin Stephen, and Guo Baining. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 1001210022.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Lv Jun, Huang Wenjian, Zhang Jue, and Wang Xiaoying. 2018. Performance of U-net based pyramidal lucas-kanade registration on free-breathing multi-b-value diffusion MRI of the kidney. The British Journal of Radiology 91, 1086 (2018), 20170813.Google ScholarGoogle Scholar
  26. [26] Lv Jun, Li Guangyuan, Tong Xiangrong, Chen Weibo, Huang Jiahao, Wang Chengyan, and Yang Guang. 2021. Transfer learning enhanced generative adversarial networks for multi-channel MRI reconstruction. Computers in Biology and Medicine 134 (2021), 104504.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] Lv Jun, Wang Chengyan, and Yang Guang. 2021. PIC-GAN: A parallel imaging coupled generative adversarial network for accelerated multi-channel MRI reconstruction. Diagnostics 11, 1 (2021), 61.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Lv Jun, Yang Ming, Zhang Jue, and Wang Xiaoying. 2018. Respiratory motion correction for free-breathing 3D abdominal MRI using CNN-based image registration: A feasibility study. The British Journal of Radiology 91, xxxx (2018), 20170788.Google ScholarGoogle Scholar
  29. [29] Lyu Jun, Li Guangyuan, Wang Chengyan, Qin Chen, Wang Shuo, Dou Qi, and Qin Jing. 2023. Region-focused multi-view transformer-based generative adversarial network for cardiac cine MRI reconstruction. Medical Image Analysis 85 (2023), 102760.Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Lyu Jun, Sui Bin, Wang Chengyan, Tian Yapeng, Dou Qi, and Qin Jing. 2022. DuDoCAF: Dual-domain cross-attention fusion with recurrent transformer for fast multi-contrast MR imaging. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 474484.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. [31] Murugesan Balamurali, Raghavan S. Vijaya, Sarveswaran Kaushik, Ram Keerthi, and Sivaprakasam Mohanasankar. 2019. Recon-glgan: A global-local context based generative adversarial network for mri reconstruction. In Machine Learning for Medical Image Reconstruction: 2nd International Workshop, MLMIR 2019, Held in Conjunction with MICCAI 2019, Shenzhen, China, October 17, 2019, Proceedings 2. Springer, 315.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Otazo Ricardo, Candes Emmanuel, and Sodickson Daniel K.. 2015. Low-rank plus sparse matrix decomposition for accelerated dynamic MRI with separation of background and dynamic components. Magnetic Resonance in Medicine 73, 3 (2015), 11251136.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Piergiovanni AJ, Kuo Weicheng, and Angelova Anelia. 2023. Rethinking video vits: Sparse video tubes for joint image and video learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 22142224.Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Qin Chen, Schlemper Jo, Caballero Jose, Price Anthony N., Hajnal Joseph V., and Rueckert Daniel. 2018. Convolutional recurrent neural networks for dynamic MR image reconstruction. IEEE Transactions on Medical Imaging 38, 1 (2018), 280290.Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Ramanarayanan Sriprabha, Murugesan Balamurali, Ram Keerthi, and Sivaprakasam Mohanasankar. 2020. DC-WCNN: A deep cascade of wavelet based convolutional neural networks for MR image reconstruction. In Proceedings of the 2020 IEEE 17th International Symposium on Biomedical Imaging. IEEE, 10691073.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Schelbert Erik B. and Messroghli Daniel R.. 2016. State of the art: Clinical applications of cardiac T1 mapping. Radiology 278, 3 (2016), 658676.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Schlemper Jo, Caballero Jose, Hajnal Joseph V., Price Anthony, and Rueckert Daniel. 2017. A deep cascade of convolutional neural networks for MR image reconstruction. In Information Processing in Medical Imaging: 25th International Conference, IPMI 2017, Boone, NC, USA, June 25-30, 2017, Proceedings 25. Springer, 647658.Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Simonyan Karen and Zisserman Andrew. 2014. Two-stream convolutional networks for action recognition in videos. Advances in Neural Information Processing Systems 27 (2014).Google ScholarGoogle Scholar
  39. [39] Taylor Andrew J., Salerno Michael, Dharmakumar Rohan, and Jerosch-Herold Michael. 2016. T1 mapping: Basic techniques and clinical applications. JACC: Cardiovascular Imaging 9, 1 (2016), 6781.Google ScholarGoogle ScholarCross RefCross Ref
  40. [40] Alina L. Machidon and Veljko Pejovic. 2021. Deep learning techniques for compressive sensing-based reconstruction and inference–A ubiquitous systems perspective. arXiv preprint arXiv:2105.13191Google ScholarGoogle Scholar
  41. [41] Vaswani Ashish, Shazeer Noam, Parmar Niki, Uszkoreit Jakob, Jones Llion, Gomez Aidan N., Kaiser Łukasz, and Polosukhin Illia. 2017. Attention is all you need. Advances in Neural Information Processing Systems 30 (2017).Google ScholarGoogle Scholar
  42. [42] Chengyan Wang, et al., 2023. CMRxRecon: An open cardiac MRI dataset for the competition of accelerated image reconstruction. arXiv preprint arXiv:2309.10836 (2023)Google ScholarGoogle Scholar
  43. [43] Wang Xiaoqing, Rosenzweig Sebastian, Roeloffs Volkert, Blumenthal Moritz, Scholand Nick, Tan Zhengguo, Holme H. Christian M., Unterberg-Buchwald Christina, Hinkel Rabea, and Uecker Martin. 2023. Free-breathing myocardial T1 mapping using inversion-recovery radial FLASH and motion-resolved model-based reconstruction. Magnetic Resonance in Medicine 89, 4 (2023), 13681384.Google ScholarGoogle ScholarCross RefCross Ref
  44. [44] Wang Yuqing, Xu Zhaoliang, Wang Xinlong, Shen Chunhua, Cheng Baoshan, Shen Hao, and Xia Huaxia. 2021. End-to-end video instance segmentation with transformers. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 87418750.Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Syed Umar Amin, Mansour Alsulaiman, Ghulam Muhammad, Mohamed Amine Mekhtiche, and M. Shamim Hossain. 2019. Deep Learning for EEG motor imagery classification based on multi-layer CNNs feature fusion, Future Generation Computer Systems, 101, (2019), 542–554.Google ScholarGoogle Scholar
  46. [46] Xing Zhaohu, Yu Lequan, Wan Liang, Han Tong, and Zhu Lei. 2022. NestedFormer: Nested modality-aware transformer for brain tumor segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 140150.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Yan Shen, Xiong Xuehan, Arnab Anurag, Lu Zhichao, Zhang Mi, Sun Chen, and Schmid Cordelia. 2022. Multiview transformers for video recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 33333343.Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Yu Weihao, Luo Mi, Zhou Pan, Si Chenyang, Zhou Yichen, Wang Xinchao, Feng Jiashi, and Yan Shuicheng. 2022. Metaformer is actually what you need for vision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1081910829.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Iterative Temporal-spatial Transformer-based Cardiac T1 Mapping MRI Reconstruction

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 20, Issue 6
      June 2024
      715 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3613638
      • Editor:
      • Abdulmotaleb El Saddik
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 March 2024
      • Online AM: 29 January 2024
      • Accepted: 24 January 2024
      • Revised: 15 January 2024
      • Received: 20 December 2023
      Published in tomm Volume 20, Issue 6

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)165
      • Downloads (Last 6 weeks)27

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text