Skip to main content

Detection of Kidney Stone Based on Super Resolution Techniques and YOLOv7 Under Limited Training Samples

  • Conference paper
  • First Online:
Intelligence of Things: Technologies and Applications (ICIT 2023)

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 188))

Included in the following conference series:

  • 214 Accesses

Abstract

In machine vision, detecting and locating object have always been challenging tasks mostly in medical imaging due to the lacking of data and unique feature of its image. Several studies applied CNN-based models for detecting task but the methods were complex or impracticable to real time tasks while others used YOLO models which were lighter but poor performance. In this study, we propose Super Resolution techniques included EDSR, FSRCNN, LapSRN, ESPCN, ESRGAN and GFPGAN with the proposed architecture of YOLOv7 for kidney stone detection on KUB X-ray image. As a result, our proposed YOLOv7 outperformed the YOLOv7 base version with sensitivity 87.6%, Precision 92.2%, F1 Score 89.8%, mAP50 91.2% and each of Super Resolution methods enabled the model precision and sensitivity to be improved considerably with highest precision reached 97.3% and sensitivity achieved 91.7% on upscaled images compared to the non-upscaled images. Consequently, the re-designed YOLOv7 and Super Resolution methods are proposed to address the problem of detection in diagnosis kidney stone disease.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aksakalli, I., Kaçdioğlu, S., Hanay, Y.S.: Kidney x-ray images classification using machine learning and deep learning methods. Balkan Journal of Electrical and Computer Engineering 9(2), 144–151 (2021)

    Article  Google Scholar 

  2. Bayram, A.F., Gurkan, C., Budak, A., KARATAŞ, H.: A detection and prediction model based on deep learning assisted by explainable artificial intelligence for kidney diseases. Avrupa Bilim ve Teknoloji Dergisi 40, 67–74 (2022)

    Google Scholar 

  3. Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T., Ronneberger, O.: 3D U-Net: learning dense volumetric segmentation from sparse annotation. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016: 19th International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part II, pp. 424–432. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46723-8_49

    Chapter  Google Scholar 

  4. Dong, C., Loy, C.C., Tang, X.: Accelerating the super-resolution convolutional neural network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) Computer Vision – ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II, pp. 391–407. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_25

    Chapter  Google Scholar 

  5. Elton, D.C., Turkbey, E.B., Pickhardt, P.J., Summers, R.M.: A deep learning system for automated kidney stone detection and volumetric segmentation on noncontrast CT scans. Med. Phys. 49(4), 2545–2554 (2022)

    Article  Google Scholar 

  6. Fujii, K., Aoyama, T., Koyama, S., Kawaura, C.: Comparative evaluation of organ and effective doses for Paediatric patients with those for adults in chest and abdominal ct examinations. Br. J. Radiol. 80(956), 657–667 (2007)

    Article  Google Scholar 

  7. Hou, Q., Zhou, D., Feng, J.: Coordinate attention for efficient mobile network design. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13713–13722 (2021)

    Google Scholar 

  8. Lai, W.S., Huang, J.B., Ahuja, N., Yang, M.H.: Deep Laplacian pyramid networks for fast and accurate super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 624–632 (2017)

    Google Scholar 

  9. Liu, Y.Y., Huang, Z.H., Huang, K.W.: Deep learning model for computer-aided diagnosis of urolithiasis detection from kidney-ureter-bladder images. Bioengineering 9(12), 811 (2022)

    Article  Google Scholar 

  10. Metaxas, V.I., Messaris, G.A., Lekatou, A.N., Petsas, T.G., Panayiotakis, G.S.: Patient doses in common diagnostic x-ray examinations. Radiat. Prot. Dosimetry. 184(1), 12–27 (2019)

    Article  Google Scholar 

  11. Misra, D.: Mish: A self regularized non-monotonic activation function. arXiv preprint arXiv:1908.08681 (2019)

  12. Parsania, P.S., Virparia, P.V.: A comparative analysis of image interpolation algorithms. Int. J. Adv. Res. Comput. Commun. Eng. 5(1), 29–34 (2016)

    Article  Google Scholar 

  13. Shi, W., et al.: Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1874–1883 (2016)

    Google Scholar 

  14. Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv preprint arXiv:2207.02696 (2022)

  15. Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., Hu, Q.: ECA-net: Efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11534–11542 (2020)

    Google Scholar 

  16. Wang, X., Li, Y., Zhang, H., Shan, Y.: Towards real-world blind face restoration with generative facial prior. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9168–9178 (2021)

    Google Scholar 

  17. Wang, X., et al.: Esrgan: Enhanced super-resolution generative adversarial networks. In: Proceedings of the European Conference on Computer Vision (ECCV) Workshops (2018)

    Google Scholar 

  18. Wang, Z., Zhang, Y., Zhang, J., Deng, Q., Liang, H.: Recent advances on the mechanisms of kidney stone formation. Int. J. Mol. Med. 48(2), 1–10 (2021)

    Article  Google Scholar 

  19. Woo, S., Park, J., Lee, J.Y., Kweon, I.S.: Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp. 3–19 (2018)

    Google Scholar 

  20. Yang, L., Zhang, R.Y., Li, L., Xie, X.: Simam: A simple, parameter-free attention module for convolutional neural networks. In: International Conference on Machine Learning, pp. 11863–11874. PMLR (2021)

    Google Scholar 

  21. Yu, J., et al.: Wide activation for efficient and accurate image super-resolution. arXiv preprint arXiv:1808.08718 (2018)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vinh Truong Hoang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nguyen, M.T.P., Le, V.T., Duong, H.T., Hoang, V.T. (2023). Detection of Kidney Stone Based on Super Resolution Techniques and YOLOv7 Under Limited Training Samples. In: Dao, NN., Thinh, T.N., Nguyen, N.T. (eds) Intelligence of Things: Technologies and Applications. ICIT 2023. Lecture Notes on Data Engineering and Communications Technologies, vol 188. Springer, Cham. https://doi.org/10.1007/978-3-031-46749-3_3

Download citation

Publish with us

Policies and ethics