Detection of Kidney Stone Based on Super Resolution Techniques and YOLOv7 Under Limited Training Samples

Nguyen, Minh Tai Pham; Le, Viet Tuan; Duong, Huu Thanh; Hoang, Vinh Truong

doi:10.1007/978-3-031-46749-3_3

Minh Tai Pham Nguyen⁵,
Viet Tuan Le⁶,
Huu Thanh Duong⁶ &
…
Vinh Truong Hoang⁶

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 188))

Included in the following conference series:

International Conference on Intelligence of Things

214 Accesses

Abstract

In machine vision, detecting and locating object have always been challenging tasks mostly in medical imaging due to the lacking of data and unique feature of its image. Several studies applied CNN-based models for detecting task but the methods were complex or impracticable to real time tasks while others used YOLO models which were lighter but poor performance. In this study, we propose Super Resolution techniques included EDSR, FSRCNN, LapSRN, ESPCN, ESRGAN and GFPGAN with the proposed architecture of YOLOv7 for kidney stone detection on KUB X-ray image. As a result, our proposed YOLOv7 outperformed the YOLOv7 base version with sensitivity 87.6%, Precision 92.2%, F1 Score 89.8%, mAP50 91.2% and each of Super Resolution methods enabled the model precision and sensitivity to be improved considerably with highest precision reached 97.3% and sensitivity achieved 91.7% on upscaled images compared to the non-upscaled images. Consequently, the re-designed YOLOv7 and Super Resolution methods are proposed to address the problem of detection in diagnosis kidney stone disease.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aksakalli, I., Kaçdioğlu, S., Hanay, Y.S.: Kidney x-ray images classification using machine learning and deep learning methods. Balkan Journal of Electrical and Computer Engineering 9(2), 144–151 (2021)
Article Google Scholar
Bayram, A.F., Gurkan, C., Budak, A., KARATAŞ, H.: A detection and prediction model based on deep learning assisted by explainable artificial intelligence for kidney diseases. Avrupa Bilim ve Teknoloji Dergisi 40, 67–74 (2022)
Google Scholar
Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T., Ronneberger, O.: 3D U-Net: learning dense volumetric segmentation from sparse annotation. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016: 19th International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part II, pp. 424–432. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46723-8_49
Chapter Google Scholar
Dong, C., Loy, C.C., Tang, X.: Accelerating the super-resolution convolutional neural network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) Computer Vision – ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II, pp. 391–407. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_25
Chapter Google Scholar
Elton, D.C., Turkbey, E.B., Pickhardt, P.J., Summers, R.M.: A deep learning system for automated kidney stone detection and volumetric segmentation on noncontrast CT scans. Med. Phys. 49(4), 2545–2554 (2022)
Article Google Scholar
Fujii, K., Aoyama, T., Koyama, S., Kawaura, C.: Comparative evaluation of organ and effective doses for Paediatric patients with those for adults in chest and abdominal ct examinations. Br. J. Radiol. 80(956), 657–667 (2007)
Article Google Scholar
Hou, Q., Zhou, D., Feng, J.: Coordinate attention for efficient mobile network design. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13713–13722 (2021)
Google Scholar
Lai, W.S., Huang, J.B., Ahuja, N., Yang, M.H.: Deep Laplacian pyramid networks for fast and accurate super-resolution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 624–632 (2017)
Google Scholar
Liu, Y.Y., Huang, Z.H., Huang, K.W.: Deep learning model for computer-aided diagnosis of urolithiasis detection from kidney-ureter-bladder images. Bioengineering 9(12), 811 (2022)
Article Google Scholar
Metaxas, V.I., Messaris, G.A., Lekatou, A.N., Petsas, T.G., Panayiotakis, G.S.: Patient doses in common diagnostic x-ray examinations. Radiat. Prot. Dosimetry. 184(1), 12–27 (2019)
Article Google Scholar
Misra, D.: Mish: A self regularized non-monotonic activation function. arXiv preprint arXiv:1908.08681 (2019)
Parsania, P.S., Virparia, P.V.: A comparative analysis of image interpolation algorithms. Int. J. Adv. Res. Comput. Commun. Eng. 5(1), 29–34 (2016)
Article Google Scholar
Shi, W., et al.: Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1874–1883 (2016)
Google Scholar
Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv preprint arXiv:2207.02696 (2022)
Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., Hu, Q.: ECA-net: Efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11534–11542 (2020)
Google Scholar
Wang, X., Li, Y., Zhang, H., Shan, Y.: Towards real-world blind face restoration with generative facial prior. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9168–9178 (2021)
Google Scholar
Wang, X., et al.: Esrgan: Enhanced super-resolution generative adversarial networks. In: Proceedings of the European Conference on Computer Vision (ECCV) Workshops (2018)
Google Scholar
Wang, Z., Zhang, Y., Zhang, J., Deng, Q., Liang, H.: Recent advances on the mechanisms of kidney stone formation. Int. J. Mol. Med. 48(2), 1–10 (2021)
Article Google Scholar
Woo, S., Park, J., Lee, J.Y., Kweon, I.S.: Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp. 3–19 (2018)
Google Scholar
Yang, L., Zhang, R.Y., Li, L., Xie, X.: Simam: A simple, parameter-free attention module for convolutional neural networks. In: International Conference on Machine Learning, pp. 11863–11874. PMLR (2021)
Google Scholar
Yu, J., et al.: Wide activation for efficient and accurate image super-resolution. arXiv preprint arXiv:1808.08718 (2018)

Download references

Author information

Authors and Affiliations

Faculty of Advanced Program, Ho Chi Minh City Open University, Ho Chi Minh City, Vietnam
Minh Tai Pham Nguyen
Faculty of Information Technology, Ho Chi Minh City Open University, Ho Chi Minh City, Vietnam
Viet Tuan Le, Huu Thanh Duong & Vinh Truong Hoang

Authors

Minh Tai Pham Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Viet Tuan Le
View author publications
You can also search for this author in PubMed Google Scholar
Huu Thanh Duong
View author publications
You can also search for this author in PubMed Google Scholar
Vinh Truong Hoang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vinh Truong Hoang .

Editor information

Editors and Affiliations

Sejong University, Seoul, Korea (Republic of)
Nhu-Ngoc Dao
Vietnam National University Ho Chi Minh City (VNU-HCM), Ho Chi Minh City University of Technology (HCMUT), Ho Chi Minh, Vietnam
Tran Ngoc Thinh
Wroclaw University of Science and Technology, Wrocław, Poland
Ngoc Thanh Nguyen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nguyen, M.T.P., Le, V.T., Duong, H.T., Hoang, V.T. (2023). Detection of Kidney Stone Based on Super Resolution Techniques and YOLOv7 Under Limited Training Samples. In: Dao, NN., Thinh, T.N., Nguyen, N.T. (eds) Intelligence of Things: Technologies and Applications. ICIT 2023. Lecture Notes on Data Engineering and Communications Technologies, vol 188. Springer, Cham. https://doi.org/10.1007/978-3-031-46749-3_3

Download citation

DOI: https://doi.org/10.1007/978-3-031-46749-3_3
Published: 20 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46748-6
Online ISBN: 978-3-031-46749-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Detection of Kidney Stone Based on Super Resolution Techniques and YOLOv7 Under Limited Training Samples