Investigation of the effect of object size on accuracy of human localisation in images acquired from unmanned aerial vehicles

Rostyslav Tsekhmystro, Oleksii Rubel, Vladimir Lukin

Abstract


The use of unmanned aerial vehicles is gaining wide popularity in various areas of research and information acquisition. More and more often, unmanned aerial vehicles are used to obtain various types of images of the Earth’s surface for its study. In particular, such data can be used in law enforcement, localization of crowds, etc. Typically, such systems operate independently of humans and provide information about objects in an automatic mode, with humans working only under the control of the aircraft. One of the main components of such systems is a neural network for localization and classification of objects, the parameters of which determine both the accuracy of the system as a whole and the design of the aircraft for shooting. In particular, the accuracy of the neural network determines the profitability of such a system, because if the accuracy is insufficient, the use of such systems will not make sense. Therefore, the main subject of research in this paper is a neural network for object localization, in particular YOLO v5, and its accuracy parameters on images obtained from unmanned aerial vehicles. The main focus of this paper is on the parameters of the neural network and the study of its metrics, which are important parameters of a trained neural network. Another important parameter for the further use of a neural network is its training parameters, as well as the data used for training. This study also pays attention to the details of the training process. The main goal of this study is to train a neural network on a selected dataset and to study the accuracy metrics of the trained neural network. The main goal of this study is to determine the dependence of localization accuracy on the area of the object, which will allow for more detailed development of unmanned systems with automatic object detection, as well as to assess the profitability of using such systems in task planning. On the basis of the data obtained, conclusions were drawn about the dependence of localization accuracy on the area of an object in images from unmanned aerial vehicles. These data can serve as a reference for unmanned aerial vehicle developers, particularly when selecting photo modules or planning the system architecture.

Keywords


object localization; YOLO v5; human classification; human localization; UAV

Full Text:

PDF

References


Tang, G., Ni, J., Zhao, Y., Gu, Y. & Cao W. A Survey of Object Detection for UAVs Based on Deep Learning. Remote Sensing, 2024, vol. 16, no. 1, article no. 149. DOI: 10.3390/rs16010149.

Zhao, M., Li, W., Li, L., Hu, J., Ma, P., & Tao, R. Single-Frame Infrared Small-Target Detection: A survey. IEEE Geoscience and Remote Sensing Magazine, 2022, vol. 10, no. 2, pp. 87-119. DOI: 10.1109/MGRS.2022.3145502.

Bondžulić, B., Stojanović, N., Lukin, V., Stankevich, S. A., Bujaković, D., & Kryvenko, S. Target acquisition performance in the presence of JPEG image compression. Defense Technology, 2023. DOI: 10.1016/j.dt.2023.12.006.

Wang, L., Tang, J., & Liao, Q. A Study on Radar Target Detection Based on Deep Neural Networks. IEEE Sensors Letters, 2019, vol. 3, no. 3, article no. 7000504. DOI: 10.1109/LSENS.2019.2896072.

Yu, S. Sonar Image Target Detection Based on Deep Learning. Mathematical Problems in Engineering, 2022, vol. 2022, article no. 5294151. DOI: 10.1155/2022/5294151.

Bustos, N., Mashhadi, M., Lai-Yuen, S. K., Sarkar, S. & Das, T. K. A systematic literature review on object detection using near infrared and thermal images. Neurocomputing, 2022, vol. 560, article no. 126804. DOI: 10.1016/j.neucom.2023.126804.

Zhang, Z., & Zhu, L. A Review on Unmanned Aerial Vehicle Remote Sensing: Platforms, Sensors, Data Processing Methods, and Applications. Drones, 2023, vol. 7, no. 6, article no. 398. DOI: 10.3390/drones7060398.

Wilkowski, A., Kasprzak, W., & Stefańczyk M. Object detection in the police surveillance scenario. 2019 Federated Conference on Computer Science and Information Systems (FedCSIS), 2019, pp. 363-372. DOI: 10.15439/2019F291.

Alsamhi, S. H., Shvetsov, A. V., Kumar, S., Shvetsova, S. V., Alhartomi, M. A., Hawbani, A., Rajput, N. S., Srivastava, S., Saif, A., & Nyangaresi, V. O. UAV Computing-Assisted Search and Rescue Mission Framework for Disaster and Harsh Environment Mitigation. Drones, 2022, vol. 6, no. 7, article no. 154. DOI: 10.3390/drones6070154.

Liu, C., & Szirányi, T. Real-Time Human Detection and Gesture Recognition for On-Board UAV Rescue. Sensors, 2021, vol. 21, no. 6, article no. 2180. DOI: 10.3390/s21062180.

Cao, Z., Kooistra, L., Wang, W., Guo, L., & Valente, J. Real-Time Object Detection Based on UAV Remote Sensing: A Systematic Literature Review. Drones, 2023, no. 7, article no. 620. DOI: 10.3390/drones7100620.

Aposporis, P. Object Detection Methods for Improving UAV Autonomy and Remote Sensing Applications. 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2020, pp. 845-853. DOI: 10.1109/ASONAM49781.2020.9381377.

Tsekhmystro, R., Rubel, O., & Lukin, V. Study of methods for searching and localizing objects in images from aircraft using convolutional neural networks. Radioelectronic and Computer Systems, 2024, no. 1, pp 87-98. DOI: 10.32620/reks.2024.1.08.

Bosquet, B., Mucientes, M., & Brea, V. M. STDnet: Exploiting high resolution feature maps for small object detection. Engineering Applications of Artificial Intelligence, 2020, vol. 91, article no. 103615. DOI: 10.1016/j.engappai.2020.103615.

Inácio, A. D., Ramos, R. H., & Lopes, H. S. Deep Learning for People Counting in Videos by Age and Gender. Anais do 15. Congresso Brasileiro de Inteligência Computacional, 2021. DOI: 10.21528/CBIC2021-53.

Jocher, G. YOLOv5 SOTA Realtime Instance Segmentation. Available at: https://zenodo.org/records/7347926 (аccessed 04.02.2024).

Zaidi, S. S. A., Ansari, M. S., Aslam, A., Kanwal, N., Asghar, M., & Lee, B. A survey of modern deep learning based object detection models. Digital Signal Processing, 2022, vol. 16, article no. 103514. DOI: 10.1016/j.dsp.2022.103514.

Paszke, A., Gross, S., Massa, M., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Köpf, A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L., Bai, J., & Chintala, S. PyTorch: An Imperative Style, High-Performance Deep Learning Library. ArXiv, 2019. DOI: 10.48550/arXiv.1912.01703.

Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., & Savarese, S. Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 658-666. DOI: 10.48550/arXiv.1902.09630.

Ting, K. M. Precision and Recall. Encyclopedia of Machine Learning, 2010. 781 p. DOI: 10.1007/978-0-387-30164-8_652.

Salari, A., Djavadifar, A., Liu, X., & Najjaran, H. Object recognition datasets and challenges: A review. Neurocomputing, 2022, vol. 495, pp. 129-152. DOI: 10.1016/j.neucom.2022.01.022.

Lai, K. T. NTUT 4K Drone Photo Dataset. Available at: https://datasetninja.com/ntut-4k-drone-photo (аccessed 20.02.2024).

Ruby, U., Theerthagiri, P., Jacob, J., & Vamsidhar, Y. Binary cross entropy with deep learning technique for Image classification. International Journal of Advanced Trends in Computer Science and Engineering, 2020, vol. 9, no. 4, pp. 5393-5397. DOI: 10.30534/ijatcse/2020/175942020

Kingma, D. P., & Ba, J. Adam: A Method for Stochastic Optimization. International Conference on Learning Representations, 2014. DOI: 10.48550/arXiv.1412.6980.

Padilla, R., Wesley L. P., Thadeu L. B. D., Sergio L. N., & Eduardo A. B. da Silva. A Comparative Analysis of Object Detection Metrics with a Companion Open-Source Toolkit. Electronics, 2021, vol. 10, no. 3, article no. 279. DOI: 10.3390/electronics10030279.

Beitzel, S. M., Jensen, E. C., & Frieder, O. MAP. Encyclopedia of Database Systems, 2009, pp. 1691-1692. DOI: 10.1007/978-0-387-39940-9_492.




DOI: https://doi.org/10.32620/aktt.2024.2.09