Abstract
With the development of society, more and more people rely on cars to travel and cause traffic congestion. Vehicle density detection can provide intelligent decision-making assistance for traffic video surveillance system and improve traffic efficiency. The camera view can cause the scale difference between vehicles in the same scene. To solve the multi-scale problem, a vehicle density detection method based on deep learning semantic segmentation is proposed. VGG-16 is used to extract vehicle features at the front end of the network, and the output feature image is 1/8 of the original image, which can improve the accuracy of the prediction density map. A dilated convolution module is designed at the back end to capture multi-scale features of the vehicle, enabling the network to capture more scale details and edge information. Finally, the network cascades the output with \(1 \times 1\) convolution to get the prediction density map. And in the output characteristics of the last layer of the network, the quantity is predicted by adding the fully connected layer, and then the final predicted quantity is obtained by summing up the counting results of the network output density map. The performance of the network is tested on the TRANCOS dataset, and the test results show that the MAE of our network is 34.1% higher than that of the Hydra-3s network.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Simonyan, K., et al.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556. (2014)
Boominathan, L., et al.: Crowdnet: a deep convolutional network for dense crowd counting. In: Proceedings of the 2016 ACM on Multimedia Conference, pages 640–644. ACM. (2016)
Zhang, Y., et al.: Single-image crowd counting via multi-column convolutional neural network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 589–597. (2016)
Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions. In: International Conference on Learning Representations, San Juan. ICLR. (2016)
hui, X.C., et al.: End-to-end dilated convolution network for document image semantic segmentation. J. Central South University. (2021)
Lei, G., et al.: Lung segmentation method with dilated convolution based on VGG-16 network. [J]. Comput. Assist. Surg. (Abingdon, England). (2019)
López-Sastre, R., Maldonado-Bascón, S., Guerrero-Gómez-Olmedo, R., et al.: Extremely overlapping vehicle counting. In: Iberian Conference on Pattern Recognition and Image Analysis, IbPRIA (2015)
Fiaschi, L., et al.: Learning to count with regression forest and structured labels. In: Proceedings of the 21st International Conference on Pattern Recognition, ICPR2012, pp. 2685–2688. (2012)
Lempitsky, V., Zisserman, A.: Learning to count objects in images. In: Advances in Neural Information Processing Systems, pages 1324–1332. (2010)
Onoro-Rubio, D., López-Sastre, R.J.: Towards perspective-free object counting with deep learning. In: European Conference on Computer Vision, pp 615–629. Springer. (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Yi, J., Zhang, X., Mao, Z., Du, H., Ma, Y. (2023). Design and Implementation of Vehicle Density Detection Method Based on Deep Learning. In: Xiong, N., Li, M., Li, K., Xiao, Z., Liao, L., Wang, L. (eds) Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery. ICNC-FSKD 2022. Lecture Notes on Data Engineering and Communications Technologies, vol 153. Springer, Cham. https://doi.org/10.1007/978-3-031-20738-9_20
Download citation
DOI: https://doi.org/10.1007/978-3-031-20738-9_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20737-2
Online ISBN: 978-3-031-20738-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)