Skip to main content

Human Detection and Tracking Based on YOLOv3 and DeepSORT

  • Conference paper
  • First Online:
Communication and Intelligent Systems (ICCIS 2022)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 686))

Included in the following conference series:

  • 292 Accesses

Abstract

This paper proposes an enhanced object tracking architecture based on YOLOv3 and DeepSORT. YOLOv3 is used for object detection, but in this case, we have only selected the human class. For object tracking, the DeepSORT, Kalman filter, and Hungarian algorithm are used. The occlusion issue is solved using the Kalman filter. The parallel approach is used in the suggested architecture to increase the base method's speed. In a parallel approach, the input frame is fed simultaneously to the object detector and the object tracker, enabling the tracker to begin following a person as soon as the object detector recognizes them. Here, occlusion and speed are the two issues that we are concentrating on. The experimental outcomes using specific data demonstrate positive outcomes with faster operation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Voigtlaender P, Krause M, Osep A, Luiten J, Sekar BBG, Geiger A, Leibe B (2019) Mots: multi-object tracking and segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7942–7951

    Google Scholar 

  2. Luo W, Xing J, Milan A, Zhang X, Liu W, Kim TK (2021) Multiple object tracking: a literature review. Artif Intell 293:103448

    Article  MathSciNet  MATH  Google Scholar 

  3. Ciaparrone G, Sánchez FL, Tabik S, Troiano L, Tagliaferri R, Herrera F (2020) Deep learning in video multi-object tracking: a survey. Neurocomputing 381:61–88

    Article  Google Scholar 

  4. Emami P, Pardalos PM, Elefteriadou L, Ranka S (2020) Machine learning methods for data association in multi-object tracking. ACM Comput Surv (CSUR) 53(4):1–34

    Article  Google Scholar 

  5. Jainul Rinosha SM, Augasta G (2021) Review of recent advances in visual tracking techniques. Multimedia Tools Appl 80(16):24185–24203

    Google Scholar 

  6. Jeong Y, Son S, Jeong E, Lee B (2018) An integrated self-diagnosis system for an autonomous vehicle based on an IoT gateway and deep learning. Appl Sci 8(7):1164

    Article  Google Scholar 

  7. Bansal P, Kockelman KM (2018) Are we ready to embrace connected and self-driving vehicles? A case study of Texans. Transportation 45(2):641–675

    Google Scholar 

  8. Lotfi F, Taghirad HD (2021) A framework for 3d tracking of frontal dynamic objects in autonomous cars. Expert Syst Appl 183:115343

    Article  Google Scholar 

  9. Shi X, Chai X, Xie J, Sun T (2022) Mc-gcn: a multi-scale contrastive graph convolutional network for unconstrained face recognition with image sets. IEEE Trans Image Process 31:3046–3055

    Article  Google Scholar 

  10. Neto JBC, Ferrari C, Marana AN, Berretti S, Bimbo AD (2022) Learning streamed attention network from descriptor images for cross-resolution 3d face recognition. ACM Trans Multimedia Comput Commun Appl (TOMM) 19(1):1–20

    Google Scholar 

  11. Rezaei F, Yazdi M (2021) Real-time crowd behavior recognition in surveillance videos based on deep learning methods. J Real-Time Image Proc 18(5):1669–1679

    Article  Google Scholar 

  12. Swathi HY, Shivakumar G (2021) Hybrid feature-assisted neural model for crowd behavior analysis. SN Comput Sci 2(4):1–11

    Article  Google Scholar 

  13. Mekler ED, Hornbæk K (2019) A framework for the experience of meaning in human-computer interaction. In: Proceedings of the 2019 CHI conference on human factors in computing systems, pp 1–15

    Google Scholar 

  14. Kashef M, Visvizi A, Troisi O (2021) Smart city as a smart service system: human-computer interaction and smart city surveillance systems. Comput Hum Behav 124:106923

    Article  Google Scholar 

  15. Siddique A, Medeiros H (2020) Tracking passengers and baggage items using multi-camera systems at security checkpoints. arXiv preprint arXiv:2007.07924

  16. Xu R, Nikouei SY, Chen Y, Polunchenko A, Song S, Deng C, Faughnan TR (2018) Real-time human objects tracking for smart surveillance at the edge. In: 2018 IEEE international conference on communications (ICC). IEEE, pp 1–6

    Google Scholar 

  17. Wang Z, Zheng L, Liu Y, Li Y, Wang S (2020) Towards real-time multi-object tracking. In: European conference on computer vision. Springer, Berlin, pp 107–122

    Google Scholar 

  18. Brunetti A, Buongiorno D, Trotta GF, Bevilacqua V (2018) Computer vision and deep learning techniques for pedestrian detection and tracking: a survey. Neurocomputing 300:17–33

    Article  Google Scholar 

  19. Zuo M, Zhu X, Chen Y, Yu J (2022) Survey of object tracking algorithm based on Siamese network. J Phys Conf Ser 2203:012035

    Google Scholar 

  20. Tong K, Wu Y, Zhou F (2020) Recent advances in small object detection based on deep learning: a review. Image Vis Comput 97:103910

    Article  Google Scholar 

  21. Nguyen HQ, Nguyen TB, Le TA, Le TL, Vu TH, Noe A (2019) Comparative evaluation of human detection and tracking approaches for online tracking applications. In: 2019 international conference on advanced technologies for communications (ATC). IEEE, pp 348–353

    Google Scholar 

  22. Llamazares Á, Molinos EJ, Ocaña M (2018) Detection and tracking of moving obstacles (datmo): a review. Robotica 38(5):761–774

    Google Scholar 

  23. Redmon J, Farhadi A. Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767

  24. Soleimanitaleb Z, Keyvanrad MA, Jafari A (2019) Object tracking methods: a review. In: 2019 9th international conference on computer and knowledge engineering (ICCKE). IEEE, pp 282–288

    Google Scholar 

  25. Rakai L, Song H, Sun S, Zhang W, Yang Y (2021) Data association in multiple object tracking: a survey of recent techniques. Expert Syst Appl 116300

    Google Scholar 

  26. Sugirtha T, Sridevi MA (2022) Survey on object detection and tracking in a video sequence. In: Proceedings of international conference on computational intelligence. Springer, Berlin, pp 15–29

    Google Scholar 

  27. Tan L, Dong X, Ma Y, Yu C (2018) A multiple object tracking algorithm based on yolo detection. In: 11th international congress on image and signal processing, biomedical engineering and informatics (CISP-BMEI). IEEE, pp 1–5

    Google Scholar 

  28. Sun S, Akhtar N, Song H, Mian A, Shah M (2019) Deep affinity network for multiple object tracking. IEEE Trans Pattern Anal Mach Intell 43(1):104–119

    Google Scholar 

  29. Lee S, Kim E (2018) Multiple object tracking via feature pyramid Siamese networks. IEEE Access 7:8181–8194

    Article  Google Scholar 

  30. Saleh F, Aliakbarian S, Rezatofighi H, Salzmann M, Gould S (2021) Probabilistic tracklet scoring and inpainting for multiple object tracking. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 14329–14339

    Google Scholar 

  31. Peng J, Gu Y, Wang Y, Wang C, Li J, Huang F (2020) Dense scene multiple object tracking with box-plane matching. In: Proceedings of the 28th ACM international conference on multimedia, pp 4615–4619

    Google Scholar 

  32. Zhang J, Zhou S, Chang X, Wan F, Wang J, Wu Y, Huang D (2020) Multiple object tracking by flowing and fusing. arXiv preprint arXiv:2001.11180

  33. Sundararaman R, De Almeida Braga C, Marchand E, Pettre J (2021) Tracking pedestrian heads in dense crowd. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3865–3875

    Google Scholar 

  34. Stadler D, Beyerer J (2021) Multi-pedestrian tracking with clusters. In: 2021 17th IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–10

    Google Scholar 

  35. Wojke N, Bewley A, Paulus D (2017) Simple online and realtime tracking with a deep association metric. In: 2017 IEEE international conference on image processing, pp 3645–3649

    Google Scholar 

  36. https://cocodataset.org/#explore

  37. https://github.com/AlexeyAB/darknet

  38. https://pjreddie.com/darknet/yolov1. Last accessed 2022/08/13

  39. Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7263–7271

    Google Scholar 

  40. Wai YJ, Bin Mohd Yussof Z, Bin Salim SI, Chuan LK (2018) Fixed point implementation of Tiny-Yolo-v2 using OpenCL on FPGA. Int J Adv Comput Sci Appl 9(10):506–512

    Google Scholar 

  41. Adarsh P, Rathi P, Kumar M (2020) YOLO v3-tiny: object detection and recognition using one stage improved model. In: 2020 6th international conference on advanced computing and communication systems (ICACCS), Mar 6. IEEE, pp 687–694

    Google Scholar 

  42. Ge Z, Liu S, Wang F, Li Z, Sun J (2021) Yolox: exceeding yolo series in. arXiv preprint arXiv:2107.08430

  43. Xu S, Wang X, Lv W, Chang Q, Cui C, Deng K, Lai B (2022) PP-YOLOE: an evolved version of YOLO. arXiv preprint arXiv:2203.16250

  44. Huang X, Wang X, Lv W, Bai X, Long X, Deng K, Yoshie O (2021) PP-YOLOv2: a practical object detector. arXiv preprint arXiv:2104.10419

  45. Multiple object tracking benchmark, https://motchallenge.net/data/MOT17/. Last accessed: 2022/08/13

  46. Kim C, Li F, Ciptadi A, Rehg JM (2015) Multiple hypothesis tracking revisited. In: Proceedings of the IEEE international conference on computer vision, pp 4696–4704

    Google Scholar 

  47. Henschel R, Leal-Taixé L, Cremers D, Rosenhahn B (2018) Fusion of head and full-body detectors for multi-object tracking. In: Computer vision and pattern recognition workshops (CVPRW)

    Google Scholar 

  48. Keuper M, Tang S, Andres B, Brox T, Schiele B (2018) Motion segmentation and multiple object tracking by correlation co-clustering. IEEE Trans Pattern Anal Mach Intell 42(1):140–153

    Google Scholar 

  49. Chen L, Ai H, Zhuang Z, Shang C (2018) Real-time multiple people tracking with deeply learned candidate selection and person re-identification. ICME

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rajiv Singh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tyagi, B., Nigam, S., Singh, R. (2023). Human Detection and Tracking Based on YOLOv3 and DeepSORT. In: Sharma, H., Shrivastava, V., Bharti, K.K., Wang, L. (eds) Communication and Intelligent Systems. ICCIS 2022. Lecture Notes in Networks and Systems, vol 686. Springer, Singapore. https://doi.org/10.1007/978-981-99-2100-3_11

Download citation

Publish with us

Policies and ethics