Dual-branch network with memory for video anomaly detection

Wang, Dicong; Hu, Qinghua; Wu, Kaijun

doi:10.1007/s00530-022-00991-x

Dual-branch network with memory for video anomaly detection

Regular Paper
Published: 24 August 2022

Volume 29, pages 247–259, (2023)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Dicong Wang^1,2,
Qinghua Hu¹ &
Kaijun Wu²

522 Accesses
Explore all metrics

Abstract

Anomaly event detection is a video surveillance technology automatically analyzing video sequences without manual intervention by employing machine learning and computer vision technology. In the existing approaches, most of them are utilized to reconstruct or predict the video frame based on an autoencoder (AE). However, impacted by the powerful characterization capabilities of Convolutional Neural Network (CNN), abnormal frames will be improperly reconstructed into normal frames. To solve the above issue, an autoencoder, based on a branch framework of reconstruction and prediction in training, is proposed. A memory module is adopted to reduce the reconstruction error, which is capable of enhancing the robustness of the autoencoder as a prototype memory module. The prediction of high-quality future frames can effectively prevent the reconstruction of abnormal frames, and the two branches can be supplemented with their respective loss functions, thus further elevating the performance of video anomaly detection. The framework for this study is trained from end to end. The methodology put forth in this article is extensively verified on three publicly available data sets, and its robustness to the uncertainty for the common occurrence as well as the efficiency to the sensitivity for the abnormalies are also confirmed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

CBAM: Convolutional Block Attention Module

Deepfakes generation and detection: state-of-the-art, open challenges, countermeasures, and way forward

Article 04 June 2022

Deepfake: An Overview

References

Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. (CSUR) 41(3), 1–58 (2009)
Article Google Scholar
Khaleghi, A., Mohammad, MS.: Improved anomaly detection in surveillance videos based on a deep learning method. 2018 8th Conference of AI & Robotics and 10th RoboCup Iranopen International Symposium (IRANOPEN). (2018)
Saligrama, V., Konrad, J., Jodoin, P.-M.: Video anomaly identification. IEEE Signal Process. Mag. 27(5), 18–33 (2010). https://doi.org/10.1109/MSP.2010.937393
Article Google Scholar
Suarez, J.J.P., Prospero, C., Naval Jr.: A survey on deep learning techniques for video anomaly detection. arXiv preprint (2020). arXiv:2009.14146.https://doi.org/10.48550/arXiv.2009.14146
Wang, Z., Yang, Z.: A promotion method for generation error-based video anomaly detection. Pattern Recognit. Lett. 140, 88–94 (2020). https://doi.org/10.1016/j.patrec.2020.09.019
Article Google Scholar
Graves, A., Wayne, G., Reynolds, M., Harley, T., Danihelka, I., Grabska-Barwinska, A., Colmenarejo, S.G., Grefenstette, E., Ramalho, T., Agapiou, J.: Hybrid computing using a neural network with dynamic external memory. Nature 538(7626), 471–476 (2016)
Article Google Scholar
Luo, W., Wen, L., Shenghua, G.: Remembering history with convolutional lstm for anomaly detection. 2017 IEEE International Conference on Multimedia and Expo (ICME). (2017). https://doi.org/10.1109/ICME.2017.8019325
Liu, W., Luo, W., Lian, D.: Future frame prediction for anomaly detection–a new baseline. 2018 Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, June 18–23, 2018, Salt Lake City, UT, USA. New York, 6536–6545. (2018). https:// doi.org/https://doi.org/10.48550/arXiv.1712.09867
Park, H., Jongyoun, N., Bumsub, H.: Learning memory-guided normality for anomaly detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. (2020)
Lai, Y., Yahong, H., Yaowei, W.: Anomaly Detection with Prototype-Guided Discriminative Latent Embeddings. (2021). arXiv preprint arXiv:2104.14945
Zong, B.: Deep autoencoding gaussian mixture model for unsupervised anomaly detection. International conference on learning representations. (2018)
Dong, G., Lingqiao, L., Vuong, L., Budhaditya, S., Moussa, R.M., Svetha, V., Anton van den H.: Memorizing normality to detect anomaly: Memory-augmented deep autoencoder for unsupervised anomaly detection. Proceedings of the IEEE/CVF International Conference on Computer Vision. (2019)
Li, W., Mahadevan, V., Vasconcelos, N.: Anomaly detection and localization in crowded scenes. IEEE Trans. Pattern anal. Mach. Intell. 36(1), 18–32 (2013). https://doi.org/10.1109/TPAMI.2013.111
Article Google Scholar
Lu, C., Jianping, S., Jiaya, J.: Abnormal event detection at 150 fps in matlab. Proceedings of the IEEE international conference on computer vision. (2013)
Luo, W., Wen, L., Shenghua, G.: A revisit of sparse coding based anomaly detection in stacked RNN framework. Proceedings of the IEEE International Conference on Computer Vision. (2017)
Chen, Y., Xiang, S.Z., Thomas S.H.: One-class SVM for learning in image retrieval. Proceedings 2001 International Conference on Image Processing (Cat. No. 01CH37205). (2001)
Schölkopf, B., Alexander, J.S., Francis, B.: Learning with kernels: support vector machines, regularization, optimization, and beyond. MIT Press (2002)
Google Scholar
Lukas, R., Robert, V., Nico, G., Lucas, D., Shoaib A.S., Alexander, B., Emmanuel M., Marius, K.: Deep one-class classification. International conference on machine learning. PMLR. (2018)
Chalapathy, R., Aditya K.M., Sanjay, C.: Anomaly detection using one-class neural networks. (2018) arXiv preprint arXiv:1802.06360. https://doi.org/10.48550/arXiv.1802.06360
Zimek, A., Schubert, E., Kriegel, H.-P.: A survey on unsupervised outlier detection in high-dimensional numerical data. Stat Anal. Data Min 5(5), 363–387 (2012)
Article MathSciNet MATH Google Scholar
Zhou, J.T., Du, J., Zhu, H., Peng, X., Liu, Y., Goh, R.S.M.: Anomalynet: an anomaly detection network for video surveillance. IEEE Trans. Inform. Forensic. Secur. 14(10), 2537–2550 (2019)
Article Google Scholar
Ganokratanaa, T., Aramvith, S., Sebe, N.: Video anomaly detection using deep residual-spatiotemporal translation network. Pattern Recognition Letters. Volume 155, March, Pages 143–150 (2022)
Sabzalian, B., Marvi, H., Ahmadyfard, A.: Deep and sparse features for anomaly detection and localization in video. In: 2019 4th International Conference on Pattern Recognition and Image Analysis. IPRIA, 173–178 (2019)
Xu, K., Sun, T., Jiang, X.: Video anomaly detection and localization based on an adaptive intra-frame classification network. IEEE Trans. Multimed. 22(2), 394–406 (2020)
Article Google Scholar
Landi, F., Snoek, C.G.M., Cucchiara, R.: Anomaly locality in video surveillance. ArXiv abs/1901.10364 (2019)
Fan, Y., Wen, G., Li, D., Qiu, S., Levine, M.D., Xiao, F.: Video anomaly detection and localization via Gaussian mixture fully convolutional variational autoencoder. Comput. Vis. Image Underst. 195, 102920 (2020). https://doi.org/10.1016/j.cviu.2020.102920
Article Google Scholar
Mohammadi, B., Mahmood, F., Mohammad, S.: Image/video deep anomaly detection: a survey. (2021) arXiv preprint arXiv:2103.01739
Pang, G., Yan, C., Shen, C., et al.: Self-trained deep ordinal regression for end-to-end video anomaly detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12173–12182 (2020)
Hasan, M., Choi, J., Neumann, J., Amit K.R.-C., Larry S.D.: Learning temporal regularity in video sequences. Proceedings of the IEEE conference on computer vision and pattern recognition. (2016)
Zhao, Y., Deng, B., Shen, C., Liu, Y., Lu, H., Hua, X-S.: Spatio-temporal autoencoder for video anomaly detection. Proceedings of the 25th ACM international conference on Multimedia. (2017)
Ravanbakhsh, M., Nabi, M., Sangineto, E., Marcenaro, L., Regazzoni, C., Sebe, N.: Abnormal event detection in videos using generative adversarial nets. 2017 IEEE International Conference on Image Processing, ICIP. (2017)
Li, D., Nie, X., et al.: Context-related video anomaly detection via generative adversarial network. Pattern Recognit. Lett. 156, 183–189 (2022). https://doi.org/10.1016/j.patrec.2022.03.004
Article Google Scholar
Medel, J.R., Andreas, S.: Anomaly detection in vieo using predictive convolutional long short-term memory networks. (2016). arXiv preprint arXiv:1612.00390
Yu, G., Wang, S., Cai, Z., Zhu, E., Xu, C., Yin, J., Kloft, M.: Cloze test helps: Effective video anomaly detection via learning to complete video events. In Proceedings of the 28th ACM International Conference on Multimedia, 583–591 (2020)
Salisu, A.: Towards a data-driven adaptive anomaly detection system for human activity. Pattern Recognit. Lett. 145, 200–207 (2021)
Article Google Scholar
Yao, Y., Wang, X., Xu, M., Pu, Z., Atkins, E., Crandall, D.: When, where, and what? A new dataset for anomaly detection in driving videos. (2020) arXiv preprint arXiv:2004.03044
Yao, T.: Integrating prediction and reconstruction for anomaly detection. Pattern Recognit. Lett. 129, 123–130 (2020)
Article Google Scholar
Nguyen, T.-N., Jean, M. Anomaly detection in video sequence with appearance-motion correspondence. Proceedings of the IEEE/CVF International Conference on Computer Vision. (2019)
Chang, Y., Tu, Z., Xie, W., et al.: Video anomaly detection with spatio-temporal dissociation[J]. Pattern Recognit. 122, 108213 (2022)
Article Google Scholar
Liu, Z., Nie, Y., Long, C.: A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction Proceedings of the IEEE/CVF International Conference on Computer Vision. 13588–13597 (2021)
Ronneberger, O., Philipp, F., Thomas, B.: U-net: Convolutional networks for biomedical image segmentation. International Conference on Medical image computing and computer-assisted intervention. Springer, Cham. (2015)
Mathieu, M., Camille, C., Yann, L.: Deep multi-scale video prediction beyond mean square error. (2015) arXiv preprint arXiv:1511.05440.https://doi.org/10.48550/arXiv.1511.05440
Müller, S.G., Frank, H.: Trivialaugment: Tuning-free yet state-of-the-art data augmentation. Proceedings of the IEEE/CVF International Conference on Computer Vision. (2021)
Cubuk, E.D.: Autoaugment: Learning augmentation policies from data. (2018) arXiv preprint arXiv:1805.09501. https://doi.org/10.48550/arXiv.1805.09501
Ho, D.: Population based augmentation: Efficient learning of augmentation policy schedules. International Conference on Machine Learning. PMLR. (2019)
Cubuk, E.D.: Randaugment: Practical automated data augmentation with a reduced search space. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. (2020)
Paszke A, Gross S, Chintala S, Chanan G, Yang E, DeVito Z, Lin Z, Desmaison A, Antiga L, Lerer A.: Automatic differentiation in pytorch. NIPS. 30 (2017)
Kim, J., Kristen, G.: Observe locally, infer globally: a space-time MRF for detecting abnormal activities with incremental updates. 2009 IEEE conference on computer vision and pattern recognition. IEEE. (2009)
Mahadevan, V., Li, W., Bhalodia, V., Vasconcelos, N.: Anomaly detection in crowded scenes. 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE. (2010)
Xu, D., Ricci, E., Yan, Y., Song, J., Sebe, N.: Learning deep representations of appearance and motion for anomalous event detection. (2015) arXiv preprint arXiv:1510.01553. https://doi.org/10.48550/arXiv.1510.01553
Tudor Ionescu, R., Smeureanu, S., Alexe B., Popescu M.: Unmasking the abnormal events in video. Proceedings of the IEEE international conference on computer vision. (2017)
Hinami, R., Tao M., Shin'ichi S.: Joint detection and recounting of abnormal events by learning deep generic knowledge. Proceedings of the IEEE International Conference on Computer Vision. (2017)
Chang, Y., Tu, Z., Xie, W., et al.: Clustering driven deep autoencoder for video anomaly detection[C]//European Conference on Computer Vision. Springer, Cham, 329–345 (2020)
Pang, W., He, Q., Li, Y.: Predicting skeleton trajectories using a Skeleton-Transformer for video anomaly detection. Multimed. Syst. (2022). https://doi.org/10.1007/s00530-022-00915-9
Article Google Scholar
Le, V.T., Kim, Y.G.: Attention-based residual autoencoder for video anomaly detection[J]. Appl. Intell. (2022). https://doi.org/10.1007/s10489-022-03613-1
Article Google Scholar
Chang, Y., Tu, Z., Xie, W., Yuan, J.: Clustering driven deep autoencoder for video anomaly detection, in Proceedings of the European Conference on Computer Vision. Springer, 329–345 (2020)

Download references

Acknowledgements

This work was supported in part by the National Natural Science Foundation of China under Grant No. 61966022, in part by the Natural Science Foundation of Gansu Province (No.21JR7RA300), Open project of Gansu Dunhuang Cultural Relics Protection and Research Center (No. GDW2021YB15), Excellent graduate innovation in Gansu Project of the Stars (2021CXZX-555).

Author information

Authors and Affiliations

College of Intelligence and Computing, Tianjin University, Tianjin, 300354, China
Dicong Wang & Qinghua Hu
School of Electronic and Information Engineering, LanZhou JiaoTong University, Lanzhou, 730070, China
Dicong Wang & Kaijun Wu

Authors

Dicong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qinghua Hu
View author publications
You can also search for this author in PubMed Google Scholar
Kaijun Wu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation, datacollection and analysis were performed by [DW]. The first draft of the manuscript was written by [DW] and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Dicong Wang.

Ethics declarations

Conflict of interest

The authors have declared no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Wang, D., Hu, Q. & Wu, K. Dual-branch network with memory for video anomaly detection. Multimedia Systems 29, 247–259 (2023). https://doi.org/10.1007/s00530-022-00991-x

Download citation

Received: 12 May 2022
Accepted: 11 August 2022
Published: 24 August 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s00530-022-00991-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dual-branch network with memory for video anomaly detection

Abstract

Access this article

Similar content being viewed by others

CBAM: Convolutional Block Attention Module

Deepfakes generation and detection: state-of-the-art, open challenges, countermeasures, and way forward

Deepfake: An Overview

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Dual-branch network with memory for video anomaly detection

Abstract

Access this article

Similar content being viewed by others

CBAM: Convolutional Block Attention Module

Deepfakes generation and detection: state-of-the-art, open challenges, countermeasures, and way forward

Deepfake: An Overview

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation