Abstract
The video forensics capabilities are constantly improving in terms of evidence accumulating, analysis, processing, and storage. Video forensic analysis involves scientific investigation, comparison, and/or assessment of video files that are considered as proof in the court. In this paper, we focus on inter-frame video forgery detection and localization with respect to frame inserting and deleting that are essential from digital video forensics perspective. We design a 3-dimensional convolutional neural network (3DCNN) model for detecting video inter-frame forgery and localize forgery using multi-scale structural similarity index measurement algorithm. We introduce an absolute difference algorithm to differentiate video frames from each other which minimize the temporal redundancy and identify forgery artefacts within the video frames. This improves the 3DCNN efficiency and accuracy in detecting frame insertion and deletion forgery. The proposed model outperforms existing models in terms of accuracy, precision, recall, and F1 score in various post-processing operations, compression rates, and video length.
Similar content being viewed by others
Data availability
The authors confirm that the data supporting the findings of this study are available within the article [and/or] its supplementary materials.
References
Akhtar, N., Saddique, M., Asghar, K., et al.: Digital video tampering detection and localization: review, representations, challenges and algorithm. Mathematics 10(2), 168 (2022)
Bakas, J., Naskar, R., Bakshi, S.: Detection and localization of inter-frame forgeries in videos based on macroblock variation and motion vector analysis. Comput. Electr. Eng. 89(106), 929 (2021)
Fadl, S., Han, Q., Li, Q.: CNN spatiotemporal features and fusion for surveillance video forgery detection. Signal Process. Image Commun. 90(116), 066 (2021)
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning, PMLR, pp. 448–456 (2015)
Ji, S., Xu, W., Yang, M., et al.: 3D convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 221–231 (2012)
Kaur, H., Jindal, N.: Deep convolutional neural network for graphics forgery detection in video. Wirel. Pers. Commun. 112(3), 1763–1781 (2020)
Kingra, S., Aggarwal, N., Singh, R.D.: Inter-frame forgery detection in h. 264 videos using motion and brightness gradients. Multim. Tools Appl. 76(24), 25767–25786 (2017)
Liu, Y., Huang, T.: Exposing video inter-frame forgery by Zernike opponent chromaticity moments and coarseness analysis. Multim. Syst. 23(2), 223–238 (2017)
Long, C., Basharat, A., Hoogs, A.: A coarse-to-fine deep convolutional neural network framework for frame duplication detection and localization in video forgery (2018). arXiv:1811.10762
Nasr, M.A.S., AlRahmawy, M.F., Tolba, A.: Multi-scale structural similarity index for motion detection. J. King Saud Univ.-Comput. Inf. Sci. 29(3), 399–409 (2017)
Nguyen, X.H., Hu, Y., Amin, M.A., et al.: Detecting video inter-frame forgeries based on convolutional neural network model. Int. J. Image Graph. Signal Process. 10(3), 1 (2020)
Nguyen, X.H., Hu, Y.: VIFFD—a dataset for detecting video inter-frame forgeries. Mendeley Data, V6 (2020). https://doi.org/10.17632/r3ss3v53sj.6
Patel, J., Sheth, R.: An optimized convolutional neural network based inter-frame forgery detection model-a multi-feature extraction framework. ICTACT J. Image Video Process. 12(02), 2570–2581 (2021)
Shelke, N.A., Kasana, S.S.: A comprehensive survey on passive techniques for digital video forgery detection. Multim. Tools Appl. 80(4), 6247–6310 (2021)
Soomro, K., Zamir, A.R., Shah, M.: Ucf101: a dataset of 101 human actions classes from videos in the wild (2012). arXiv:1212.0402
Stamm, M.C., Lin, W.S., Liu, K.R.: Temporal forensics and anti-forensics for motion compensated video. IEEE Trans. Inf. Forensics Secur. 7(4), 1315–1329 (2012)
Wang, W., Jiang, X., Wang, S., et al.: Identifying video forgery process using optical flow. In: International workshop on digital watermarking. Springer, pp. 244–257 (2013)
Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multiscale structural similarity for image quality assessment. In: The Thrity-seventh asilomar conference on signals, systems & computers, 2003, pp. 1398–1402. IEEE (2003)
Yu, L., Wang, H., Han, Q., et al.: Exposing frame deletion by detecting abrupt changes in video streams. Neurocomputing 205, 84–91 (2016)
Zampoglou, M., Markatopoulou, F., Mercier, G., et al.: Detecting tampered videos with multimedia forensics and deep learning. In: International conference on multimedia modeling, pp. 374–386. Springer (2019)
Zhang, Z., Hou, J., Ma, Q., et al.: Efficient video frame insertion and deletion detection based on inconsistency of correlations between local binary pattern coded frames. Secur. Commun. Netw. 8(2), 311–320 (2015)
Zhao, D.N., Wang, R.K., Lu, Z.M.: Inter-frame passive-blind forgery detection for video shot based on similarity analysis. Multim. Tools Appl. 77(19), 25389–25408 (2018)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gowda, R., Pawar, D. Deep learning-based forgery identification and localization in videos. SIViP 17, 2185–2192 (2023). https://doi.org/10.1007/s11760-022-02433-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-022-02433-7