Abstract
Audio signal is sometimes stored and/or processed in WAV (waveform) format without any knowledge of its previous compression operations. To perform some subsequent processing, such as digital audio forensics, audio enhancement and blind audio quality assessment, it is necessary to identify its compression history. In this article, we will investigate how to identify a decompressed wave audio that went through one of three popular compression schemes, including MP3, WMA (windows media audio) and AAC (advanced audio coding). By analyzing the corresponding frequency coefficients, including modified discrete cosine transform (MDCT) and Mel-frequency cepstral coefficients (MFCCs), of those original audio clips and their decompressed versions with different compression schemes and bit rates, we propose several statistics to identify the compression scheme as well as the corresponding bit rate previously used for a given WAV signal. The experimental results evaluated on 8,800 audio clips with various contents have shown the effectiveness of the proposed method. In addition, some potential applications of the proposed method are discussed.
- P. Bestagini, A. Allam, S. Milani, M. Tagliasacchi, and S. Tubaro. 2012. Video codec identification. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing. 2257--2260.Google Scholar
- T. Bianchi, A. Rosa, and M. Fontani. 2013. Detection and classification of double compressed MP3 audio tracks. In Proceedings of the 1st ACM Workshop on Information Hiding and Multimedia Security. 159--164. Google ScholarDigital Library
- C.-C. Chang and C.-J. Lin. 2011. LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 27:1--27:27. Google ScholarDigital Library
- G. Chen, X. Kong, W. Zhong, and B. Wang. 2012. Detection of double mp3 compression based on fluctuation intensity of quantized MDCT coefficients. In Proceedings of the China Information Hiding and Multimedia Security Workshop. 164--167.Google Scholar
- Z. Fan and R. L. De Queiroz. 2003. Identification of bitmap compression history: JPEG detection and quantizer estimation. IEEE Trans. Image Process. 12, 2, 230--235. Google ScholarDigital Library
- Formatfactory. Formatfactory software - http://www.formatoz.com/.Google Scholar
- D. Fu, Y. Shi, and W. Su. 2007. A generalized benford's law for JPEG coefficients and its applications in image forensics. In Proceedings of SPIE on Electronic Imaging, Security, Steganography, and Watermarking of Multimedia Contents. Vol. 6505.Google Scholar
- Goldwave. Goldwave software - http://www.goldwave.ca/.Google Scholar
- GTZAN. GTZAN Genre Collection - http://marsyas.info/download/data sets/.Google Scholar
- S. Hacker. 2000. MP3: The Definitive Guide. O'Reilly Media. Google ScholarDigital Library
- S. Hiçsönmez, H. T. Sencar, and I. Avcibas. 2011. Audio codec identification through payload sampling. In Proceedings of the International Workshop on Information Forensics and Security. Google ScholarDigital Library
- S. Hiçsönmez, E. Uzun, and H. T. Sencar. 2013. Methods for identifying traces of compression in audio. In Proceedings of the 1st International Conference on Communications, Signal Processing, and Their Applications. 1--6.Google Scholar
- F. Jenner and A. Kwasinski. 2012. Highly accurate non-intrusive speech forensics for codec identifications from observed decoded signals. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing. Kyoto, 1737--1740.Google Scholar
- C. Kraetzer, A. Oermann, J. Dittmann, and A. Lang. 2007. Digital audio forensics: A first practical evaluation on microphone and environment classification. In Proceedings of the Workshop on Multimedia and security. 63--74. Google ScholarDigital Library
- Lame MP3 Encoder. http://sourceforge.net/projects/lame/.Google Scholar
- Q. Liu, A. Sung, and M. Qiao. 2010. Detection of double mp3 compression. Cognitive Comput. 2, 291--296.Google ScholarCross Ref
- J. Lukáš and J. Fridrich. 2003. Estimation of primary quantization matrix in double compressed JPEG images. In Proceedings of the Digital Forensic Research Workshop.Google Scholar
- D. Luo, W. Luo, R. Yang, and J. Huang. 2012. Compression history identification for digital audio signal. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing. 1733--1736.Google Scholar
- W. Luo, J. Huang, and G. Qiu. 2010a. JPEG error analysis and its applications to digital image forensics. IEEE Trans. Inf. Forensics Secur. 5, 3, 480--491. Google ScholarDigital Library
- W. Luo, Y. Wang, and J. Huang. 2010b. Detection of quantization artifacts and its applications to transform encoder identification. IEEE Trans. Inf. Forensics Secur. 5, 4, 810--815. Google ScholarDigital Library
- H. Malik and H. Farid. 2010. Audio forensics from acoustic reverberation. In Proceedings of the International Conference on Acoustics Speech and Signal Processing. 1710--1713.Google Scholar
- MP3Standard. Information technology - coding of moving pictures and associated audio for digital storage media up to about 1.5 mbit/s.Google Scholar
- T. Painter and A. Spanias. 2000. Perceptual coding of digital audio. Proc. IEEE 88, 4, 451--515.Google ScholarCross Ref
- D. Pan. 1995. A tutorial on MPEG/Audio compression. IEEE Multimedia 2, 2, 60--74. Google ScholarDigital Library
- J. P. Princen, A. W. Johnson, and A. B. Bradley. 1987. Subband/transform coding using filter bank designs based on time domain aliasing cancellation. In Proceedings of the Intenational Conference on Acoustics, Speech, and Signal Processing. 2161--2164.Google Scholar
- M. Qiao, A. Sung, and Q. Liu. 2010. Revealing real quality of double compressed MP3 audio. In Proceedings of the International Conference on Multimedia. 1011--1014. Google ScholarDigital Library
- D. Reynolds, T. Quatieri, and R. Dunn. 2000. Speaker verification using adapted gaussian mixture models. Digital Signal Process. 10, 1, 19--41. Google ScholarDigital Library
- M. Tagliasacchi and S. Tubaro. 2010. Blind estimation of the QP parameter in H.264/AVC decoded video. In Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services. 1--4.Google Scholar
- Voicebox. http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html.Google Scholar
- R. Yang, Z. Qu, and J. Huang. 2008. Detecting digital audio forgeries by checking frame offsets. In Proceedings of the ACM Workshop on Multimedia and Security. 21--26. Google ScholarDigital Library
- R. Yang, Y. Shi, and J. Huang. 2009. Defeating fake-quality MP3. In Proceedings of the ACM Workshop on Multimedia and Security. 117--124. Google ScholarDigital Library
- R. Yang, Y. Shi, and J. Huang. 2010. Detecting double compression of audio signal. In Proceedings of SPIE vol. 7541, Media Forensics and Security II.Google Scholar
Index Terms
- Identifying Compression History of Wave Audio and Its Applications
Recommendations
A Tutorial on MPEG/Audio Compression
This tutorial covers the theory behind MPEG/audio compression. This algorithm was developed by the Motion Picture Experts Group (MPEG), as an International Organization for Standardization (ISO) standard for the high fidelity compression of digital ...
Scalable Audio Compression at Low Bitrates
A perceptually scalable audio coder generates a bit-stream that contains layers of audio fidelity and is encoded in such a way that adding one of these layers enhances the reconstructed audio by an amount that is just noticeable by the listener. Such ...
Comments