Abstract
This article proposes to use the relative distances between adjacent envelope peaks detected in stereo audio as fingerprints for copy identification. The matching algorithm used is the rough longest common subsequence (RLCS) algorithm. The experimental results show that the proposed approach has better identification accuracy than an MPEG-7 based scheme for distorted and noisy audio. When compared with other schemes, the proposed scheme uses fewer bits with comparable performance. The proposed fingerprints can also be used in conjunction with the MPEG-7 based scheme for lower computational burden.
- 3GPP TS 26.404. 2012. 3rd generation partnership project; technical specification group services and system aspects; general audio codec audio process functions; Enhanced aacPlus general audio codec; enhanced aacPlus encoder SBR part, 3GPP TS 26 404, v 11.0.0 (Sept. 2012).Google Scholar
- Shumeet Baluja and Michele Covell. 2007. Audio fingerprinting: combining computer vision and data stream processing. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, II-213--II-216.Google ScholarCross Ref
- Carlo Bellettini and Gianluca Mazzini. 2007. On audio recognition performance via robust hashing. In Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems. 20--23.Google ScholarCross Ref
- Carlo Bellettini and Gianluca Mazzini. 2010. A framework for robust audio fingerprinting. J. Commun. 5, 5, 409--424.Google ScholarCross Ref
- Juan Pablo Bello, Laurent Daudet, Samer Abdalllfh, Chris Duxbury, Mike Davies, and Mark B. Sandler. 2005. A tutorial on onset detection in music signals. IEEE Trans. Speech Audio Process. 13, 5, 1035--1047.Google ScholarCross Ref
- Christopher J. C. Burges, John C. Platt, and Soumya Jana. 2003. Distortion discriminant analysis for audio fingerprinting. IEEE Trans. Speech Audio Process. 11, 3, 165--174.Google ScholarCross Ref
- Pedro Cano, Eloi Battle, Ton Kalker, and Jaap Haitsma. 2005. A review of audio fingerprinting. J. VLSI Signal Process. 41, 3, 271--284. Google ScholarDigital Library
- Vijay Chandrasekha, Matt Sharifi, and David A. Ross. 2011. Survey and evaluation of audio fingerprinting schemes for mobile query-by-example applications. In Proceedings of the 12th International Conference on Music Information Retrieval. ISMIR, 801--806.Google Scholar
- Holger Crysandt. 2003. Music identification with MPEG-7. In Proceedings of the 115th AES Convention. Paper 5967, AES, New York, 7 pages.Google Scholar
- P. J. O. Doets, M. Menor Gisbert and R. L. Lagendijk. 2006. On the comparison of audio fingerprints for extracting quality parameters of compressed audio. In Proc. SPIE 6072, Security, Steganography, and Watermarking of Multimedia Contents VIII. SPIE, L-l--12.Google Scholar
- D. Ellis. 2009. Robust landmark-based audio fingerprinting. http://labrosa.ee.columbia.edu/matlab/fingerprint/.Google Scholar
- Leandro de C. T. Gomes, Pedro Cano, Emilia Gomez, Madeleine Bonnet, and Eloi Batlle. 2003. Audio watermarking and fingerprinting: for which applications? J. New Music Research, 32, 1, 65--81.Google ScholarCross Ref
- Dan Gusfield. 1997. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, New York. Google ScholarDigital Library
- Jaap Haitsma, Michiel van der Veen, Ton Kalker, and Fans Bruekers. 2000. Audio watermarking for monitoring and copy protection. In Proceedings of the ACM Workshops on Multimedia. ACM Press, New York, 119--122. Google ScholarDigital Library
- Jaap Haitsma and Ton Kalker. 2002. A highly robust audio fingerprinting system. In Proceedings of the International Conference on Music Information Retrieval. IRCAM, 107--115.Google Scholar
- Jaap Haitsma and Ton Kalker. 2003. Speed-change resistant audio fingerprinting using auto-correlation. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. IV-728--31.Google ScholarCross Ref
- Oliver Hellmuth, Eric Allamance, Markus Cremer, Holger Grossmann, Jurgen Herre, and Thorsten Kastner. 2003. Using MPEG-7 audio fingerprinting in real-world application. In Proceedings of the 115th AES Convention. Paper 5961, AES, New York, 10 pages.Google Scholar
- D. S. Hirschberg. Algorithms for the longest common subsequence problem. 1977. J. ACM 24, 4, 664--675. Google ScholarDigital Library
- ISO/IEC 11172-3. 1993. Information technology: Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s. Part 3: Audio, IS 11172-3. ISO, Geneva, Switzerland.Google Scholar
- ISO/IEC, 13818-3. 1998. Information technology: Generic coding of moving pictures and associated audio information. Part 3: Audio, IS 13818-3, 2nd Ed. ISO, Geneva, Switzerland.Google Scholar
- ISO/IEC 15938-4. 2002. Information technology: Multimedia content description interface. Part 4: Audio, IS 15938-4. ISO, Geneva, Switzerland.Google Scholar
- Y. Ke, D. Hoiem, and R. Sukthankar. 2005. Computer vision for music identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarDigital Library
- Marcos Kiwi, Martin Loebl, and Jiri Matousek. 2005. Expected length of the longest common subsequence for large alphabets. Adv. Math. 197, 480--498.Google ScholarCross Ref
- Jui-Yu Lee and Shingchern D. You. 2005. Dimension-reduction technique for MPEG-7 audio descriptors. In Proceedings of the 6th Pacific-Rim Conference on Multimedia. Lecture Notes in Computer Science, vol. 3768, Springer, 526--537. Google ScholarDigital Library
- Hwei-Jen Lin, Hung-Hsuan Wu, Chun-Wei Wang. 2011. Music matching based on rough longest common subsequence. J. Inf. Sci. Eng. 27, 1, 95--110.Google Scholar
- Mathieu Ramona and Geoffroy Peeters. 2011. Audio identification based on spectral modeling ci barkbands energy and synchronisation through onset detection. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'11). 477--480.Google Scholar
- Mathieu Ramona, Sebastien Fenet, Raphael Blouet, Herve Bredin, Thomas Fillon, and Geoffroy Peeters. 2012. A public audio identification evaluation framework for broadcast monitoring. Appl. Artif. Intell. 26, 1--2, 119--136.Google ScholarCross Ref
- Mathieu Ramona and Geoffroy Peeters. 2013. Audioprint: An efficient audio fingerprint system based on a novel cost-less synchronization scheme. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'13). 818--822.Google ScholarCross Ref
- Avery Li-Chun Wang. 2003. An industrial-strength audio search algorithm. In Proceedings of the International Conference on Music Information Retrieval.Google Scholar
- Avery Li-Chun Wang. 2006. The Shazam music recognition service. Commun. ACM 49, 8, 44--48. Google ScholarDigital Library
- Shingchern D. You and Fan-Yu Cheng. 2012. Spatial localization evaluation model for parametric stereo audio. Appl. Math. Inf. Sci. 6, S2, 397--402.Google Scholar
- Shingchern D. You, Wei-Hwa Chen, and Woei-Kae Chen. 2013. Music identification system using MPEG-7 audio signature descriptors. Sci. World J. 11 pages. DOI:http://dx.doi.org/l0.1155/2013/752464, 2013.Google Scholar
- Shingchern D. You, Wei-Hwa. Chen. 2013. Comparative study of methods for reducing dimensionality of MPEG-7 audio signature descriptors. Multimedia Tools Appl. 20 pages. DOI:http://dx.doi.org/1O.1007/s11042-013-1670-y. Google ScholarDigital Library
Index Terms
- Using Paired Distances of Signal Peaks in Stereo Channels as Fingerprints for Copy Identification
Recommendations
Altered Fingerprints: Analysis and Detection
The widespread deployment of Automated Fingerprint Identification Systems (AFIS) in law enforcement and border control applications has heightened the need for ensuring that these systems are not compromised. While several issues related to fingerprint ...
Fingerprints Recognition Using Minutiae Extraction: a Fuzzy Approach.
ICIAP '07: Proceedings of the 14th International Conference on Image Analysis and ProcessingThe aim of this paper is to study the fingerprint verification based on local ridge discontinuities features (minutiae) only using grey scale images. We extract minutiae using two algorithms those following ridge lines and then recording ridge endings ...
Separating Overlapped Fingerprints
Fingerprint images generally contain either a single fingerprint (e.g., rolled images) or a set of nonoverlapped fingerprints (e.g., slap fingerprints). However, there are situations where several fingerprints overlap on top of each other. Such ...
Comments