skip to main content
research-article

Using Paired Distances of Signal Peaks in Stereo Channels as Fingerprints for Copy Identification

Published:24 August 2015Publication History
Skip Abstract Section

Abstract

This article proposes to use the relative distances between adjacent envelope peaks detected in stereo audio as fingerprints for copy identification. The matching algorithm used is the rough longest common subsequence (RLCS) algorithm. The experimental results show that the proposed approach has better identification accuracy than an MPEG-7 based scheme for distorted and noisy audio. When compared with other schemes, the proposed scheme uses fewer bits with comparable performance. The proposed fingerprints can also be used in conjunction with the MPEG-7 based scheme for lower computational burden.

References

  1. 3GPP TS 26.404. 2012. 3rd generation partnership project; technical specification group services and system aspects; general audio codec audio process functions; Enhanced aacPlus general audio codec; enhanced aacPlus encoder SBR part, 3GPP TS 26 404, v 11.0.0 (Sept. 2012).Google ScholarGoogle Scholar
  2. Shumeet Baluja and Michele Covell. 2007. Audio fingerprinting: combining computer vision and data stream processing. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, II-213--II-216.Google ScholarGoogle ScholarCross RefCross Ref
  3. Carlo Bellettini and Gianluca Mazzini. 2007. On audio recognition performance via robust hashing. In Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems. 20--23.Google ScholarGoogle ScholarCross RefCross Ref
  4. Carlo Bellettini and Gianluca Mazzini. 2010. A framework for robust audio fingerprinting. J. Commun. 5, 5, 409--424.Google ScholarGoogle ScholarCross RefCross Ref
  5. Juan Pablo Bello, Laurent Daudet, Samer Abdalllfh, Chris Duxbury, Mike Davies, and Mark B. Sandler. 2005. A tutorial on onset detection in music signals. IEEE Trans. Speech Audio Process. 13, 5, 1035--1047.Google ScholarGoogle ScholarCross RefCross Ref
  6. Christopher J. C. Burges, John C. Platt, and Soumya Jana. 2003. Distortion discriminant analysis for audio fingerprinting. IEEE Trans. Speech Audio Process. 11, 3, 165--174.Google ScholarGoogle ScholarCross RefCross Ref
  7. Pedro Cano, Eloi Battle, Ton Kalker, and Jaap Haitsma. 2005. A review of audio fingerprinting. J. VLSI Signal Process. 41, 3, 271--284. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Vijay Chandrasekha, Matt Sharifi, and David A. Ross. 2011. Survey and evaluation of audio fingerprinting schemes for mobile query-by-example applications. In Proceedings of the 12th International Conference on Music Information Retrieval. ISMIR, 801--806.Google ScholarGoogle Scholar
  9. Holger Crysandt. 2003. Music identification with MPEG-7. In Proceedings of the 115th AES Convention. Paper 5967, AES, New York, 7 pages.Google ScholarGoogle Scholar
  10. P. J. O. Doets, M. Menor Gisbert and R. L. Lagendijk. 2006. On the comparison of audio fingerprints for extracting quality parameters of compressed audio. In Proc. SPIE 6072, Security, Steganography, and Watermarking of Multimedia Contents VIII. SPIE, L-l--12.Google ScholarGoogle Scholar
  11. D. Ellis. 2009. Robust landmark-based audio fingerprinting. http://labrosa.ee.columbia.edu/matlab/fingerprint/.Google ScholarGoogle Scholar
  12. Leandro de C. T. Gomes, Pedro Cano, Emilia Gomez, Madeleine Bonnet, and Eloi Batlle. 2003. Audio watermarking and fingerprinting: for which applications? J. New Music Research, 32, 1, 65--81.Google ScholarGoogle ScholarCross RefCross Ref
  13. Dan Gusfield. 1997. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Jaap Haitsma, Michiel van der Veen, Ton Kalker, and Fans Bruekers. 2000. Audio watermarking for monitoring and copy protection. In Proceedings of the ACM Workshops on Multimedia. ACM Press, New York, 119--122. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Jaap Haitsma and Ton Kalker. 2002. A highly robust audio fingerprinting system. In Proceedings of the International Conference on Music Information Retrieval. IRCAM, 107--115.Google ScholarGoogle Scholar
  16. Jaap Haitsma and Ton Kalker. 2003. Speed-change resistant audio fingerprinting using auto-correlation. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. IV-728--31.Google ScholarGoogle ScholarCross RefCross Ref
  17. Oliver Hellmuth, Eric Allamance, Markus Cremer, Holger Grossmann, Jurgen Herre, and Thorsten Kastner. 2003. Using MPEG-7 audio fingerprinting in real-world application. In Proceedings of the 115th AES Convention. Paper 5961, AES, New York, 10 pages.Google ScholarGoogle Scholar
  18. D. S. Hirschberg. Algorithms for the longest common subsequence problem. 1977. J. ACM 24, 4, 664--675. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. ISO/IEC 11172-3. 1993. Information technology: Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s. Part 3: Audio, IS 11172-3. ISO, Geneva, Switzerland.Google ScholarGoogle Scholar
  20. ISO/IEC, 13818-3. 1998. Information technology: Generic coding of moving pictures and associated audio information. Part 3: Audio, IS 13818-3, 2nd Ed. ISO, Geneva, Switzerland.Google ScholarGoogle Scholar
  21. ISO/IEC 15938-4. 2002. Information technology: Multimedia content description interface. Part 4: Audio, IS 15938-4. ISO, Geneva, Switzerland.Google ScholarGoogle Scholar
  22. Y. Ke, D. Hoiem, and R. Sukthankar. 2005. Computer vision for music identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Marcos Kiwi, Martin Loebl, and Jiri Matousek. 2005. Expected length of the longest common subsequence for large alphabets. Adv. Math. 197, 480--498.Google ScholarGoogle ScholarCross RefCross Ref
  24. Jui-Yu Lee and Shingchern D. You. 2005. Dimension-reduction technique for MPEG-7 audio descriptors. In Proceedings of the 6th Pacific-Rim Conference on Multimedia. Lecture Notes in Computer Science, vol. 3768, Springer, 526--537. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Hwei-Jen Lin, Hung-Hsuan Wu, Chun-Wei Wang. 2011. Music matching based on rough longest common subsequence. J. Inf. Sci. Eng. 27, 1, 95--110.Google ScholarGoogle Scholar
  26. Mathieu Ramona and Geoffroy Peeters. 2011. Audio identification based on spectral modeling ci barkbands energy and synchronisation through onset detection. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'11). 477--480.Google ScholarGoogle Scholar
  27. Mathieu Ramona, Sebastien Fenet, Raphael Blouet, Herve Bredin, Thomas Fillon, and Geoffroy Peeters. 2012. A public audio identification evaluation framework for broadcast monitoring. Appl. Artif. Intell. 26, 1--2, 119--136.Google ScholarGoogle ScholarCross RefCross Ref
  28. Mathieu Ramona and Geoffroy Peeters. 2013. Audioprint: An efficient audio fingerprint system based on a novel cost-less synchronization scheme. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'13). 818--822.Google ScholarGoogle ScholarCross RefCross Ref
  29. Avery Li-Chun Wang. 2003. An industrial-strength audio search algorithm. In Proceedings of the International Conference on Music Information Retrieval.Google ScholarGoogle Scholar
  30. Avery Li-Chun Wang. 2006. The Shazam music recognition service. Commun. ACM 49, 8, 44--48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Shingchern D. You and Fan-Yu Cheng. 2012. Spatial localization evaluation model for parametric stereo audio. Appl. Math. Inf. Sci. 6, S2, 397--402.Google ScholarGoogle Scholar
  32. Shingchern D. You, Wei-Hwa Chen, and Woei-Kae Chen. 2013. Music identification system using MPEG-7 audio signature descriptors. Sci. World J. 11 pages. DOI:http://dx.doi.org/l0.1155/2013/752464, 2013.Google ScholarGoogle Scholar
  33. Shingchern D. You, Wei-Hwa. Chen. 2013. Comparative study of methods for reducing dimensionality of MPEG-7 audio signature descriptors. Multimedia Tools Appl. 20 pages. DOI:http://dx.doi.org/1O.1007/s11042-013-1670-y. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Using Paired Distances of Signal Peaks in Stereo Channels as Fingerprints for Copy Identification

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Transactions on Multimedia Computing, Communications, and Applications
              ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 12, Issue 1
              August 2015
              220 pages
              ISSN:1551-6857
              EISSN:1551-6865
              DOI:10.1145/2816987
              Issue’s Table of Contents

              Copyright © 2015 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 24 August 2015
              • Accepted: 1 February 2015
              • Revised: 1 May 2014
              • Received: 1 October 2013
              Published in tomm Volume 12, Issue 1

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article
              • Research
              • Refereed

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader