research-article

Using Paired Distances of Signal Peaks in Stereo Channels as Fingerprints for Copy Identification

Authors:
Shingchern D. You

National Taipei University of Technology, Taipei, Taiwan

National Taipei University of Technology, Taipei, Taiwan
View Profile

,
Yi-Han Pu

National Taipei University of Technology, Taipei

National Taipei University of Technology, Taipei
View Profile

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 12 Issue 1Article No.: 1pp 1–22https://doi.org/10.1145/2742059

Published:24 August 2015Publication History

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

This article proposes to use the relative distances between adjacent envelope peaks detected in stereo audio as fingerprints for copy identification. The matching algorithm used is the rough longest common subsequence (RLCS) algorithm. The experimental results show that the proposed approach has better identification accuracy than an MPEG-7 based scheme for distorted and noisy audio. When compared with other schemes, the proposed scheme uses fewer bits with comparable performance. The proposed fingerprints can also be used in conjunction with the MPEG-7 based scheme for lower computational burden.

References

3GPP TS 26.404. 2012. 3rd generation partnership project; technical specification group services and system aspects; general audio codec audio process functions; Enhanced aacPlus general audio codec; enhanced aacPlus encoder SBR part, 3GPP TS 26 404, v 11.0.0 (Sept. 2012).Google Scholar
Shumeet Baluja and Michele Covell. 2007. Audio fingerprinting: combining computer vision and data stream processing. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, II-213--II-216.Google ScholarCross Ref
Carlo Bellettini and Gianluca Mazzini. 2007. On audio recognition performance via robust hashing. In Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems. 20--23.Google ScholarCross Ref
Carlo Bellettini and Gianluca Mazzini. 2010. A framework for robust audio fingerprinting. J. Commun. 5, 5, 409--424.Google ScholarCross Ref
Juan Pablo Bello, Laurent Daudet, Samer Abdalllfh, Chris Duxbury, Mike Davies, and Mark B. Sandler. 2005. A tutorial on onset detection in music signals. IEEE Trans. Speech Audio Process. 13, 5, 1035--1047.Google ScholarCross Ref
Christopher J. C. Burges, John C. Platt, and Soumya Jana. 2003. Distortion discriminant analysis for audio fingerprinting. IEEE Trans. Speech Audio Process. 11, 3, 165--174.Google ScholarCross Ref
Pedro Cano, Eloi Battle, Ton Kalker, and Jaap Haitsma. 2005. A review of audio fingerprinting. J. VLSI Signal Process. 41, 3, 271--284. Google ScholarDigital Library
Vijay Chandrasekha, Matt Sharifi, and David A. Ross. 2011. Survey and evaluation of audio fingerprinting schemes for mobile query-by-example applications. In Proceedings of the 12th International Conference on Music Information Retrieval. ISMIR, 801--806.Google Scholar
Holger Crysandt. 2003. Music identification with MPEG-7. In Proceedings of the 115th AES Convention. Paper 5967, AES, New York, 7 pages.Google Scholar
P. J. O. Doets, M. Menor Gisbert and R. L. Lagendijk. 2006. On the comparison of audio fingerprints for extracting quality parameters of compressed audio. In Proc. SPIE 6072, Security, Steganography, and Watermarking of Multimedia Contents VIII. SPIE, L-l--12.Google Scholar
D. Ellis. 2009. Robust landmark-based audio fingerprinting. http://labrosa.ee.columbia.edu/matlab/fingerprint/.Google Scholar
Leandro de C. T. Gomes, Pedro Cano, Emilia Gomez, Madeleine Bonnet, and Eloi Batlle. 2003. Audio watermarking and fingerprinting: for which applications? J. New Music Research, 32, 1, 65--81.Google ScholarCross Ref
Dan Gusfield. 1997. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, New York. Google ScholarDigital Library
Jaap Haitsma, Michiel van der Veen, Ton Kalker, and Fans Bruekers. 2000. Audio watermarking for monitoring and copy protection. In Proceedings of the ACM Workshops on Multimedia. ACM Press, New York, 119--122. Google ScholarDigital Library
Jaap Haitsma and Ton Kalker. 2002. A highly robust audio fingerprinting system. In Proceedings of the International Conference on Music Information Retrieval. IRCAM, 107--115.Google Scholar
Jaap Haitsma and Ton Kalker. 2003. Speed-change resistant audio fingerprinting using auto-correlation. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. IV-728--31.Google ScholarCross Ref
Oliver Hellmuth, Eric Allamance, Markus Cremer, Holger Grossmann, Jurgen Herre, and Thorsten Kastner. 2003. Using MPEG-7 audio fingerprinting in real-world application. In Proceedings of the 115th AES Convention. Paper 5961, AES, New York, 10 pages.Google Scholar
D. S. Hirschberg. Algorithms for the longest common subsequence problem. 1977. J. ACM 24, 4, 664--675. Google ScholarDigital Library
ISO/IEC 11172-3. 1993. Information technology: Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s. Part 3: Audio, IS 11172-3. ISO, Geneva, Switzerland.Google Scholar
ISO/IEC, 13818-3. 1998. Information technology: Generic coding of moving pictures and associated audio information. Part 3: Audio, IS 13818-3, 2nd Ed. ISO, Geneva, Switzerland.Google Scholar
ISO/IEC 15938-4. 2002. Information technology: Multimedia content description interface. Part 4: Audio, IS 15938-4. ISO, Geneva, Switzerland.Google Scholar
Y. Ke, D. Hoiem, and R. Sukthankar. 2005. Computer vision for music identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarDigital Library
Marcos Kiwi, Martin Loebl, and Jiri Matousek. 2005. Expected length of the longest common subsequence for large alphabets. Adv. Math. 197, 480--498.Google ScholarCross Ref
Jui-Yu Lee and Shingchern D. You. 2005. Dimension-reduction technique for MPEG-7 audio descriptors. In Proceedings of the 6th Pacific-Rim Conference on Multimedia. Lecture Notes in Computer Science, vol. 3768, Springer, 526--537. Google ScholarDigital Library
Hwei-Jen Lin, Hung-Hsuan Wu, Chun-Wei Wang. 2011. Music matching based on rough longest common subsequence. J. Inf. Sci. Eng. 27, 1, 95--110.Google Scholar
Mathieu Ramona and Geoffroy Peeters. 2011. Audio identification based on spectral modeling ci barkbands energy and synchronisation through onset detection. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'11). 477--480.Google Scholar
Mathieu Ramona, Sebastien Fenet, Raphael Blouet, Herve Bredin, Thomas Fillon, and Geoffroy Peeters. 2012. A public audio identification evaluation framework for broadcast monitoring. Appl. Artif. Intell. 26, 1--2, 119--136.Google ScholarCross Ref
Mathieu Ramona and Geoffroy Peeters. 2013. Audioprint: An efficient audio fingerprint system based on a novel cost-less synchronization scheme. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'13). 818--822.Google ScholarCross Ref
Avery Li-Chun Wang. 2003. An industrial-strength audio search algorithm. In Proceedings of the International Conference on Music Information Retrieval.Google Scholar
Avery Li-Chun Wang. 2006. The Shazam music recognition service. Commun. ACM 49, 8, 44--48. Google ScholarDigital Library
Shingchern D. You and Fan-Yu Cheng. 2012. Spatial localization evaluation model for parametric stereo audio. Appl. Math. Inf. Sci. 6, S2, 397--402.Google Scholar
Shingchern D. You, Wei-Hwa Chen, and Woei-Kae Chen. 2013. Music identification system using MPEG-7 audio signature descriptors. Sci. World J. 11 pages. DOI:http://dx.doi.org/l0.1155/2013/752464, 2013.Google Scholar
Shingchern D. You, Wei-Hwa. Chen. 2013. Comparative study of methods for reducing dimensionality of MPEG-7 audio signature descriptors. Multimedia Tools Appl. 20 pages. DOI:http://dx.doi.org/1O.1007/s11042-013-1670-y. Google ScholarDigital Library

Index Terms

Using Paired Distances of Signal Peaks in Stereo Channels as Fingerprints for Copy Identification

Recommendations

Altered Fingerprints: Analysis and Detection

The widespread deployment of Automated Fingerprint Identification Systems (AFIS) in law enforcement and border control applications has heightened the need for ensuring that these systems are not compromised. While several issues related to fingerprint ...
Read More
Fingerprints Recognition Using Minutiae Extraction: a Fuzzy Approach.
ICIAP '07: Proceedings of the 14th International Conference on Image Analysis and Processing

The aim of this paper is to study the fingerprint verification based on local ridge discontinuities features (minutiae) only using grey scale images. We extract minutiae using two algorithms those following ridge lines and then recording ridge endings ...
Read More
Separating Overlapped Fingerprints

Fingerprint images generally contain either a single fingerprint (e.g., rolled images) or a set of nonoverlapped fingerprints (e.g., slap fingerprints). However, there are situations where several fingerprints overlap on top of each other. Such ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Multimedia Computing, Communications, and Applications Volume 12, Issue 1
August 2015
220 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/2816987
Editor:
Ralf Steinmetz
Technische Universität Darmstadt, Germany
Issue’s Table of Contents
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 August 2015
- Accepted: 1 February 2015
- Revised: 1 May 2014
- Received: 1 October 2013
Published in tomm Volume 12, Issue 1

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
MPEG-7 audio signature descriptor
Music fingerprint
envelope peak
rough longest common subsequence
stereo music
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 145
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Using Paired Distances of Signal Peaks in Stereo Channels as Fingerprints for Copy Identification

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

References

Cited By

Index Terms

Recommendations

Altered Fingerprints: Analysis and Detection

Fingerprints Recognition Using Minutiae Extraction: a Fuzzy Approach.

Separating Overlapped Fingerprints

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Using Paired Distances of Signal Peaks in Stereo Channels as Fingerprints for Copy Identification

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

References

Cited By

Index Terms

Recommendations

Altered Fingerprints: Analysis and Detection

Fingerprints Recognition Using Minutiae Extraction: a Fuzzy Approach.

Separating Overlapped Fingerprints

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media