skip to main content
10.1145/1991996.1992027acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

A comparison of extended fingerprint hashing and locality sensitive hashing for binary audio fingerprints

Published:18 April 2011Publication History

ABSTRACT

Hash tables have been proposed for the indexing of high-dimensional binary vectors, specifically for the identification of media by fingerprints. In this paper we develop a new model to predict the performance of a hash-based method (Fingerprint Hashing) under varying levels of noise. We show that by the adjustment of two parameters, robustness to a higher level of noise is achieved. We extend Fingerprint Hashing to a multi-table range search (Extended Fingerprint Hashing) and show this approach also increases robustness to noise. We then show the relationship between Extended Fingerprint Hashing and Locality Sensitive Hashing and investigate design choices for dealing with higher noise levels. If index size must be held constant, the Extended Fingerprint Hash is a superior method. We also show that to achieve similar performance at a given level of noise a Locality Sensitive Hash requires nearly a six-fold increase in index size which is likely to be impractical for many applications.

References

  1. Audible magic. (Online). Available: {http://www.audiblemagic.com}.Google ScholarGoogle Scholar
  2. A. Andoni and P. Indyk. Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Communications of the ACM, 51(1):117--122, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. L. Bentley. Multidimensional divide-and-conquer. Communications of the ACM, 23(4):214--229, 1980. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Califano and I. Rigoutsos. Flash: a fast look-up algorithm for string homology. In Proc. CVPR, pages 353--359, jun. 1993.Google ScholarGoogle ScholarCross RefCross Ref
  5. P. Cano, E. Batlle, T. Kalker, and J. Haitsma. A review of audio fingerprinting. The Journal of VLSI Signal Processing, 41(3):271--284, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Cano, E. Batlle, H. Mayer, and H. Neuschmied. Robust sound modeling for song detection in broadcast audio. In Proc. 112th Int. Conv. of the AES, 2002.Google ScholarGoogle Scholar
  7. E. Chávez, G. Navarro, R. Baeza-Yates, and J. L. Marroquín. Searching in metric spaces. ACM Computing Surveys, 33(3):273--321, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. K. L. Clarkson. Nearest-neighbor searching and metric space dimensions. In G. Shakhnarovich, T. Darrell, and P. Indyk, editors, Nearest-Neighbor Methods for Learning and Vision: Theory and Practice, pages 15--59. MIT Press, 2006.Google ScholarGoogle Scholar
  9. J. Haitsma and T. Kalker. A highly robust audio fingerprinting system. In Proc. of the Int. Symposium on Music Information Retrieval, pages 107--115, 2002.Google ScholarGoogle Scholar
  10. P. Indyk and R. Motwani. Approximate nearest neighbors: towards removing the curse of dimensionality. In Proc. of the 30th Annual ACM Symposium on Theory of Computing, pages 604--613, New York, NY, USA, 1998. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Karp, O. Waarts, and G. Zweig. The bit vector intersection problem. In Proc. of the 36th Annual Symposium on the Foundations of Computer Science, pages 621--630, oct. 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. M. Kleinberg. Two algorithms for nearest-neighbor search in high dimensions. In Proc. of the 29th Annual ACM Symposium on Theory of Computing, pages 599--608, New York, NY, USA, 1997. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. F. Kurth, A. Ribbrock, and M. Clausen. Identification of highly distorted audio material for querying large scale data bases. In Proc. 112th Audio Engineering Society Convention, 2002.Google ScholarGoogle Scholar
  14. E. Kushilevitz, R. Ostrovsky, and Y. Rabani. Efficient search for approximate nearest neighbor in high dimensional spaces. In Proc. of the 30th Annual ACM Symposium on Theory of Computing, pages 614--623, New York, NY, USA, 1998. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. D. Manning, P. Raghavan, and H. Schütze. Introduction to information retrieval. Cambridge University Press, Cambridge, 2008. Google ScholarGoogle ScholarCross RefCross Ref
  16. M. Muja and D. G. Lowe. Fast approximate nearest neighbors with automatic algorithm configuration. In VISSAPP '09: Proc. of the Int. Conf. on Computer Vision Theory and Application, pages 331--340. INSTICC Press, 2009.Google ScholarGoogle Scholar
  17. P. N. Yianilos. Data structures and algorithms for nearest neighbor search in general metric spaces. In SODA '93: Proceedings of the 4th annual ACM-SIAM Symposium on Discrete algorithms, pages 311--321, Philadelphia, PA, USA, 1993. Society for Industrial and Applied Mathematics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. P. N. Yianilos. Locally lifting the curse of dimensionality for nearest neighbor search (extended abstract). In SODA '00: Proc. of the 11th annual ACM-SIAM symposium on Discrete algorithms, pages 361--370, Philadelphia, PA, USA, 2000. Society for Industrial and Applied Mathematics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Zobel and A. Moffat. Inverted files for text search engines. ACM Computing Surveys, 38(2):6, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A comparison of extended fingerprint hashing and locality sensitive hashing for binary audio fingerprints

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICMR '11: Proceedings of the 1st ACM International Conference on Multimedia Retrieval
        April 2011
        512 pages
        ISBN:9781450303361
        DOI:10.1145/1991996

        Copyright © 2011 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 18 April 2011

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate254of830submissions,31%

        Upcoming Conference

        ICMR '24
        International Conference on Multimedia Retrieval
        June 10 - 14, 2024
        Phuket , Thailand

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader