Landmark-based music recognition system optimisation using genetic algorithms

Gutiérrez, Salvador; García, Salvador

doi:10.1007/s11042-015-2963-0

Landmark-based music recognition system optimisation using genetic algorithms

Published: 15 October 2015

Volume 75, pages 16905–16922, (2016)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Salvador Gutiérrez¹ &
Salvador García^2,3

327 Accesses
2 Citations
Explore all metrics

Abstract

Audio fingerprinting allows us to label an unidentified music fragment within a previously generated database. The use of spectral landmarks aims to obtain a robustness that lets a certain level of noise be present in the audio query. This group of audio identification algorithms holds several configuration parameters whose values are usually chosen based upon the researcher’s knowledge, previous published experimentation or just trial and error methods. In this paper we describe the whole optimisation process of a Landmark-based Music Recognition System using genetic algorithms. We define the actual structure of the algorithm as a chromosome by transforming its high relevant parameters into various genes and building up an appropriate fitness evaluation method. The optimised output parameters are used to set up a complete system that is compared with a non-optimised one by designing an unbiased evaluation model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A review on genetic algorithm: past, present, and future

Article 31 October 2020

Sourabh Katoch, Sumit Singh Chauhan & Vijay Kumar

Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning

Article 19 January 2024

Benyamin Abdollahzadeh, Nima Khodadadi, … Seyedali Mirjalili

Genetic algorithms: theory, genetic operators, solutions, and applications

Article 03 February 2023

Bushra Alhijawi & Arafat Awajan

Notes

Function m a x creates a new vector taking the higher value for each i position from the vectors

References

Almeida LB (1994) Fractional fourier transform and time-frequency representations. IEEE Trans Signal Process 42(11):3084–3091
Article Google Scholar
Apelblat A (2012) Laplace transforms and their applications. Nova Science Publishers
Bellettini C, Mazzini G (2010) A framework for robust audio fingerprinting. J Commun 5(5):409–424
Article Google Scholar
Buqing C, Jianxun L, Liu X, Li B, Dong Z, Kang G (2013) CHC-TSCM: A trustworthy service composition method based on an improved CHC genetic algorithm. Communications, China 10(12):77–91
Article Google Scholar
Cano P, Batlle E, Kalker T, Haitsma J (2005) A review of audio fingerprinting. Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology 41(3):271–284. (SPEC, ISS.)
Article Google Scholar
Casey MA, Veltkamp R, Goto M, Leman M, Rhodes C, Slaney M (2008) Content-based music information retrieval: Current directions and future challenges. Proc IEEE 96(4):668–696
Article Google Scholar
Chen W-H, Smith CH, Fralick SC (1977) Fast computational algorithm for the discrete cosine transform. IEEE Trans Commun COM-25(9):1004–1009
Article MATH Google Scholar
Cordon O, Damas S, Santamaria J (2006) Feature-based image registration by means of the CHC evolutionary algorithm. Image Vis Comput 24(5):525–533
Article Google Scholar
Deng J, Wan W, Swaminathan R, Yu X, Pan X (2011) An audio fingerprinting system based on spectral energy structure. In: IET International Conference on Smart and Sustainable City, 2011, 27
Deng J, Wan W, Yu X, Pan X, Yang W (2011) Audio fingerprinting based on harmonic enhancement and spectral subband centroid. In: IET International Communication Conference on Wireless Mobile and Computing, 2011, 93–96
Duhamel P, Vetterli M (1990) Fast fourier transforms: A tutorial review and a state of the art. Signal Process 19(4):259–299
Article MathSciNet MATH Google Scholar
Dupraz E, Richard G (2010) Robust frequency-based audio fingerprinting. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 281–284
Eiben AE, Smith JE (2003) Introduction to Evolutionary Computing. Springer
Ellis D (2009) Robust landmark-based audio fingerprinting. Available at http://labrosa.ee.columbia.edu/matlab/fingerprint/. Accessed: 2015-01-20.
Eshelman LJ (1991) The CHC adaptive search algorithm: How to have safe search when engaging in nontraditional genetic recombination. Proceedings of the First Workshop on Foundations of Genetic Algorithms, 265–283
Eshelman LJ, Schaffer JD (1992) Real-coded genetic algorithms and interval-schemata. In: LD Whitley (ed) FOGA, 187–202
Filipiak P, Lipiski P (2012) Parallel CHC algorithm for solving dynamic traveling salesman problem using many-core GPU. Lect Notes Comput Sci (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7557 LNAI:305–314
Herrera F, Lozano M, Snchez AM (2003) A taxonomy for the crossover operator for real-coded genetic algorithms: An experimental study. Int J Intell Syst 18(3):309–338
Article MATH Google Scholar
Jiang W, Zhu Y, Bao X, Yu R (2012) Cloud-based audio fingerprinting service. In: 2012 Conference Handbook - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2012
Kamaladas MD, Dialin MM (2013) Fingerprint extraction of audio signal using wavelet transform. In: International Conference on Signal Processing, Image Processing and Pattern Recognition 2013, ICSIPR 2013, 1
Ke Y, Hoiem D, Sukthankar R (2005) Computer vision for music identification. In: Proceedings - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, I, 597–604
Klapuri A, Davy M (2006) Signal processing methods for music transcription. Springer
Lee I-H, Mahmood MT, Shim S-O, Choi T-S (2014) Optimizing image focus for 3d shape recovery through genetic algorithm. Multimedia Tools and Applications 71(1):247–262
Article Google Scholar
Levy M, Sandler M (2009) Music information retrieval using social tags and audio. IEEE Trans Multimedia 11(3):383–395
Article Google Scholar
Li Z-Y, Zhang W-Q, Liu J (2013) Multi-resolution time frequency feature and complementary combination for short utterance speaker recognition. Multimedia Tools and Applications:1–17
Liu J, Zhang T (2011) Wavelet-based audio fingerprinting algorithm robust to linear speed change. Communications in Computer and Information Science, 234 CCIS(PART 4):360–368
Liu Y, Yun HS, Kim NS (2009) Audio fingerprinting based on multiple hashing in DCT domain. IEEE Signal Process Lett 16(6):525–528
Article Google Scholar
Malekesmaeili M, Ward RK (2012) A novel local audio fingerprinting algorithm. In: 2012 IEEE 14th International Workshop on Multimedia Signal Processing, MMSP 2012 - Proceedings, pages 136–140
Marín J, Molina D, Herrera F (2012) Modeling dynamics of a real-coded CHC algorithm in terms of dynamical probability distributions. Soft Computing 16(2):331–351
Article Google Scholar
Mendoza M, Cobos C, León E, Lozano M, Rodríguez F, Herrera-Viedma E (2014) A new memetic algorithm for multi-document summarization based on CHC algorithm and greedy search. Human-Inspired Computing and Its Applications:125–138
Mohsenfar SM, Mosleh M, Barati A (2013) Audio watermarking method using QR decomposition and genetic algorithm. Multimedia Tools and Applications 74(3):1–21
Google Scholar
Nesmachnow S, Alba E, Cancela H (2012) Scheduling in heterogeneous computing and grid environments using a parallel CHC evolutionary algorithm. Comput Intell 28(2):131–155
Article MathSciNet Google Scholar
Pan X, Yu X, Deng J, Yang W, Wang H (2011) Audio fingerprinting based on local energy centroid. In: IET International Communication Conference on Wireless Mobile and Computing, vol. 2011, pp. 351–354
Ramalingam A, Krishnan S (2006) Gaussian mixture modeling of short-time fourier transform features for audio fingerprinting. IEEE Trans Inf Forensics Secur 1(4):457–463
Article Google Scholar
Seo JS, Jin M, Lee S, Jang D, Lee S, Yoo CD (2005) Audio fingerprinting based on normalized spectral subband centroids. In: IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, volume III, pages 213–216
Sethares W.A (2005) Tuning, timbre, spectrum, scale, 2nd. Springer
Sinitsyn A (2006) Duplicate song detection using audio fingerprinting for consumer electronics devices. In: Proceedings of the International Symposium on Consumer Electronics, ISCE, pages 622–627
Son W, Cho H-T, Yoon K, Lee S-P (2010) Sub-fingerprint masking for a robust audio fingerprinting system in a real-noise environment for portable consumer devices. IEEE Trans Consum Electron 56(1):156–160
Article Google Scholar
Suyoto ISH, Uitdenbogerd AL, Scholer F (2008) Searching musical audio using symbolic queries. IEEE Trans Audio Speech Lang Process 16(2):372–381
Article Google Scholar
Theodoridis S, Koutroumbas K (2009) Pattern Recognition. Elsevier
Tsai T-H, Huang Y-S, Liu P-Y, Chen D-M (2014) Content-based singer classification on compressed domain audio data. Multimedia Tools and Applications:1–21
Typke R, Wiering F, Veltkamp RC (2005) A survey of music information retrieval systems. In: Proceedings of the sixth International conference on Music Information Retrieval, ISMIR 2005, pages 153–160
Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing 10(5):293–302
Article Google Scholar
Vaseghi SV (2007) Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications, John Wiley & Sons
Wang A (2004) An industrial-strength audio search algorithm. Proceedings of SPIE - The International Society for Optical Engineering 5307:582–588
Google Scholar
Wang A (2006) The shazam music recognition service. Commun ACM 49(8):44–48
Article Google Scholar
Wang Q, Guo Z, Liu G, Guo J (2012) Audio fingerprinting based on n-grams. International Journal of Digital Content Technology and its Applications 6(10):361–368
Article Google Scholar
Wong GY, Leung FHF, Ling S-H (2014) An under-sampling method based on fuzzy logic for large imbalanced dataset. In: Fuzzy Systems (FUZZ-IEEE), 2014 IEEE International Conference on, 1248–1252
Zhu B, Li W, Wang Z, Xue X (2010) A novel audio fingerprinting method robust to time scale modification and pitch shifting. In: MM’10 - Proceedings of the ACM Multimedia 2010 International Conference, 987–990

Download references

Acknowledgments

This work is supported by the research project TIN2014-57251-P. The authors are very grateful to the anonymous reviewers for their valuable suggestions and comments to improve the quality of this paper.

Author information

Authors and Affiliations

Instituto de Ciencias de la Vid y del Vino, University of La Rioja, CSIC, Gobierno de La Rioja, Apartado Postal 1042 — 26080 Logroño–Finca La Grajera, Crta. De Burgos Km. 6, 26007, Logroño, Spain
Salvador Gutiérrez
Department of Computer Science and Articial Intelligence, CITIC-UGR Research Center on Information and Communications Technology, University of Granada, ETSII, Calle Periodista Daniel Saucedo Aranda S/N, Granada, 18071, Spain
Salvador García
Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
Salvador García

Authors

Salvador Gutiérrez
View author publications
You can also search for this author in PubMed Google Scholar
Salvador García
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Salvador García.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gutiérrez, S., García, S. Landmark-based music recognition system optimisation using genetic algorithms. Multimed Tools Appl 75, 16905–16922 (2016). https://doi.org/10.1007/s11042-015-2963-0

Download citation

Received: 05 March 2015
Revised: 24 July 2015
Accepted: 18 September 2015
Published: 15 October 2015
Issue Date: December 2016
DOI: https://doi.org/10.1007/s11042-015-2963-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Landmark-based music recognition system optimisation using genetic algorithms

Abstract

Access this article

Similar content being viewed by others

A review on genetic algorithm: past, present, and future

Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning

Genetic algorithms: theory, genetic operators, solutions, and applications

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Landmark-based music recognition system optimisation using genetic algorithms

Abstract

Access this article

Similar content being viewed by others

A review on genetic algorithm: past, present, and future

Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning

Genetic algorithms: theory, genetic operators, solutions, and applications

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation