Deep Learning for Cover Song Apperception

Vali, D. Khasim; Bhajantri, Nagappa U.

doi:10.1007/978-981-15-6353-9_9

D. Khasim Vali¹⁹ &
Nagappa U. Bhajantri²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1199))

743 Accesses

Abstract

In this work, we proposed a cover song recognition system using deep learning. From the literature, understand that most of the works extract the discriminate feature that classifies the cover song between a pair of songs and calculates the dissimilarity or similarity between the two songs based on the observation, which is a meaningful pattern between cover songs. Moreover, it inspires reformulating the cover song apperception obstacle in a machine learning framework. In other words, essentially builds the cover song recognition system using Convolution Neural Network (CNN) and Mel Frequency Cepstral Coefficients (MFCCs) features following the construction of the data set composed of cover song pairs. The prepared CNN yields the likelihood of being in the spread tune connection given a cross-closeness grid produced from any two bits of music and recognizes the spread tune by positioning on the likelihood. Test results display the prescribed methodology that has accomplished enhanced execution tantamount to the cutting edge endeavors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bertin-Mahieux, T., Ellis, D.P.W.: Largescale cover song recognition using the 2d Fourier transform magnitude. Int. Soc. Music Inf. Retr. (2012)
Google Scholar
Bertin-Mahieux, T., Ellis, D.P.W.: (2011). Largescale cover song recognition using hashed chroma landmarks. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Google Scholar
Bertin-Mahieux, T., Ellis, D.P.W., Whitman, B., Lamere, P.: The million song dataset. In Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR) (2011)
Google Scholar
Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)
MATH Google Scholar
Schmidhuber, J.: Deep learning in neural networks: an overview. Neural Netw. (2015). https://doi.org/10.1016/j.neunet.2014.09.003
Article Google Scholar
Othman, E., Bazi, Y., Alajlan, N., Alhichri, H., Melgani, F.: Using convolutional features and a sparse autoencoder for land-use scene classification. Int. J. Remote Sens. 37, 2149–2167 (2016)
Google Scholar
Cai, K., Yang, D., Chen, X.: Two-layer large-scale cover song identification system based on music structure segmentation. In: 2016 IEEE 18th International Workshop on Multimedia Signal Processing, MMSP 2016 (2017)
Google Scholar
Cano, P., Batle, E., Kalker, T., Haitsma, J.: A review of algorithms for audio fingerprinting. In Multimedia Signal Processing, 2002 IEEE Workshop on, pp. 169–173. IEEE (2002)
Google Scholar
Chang, S., Lee, J., Keun Choe, S., Lee, K.: Audio cover song identification using a convolutional neural network (2017). arXiv preprint https://arxiv.org/abs/1712.00166
Chen, N., Li, W., Xiao, H.: Fusing similarity functions for cover song identification. Multimed. Tools Appl. 77(2), 2629–2652 (2018)
Google Scholar
Ellis, D.P.W., Poliner, G.E.: Identifying cover songs’ with chroma features and dynamic programming beat tracking. In: Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on. IEEE (2007)
Google Scholar
Foster, P., Dixon, S., Klapuri, A.: Identifying cover songs using information-theoretic measures of similarity. IEEE/ACM Trans. Audio, Speech Languag. Process. (TASLP) 23(6), 993–1005 (2015)
Google Scholar
Heo, H., Kim, H.J., Kim, W.S., Lee, K.: Cover song identification with metric learning using distance as a feature. In: ISMIR (2017)
Google Scholar
Humphrey, E.J., Nieto, O., Bello, J.P.: Data-driven and discriminative projections for large-scale cover song identification. In: Proceedings of the 14th International Society for Music Information Retrieval Conference (2013)
Google Scholar
Khadkevich, M., Omologo, M.: LargeScale cover song identification using chord profiles. In: Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR-2013) (2013)
Google Scholar
Knees, P., Schedl, M.: Music similarity and retrieval: an introduction to audio-and web-based strategies, vol. 36. Springer (2016)
Google Scholar
Knees, P., Schedl, M., Widmer, G.: Multiple lyrics alignment: automatic retrieval of song lyrics. In: International Society for Music Information Retrieval Conference (ISMIR) (2005)
Google Scholar
Manning, C.D., Raghavan, P., Schutze, H.: Introduction to Information Retrieval, vol. 1. Cambridge University Press, Cambridge (2008)
Google Scholar
Muller, M., Kurth, F., Clausen, M.: Audio matching via chroma-based statistical features. In: Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR) (2005)
Google Scholar
Oramas, S., Nieto, O., Barbieri, F., Serra, X.: Multi-label music genre classification from audio, text, and images using deep features. In: International Conference on Music Information Retrieval (ISMIR) (2017)
Google Scholar
Oramas, S., Nieto, O., Sordo, M., Serra, X.: A Deep multimodal approach for coldstart music recommendation. In: Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems—DLRS (2017)
Google Scholar
Osmalskyj, J., Pirard, S., Van Droogenbroeck, M., Embrechts, J.J.: Efficient database pruning for large-scale cover song recognition. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 714–718 (2013)
Google Scholar
Osmalskyj, J., Foster, P., Dixon, S., Jean-Jacques: Embrechts. Combining features for cover song identification. In: 16th International Society for Music Information Retrieval Conference (ISMIR) (2015)
Google Scholar
Rafii, Z., Coover, B., Han, J.: An audio fingerprinting system for live version identification using image processing techniques. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 644–648 (2014)
Google Scholar
Ravuri, S.V., Ellis, D.P.W.: Cover song detection: From high scores to general classification. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010, Texas, USA (2010)
Google Scholar
Salamon, J., Serra, J., Gomez, E.: Tonal ´ representations for music retrieval: from version identification to query-by-humming. Int. J. Multimed. Inf. Retr. 2(1), 45–58 (2013)
Google Scholar
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)
MATH Google Scholar
Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Proceedings of International Symposium on Music Information Retrieval (2000)
Google Scholar
Ellis, D.P.W.: The “covers80” cover song data set (2007)
Google Scholar
Tharwat, A.: AdaBoost classifier: an overview. https://doi.org/10.13140/RG.2.2.19929.01122 (2018)

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Vidyavardhaka College of Engineering, Mysuru, 570017, India
D. Khasim Vali
Department of Computer Science and Engineering, Government College of Engineering, Chamarajanagara, 571313, India
Nagappa U. Bhajantri

Authors

D. Khasim Vali
View author publications
You can also search for this author in PubMed Google Scholar
Nagappa U. Bhajantri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. Khasim Vali .

Editor information

Editors and Affiliations

Department of Computer Science, Rama Devi Women's University, Bhubaneswar, India
Chhabi Rani Panigrahi
Department of Computer Science, Rama Devi Women's University, Bhubaneswar, India
Bibudhendu Pati
Department of Computer Science, University of California, Davis, CA, USA
Prasant Mohapatra
Cloud Computing and Distributed Systems (CLOUDS) Lab, School of Computing and Information Systems, The University of Melbourne, Melbourne, VIC, Australia
Rajkumar Buyya
Department of Computer Science and Information Engineering, Providence University, Taichung, Taiwan
Kuan-Ching Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vali, D.K., Bhajantri, N.U. (2021). Deep Learning for Cover Song Apperception. In: Panigrahi, C.R., Pati, B., Mohapatra, P., Buyya, R., Li, KC. (eds) Progress in Advanced Computing and Intelligent Engineering. Advances in Intelligent Systems and Computing, vol 1199. Springer, Singapore. https://doi.org/10.1007/978-981-15-6353-9_9

Download citation

DOI: https://doi.org/10.1007/978-981-15-6353-9_9
Published: 10 November 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6352-2
Online ISBN: 978-981-15-6353-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics