Abstract
In this work, we proposed a cover song recognition system using deep learning. From the literature, understand that most of the works extract the discriminate feature that classifies the cover song between a pair of songs and calculates the dissimilarity or similarity between the two songs based on the observation, which is a meaningful pattern between cover songs. Moreover, it inspires reformulating the cover song apperception obstacle in a machine learning framework. In other words, essentially builds the cover song recognition system using Convolution Neural Network (CNN) and Mel Frequency Cepstral Coefficients (MFCCs) features following the construction of the data set composed of cover song pairs. The prepared CNN yields the likelihood of being in the spread tune connection given a cross-closeness grid produced from any two bits of music and recognizes the spread tune by positioning on the likelihood. Test results display the prescribed methodology that has accomplished enhanced execution tantamount to the cutting edge endeavors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bertin-Mahieux, T., Ellis, D.P.W.: Largescale cover song recognition using the 2d Fourier transform magnitude. Int. Soc. Music Inf. Retr. (2012)
Bertin-Mahieux, T., Ellis, D.P.W.: (2011). Largescale cover song recognition using hashed chroma landmarks. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Bertin-Mahieux, T., Ellis, D.P.W., Whitman, B., Lamere, P.: The million song dataset. In Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR) (2011)
Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)
Schmidhuber, J.: Deep learning in neural networks: an overview. Neural Netw. (2015). https://doi.org/10.1016/j.neunet.2014.09.003
Othman, E., Bazi, Y., Alajlan, N., Alhichri, H., Melgani, F.: Using convolutional features and a sparse autoencoder for land-use scene classification. Int. J. Remote Sens. 37, 2149–2167 (2016)
Cai, K., Yang, D., Chen, X.: Two-layer large-scale cover song identification system based on music structure segmentation. In: 2016 IEEE 18th International Workshop on Multimedia Signal Processing, MMSP 2016 (2017)
Cano, P., Batle, E., Kalker, T., Haitsma, J.: A review of algorithms for audio fingerprinting. In Multimedia Signal Processing, 2002 IEEE Workshop on, pp. 169–173. IEEE (2002)
Chang, S., Lee, J., Keun Choe, S., Lee, K.: Audio cover song identification using a convolutional neural network (2017). arXiv preprint https://arxiv.org/abs/1712.00166
Chen, N., Li, W., Xiao, H.: Fusing similarity functions for cover song identification. Multimed. Tools Appl. 77(2), 2629–2652 (2018)
Ellis, D.P.W., Poliner, G.E.: Identifying cover songs’ with chroma features and dynamic programming beat tracking. In: Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on. IEEE (2007)
Foster, P., Dixon, S., Klapuri, A.: Identifying cover songs using information-theoretic measures of similarity. IEEE/ACM Trans. Audio, Speech Languag. Process. (TASLP) 23(6), 993–1005 (2015)
Heo, H., Kim, H.J., Kim, W.S., Lee, K.: Cover song identification with metric learning using distance as a feature. In: ISMIR (2017)
Humphrey, E.J., Nieto, O., Bello, J.P.: Data-driven and discriminative projections for large-scale cover song identification. In: Proceedings of the 14th International Society for Music Information Retrieval Conference (2013)
Khadkevich, M., Omologo, M.: LargeScale cover song identification using chord profiles. In: Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR-2013) (2013)
Knees, P., Schedl, M.: Music similarity and retrieval: an introduction to audio-and web-based strategies, vol. 36. Springer (2016)
Knees, P., Schedl, M., Widmer, G.: Multiple lyrics alignment: automatic retrieval of song lyrics. In: International Society for Music Information Retrieval Conference (ISMIR) (2005)
Manning, C.D., Raghavan, P., Schutze, H.: Introduction to Information Retrieval, vol. 1. Cambridge University Press, Cambridge (2008)
Muller, M., Kurth, F., Clausen, M.: Audio matching via chroma-based statistical features. In: Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR) (2005)
Oramas, S., Nieto, O., Barbieri, F., Serra, X.: Multi-label music genre classification from audio, text, and images using deep features. In: International Conference on Music Information Retrieval (ISMIR) (2017)
Oramas, S., Nieto, O., Sordo, M., Serra, X.: A Deep multimodal approach for coldstart music recommendation. In: Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems—DLRS (2017)
Osmalskyj, J., Pirard, S., Van Droogenbroeck, M., Embrechts, J.J.: Efficient database pruning for large-scale cover song recognition. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 714–718 (2013)
Osmalskyj, J., Foster, P., Dixon, S., Jean-Jacques: Embrechts. Combining features for cover song identification. In: 16th International Society for Music Information Retrieval Conference (ISMIR) (2015)
Rafii, Z., Coover, B., Han, J.: An audio fingerprinting system for live version identification using image processing techniques. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 644–648 (2014)
Ravuri, S.V., Ellis, D.P.W.: Cover song detection: From high scores to general classification. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010, Texas, USA (2010)
Salamon, J., Serra, J., Gomez, E.: Tonal ´ representations for music retrieval: from version identification to query-by-humming. Int. J. Multimed. Inf. Retr. 2(1), 45–58 (2013)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)
Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Proceedings of International Symposium on Music Information Retrieval (2000)
Ellis, D.P.W.: The “covers80” cover song data set (2007)
Tharwat, A.: AdaBoost classifier: an overview. https://doi.org/10.13140/RG.2.2.19929.01122 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Vali, D.K., Bhajantri, N.U. (2021). Deep Learning for Cover Song Apperception. In: Panigrahi, C.R., Pati, B., Mohapatra, P., Buyya, R., Li, KC. (eds) Progress in Advanced Computing and Intelligent Engineering. Advances in Intelligent Systems and Computing, vol 1199. Springer, Singapore. https://doi.org/10.1007/978-981-15-6353-9_9
Download citation
DOI: https://doi.org/10.1007/978-981-15-6353-9_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6352-2
Online ISBN: 978-981-15-6353-9
eBook Packages: EngineeringEngineering (R0)