Abstract
This paper studies the detection of singing segments using various features, such as MFCC (Mel Frequency Cepstral Coefficients) and LPCC (Linear Predictive Cepstral Coefficients), with the HMM (Hidden Markov Model) models. The audio clips under test in this paper include isolated segments from different sound tracks and all segments entirely from a sound track. In the literature, these two cases are usually individually investigated. However, we have a unified treatment to both types of segments using the same features and classifiers. In the experiments, we design five experiments to fully examine the performance limitation of the approaches studied in this paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
You, S.D., Chen, W.-H., Chen, W.-K.: Music Identification System Using MPEG-7 Audio Signature Descriptors. The Scientific World Journal (2013), doi:10.1155/2013/752464
Berenzweig, A.L., Ellis, D.P.W.: Locating Singing Voice Segments Within Music Signals. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 21–24. IEEE Press, New York (2001)
Vembu, S., Baumann, S.: Separation of Vocals from Polyphonic Audio Recordings. In: Proc. of 6th International Conference on Music Information Retrieval (ISMIR 2005), pp. 1–8. Queen Mary. University of London (2005)
Rocamora, M., Herrera, P.: Comparing Audio Descriptors for Singing Voice Detection in Music Audio Files. In: Proc. of 11th Brazilian Symposium on Computer Music, San Pablo, Brazil, pp. 1–10 (2007)
Lukashevich, H., et al.: Effective Singing Voice Detection in Popular Music Using ARMA Filtering. In: Proc. 10th International Conference on Digital Audio Effects (DAFx-2007), Bordeaux, France, pp. 10–15 (2007)
New, T.L., et al.: Singing Voice Detection in Popular Music. In: Proc. 12th Annual ACM International Conference on Multimedia, pp. 1–4. ACM, NY (2004)
O’Shaughnessy, D.: Speech Communication: Human and Machine. Addison-Wesley, Reading (1987)
Lindsay, P.H., Norman, D.A.: Human Information Processing: An Introduction to Psychology, 2nd edn. Academic Press, New York (1977)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
You, S.D., Wu, YC. (2015). Comparative Study of Singing Voice Detection Methods. In: Park, J., Stojmenovic, I., Jeong, H., Yi, G. (eds) Computer Science and its Applications. Lecture Notes in Electrical Engineering, vol 330. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45402-2_180
Download citation
DOI: https://doi.org/10.1007/978-3-662-45402-2_180
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45401-5
Online ISBN: 978-3-662-45402-2
eBook Packages: EngineeringEngineering (R0)