Comparative Study of Singing Voice Detection Methods

Conference paper

pp 1291–1298
Cite this conference paper

Computer Science and its Applications

Shingchern D. You⁵ &
Yi-Chung Wu⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 330))

3386 Accesses
2 Citations

Abstract

This paper studies the detection of singing segments using various features, such as MFCC (Mel Frequency Cepstral Coefficients) and LPCC (Linear Predictive Cepstral Coefficients), with the HMM (Hidden Markov Model) models. The audio clips under test in this paper include isolated segments from different sound tracks and all segments entirely from a sound track. In the literature, these two cases are usually individually investigated. However, we have a unified treatment to both types of segments using the same features and classifiers. In the experiments, we design five experiments to fully examine the performance limitation of the approaches studied in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Similar content being viewed by others

Comparative study of singing voice detection methods

Article 29 August 2015

Singer identification using perceptual features and cepstral coefficients of an audio signal from Indian video songs

Article Open access 25 June 2015

Content-based singer classification on compressed domain audio data

Article 30 July 2014

References

You, S.D., Chen, W.-H., Chen, W.-K.: Music Identification System Using MPEG-7 Audio Signature Descriptors. The Scientific World Journal (2013), doi:10.1155/2013/752464
Google Scholar
Berenzweig, A.L., Ellis, D.P.W.: Locating Singing Voice Segments Within Music Signals. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 21–24. IEEE Press, New York (2001)
Google Scholar
Vembu, S., Baumann, S.: Separation of Vocals from Polyphonic Audio Recordings. In: Proc. of 6th International Conference on Music Information Retrieval (ISMIR 2005), pp. 1–8. Queen Mary. University of London (2005)
Google Scholar
Rocamora, M., Herrera, P.: Comparing Audio Descriptors for Singing Voice Detection in Music Audio Files. In: Proc. of 11th Brazilian Symposium on Computer Music, San Pablo, Brazil, pp. 1–10 (2007)
Google Scholar
Lukashevich, H., et al.: Effective Singing Voice Detection in Popular Music Using ARMA Filtering. In: Proc. 10th International Conference on Digital Audio Effects (DAFx-2007), Bordeaux, France, pp. 10–15 (2007)
Google Scholar
New, T.L., et al.: Singing Voice Detection in Popular Music. In: Proc. 12th Annual ACM International Conference on Multimedia, pp. 1–4. ACM, NY (2004)
Google Scholar
O’Shaughnessy, D.: Speech Communication: Human and Machine. Addison-Wesley, Reading (1987)
Google Scholar
Lindsay, P.H., Norman, D.A.: Human Information Processing: An Introduction to Psychology, 2nd edn. Academic Press, New York (1977)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei, 106, Taiwan
Shingchern D. You & Yi-Chung Wu

Authors

Shingchern D. You
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Chung Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shingchern D. You .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Seoul University of Science and Technology (SeoulTech), Seoul, Korea, Republic of (South Korea)
James J. (Jong Hyuk) Park
School of IT and Engineering, University of Ottawa, Ottawa, Ontario, Canada
Ivan Stojmenovic
Humanitas College, Kyung Hee University, Seoul, Korea, Republic of (South Korea)
Hwa Young Jeong
Computer Science & Engineering, Gangneung-Wonju Natioanl University, Wonju, Korea, Republic of (South Korea)
Gangman Yi

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

You, S.D., Wu, YC. (2015). Comparative Study of Singing Voice Detection Methods. In: Park, J., Stojmenovic, I., Jeong, H., Yi, G. (eds) Computer Science and its Applications. Lecture Notes in Electrical Engineering, vol 330. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45402-2_180

Download citation

DOI: https://doi.org/10.1007/978-3-662-45402-2_180
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45401-5
Online ISBN: 978-3-662-45402-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions