Integration of Text and Audio Features for Genre Classification in Music Information Retrieval

Neumayer, Robert; Rauber, Andreas

doi:10.1007/978-3-540-71496-5_78

Robert Neumayer¹ &
Andreas Rauber¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4425))

Included in the following conference series:

European Conference on Information Retrieval

2237 Accesses
22 Citations

Abstract

Multimedia content can be described in versatile ways as its essence is not limited to one view. For music data these multiple views could be a song’s audio features as well as its lyrics. Both of these modalities have their advantages as text may be easier to search in and could cover more of the ‘content semantics’ of a song, while omitting other types of semantic categorisation. (Psycho)acoustic feature sets, on the other hand, provide the means to identify tracks that ‘sound similar’ while less supporting other kinds of semantic categorisation. Those discerning characteristics of different feature sets meet users’ differing information needs. We will explain the nature of text and audio feature sets which describe the same audio tracks. Moreover, we will propose the use of textual data on top of low level audio features for music genre classification. Further, we will show the impact of different combinations of audio features and textual features based on content words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Foote, J.: An overview of audio information retrieval. Multimedia Systems 7(1), 2–10 (1999)
Article Google Scholar
Lidy, T., Rauber, A.: Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In: Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR 2005), London, UK, September 11-15, 2005, pp. 34–41 (2005)
Google Scholar
Logan, B., Kositsky, A., Moreno, P.: Semantic analysis of song lyrics. In: Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, ICME 2004, Taipei, Taiwan, 27-30 June 2004, IEEE Computer Society Press, Los Alamitos (2004)
Google Scholar
Mahedero, J.P.G., et al.: Natural language processing of lyrics. In: MULTIMEDIA ’05: Proceedings of the 13th annual ACM international conference on Multimedia, pp. 475–478. ACM Press, New York (2005)
Chapter Google Scholar
Rauber, A., Pampalk, E., Merkl, D.: Using psycho-acoustic models and self-organizing maps to create a hierarchical structuring of music by musical styles. In: Proceedings of the 3rd International Symposium on Music Information Retrieval, Paris, France, October 13-17, 2002, pp. 71–80 (2002)
Google Scholar
Tzanetakis, G., Cook, P.: Marsyas: A framework for audio analysis. Organized Sound 4(30) (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Vienna University of Technology, Institute for Software Technology and Interactive Systems,
Robert Neumayer & Andreas Rauber

Authors

Robert Neumayer
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Rauber
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Giambattista Amati Claudio Carpineto Giovanni Romano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Neumayer, R., Rauber, A. (2007). Integration of Text and Audio Features for Genre Classification in Music Information Retrieval. In: Amati, G., Carpineto, C., Romano, G. (eds) Advances in Information Retrieval. ECIR 2007. Lecture Notes in Computer Science, vol 4425. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71496-5_78

Download citation

DOI: https://doi.org/10.1007/978-3-540-71496-5_78
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71494-1
Online ISBN: 978-3-540-71496-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics