Skip to main content

Advances in Speech and Music Technology

Computational Aspects and Applications

  • Book
  • © 2023

Overview

  • Presents comprehensive coverage of the interdisciplinary aspects of speech and music processing
  • Offer detailed technological insights and a deep understanding of speech and music processing applications by considering both theory and practice in the relevant topics
  • Topics include music information retrieval and spoken language processing

Part of the book series: Signals and Communication Technology (SCT)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (21 chapters)

  1. State-of-the-Art

  2. Machine Learning

  3. Perception, Health and Emotion

  4. Case Studies

Keywords

About this book

This book presents advances in speech and music in the domain of audio signal processing. The book begins with introductory chapters on the basics of speech and music, and then proceeds to computational aspects of speech and music, including music information retrieval and spoken language processing. The authors discuss the intersection in the field of computer science, musicology and speech analysis, and how the multifaceted nature of speech and music information processing requires unique algorithms, systems using sophisticated signal processing, and machine learning techniques that better extract useful information. The authors discuss how a deep understanding of both speech and music in terms of perception, emotion, mood, gesture and cognition is essential for successful application. Also discussed is the overwhelming amount of data that has been generated across the world that requires efficient processing for better maintenance, retrieval, indexing and querying and how machine learning and artificial intelligence are most suited for these computational tasks. The book provides both technological knowledge and a comprehensive treatment of essential topics in speech and music processing.

Editors and Affiliations

  • Department of Computer Science & Engineering, National Institute of Technology Silchar, Cachar, India

    Anupam Biswas

  • Department of Media and Culture Studies, Utrecht University, Utrecht, The Netherlands

    Emile Wennekes

  • Multimedia Department, Polish-Japanese Academy of Information Technology, Warsaw, Poland

    Alicja Wieczorkowska

  • Department of Electronics & Communication Engineering, National Institute of Technology Silchar, Cachar, India

    Rabul Hussain Laskar

About the editors

Dr. Anupam Biswas received his Ph.D. degree in computer science and engineering from Indian Institute of Technology (BHU), Varanasi, India, in 2017. He has received his M.Tech. and B.E. degrees in computer science and engineering from Nehru National Institute of Technology Allahabad, Prayagraj, India, in 2013, and Jorhat Engineering College, Jorhat, Assam, in 2011, respectively. He is currently working as Assistant Professor in the Department of Computer Science & Engineering, National Institute of Technology Silchar, Assam, India. He has published several research papers in reputed international journals, conference, and book chapters. His research interests include computational music, machine learning, fuzzy systems, information retrieval, and evolutionary computation. He has served as Program Chair of the International Conference on Big Data, Machine Learning and Applications (BigDML 2019). He has served as General Chair of 25th International Symposium Frontiers of Researchin Speech and Music (FRSM 2020) and co-edited proceedings of FRSM 2020 published as book volume in Springer AISC Series. He has co-edited three books titled “Health Informatics: A Computational Perspective in Healthcare” and “Principles of Social Networking: The New Horizon and Emerging Challenges” with Springer series and “Principles of Big Graph: In-depth Insight” with Elsevier book series.

Prof. Dr. Emile Wennekes is Chair Professor of Musicology at Utrecht University, The Netherlands. He was appointed full professor there in 2000. From 2006 to 2011, he was the first Head of the School of the Media and Culture Studies department. In 2017, his chair was modified from Musicology: Post-1800 Music History into Musicology: Music and Media, now also officially embracing his main field of research. Wennekes has written on a broad range of subjects, including a co-authored book on contemporary Dutch music available in six languages. His work has been published by, among others, Oxford University Press, Routledge, Michigan University Press, and Brepols. Most recently, he edited the volume Cinema Changes: Incorporation of Jazz in the Film Soundtrack (Brepols 2019) together with Dr. Emilio Audissino. Wennekes founded and chairs the Study Group Music and Media (MaM) under the auspices of the International Musicological Society. He coordinates its annual conferences.

Dr. Alicja A. Wieczorkowska Ph.D., D.Sc., is a computer scientist, specializing in audio signal analysis. She holds a Ph.D. from the Gdansk University of Technology, and additionally she is the alumna of the State School of Music (Second Level). Her Ph.D. thesis examined the automatic recognition of musical instrument sounds, depending on parameterization and classifiers applied. Dr. A. Wieczorkowska is presently Associate Professor and Head of Multimedia Laboratory at the Polish-Japanese Academy of Information Technology, Warsaw, Poland. Additionally, she is also an associate member of the Graduate Faculty at the University of North Carolina at Charlotte. Her scientific interests include audio information retrieval, music and speech processing, data mining, as well as computer graphics, multimedia, and automated identification of emotions from various signals.

Dr. Rabul Husain Laskar received his PhD degree in Electronics & Communication Engineering from National Institute of Technology Silchar, India, in 2013. He has received his M.Tech. and B.E. degrees in Electronics and Communication Engineering and Electrical Engineering from Indian Institute of Technology, Guwahati, India, in 2007, and National Institute of Technology, Silchar, Assam, in 1998, respectively. He is currently working as Associate Professor in the Department of Electronics & Communication Engineering, National Institute of Technology Silchar, Assam, India. He has published several research papers in reputed international journals, conference, and book chapters. His research interests include speech and audio processing, image and video processing, multimedia systems, biomedical signal processing, machine learning and soft computing techniques. He has served as technical program committee member and session chairs in different national and International Conferences.


Bibliographic Information

  • Book Title: Advances in Speech and Music Technology

  • Book Subtitle: Computational Aspects and Applications

  • Editors: Anupam Biswas, Emile Wennekes, Alicja Wieczorkowska, Rabul Hussain Laskar

  • Series Title: Signals and Communication Technology

  • DOI: https://doi.org/10.1007/978-3-031-18444-4

  • Publisher: Springer Cham

  • eBook Packages: Engineering, Engineering (R0)

  • Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2023

  • Hardcover ISBN: 978-3-031-18443-7Published: 02 January 2023

  • Softcover ISBN: 978-3-031-18446-8Published: 03 January 2024

  • eBook ISBN: 978-3-031-18444-4Published: 01 January 2023

  • Series ISSN: 1860-4862

  • Series E-ISSN: 1860-4870

  • Edition Number: 1

  • Number of Pages: XVII, 443

  • Number of Illustrations: 34 b/w illustrations, 120 illustrations in colour

  • Topics: Signal, Image and Speech Processing, Engineering Acoustics, Music

Publish with us