Skip to main content

Object-Based Spatial Audio: Concept, Advantages, and Challenges

  • Chapter
  • First Online:
3D Future Internet Media

Abstract

One of the primary objectives of modern audiovisual media creation and reproduction techniques is realistic perception of the delivered contents by the consumer. Spatial audio-related techniques in general attempt to deliver the impression of an auditory scene where the listener can perceive the spatial distribution of the sound sources as if he/she were in the actual scene. Advances in spatial audio capturing and rendering techniques have led to a new concept of delivering audio which does not only aim to present to the listener a realistic auditory scene just as captured but also gives more control over the delivered auditory scene to the producer and/or the listener. This is made possible by being able to control the attributes of individual sound objects that appear in the delivered scene. In this section, this so-called “object-based” approach is introduced, with its key features that distinguish it from the conventional spatial audio production and delivery techniques. The related applications and technologies will also be introduced, and the limitations and challenges that this approach is facing will follow.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Begault DR (1994) 3-D sound for virtual reality and multimedia. AP Professional, Boston

    Google Scholar 

  2. Berkhout AJ, DD V, Vogel P (1993) Acoustic control by wave field synthesis. J Acoust Soc Am 93(5):2764–2778

    Article  Google Scholar 

  3. Blauert J (1997) Spatial hearing: the psychophysics of human sound localization, 2nd revised edn. Mit Press, Cambridge, MA

    Google Scholar 

  4. Geier M, Ahrens J, Spors S (2010) Object-based audio reproduction and the audio scene description format. Organ Sound 15(03):219–227. doi:10.1017/S1355771810000324

    Article  Google Scholar 

  5. Herre J, Purnhagen H, Koppens J, Hellmuth O, Engdegård J, Hilper J, Villemoes L, Terentiv L, Falch C, Hölzer A, Valero ML, Resch B, Mundt H, Oh H-O (2012) MPEG spatial audio object coding-the ISO/MPEG standard for efficient coding of interactive audio scenes. J Audio Eng Soc 60(9):655–673

    Google Scholar 

  6. ISO/IEC (1997) Information technology—computer graphics and image processing—the virtual reality modeling language (VRML)—part 1: functional specification and UTF-8 encoding. ISO/IEC 14772–1:1997. International Standards Organization (ISO)

    Google Scholar 

  7. ISO/IEC (2006) Information technology—generic coding of moving pictures and associated audio information—part 7: advanced audio coding (AAC). ISO/IEC 13818–7:2006(E). International Standard Organization (ISO)

    Google Scholar 

  8. Malham D (2008) Spatial hearing mechanisms and sound reproduction. Music Technology Group, University of York. http://www.york.ac.uk/inst/mustech/3d_audio/ambis2.htm. Accessed 17 Mar 2013

  9. Peters N, Lossius T, Schacher JC (2012) SpatDIF: principles, specification, and examples. In: The 9th sound and music computing conference, Copenhagen, Denmark. pp 500–505

    Google Scholar 

  10. Pulkki V (1997) Virtual sound source positioning using vector base amplitude panning. J Audio Eng Soc 45(6):456–466

    Google Scholar 

  11. Rumsey F (2011) Whose head is it anyway? Optimizing binaural audio. J Audio Eng Soc 59(9):672–675

    Google Scholar 

  12. Scheirer ED, Väänänen R, Huopaniemi J (1999) AudioBIFS: describing audio scenes with the MPEG-4 multimedia standard. IEEE Trans Multimed 1(3):237–250

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chungeun Kim .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this chapter

Cite this chapter

Kim, C. (2014). Object-Based Spatial Audio: Concept, Advantages, and Challenges. In: Kondoz, A., Dagiuklas, T. (eds) 3D Future Internet Media. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8373-1_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-8373-1_5

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-8372-4

  • Online ISBN: 978-1-4614-8373-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics