Skip to main content

Basics of Digital Audio

  • Chapter
  • First Online:
Fundamentals of Multimedia

Abstract

Audio data has special and unique properties. E.g., while it is useful to occasionally drop a video frame from a stream, we simply cannot do the same with audio information or all sense will be lost from that dimension. Therefore, how to sensibly compress sound information is an important question. We begin with a discussion of just what makes up sound, and consider the digitization of sound information. We introduce the Nyquist theorem as a fundamental property of sampling. Signal-to-noise ratio (SNR) is defined and adopted as a useful measure of audio (and, in general, signal) quality, including the effect of quantization noise. Linear and nonlinear quantizations, including companding for audio data, are discussed. Synthetic sounds are introduced, and we then go on to a thorough introduction to the use of MIDI as an enabling technology to capture, store, and play back musical notes. We look at some details of audio quantization, and give introductory information on how digital audio is dealt with for storage and transmission. This entails a first discussion of how subtraction of signals from predicted values yields numbers that are close to zero, and hence easier to deal with. Pulse code modulation (PCM) is introduced, followed by differential coding of audio and lossless predictive coding. Finally, differential pulse code modulation (DPCM) and adaptive DPCM are introduced, and we take a look at encoder/decoder schema.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 99.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    This ratio is actually the peak signal-to-quantization-noise ratio, or PSQNR.

  2. 2.

    The hexadecimal numbers derived from the four LSBs range from 0 to 15. In practice, musicians and software refer to the MIDI channels from 1 to 16, so there is a difference of 1 when coding them in hexadecimal, e.g., Channel 1 is coded “0,” and Channel 16 is coded “F.”

References

  1. B. Truax, Handbook for Acoustic Ecology, 2nd edn. (Cambridge Street Publishing, 1999)

    Google Scholar 

  2. K.C. Pohlmann, Principles of Digital Audio, 6th edn. (McGraw-Hill, 2010)

    Google Scholar 

  3. J.H. McClellan, R.W. Schafer, M.A. Yoder, DSP First: a Multimedia Approach. (Prentice-Hall PTR, 1998)

    Google Scholar 

  4. J. Heckroth, Tutorial on MIDI and music synthesis. The MIDI Manufacturers Association, POB 3173, La Habra CA 90632-3173 (1995). http://www.harmony-central.com/MIDI/Doc/tutorial.html

  5. P.K. Andleigh, K. Thakrar, Multimedia Systems Design. (Prentice-Hall PTR, 1995)

    Google Scholar 

  6. The MIDI Association. Strategic overview and introduction to midi 2.0. in The National Association of Music Merchants (NAMM) Show (2020)

    Google Scholar 

  7. K. Sayood, Introduction to Data Compression, 5th edn. (Morgan Kaufmann, San Francisco, 2017)

    MATH  Google Scholar 

  8. R.L. Freeman, Reference Manual for Telecommunications Engineering, 3rd edn. (Wiley, 2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiangchuan Liu .

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Li, ZN., Drew, M.S., Liu, J. (2021). Basics of Digital Audio. In: Fundamentals of Multimedia. Texts in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-030-62124-7_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-62124-7_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-62123-0

  • Online ISBN: 978-3-030-62124-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics