INTRODUCTION

Repeat recreational exposure to loud music is a key risk factor for noise-induced hearing loss (NIHL) and represents a major public health concern.1 The World Health Organization estimates that 50% of individuals aged 12–35 in developed nations are regularly exposed to unsafe levels of sound from personal music players, with up to 1.1 billion youth worldwide at risk of consequent NIHL.2

While prior studies have established the deleterious effects of high music volume expressed in decibels of sound pressure level (dBSPL),3 this end-product perceived by the human ear is a composite of several inter-related factors leading to the perception of loudness, including user-controlled volume settings and the intrinsic sound intensity of the sound media itself. In acoustical engineering, the latter may be expressed as decibels relative to full scale (dBFS), an absolute measurement of instantaneous energy which, when all other factors (such as user-defined volume levels) are held equal, is correlated to dBSPL.4

Few studies have examined the evolution of the underlying audio engineering of music, a systemic factor which may contribute to NIHL. As industry trends may have progressively accentuated popular music sound intensity over time, a phenomenon generally referred to as the “loudness wars,”5 we undertook a big data analysis of the evolution of such musical sound intensity, as measured in dBFS, to quantitatively assess temporal trends in the loudness of popular music released over the past six decades.

METHODS

We analyzed the Million Song Dataset, a public dataset of tracks released from 1922 to 2011. The development and content of this dataset have been previously described.6 Briefly, it consists of the metadata of predominantly occidental popular music, with loudness data reported in dBFS. Spanning from the 1920s to the 2010s, mean (standard deviation [SD]) of song loudness was modelled linearly for each decade that contained > 1000 songs. Kernel density estimation was used to normalize the frequency distribution of song sound intensity and create ridgeline plots for each decade.

RESULTS

Combined year and loudness data were available for 515,412 songs released from 1922 to 2011, with all decades from the 1950s onward containing n > 1000 songs (total n = 514,579) (panel A). Overall mean (SD) song loudness was − 9.7 (4.9) dBFS, with means ranging from − 14.4 (5.1) dBFS in the 1950s to − 8.2 (4.3) dBFS in the 2010s. When modelled linearly, mean loudness increased by 1.23 dBFS/10 years (95% confidence interval 1.22–1.25) (p < 0.001) (panel B).

DISCUSSION

Social exposure to noise from personal music players is a major cause of preventable hearing loss and a growing public health concern.1,2,3 In analyzing a large database of songs spanning several decades, we demonstrate a significant increase in the mean intrinsic sound intensity of popular music (Fig. 1), likely related to progressive changes in audio engineering technology in the transition to digital media. The advent of dynamic range compression in particular, an operation in audio signal processing whereby the differences between peaks and troughs of sound intensity within a song are minimized, results in tracks with consistently higher sound intensity.5

Figure 1
figure 1

Temporal trends in the loudness of songs in decibels relative to full scale (dBFS) by decade, 1950s to 2010s. a Ridgeline plots of popular music song loudness by decade; kernel density estimation was used to normalize the frequency distribution of loudness in decibels relative to full scale (dBFS) for each decade from 1950s to 2010s. The maximum height of each curve represents the most common (modal) loudness in dBFS occurring in the associated decade. b Popular music song loudness as a function of time, modelled linearly.

Although our study demonstrates an ongoing, progressive accentuation of the intrinsic sound intensity of popular music over time, a limitation is that the subsequent user-level response in volume adjustment cannot be accounted for. However, higher volume levels and longer durations of exposure are well-established user-level risk factors for NIHL1, 3 and dynamic range compression may exacerbate these phenomena as there are fewer quiet troughs providing “rest periods” within tracks.5 Additionally, this imposes increased onus on the part of the consumer to engage in frequent manual volume adjustments between tracks of varying compression to prevent deleterious sound levels. Such trends in popular music sound intensity therefore represent a potentially preventable, but music industry-level, risk factor for NIHL which should be the target of future research.