Individual Differences in Behavioral Estimates of Cochlear Nonlinearities

Poling, Gayla L.; Horwitz, Amy R.; Ahlstrom, Jayne B.; Dubno, Judy R.

doi:10.1007/s10162-011-0291-2

Individual Differences in Behavioral Estimates of Cochlear Nonlinearities

Published: 22 September 2011

Volume 13, pages 91–108, (2012)
Cite this article

Download PDF

Journal of the Association for Research in Otolaryngology Aims and scope Submit manuscript

Individual Differences in Behavioral Estimates of Cochlear Nonlinearities

Download PDF

Gayla L. Poling^1,2,
Amy R. Horwitz¹,
Jayne B. Ahlstrom¹ &
…
Judy R. Dubno¹

1095 Accesses
10 Citations
Explore all metrics

Abstract

Psychophysical methods provide a mechanism to infer the characteristics of basilar membrane responses in humans that cannot be directly measured. Because these behavioral measures are indirect, the interpretation of results depends on several underlying assumptions. Ongoing uncertainty about the suitability of these assumptions and the most appropriate measurement and compression estimation procedures, and unanswered questions regarding the effects of cochlear hearing loss and age on basilar membrane nonlinearities, motivated this experiment. Here, estimates of cochlear nonlinearities using temporal masking curves (TMCs) were obtained in a large sample of adults of various ages whose hearing ranged from normal to moderate cochlear hearing loss (Experiment 1). A wide range of compression slopes was observed, even for subjects with similar ages and thresholds, which warranted further investigation (Experiment 2). Potential sources of variance contributing to these individual differences were explored, including procedural-related factors (test–retest reliability, suitability of the linear-reference TMC, probe sensation levels, and parameters of TMC fitting algorithms) and subject-related factors (age and age-related changes in temporal processing, strength of cochlear nonlinearities estimated with distortion-product otoacoustic emissions, estimates of changes in cochlear function from damage to outer hair cells versus inner hair cells). Subject age did not contribute significantly to TMC or compression slopes, and TMC slopes did not vary significantly with threshold. Test–retest reliability of TMCs suggested that TMC masker levels and the general shapes of TMCs did not change in a systematic way when re-measured many weeks later. Although the strength of compression decreased slightly with increasing hearing loss, the magnitude of individual differences in compression estimates makes it difficult to determine the effects of hearing loss and cochlear damage on basilar membrane nonlinearities in humans.

Computational Modeling of Individual Differences in Behavioral Estimates of Cochlear Nonlinearities

Article 30 September 2014

Cochlear Compression: Recent Insights from Behavioural Experiments

On the use of envelope following responses to estimate peripheral level compression in the auditory system

Article Open access 26 March 2021

Introduction

The ability to perceive sounds over a wide range of sound pressure levels has been attributed to the normal functioning of outer hair cells and basilar membrane compression. Physiological evidence indicates that the healthy basilar membrane is sensitive to very low-level sounds and that the response of the basilar membrane grows compressively as sound level increases (see Robles and Ruggero 2001 for review). Although the basilar membrane response is highly compressive close to the characteristic frequency (CF), the response to signals at frequencies well below the CF (at least for high CFs) is more linear regardless of input level (Rhode and Cooper 1996). Damage to outer hair cells results in elevated thresholds, narrower dynamic ranges, less compressive response growth, and poorer tuning (e.g., Ruggero et al. 1997). Several behavioral methods have been proposed to estimate cochlear nonlinearities in humans, including growth-of-masking functions (Oxenham and Plack 1997), additivity of masking (Plack et al. 2006), and temporal masking curves (TMC; Nelson et al. 2001). These methods provide a way to infer the characteristics of the basilar membrane response in humans that cannot be directly measured (see Oxenham and Bacon 2003 for review). Because these measures are indirect, the interpretation of results depends on several underlying assumptions. Investigations of the suitability of these assumptions and, in turn, the most appropriate measurement and compression estimation procedures are ongoing (e.g., Wojtczak and Oxenham 2009). Here, the focus is on the TMC method, a commonly used technique, which has been considered among the more accurate methods because it minimizes off-frequency listening.

The TMC method described by Nelson et al. (2001) involves a forward-masking task whereby the masker level required to just mask a fixed, low-level pure-tone probe is measured as a function of the masker-probe interval to produce a temporal masking curve. Limiting the probe to a low level minimizes spread of excitation along the basilar membrane and effects of off-frequency listening (Nelson et al. 2001). Because the probe level is fixed, the masker level needed increases with increasing masker-probe interval, resulting in TMCs that have positive slopes. For an off-frequency masker, which is assumed to be processed linearly, the TMC is assumed to reflect decay of masking. As the masker-probe interval is increased, the masker level required at masked threshold increases to compensate for the time course of decay. For an on-frequency masker, the TMC is assumed to reflect the combined effects of the decay of masking and compression applied to the masker. Therefore, a larger change in masker level would be required to produce a given change in basilar membrane excitation when the response to the masker is compressive. This would be reflected as a steeper on-frequency TMC compared to an off-frequency TMC. Assuming the time course of decay of the masker is identical for all masker frequencies (and levels), the degree of basilar membrane compression can be estimated by comparing the slope of a TMC for an on-frequency masker against the slope of a TMC for a masker that is processed linearly by the basilar membrane (i.e., off-frequency masker or linear-reference TMC). Basilar membrane responses can be inferred by plotting the off-frequency masker level against the on-frequency masker level required for each masker-probe interval.

The accuracy of compression estimates obtained with TMCs depends on the selection of a suitable linear reference. Citing physiologic data (e.g., Yates 1990; Ruggero 1992; Rhode and Recio 2000) and behavioral data (e.g., Nelson and Schroder 1997; Oxenham and Plack 1997), Nelson et al. (2001) proposed using an off-frequency masker that is nearly an octave below the probe frequency, assuming it produces a linear response at the probe-frequency place in the cochlea. Subsequently, Lopez-Poveda et al. (2003) proposed using off-frequency maskers measured for higher frequency probes, citing physiological evidence that compression extends to lower frequencies (relative to CF) for apical cochlear sites (Rhode and Cooper 1996). However, this suggestion requires an additional assumption, namely that the rate of decay of forward masking is the same for all probe frequencies. Further, Lopez-Poveda and Alves-Pinto (2008) reported that for a 4.0-kHz probe frequency, basilar membrane responses to a 2.2-kHz masker (i.e., 0.55*probe frequency) showed approximately 2:1 compression, whereas responses to a 1.6-kHz masker (i.e., 0.40*probe frequency) reflected linear basilar membrane responses. In addition to the frequency separation between the probe and masker, other important factors may include the absolute levels of the maskers and effects related to the medial olivocochlear reflex (MOCR; e.g., Wojtczak and Oxenham 2010).

In addition to the ongoing uncertainties concerning parameters required for a suitable linear-reference TMC, a more complete understanding of compression estimates obtained from TMCs is further complicated by the limited data obtained from individuals with cochlear hearing loss. Previous studies have included subjects with a range of hearing loss (mild to severe) and a variety of signal and masker frequencies and levels (Nelson et al. 2001; Plack et al. 2004; Rosengard et al. 2005). Plack et al. (2004) examined the impact of hearing loss on the basilar membrane response by comparing behavioral measures obtained from individuals with mild to moderate cochlear hearing loss (12 ears) to those with normal hearing (16 ears). Slopes for both off- and on-frequency TMCs were shallower for hearing-impaired than for normal-hearing subjects. The shapes of inferred basilar membrane input–output responses derived from the on-frequency and off-frequency TMCs suggested a compressive region at high levels and a truncated linear region at lower levels. Plack et al. (2004) interpreted these findings as indicating reduced gain for lower level CF tones, with little change in maximum compression associated with mild-to-moderate hearing loss. Rosengard et al. (2005) investigated compression effects in individuals with normal hearing and hearing loss ranging from mild to severe (five listeners in each group). Subjects with more hearing loss had slopes that were generally similar for the off- and on-frequency TMCs. Consistent with Plack et al. (2004), slopes for both off- and on-frequency TMCs were shallower for hearing-impaired than for normal-hearing subjects. With very shallow TMC slopes resulting from restricted ranges of masker levels, derived input–output functions cover a restricted range of input and output levels, thereby providing an incomplete description of the basilar membrane response. In contrast to Plack et al. (2004), Rosengard et al. (2005) found no evidence for a range of levels over which compression was near normal. Stainsby and Moore (2006) obtained TMCs for three ears with moderate sensorineural hearing loss for probe frequencies from 0.5 to 6.0 kHz. Most TMCs were well-fit by straight lines that were roughly parallel for each probe frequency tested, resulting in linear response functions consistent with an absence of basilar membrane compression. However, slopes of the TMCs for 0.5 and 1.0 kHz were steeper than for higher frequencies, calling into question the assumption that the rate of decay of masking is the same for all probe frequencies. Most recently, Jepsen and Dau (2011) obtained TMCs for three normal-hearing and 10 hearing-impaired subjects to estimate compression at 1.0 and 4.0 kHz and found that compression slope generally increased with pure-tone thresholds, but noted “remarkable variation” in input–output functions among individuals with similar audiograms.

Given these conflicting findings, the effect of hearing loss on behavioral estimates of cochlear nonlinearities is unclear. It remains to be determined to what extent the variation observed across previous studies reflects true differences in cochlear compression among individuals or measurement factors, such as choice of linear-reference TMC and fitting algorithms for inferred input–output responses. The issue may be further complicated by the contribution of subject age, which has not been explored systematically in sample sizes large enough and age ranges wide enough to permit separating effects of age and hearing loss (Lopez-Poveda and Alves-Pinto 2008). The present study investigated the use of the TMC method to estimate cochlear nonlinearities in a large sample of adults of various ages and with hearing thresholds ranging from normal to moderate hearing loss. TMCs were measured and basilar membrane responses were inferred to explore the impact of hearing loss and age on compression estimates (Experiment 1). Large individual differences observed in TMCs and compression estimates prompted investigation of potential sources of this variance (Experiment 2).

Experiment 1

Methods

Subjects

Fifty-one adults, ranging in age from 19 to 87 years (mean age = 55.3 years; 19 males; 32 females), participated in this experiment. Recruitment efforts focused on obtaining subjects across a continuum of ages and hearing thresholds. All subjects provided written informed consent prior to their participation, which was approved by the Institutional Review Board of the Medical University of South Carolina. For data analysis, subjects were organized into three groups according to the threshold for the probe measured in quiet (described in the next section): (1) low threshold group (n = 24; mean age = 39.6 years, range = 19–77); (2) mid threshold group (n = 11; mean age = 59.3 years, range = 22–80); and (3) high threshold group (n = 16; mean age = 76.1 years, range = 65–87). Mean audiometric thresholds (in dB HL) for the right (test) ear for each group are shown in Figure 1.

Stimuli and apparatus

TMCs were measured for a 1.0-kHz probe at 10 dB above the quiet threshold for the probe as a function of the time interval between the masker and the probe, for on-frequency (1.0 kHz) and off-frequency (0.5 kHz) maskers. The 1.0-kHz probe was selected to complement data being collected in a parallel study measuring detection of gaps in noise markers centered at 1.0 kHz (Horwitz et al. 2011). Masker-probe intervals ranged from 0 to 70 ms in 10-ms steps targeting a minimum of seven masker-probe intervals. For cases where masker levels would have exceeded maximum level restrictions using 10-ms steps, 5-ms steps were used. Probe and masker durations were 20 and 200 ms, respectively, with 10-ms raised-cosine rise and fall ramps.

During data collection for TMCs, the subject was seated inside a double-walled, sound-attenuating booth and registered responses via a button box (TDT RBOX). The probe and maskers were digitally generated with custom Labview software (Labview 8.5, National Instruments) and converted to analog using two channels of a 16-bit digital-to-analog converter (National Instruments, model 6052E) with a sampling rate of 50 kHz. The amplitudes of all signals were controlled individually using fixed attenuators (TDT PA4). The probe was added to the masker (TDT SM3) and then passed through a headphone buffer (TDT HB5) for monaural presentation to the test ear through TDH-39 (Telephonics) headphones.

Procedures

A three-interval, three-alternative forced-choice adaptive procedure with feedback was used to measure masker levels and detection thresholds for the probe and maskers. The adaptive procedure converged on the 70.7% point with a two-down, one-up tracking technique (Levitt 1971). The probe level was fixed at 10 dB SL and masker level was varied adaptively. The step size of the adaptive track was 4 dB for the first four reversals and then reduced to 2 dB for the subsequent eight reversals. A run terminated after 12 reversals, and a threshold estimate was obtained by averaging the masker levels at the last six reversals. The maximum allowable masker level was set at 102 dB SPL; the run was aborted if the masker level determined by the adaptive track would have exceeded this limit. A threshold measurement was discarded and repeated if the standard deviation of masker levels of the last six reversals exceeded 6 dB (Lopez-Poveda and Alves-Pinto 2008). Three threshold estimates were obtained and averaged at each masker-probe interval. When the standard deviation of the mean of those three thresholds exceeded 6 dB, a fourth estimate was obtained (this occurred in 17% of cases) and included in the average (Lopez-Poveda and Alves-Pinto 2008). Therefore, three to four threshold estimates were obtained for each masker-probe interval.

Test order for the masker (on- or off-frequency) and masker-probe intervals (0 to 70 ms) were selected randomly for each subject. Masker levels were obtained for a complete set of masker-probe intervals, alternating between maskers. Quiet thresholds for the 1.0-kHz probe were measured first using the same adaptive paradigm; when the standard deviation of the means of three threshold estimates exceeded 3 dB, a fourth estimate was obtained (this occurred once) and included in the average.

Each subject practiced the TMC task prior to data collection. Practice started with a longer masker-probe interval (i.e., 60 or 70 ms) and the off-frequency masker to ensure that the probe was relatively easy to hear and that the subject understood the task. These conditions, for which the masker level would most likely exceed the maximum, also determined if a shorter maximum masker-probe interval and 5-ms step sizes were required for that subject. Additional masker-probe intervals were practiced until the subject was familiarized with the task. Thresholds obtained during this practice session were discarded. During each subsequent day of data collection, the first run collected was also discarded.

Fitting procedures for TMCs and inferred input–output responses

Estimating compression from TMCs involves inferring basilar membrane input–output responses by plotting the level of the off-frequency masker against the level of the on-frequency masker for each masker-probe interval. For cases where off-frequency masker levels at longer masker-probe intervals would have exceeded the maximum permissible level, a maximum of one extrapolated point in the off-frequency TMC was included where a corresponding on-frequency data point had been collected. To obtain the extrapolated point, a straight-line fit was used (if the number of measured points in the off-frequency TMC was ≥3) or a double exponential function was used (if the number of measured points was ≥5; Johannesen and Lopez-Poveda 2008). The extrapolated thresholds for these two procedures differed by <5 dB and, with the limited data available, the straight-line fit could not be reliably differentiated from the nonlinear fit. Therefore, a straight-line fit was used to generate an extrapolated point, when needed. Finally, slopes of off-frequency TMCs were estimated using straight-line fits.

A three-segment linear regression procedure, reported by Yasin and Plack (2003) and Plack et al. (2004), was used to infer the basilar membrane input–output response. The three-segment fit corresponds to the linear–compressed–linear segments typical of these functions and was used to estimate the breakpoints (the transition point from the linear to the compressed segments and a second transition point from the compressed segment to the higher level linear segment) and the slope of the mid-level compressed segment. A custom Matlab program including the fminsearch function was used, whereby the slopes of the lower and upper segments were fixed at 1.0 (linear response). The slope of the mid-level segment and the location of the lower and upper breakpoints joining the three segments were varied by the fitting procedure to satisfy a least-squares regression criterion. At least five points were required for the fitting algorithm (Plack et al. 2004), with at least three of those points falling in the compressed region. The root-mean-square (rms) error between fitted and measured values must be <5 dB. Using these criteria, compression slope estimates were obtained for 43 of 51 subjects. An estimate of breakpoint was included only if at least one measured point occurred below the estimated breakpoint, which was the case for 27 of 51 subjects.

In addition to the three-segment linear regression, a third-order polynomial fit (Johannesen and Lopez-Poveda 2008) was used to infer basilar membrane input–output responses. Compression slopes estimated by the two procedures were similar and significantly correlated (N = 25; r = 0.543, p = 0.004). Because data from more subjects could be fit with the three-segment regression procedure than with third-order polynomials, only responses inferred using three-segment fits are presented here.

Results and discussion

Compression estimates from inferred input–output responses

Compression slope estimates plotted against threshold for the 1.0-kHz probe are shown in Figure 2 (top). Compression slopes ranged from strongly compressive to expansive, even among subjects with similar thresholds. Specifically, slopes ranged from 0.083 to 1.749 dB/dB in the low threshold group, 0.150 to 0.862 dB/dB in the mid threshold group, and 0.174 to 1.648 dB/dB in the high threshold group. Using an analysis of covariance (ANCOVA) with threshold as a grouping factor and age as a covariate, no significant main effects of threshold group [F(2,39) = 1.90, p = 0.163] or age [F(1,39) = 0.04, p = 0.839] on compression slope were observed. Removing age from the model in a subsequent ANOVA assessing the effect of threshold group did not affect the results.

As shown in Figure 2, compression slopes increased slightly but significantly with increasing probe threshold (r = 0.302, p = 0.049). The slope of the regression indicates that for each 10-dB increase in probe threshold, compression slope increases by 0.088 dB/dB. Moreover, the modest correlation means that probe threshold accounted for only 9.1% of the variance in compression slope. These results were further examined using Mplus (Version 5.2) with a path analysis (Loehlin 2004), which uses a different model to determine effects of threshold on compression slope (while controlling for correlations among other variables, including age). Probe threshold was found to account for a non-significant 10.8% of the variance in slope (p = 0.091). Taken together, these analyses suggest that the contribution of threshold to compression slope is small and that the magnitude of individual differences makes it difficult to offer a strong conclusion about the relationship between hearing loss and compression slope.

Figure 2 (bottom) shows breakpoints plotted against probe threshold. Breakpoints ranged from 18.7 to 42.7 dB in the low threshold group, 34.8 to 56.7 dB in the mid threshold group, and 57.5 to 69.3 dB in the high threshold group. Using ANCOVA with threshold as a grouping factor and age as a covariate, a significant main effect of threshold group on breakpoint was found [F(2,23) = 25.52, p < 0.001], but no significant main effect of age [F(1,23) = 1.82, p = 0.190]. Removing age from the model in a subsequent ANOVA assessing the effect of threshold group did not affect the results. Breakpoints were strongly positively correlated with probe threshold (r = 0.915; p < 0.001), such that subjects with higher probe thresholds had higher breakpoints. The slope of the regression was nearly linear (1.1 dB/dB), indicating that for every dB increase in probe threshold, breakpoint increases by approximately the same amount (1.1 dB). Using Mplus with a path analysis, probe threshold was found to uniquely account for 76.6% of the variance in breakpoints (p < 0.001). Breakpoint estimates and the strong association with probe threshold are consistent with previous findings derived from growth-of-masking functions at higher frequencies (Oxenham and Plack 1997; Dubno et al. 2007; Horwitz et al. 2007).

The current results showing a range of compression estimates and a slight increase in compression slope with increasing threshold are generally consistent with previous studies that included hearing-impaired subjects. With fewer subjects, Rosengard et al. (2005) also estimated steeper compression slopes for hearing-impaired than for normal-hearing subjects. Results from Plack et al. (2004) showed a non-significant trend for slightly increasing compression slope with increasing hearing loss. Estimates from some hearing-impaired subjects with shallow slopes were based on line fits with only one or two values within the compressed region or relied on extrapolated off-frequency data. Measuring growth-of-masking functions at 2.0 and 4.0 kHz instead of TMCs, we also reported a range of compression slopes for subjects with similar thresholds (Dubno et al. 2007; Horwitz et al. 2007); fewer subjects with elevated thresholds were included. Although an increasing trend in slopes was seen in some cases, compression slope did not vary significantly with probe threshold. These differences and similarities across studies highlight the uncertainty remaining regarding the nature of the effect of hearing loss on basilar membrane compression in human subjects.

On-frequency and off-frequency TMCs

Analysis of TMCs included a subset of subjects (N = 36) for whom masker levels were available for on- and off-frequency maskers for six masker-probe intervals (0, 10, 20, 30, 40, and 50 ms). The same pattern of results was obtained when seven masker-probe intervals were included (0–60 ms), with data from fewer subjects available (N = 32). Masker levels were examined using a repeated-measures ANCOVA with masker type (on- and off-frequency) and masker-probe interval as within-subjects factors, and probe threshold as a covariate. Because age did not show a significant effect when added as a covariate [F(1,32) = 0.29, p = 0.592], age was not included in this ANCOVA.

Significant interactions were found between masker type and masker-probe interval [F(5,170) = 6.30; p < 0.001] and among masker type, masker-probe interval, and probe threshold [F(5,170) = 4.14; p = 0.001]. The interaction between masker type and masker-probe interval simply reflects the expected differences in masker levels for off- and on-frequency maskers as the duration between the masker and probe increases, which is the basis for the TMC method. The three-way interaction with probe threshold reflects that the interval-dependent differences in masker levels for off- and on-frequency maskers change with increases in hearing loss at the probe frequency.

One of the assumptions of the TMC method is that slopes of the off-frequency TMCs depend only on the decay of masking. However, slopes of off-frequency TMCs have been shown in previous studies to become shallower with increasing hearing loss (Plack et al. 2004; Rosengard et al. 2005; Lopez-Poveda et al. 2005). To determine the extent to which slopes of off-frequency TMCs vary with probe threshold in this large dataset, slopes plotted against probe threshold are shown in Figure 3. As noted previously for compression slope estimates, large individual differences in off-frequency TMC slopes were observed for subjects with similar thresholds, even individuals with essentially normal hearing. For example, off-frequency TMC slopes as steep as 0.6–0.8 dB/dB were observed for subjects whose probe thresholds ranged from ~10 to 20 dB SPL. Although there is a trend for shallower slopes with increasing hearing loss, off-frequency TMC slope did not vary significantly with probe threshold (r = −0.216; p = 0.129). These results were further examined using Mplus with a path analysis to determine effects of threshold on TMC slope. Probe threshold was found to account for only 0.08% of the variance in off-frequency TMC slope (p = 0.875), which is not consistent with previous findings (e.g., Plack et al. 2004; Lopez-Poveda et al. 2005), perhaps due to the larger sample size and wider and continuous distributions of probe thresholds in the current experiment.

A linear reference can be difficult to obtain for subjects with hearing loss because the required masker levels may exceed maximum limits, especially at higher frequencies where hearing loss is greater. As noted earlier, one factor in the selection of signal frequencies in the current experiment was a parallel study measuring gap detection in noise markers centered at 1.0 kHz. As a compromise to meet all requirements, a 1.0-kHz probe, 1.0-kHz on-frequency masker, and 0.5-kHz off-frequency masker were selected for this experiment, which follow parameters used in earlier studies (e.g., Nelson et al. 2001). The use of higher frequency off-frequency maskers has been proposed to avoid potential compression influences on lower frequency off-frequency maskers (e.g., Lopez-Poveda et al. 2003). Nonetheless, this selection requires the additional assumption that the rate of recovery from forward masking does not depend on probe frequency. Although the rationale for the selection of masker and probe frequencies for this experiment was sound, the large individual differences seen in the slopes of the off-frequency TMCs call into question the suitability of a 0.5-kHz off-frequency masker for a mid-frequency probe.

Accordingly, the linear-reference TMC (along with several other factors) was investigated in Experiment 2 as a potential source of variance contributing to the large individual differences seen in the results of Experiment 1. In addition to masker and probe frequencies for the linear-reference TMC, other factors included: test–retest reliability of TMCs, effects of probe sensation level, parameters of TMC fitting algorithms, and subject-related variables such as age, strength of cochlear nonlinearities estimated with distortion-product otoacoustic emissions (DPOAEs), and presumed underlying cochlear damage (outer hair cells versus inner hair cells).