Abstract
All sounds are multidimensional, yet the relationships among auditory dimensions have been studied only infrequently. General recognition theory (GRT; Ashby & Townsend, 1986) is a multidimensional generalization of signal detection theory and, as such, provides powerful tools well suited to the study of the relationships among perceptual dimensions. However, previous uses of GRT have been limited in serious ways. We present methods designed to overcome these limitations, and we use these methods to apply GRT to investigations of the relationships among auditory perceptual dimensions that previous work suggests are independent (frequency, duration) or not (fundamental frequency [ f0], spectral shape). Results from Experiment 1 confirm that frequency and duration do not interact decisionally, and they extend this finding with evidence of perceptual independence. Results from Experiment 2 show that f0 and spectral shape tend to interact perceptually, decisionally, or both, and that perceptual interactions occur within, but not between, stimuli (i.e., the interactions suggest correlated noise across processing channels corresponding to perceptually separable dimensions). The results are discussed in relation to lower level sensory modeling and higher level cognitive and linguistic issues.
Article PDF
Similar content being viewed by others
References
Abel, S. M. (1972). Duration discrimination of noise and tone bursts. Journal of the Acoustical Society of America, 51, 1219–1223.
Ashby, F. G. (2000). A stochastic version of general recognition theory. Journal of Mathematical Psychology, 44, 310–329.
Ashby, F. G., & Gott, R. E. (1988). Decision rules in the perception and categorization of multidimensional stimuli. Journal of Experimental Psychology: Learning, Memory, & Cognition, 14, 33–53.
Ashby, F. G., & Lee, W. W. (1991). Predicting similarity and categorization from identification. Journal of Experimental Psychology: General, 120, 150–172.
Ashby, F. G., & Townsend, J. T. (1986). Varieties of perceptual independence. Psychological Review, 93, 154–179.
Christensen, L. A., & Humes, L. E. (1996). Identification of multidimensional complex sounds having parallel dimension structure. Journal of the Acoustical Society of America, 99, 2307–2315.
Christensen, L. A., & Humes, L. E. (1997). Identification of multidimensional stimuli containing speech cues and the effects of training. Journal of the Acoustical Society of America, 102, 2297–2310.
Corcoran, D. W. (1967). Perceptual independence and recognition of two-dimensional auditory stimuli. Journal of the Acoustical Society of America, 42, 139–142.
Creelman, C. D. (1962). Human discrimination of auditory duration. Journal of the Acoustical Society of America, 34, 582–593.
DeCarlo, L. T. (2002). Signal detection theory with finite mixture distributions: Theoretical developments with applications to recognition memory. Psychological Review, 109, 710–721.
Eddins, D. A., Hall, J. W., III, & Grose, J. H. (1992). The detection of temporal gaps as a function of frequency region and absolute noise bandwidth. Journal of the Acoustical Society of America, 91, 1069–1077.
Eimas, P. D., Tartter, V. C., Miller, J. L., & Keuthen, N. J. (1978). Asymmetric dependencies in processing phonetic features. Perception & Psychophysics, 23, 12–20.
Espinoza-Varas, B., & Jamieson, D. G. (1984). Integration of spectral and temporal cues separated in time and frequency. Journal of the Acoustical Society of America, 76, 732–738.
Florentine, M., Fastl, H., & Buus, S. (1988). Temporal integration in normal hearing, cochlear impairment, and impairment simulated by masking. Journal of the Acoustical Society of America, 84, 195–203.
Garner, W. R. (1974). The processing of information and structure. Potomac, MD: Erlbaum.
Green, D. M., & Forrest, T. G. (1989). Temporal gaps in noise and sinusoids. Journal of the Acoustical Society of America, 86, 961–970.
Green, D. M., & Swets, J. A. (1966). Signal detection theory and psychophysics. New York: Wiley.
Holt, L. L., & Lotto, A. J. (2006). Cue weighting in auditory categorization: Implications for first and second language acquisition. Journal of the Acoustical Society of America, 119, 3059–3071.
Kadlec, H., & Townsend, J. T. (1992a). Implications of marginal and conditional detection parameters for the separabilities and independence of perceptual dimensions. Journal of Mathematical Psychology, 36, 325–374.
Kadlec, H., & Townsend, J. T. (1992b). Signal detection analyses of dimensional interactions. In F. G. Ashby (Ed.), Multidimensional models of perception and cognition (pp. 181–227). Hillsdale, NJ: Erlbaum.
Kass, R. E., & Raftery, A. E. (1995). Bayes factors. Journal of the American Statistical Association, 90, 773–795.
Kidd, G., Jr., & Mason, C. R. (1992). A new technique for measuring spectral shape discrimination. Journal of the Acoustical Society of America, 91, 2855–2864.
Kingston, J., Diehl, R. L., Kirk, C. J., & Castleman, W. A. (2008). On the internal perceptual structure of distinctive features: The [voice] contrast. Journal of Phonetics, 36, 28–54.
Kingston, J., & Macmillan, N. A. (1995). Integrality of nasalization and F1 in vowels in isolation and before oral and nasal consonants: A detection-theoretic application of the Garner paradigm. Journal of the Acoustical Society of America, 97, 1261–1285.
Kingston, J., Macmillan, N. A., Walsh-Dickey, L. W., Thorburn, R., & Bartels, C. (1997). Integrality in the perception of tongue root position and voice quality in vowels. Journal of the Acoustical Society of America, 101, 1696–1709.
Krumhansl, C. L., & Iverson, P. (1992). Perceptual interactions between musical pitch and timbre. Journal of Experimental Psychology: Human Perception & Performance, 18, 739–751.
Macmillan, N. A., Kingston, J., Thorburn, R., Walsh-Dickey, L. W., & Bartels, C. (1999). Integrality of nasalization and F1: II. Basic sensitivity and phonetic labeling measure distinct sensory and decision-related interactions. Journal of the Acoustical Society of America, 106, 2913–2932.
Maddox, W. T. (1992). Perceptual and decisional separability. In F. G. Ashby (Ed.), Multidimensional models of perception and cognition (pp. 147–180). Hillsdale, NJ: Erlbaum.
Maddox, W. T. (1995). Base-rate effects in multidimensional perceptual categorization. Journal of Experimental Psychology: Learning, Memory, & Cognition, 21, 288–301.
Maddox, W. T., & Bohil, C. J. (1998a). Base-rate and payoff effects in multidimensional perceptual categorization. Journal of Experimental Psychology: Learning, Memory, & Cognition, 24, 1459–1482.
Maddox, W. T., & Bohil, C. J. (1998b). Overestimation of base-rate differences in complex perceptual categories. Perception & Psychophysics, 60, 575–592.
Maddox, W. T., & Bohil, C. J. (2003). A theoretical framework for understanding the effects of simultaneous base-rate and payoff manipulations on decision criterion learning in perceptual categorization. Journal of Experimental Psychology: Learning, Memory, & Cognition, 29, 307–320.
Massaro, D. W., & Oden, G. C. (1980). Evaluation and integration of acoustic features in speech perception. Journal of the Acoustical Society of America, 67, 996–1013.
Meddis, R., & O’Mard, L. (1997). A unitary model of pitch perception. Journal of the Acoustical Society of America, 102, 1811–1820.
Melara, R. D., & Marks, L. E. (1990a). Interaction among auditory dimensions: Timbre, pitch, and loudness. Perception & Psychophysics, 48, 169–178.
Melara, R. D., & Marks, L. E. (1990b). Perceptual primacy of dimensions: Support for a model of dimensional interaction. Journal of Experimental Psychology: Human Perception & Performance, 16, 398–414.
Mirman, D., Holt, L. L., & McClelland, J. L. (2004). Categorization and discrimination of nonspeech sounds: Differences between steadystate and rapidly changing acoustic cues. Journal of the Acoustical Society of America, 116, 1198–1207.
Moore, B. C. J., Glasberg, B. R., & Peters, R. W. (1985). Relative dominance of individual partials in determining the pitch of complex tones. Journal of the Acoustical Society of America, 77, 1853–1860.
Nearey, T. (1992). Context effects in a double-weak theory of speech perception. Language & Speech, 35, 153–171.
Oden, G. C., & Massaro, D. W. (1978). Integration of featural information in speech perception. Psychological Review, 85, 172–191.
Olzak, L. A. (1986). Widely separated spatial frequencies: Mechanism interactions. Vision Research, 26, 1143–1153.
Olzak, L. A., & Wickens, T. D. (1997). Discrimination of complex patterns: Orientation information is integrated across spatial scale; spatial-frequency and contrast information are not. Perception, 26, 1101–1120.
Pastore, R. E., & Sorkin, R. D. (1972). Simultaneous two-channel signal detection: I. Simple binaural stimuli. Journal of the Acoustical Society of America, 51, 544–551.
Pitt, M. (1994). Perception of pitch and timbre by musically trained and untrained listeners. Journal of Experimental Psychology: Human Perception & Performance, 20, 976–986.
Plomp, R. (1967). Pitch of complex tones. Journal of the Acoustical Society of America, 41, 1526–1533.
Plomp, R., & Bouman, M. A. (1959). Relation between hearing threshold and duration for tone pulses. Journal of the Acoustical Society of America, 31, 749–758.
Richards, V. M., & Lentz, J. J. (1998). Sensitivity to changes in level and envelope patterns across frequency. Journal of the Acoustical Society of America, 104, 3019–3029.
Semal, C., & Demany, L. (1991). Dissociation of pitch from timbre in auditory short-term memory. Journal of the Acoustical Society of America, 89, 2404–2410.
Shailer, M. J., & Moore, B. C. J. (1983). Gap detection as a function of frequency, bandwidth, and level. Journal of the Acoustical Society of America, 74, 467–473.
Shailer, M. J., & Moore, B. C. J. (1985). Detection of temporal gaps in bandlimited noise: Effects of variations in bandwidth and signalto-masker ratio. Journal of the Acoustical Society of America, 77, 635–639.
Shailer, M. J., & Moore, B. C. J. (1987). Gap detection and the auditory filter: Phase effects using sinusoidal stimuli. Journal of the Acoustical Society of America, 81, 1110–1117.
Silbert, N., & de Jong, K. (2008). Focus, prosodic context, and phonological feature specification: Patterns of variation in fricative production. Journal of the Acoustical Society of America, 123, 2769–2779.
Singh, P. G., & Hirsh, I. J. (1992). Influence of spectral locus and f0 changes on the pitch and timbre of complex tones. Journal of the Acoustical Society of America, 92, 2650–2661.
Smits, R. (2001). Evidence for hierarchical categorization of coarticulated phonemes. Journal of Experimental Psychology: Human Perception & Performance, 27, 1145–1162.
Smits, R., Sereno, J., & Jongman, A. (2006). Categorization of sounds. Journal of Experimental Psychology: Human Perception & Performance, 32, 733–754.
Sorkin, R. D., Pastore, R. E., & Pohlmann, L. D. (1972). Simultaneous two-channel signal detection: II. Correlated and uncorrelated signals. Journal of the Acoustical Society of America, 51, 1960–1965.
Sorkin, R. D., Pohlmann, L. D., & Gilliom, J. D. (1973). Simultaneous two-channel signal detection: III. 630- and 1400-Hz signals. Journal of the Acoustical Society of America, 53, 1045–1050.
Stevens, K. N. (1998). Acoustic phonetics. Cambridge, MA: MIT Press.
Tanner, W. P. (1956). Theory of recognition. Journal of the Acoustical Society of America, 28, 882–888.
Thomas, R. D. (1995). Gaussian general recognition theory and perceptual independence. Psychological Review, 102, 192–200.
Thomas, R. D. (1996). Separability and independence of dimensions within the same-different judgment task. Journal of Mathematical Psychology, 40, 318–341.
Thomas, R. D. (1999). Assessing sensitivity in a multidimensional space: Some problems and a definition of a general d’. Psychonomic Bulletin & Review, 6, 224–238.
Thomas, R. D. (2001a). Characterizing perceptual interactions in face identification using multidimensional signal detection theory. In M. J. Wenger & J. T. Townsend (Eds.), Computational, geometric, and process perspectives on facial cognition: Contexts and challenges (pp. 193–227). Mahwah, NJ: Erlbaum.
Thomas, R. D. (2001b). Perceptual interactions of facial dimensions in speeded classification and identification. Perception & Psychophysics, 63, 625–650.
Thomas, R. D. (2003). Further considerations of a general d’ in multidimensional space. Journal of Mathematical Psychology, 47, 220–224.
Wang, X., & Humes, L. E. (2008). Classification and cue weighting of multidimensional stimuli with speech-like cues for young normal hearing and elderly hearing-impaired listeners. Ear & Hearing, 29, 725–745. doi:10.1097/AUD.0b013e31817bdd42
Warrier, C. M., & Zatorre, R. J. (2002). Influence of tonal context and timbral variation on perception of pitch. Perception & Psychophysics, 64, 198–207.
Wickens, T. D. (1992). Maximum-likelihood estimation of a multivariate Gaussian rating model with excluded data. Journal of Mathematical Psychology, 36, 213–234.
Wickens, T. D., & Olzak, L. A. (1989). The statistical analysis of concurrent detection ratings. Perception & Psychophysics, 45, 514–528.
Zagorski, M. (1975). Perceptual independence of pitch and loudness in a signal detection experiment: A processing model for 2ATFC (21FC) experiments. Perception & Psychophysics, 17, 525–531.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported by NIH Grants R01-MH0577-17-07A1 and T32 MH019879-15.
Rights and permissions
About this article
Cite this article
Silbert, N.H., Townsend, J.T. & Lentz, J.J. Independence and separability in the perception of complex nonspeech sounds. Attention, Perception, & Psychophysics 71, 1900–1915 (2009). https://doi.org/10.3758/APP.71.8.1900
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/APP.71.8.1900