Abstract
Phonotactic probability refers to the frequency with which phonological segments and sequences of phonological segments occur in words in a given language. We describe one method of estimating phonotactic probabilities based on words in American English. These estimates of phonotactic probability have been used in a number of previous studies and are now being made available to other researchers via a Web-based interface. Instructions for using the interface, as well as details regarding how the measures were derived, are provided in the present article. The Phonotactic Probability Calculator can be accessed athttp://www.people.ku.edu/~mvitevit/PhonoProbHome.html.
Article PDF
Similar content being viewed by others
References
Baayen, R. H., Piepenbrock, R., &Gulikers, L. (1995).CELEX-2 [CD-ROM]. Philadelphia: University of Pennsylvania, Linguistic Data Consortium.
Bailey, T. M., &Hahn, U. (2001). Determinants of wordlikeness: Phonotactics or lexical neighborhoods?Journal of Memory & Language,44,568–591.
Balota, D. A., Pilotti, M., &Cortese, M. J. (2001). Subjective frequency estimates for 2,938 monosyllabic words.Memory & Cognition,29,639–647.
Bernstein, N. (1982).Acoustic study of mothers’ speech to language-learning children: An analysis of vowel articulatory characteristics. Unpublished doctoral dissertation, Boston University.
Crystal, D. (1992).An encyclopedic dictionary of language and languages. Middlesex, U.K.: Blackwell.
Cutler, A., &Butterfield, S. (1992). Rhythmic cues to speech segmentation: Evidence from juncture misperception.Journal of Memory & Language,31,218–236.
Dell, G. S., Reed, K. D., Adams, D. R., &Meyer, A. S. (2000). Speech errors, phonotactic constraints, and implicit learning: A study of experience in language production.Journal of Experimental Psychology: Learning, Memory, & Cognition,26,1355–1367.
Eukel, B. (1980). Phonotactic basis for word frequency effects: Implications for lexical distance metrics [Abstract].Journal of the Acoustical Society of America,68, S33.
Francis, W. N., &Kučera, H. (1984).Frequency analysis of English usage: Lexicon and grammar. Boston: Houghton Mifflin.
Gaygen, D. E. (1997).The effects of probabilistic phonotactics on the segmentation of continuous speech. Unpublished doctoral dissertation, SUNY, Buffalo.
Jusczyk, P. W., Friederici, A. D., Wessels, J. M. I., Svenkerud, V. Y., &Jusczyk, A. M. (1993). Infants’ sensitivity to the sound patterns of native language words.Journal of Memory & Language,32,402–420.
Jusczyk, P. W., Luce, P. A., &Charles-Luce, J. (1994). Infants’ sensitivity to phonotactic patterns in the native language.Journal of Memory & Language,33,630–645.
Kelly, M. (1992). Using sound to solve syntactic problems: The role of phonology in grammatical category assignment.Psychological Review,99,349–364.
Kucera, H., &Francis, W. N. (1967).Computational analysis of present-day American English. Providence, RI: Brown University Press.
Landauer, T. K., &Streeter, L. A. (1973). Structural differences between common and rare words: Failure of equivalence assumptions for theories of word recognition.Journal of Verbal Learning & Verbal Behavior,12,119–131.
Luce, P. A., &Large, N. R. (2001). Phonotactics, density, and entropy in spoken word recognition.Language & Cognitive Processes,16,565–581.
Luce, P. A., &Pisoni, D. B. (1998). Recognizing spoken words: The neighborhood activation model.Ear & Hearing,19,1–36.
MacWhinney, B. (1991).The CRILDES project: Tools for analyzing talk. Hillsdale, NJ: Erlbaum.
Mattys, S. L., &Jusczyk, P. W. (2001). Phonotactic cues for segmentation of fluent speech by infants.Cognition,78,91–121.
Mayzner, M. S., &Tresselt, M. E. (1965). Tables of single-letter and digram frequency counts for various word-length and letter-position combinations.Psychonomic Monograph Supplements,1,13–32.
Messer, S. (1967). Implicit phonology in children.Journal of Verbal Learning & Verbal Behavior,6, 609–613.
Mook, D. G. (1983). In defense of external invalidity.American Psychologist,38,379–387.
NIH-NOT-OD-02-035 (2002). NIH announces draft statement on sharing research data. Retrieved February 14,2003 fromhttp://grants2.nih.gov/ grants/guide/notice-files/NOT-OD-02-035.html.
Nusbaum, H. C., Pisoni, D. B., &Davis, C. K. (1984).Sizing up the Hoosier mental lexicon: Measuring the familiarity of 20,000 words (Research on Speech Perception, Progress Report No. 10). Bloomington: Indiana University, Psychology Department, Speech Research Laboratory.
Pertz, D. L., &Bever, T. G. (1975). Sensitivity to phonological universals in children and adolescents.Language,39, 347–370.
Pisoni, D. B., & Garber, E. E. (1991). Lexical memory in visual and auditory modalities: The case for a common lexicon. In H. Fujisaki (Ed.),Proceedings of the 1990 International Conference of Spoken Language Processing. Kobe, Japan.
Pitt, M. A., &McQueen, J. M. (1998). Is compensation for coarticulation mediated by the lexicon?Journal of Memory & Language,39,347–370.
Storkel, H. L. (2001). Learning new words: Phonotactic probability in language development.Journal of Speech, Language, & Hearing Research,44,1321–1337.
Storkel, H. L. (2003). Learning new words II: Phonotactic probability in verb learning.Journal of Speech, Language, & Hearing Research,46,1312–1323.
Storkel, H. L. (in press). Methods for minimizing the confounding effects of word length in the analysis of phonotactic probability and neighborhood density.Journal of Speech, Language, & Hearing Research.
Storkel, H. L., &Rogers, M. A. (2000). The effect of probabilistic phonotactics on lexical acquisition.Clinical Linguistics & Phonetics,14,407–425.
Treiman, R. (1986). The division between onsets and rimes in English syllables.Journal of Memory & Language,25,476–491.
Vitevitch, M. S. (2002a). Influence of onset density on spoken-word recognition.Journal of Experimental Psychology: Human Perception & Performance,28,270–278.
Vitevitch, M. S. (2002b). The influence of phonological similarity neighborhoods on speech production.Journal of Experimental Psychology: Learning, Memory, & Cognition,28,735–747.
Vitevitch, M. S. (2003). The influence of sublexical and lexical representations on the processing of spoken words in English.Clinical Linguistics & Phonetics,6, 487–499.
Vitevitch, M. S., Armbrüster, J., &Chu, S. (2004). Sublexical and lexical representations in speech production: Effects of phonotactic probability and onset density.Journal of Experimental Psychology: Learning, Memory, & Cognition,30,514–529.
Vitevitch, M. S., &Luce, P. A. (1998). When words compete: Levels of processing in spoken word perception.Psychological Science,9,325–329.
Vitevitch, M. S., &Luce, P. A. (1999). Probabilistic phonotactics and spoken word recognition.Journal of Memory & Language,40,374–408.
Vitevitch, M. S., Luce, P. A., Charles-Luce, J., &Kemmerer, D. (1997). Phonotactics and syllable stress: Implications for the processing of spoken nonsense words.Language & Speech,40,47–62.
Vitevitch, M. S., Luce, P. A., Pisoni, D. B., &Auer, E. T. (1999). Phonotactics, neighborhood activation, and lexical access for spoken words.Brain & Language,68306–311.
Vitevitch, M. S., Pisoni, D. B., Kirk, K. I., Hay-McCutcheon, M., &Yount, S. L. (2002). Effects of phonotactic probabilities on the processing of spoken words and nonwords by postlingually deafened adults with cochlear implants.Volta Review,102, 283–302.
Zipf, G. K. (1935).The psycho-biology of language: An introduction to dynamic philology. Cambridge, MA: Houghton Mifflin.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported in part by Grants R03 DC 04259 and P30 HD 002528 (University of Kansas), and R01 DC 0265801 (SUNY Buffalo) from the National Institute on Deafness and Other Communication Disorders, National Institutes of Health.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Vitevitch, M.S., Luce, P.A. A Web-based interface to calculate phonotactic probability for words and nonwords in English. Behavior Research Methods, Instruments, & Computers 36, 481–487 (2004). https://doi.org/10.3758/BF03195594
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03195594