Abstract
It is well known that the statistical characteristics of a language, such as word frequency or the consistency of the relationships between orthography and phonology, influence literacy acquisition. Accordingly, linguistic databases play a central role by compiling quantitative and objective estimates about the principal variables that affect reading and writing acquisition. We describe a new set of Web-accessible databases of French orthography whose main characteristic is that they are based on frequency analyses of words occurring in reading books used in the elementary school grades. Quantitative estimates were made for several infralexical variables (syllable, grapheme-to-phoneme mappings, bigrams) and lexical variables (lexical neighborhood, homophony and homography). These analyses should permit quantitative descriptions of the written language in beginning readers, the manipulation and control of variables based on objective data in empirical studies, and the development of instructional methods in keeping with the distributional characteristics of the orthography.
Article PDF
Similar content being viewed by others
References
Alegria, J., &Mousty, P. (1996). The development of spelling procedures in French-speaking, normal and reading-disabled children: Effects of frequency and lexicality.Journal of Experimental Child Psychology,63, 312–338.
Ans, B., Carbonnel, S., &Valdois, S. (1998). A connectionist multiple-trace memory model for polysyllabic word reading.Psychological Review,105, 678–723.
Baayen, R. H., Piepenbrock, R., &Gulikers, L. (1995).The CELEX lexical database [CD-ROM]. Philadelphia: Linguistic Data Consortium, University of Pennsylvania.
Balota, D. A., &Chumbley, J. I. (1984). Are lexical decisions a good measure of lexical access? The role of word frequency in the neglected decision stage.Journal of Experimental Psychology: Human Perception & Performance,10, 340–357.
Balota, D. A., Cortese, M. J., Sergent-Marshall, S. D., Spieler, D. H., &Yap, M. J. (2004). Visual word recognition of single-syllable words.Journal of Experimental Psychology: General,133, 283–316.
Berndt, R. S., D’Autrechy, C. L., &Reggia, J. A. (1994). Functionnal pronunciation units in English words.Journal of Experimental Psychology: Learning, Memory, & Cognition,20, 977–991.
Brown, G. D. A., &Deavers, R. P. (1999). Units of analysis in nonword reading: Evidence from children and adults.Journal of Experimental Child Psychology,73, 208–242.
Bruck, M., Genesee, F., &Caravolas, M. (1997). A cross linguistic study of early literacy acquisition. In B. Blachman (Ed.),Foundations of reading acquisition and dyslexia: Implications for early intervention (pp. 145–162). Mahwah, NJ: Erlbaum.
Bruck, M., &Waters, G. (1990). An analysis of component spelling and reading skills of good readers—good spellers, good readers—poor spellers and poor readers—poor spellers. In T. H. Carr & B. A. Levy (Eds.),Reading and its development: Component skills approaches (pp. 161–206). San Diego: Academic Press.
Brand, M., Rey, A., Peereman, R., & Spieler, D. (2007).Syllable frequency effects in disyllabic word reading: A large-scale study. Manuscript in preparation.
Carroll, J. B., Davies, P., & Richman, B. (Eds.) (1971).The American heritage word-frequency book. Boston: Houghton Mifflin.Catach, N. (1980).L’orthographe française: Traité théorique et pratique [French orthography: Theoretical and practical treatise]. Paris: Nathan.
Catach, N. (1984).La phonétisation automatique du français: les ambiguïtés de la langue écrite. Paris: Presse du CNRS.
Catach, N., Jejcic, F., &HESO Group (1984).Les listes orthographiques de base du français (LOB). Les mots les plus fréquents et leurs formes fléchies les plus fréquentes. Paris: Nathan.
Chateau, D., &Jared, D. (2003). Spelling—sound consistency effects in disyllabic word naming.Journal of Memory & Language,48, 255–280.
Clements, G. N. (1990). The role of the sonority cycle in core syllabation. In J. Kingston, & M. E. Beckman (Eds.),Papers in laboratory phonology: 1. Between the grammar and physics of speech (pp. 283–333). Cambridge: Cambridge University Press.
Colé, P., Magnan, A., &Grainger, J. (1999). Syllable-sized units in visual word recognition: Evidence from skilled and beginning readers.Applied Psycholinguistics,20, 507–32.
Colé, P., Sprenger-Charolles, L., Siegel, L., & Jimenez Gonzales, J. E. (2004, June).Syllables in learning to read in English, French and Spanish. Paper presented at the SSS-R Congress, Amsterdam.
Coltheart, M., Rastle, K., Perry, C., Langdon, R., &Ziegler, J. (2001). DRC: A Dual Route Cascaded model of visual word recognition and reading aloud.Psychological Review,108, 204–256.
Content, A. (1991). The effect of spelling-to-sound regularity on naming in French.Psychological Research,53, 3–12.
Content, A., Mousty, P., &Radeau, M. (1990). BRULEX: Une base de données lexicales informatisée pour le français écrit et parlé.L’Année Psychologique,90, 551–566.
Content, A., &Peereman, R. (1992). Single and multiple process models of print to sound conversion. In J. Alegria, D. Holender, J. Morais, & M. Radeau (Eds.),Analytic approaches to human cognition. Amsterdam: Elsevier.
Cuetos, F. (1989). Lectura y escritura de palabras a traves de la ruta phonologica [Involvement of the phonological reading route in word reading and spelling].Infencia y Aprendizaje,45, 71–84.
Cutler, A., &Carter, D. M. (1987). The predominance of strong initial syllables in the English vocabulary.Computer Speech & Language,2, 133–142.
De Cara, B., &Goswami, U. (2002). Similarity relations among spoken words: The special status of rimes in English.Behavior Research Methods, Instruments, & Computers,34, 416–423.
Dubois, F., & Buyse, R. (1952).Échelle Dubois3-Buyse. Bulletin de la Société Alfred Binet, No. 405. (Original work published 1940)
Duncan, L. G., &Seymour, P. H. K. (2003). How do children read multisyllabic words? Some preliminary observations.Journal of Research in Reading,26, 101–120.
Elman, J. L. (1993). Learning and development in neural networks: The importance of starting small.Cognition,48, 71–99.
Eme, E., &Golder, C. (2005). Word-reading and word-spelling styles of French beginners: Do all children learn to read and spell in the same way?Reading & Writing: An Interdisciplinary Journal,18, 157–188.
Folk, J. R., &Morris, R. K. (1995). Multiple lexical codes in reading: Evidence from eye movements, naming time, and oral reading.Journal of Experimental Psychology: Learning, Memory, & Cognition,21, 1412–1429.
Frauenfelder, U. H., Baayen, R. H., Hellwig, F. M., &Schreuder, R. (1993). Neighborhood density and frequency across languages and modalities.Journal of Memory & Language,32, 781–804.
Frederiksen, J. R., &Kroll, J. F. (1976). Spelling and sound: Approaches to the internal lexicon.Journal of Experimental Psychology: Human Perception & Performance,2, 361–379.
Frith, U., Wimmer, H., &Landerl, K. (1998). Differences in phonological recoding in German- and English-speaking children.Scientific Studies of Reading,2, 31–54.
Gahl, S., Jurafsky, D., &Roland, D. (2004). Verb subcategorization frequencies: American English corpus data, methodological studies, and cross-corpus comparisons.Behavior Research Methods, Instruments, & Computers,36, 432–443.
Goswami, U., Gombert, J. E., &Barrera, L. F. (1998). Children’s orthographic representations and linguistic transparency: Nonsense word reading in English, French and Spanish.Applied Psycholinguistics,19, 19–52.
Goswami, U., Ziegler, J. C., Dalton, L., &Schneider, W. (2003). Nonword reading across orthographies: How flexible is the choice of reading units?Applied Psycholinguistics,24, 235–247.
Gottlob, L. R., Goldinger, S.D., Stone, G.O., &Van Orden, G.C. (1999). Reading homographs: Orthographic, phonologic, and semantic dynamics.Journal of Experimental Psychology: Human Perceptual Processes,25, 561–574.
Gougenheim, G., Michéa, R. Rivenc, P., &Sauvageot, A. (1964).L’élaboration du français fondamental (1° degré). Paris: Didier.
Hudson, P. T. W., &Bergman, M. W. (1985). Lexical knowledge in word recognition: Word length and word frequency in naming and lexical decision tasks.Journal of Memory & Language,24, 46–58.
Imbs, P. (1971).Dictionnaire des fréquences: Vocabulaire littéraire des XIXe et XXe siècles. I: Table alphabétique. II: Table des fréquences décroissantes. Nancy, Paris: CNRS, Didier.
Jared, D. (1997). Spelling—sound consistency affects the naming of highfrequency words.Journal of Memory & Language,36, 687–715.
Jimenez, J. E., &Guzman, R. (2003). The influence of code-oriented versus meaning-oriented approaches to reading instruction on word recognition in the Spanish language.International Journal of Psychology,38, 65–78.
Jones, M. N., &Mewhort, D. J. K. (2004). Case-sensitive letter and bigram frequency counts from large-scale English corpora.Behavior Research Methods, Instruments, & Computers,36, 388–396.
Jusczyk, P. W. (1997).The discovery of spoken language. Cambridge, MA: MIT Press.
Kessler, B. &Treiman, R. (1997). Syllable structure and the distribution of phonemes in English syllables.Journal of Memory & Language,37, 295–311.
Kučera, H., &Francis, W.N. (1967).Computational analysis of present-day American English. Providence, RI: Brown University Press.
Kwantes, P. J., &Mewhort, D. J. K. (1999). Evidence for sequential processing in visual word recognition.Journal of Experimental Psychology: Human Perception & Performance,25, 376–381.
Laeufer, C. (1992). Syllabification and resyllabification in French. In C. Laeufer & T. A. Morgan (Eds.),Theoretical analyses in Romance linguistics (pp. 18–36). Amsterdam: Benjamins.
Lambert, E., &Chesnet, D. (2001). Novlex: Une base de données lexicales pour les élèves de primaire.L’Année Psychologique,101, 277–288.
Lamberts, K. (2005). Interpretation of orthographic uniqueness point effects in visual word recognition.Journal of Experimental Psychology: Human Perception & Performance,31, 14–19.
Landauer, T. K., &Streeter, L. A. (1973). Structural differences between common and rare words: Failure or equivalence assumption for theories of word recognition.Journal of Learning & Verbal Behavior,12, 119–131.
Lange, M. (2000).De l’orthographe à la prononciation: Nature des processus de conversion graphème-phonème dans la reconnaissance des mots écrits. Unpublished doctoral dissertation, Université libre de Bruxelles.
Laxon, V., Coltheart, V., &Keating, C. (1988). Children find friendly words friendly too: Words with many orthographic neighbours are easier to read and spell.British Journal of Educational Psychology,58, 103–119.
Laxon, V., Masterson, J., &Moran, R. (1994). Are children’s representations of words distributed? Effects of orthographic neighbourhood size, consistency, and regularity of naming.Language & Cognitive Processes,9, 1–27.
Le Petit Robert (2001).Dictionnaires Le Robert. Electronic version 2.2.
Lété, B., Sprenger-Charolles, L., &Colé, P. (2004). MANULEX: A grade-level lexical database from French elementary-school readers.Behavior Research Methods, Instruments, & Computers,36, 156–166.
Leybaert, J., &Content, A. (1995). Reading and spelling acquisition in two different teaching methods: A test of the independence hypothesis.Reading & Writing: An Interdisciplinary Journal,7, 65–88.
Lindell, A. K., Nicholls, M. E. R., Kwantes, P. J. K., &Castles, A. (2005). Sequential processing in hemispheric word recognition: The impact of initial letter discriminability on the OUP naming effect.Brain & Language,93, 160–172.
Marslen-Wilson, W. D. (1984). Function and process in spoken word recognition. In H. Bouma & D. G. Bouwhuis (Eds.),Attention and performance X: Control of language processes (pp. 125–150). Hillsdale, NJ: Erlbaum.
Martensen, H., Maris, E., &Dijkstra, T. (2003). Phonological ambiguity and context sensitivity: On sublexical clustering in visual word recognition.Journal of Memory & Language,89, 375–395.
Masterson, J., Stuart, M., Dixon, M., Desmond, L., & Lovejoy, S. (2003).The children’s printed word data base. Retrieved July 5, 2006, from www.essex.ac.uk/psychology/cpwd.
Maxwell, J. P., Masters, R. S. W., Kerr, E., &Weedon, E. (2001). The implicit benefit of learning without errors.Quarterly Journal of Experimental Psychology,54A, 1049–1068.
McClelland, J. L., &Rumelhart, D. E. (1981). An interactive activation model of context effects in letter perception: Part I: An account of basic findings.Psychological Review,88, 375–407.
Monsell, S. (1991). The nature and locus of word frequency effects in reading. In D. Besner & G. W. Humphreys (Eds.),Basic processes in reading: Visual word recognition (pp. 148–197). Hillsdale, NJ: Erlbaum.
New, B., Pallier, C., Brysbaert, M., &Ferrand, L. (2004). Lexique 2: A new French lexical database.Behavior Research Methods, Instruments, & Computers,36, 516–524.
Novick, L. R., &Sherman, S. J. (2004). Type-based bigram frequencies for five-letter words.Behavior Research Methods, Instruments, & Computers,36, 397–401.
Paulesu, E., Démonet, J.-F., Fazio, F., McCrory, E., Chanoine, V., Brunswick, N., et al. (2001). Dyslexia, cultural diversity and biological unity.Science,291, 2165–2167.
Peereman, R. (1995). Naming regular and exception words: Further examination of the effect of phonological dissension among lexical neighbours.European Journal of Cognitive Psychology,7, 307–330.
Peereman, R., &Content, A. (1995). The neighborhood size effect in naming: Lexical activation or sublexical correspondences?Journal of Experimental Psychology: Learning, Memory, & Cognition,21, 409–421.
Peereman, R., &Content, A. (1997). Orthographic and phonological neighborhoods in naming: Not all neighbors are equally influential in orthographic space.Journal of Memory & Language,37, 382–421.
Peereman, R., & Content, A. (1998).Quantitative analyses of orthography to phonology mapping in English and French. Retrieved July 5, 2006, from homepages.ulb.ac.be/~acontent/OPMapping.html.
Peereman, R., &Content, A. (1999). LexOP: A Lexical database with Orthography—phonology statistics for French monosyllabic words.Behavior Research Methods, Instruments, & Computers,31, 376–379.
Peereman, R., &Dufour, S. (2003). Un correctif aux codifications phonétiques de la base de données LEXIQUE.L’Année Psychologique,103, 103–108.
Pexman, P. M., Lupker, S. J., &Jared, D. (2001). Homophone effects in lexical decision.Journal of Experimental Psychology: Learning, Memory, & Cognition,27, 139–156.
Plaut, D. C., McClelland, J. L., Seidenberg, M.S., &Patterson, K. (1996). Understanding normal and impaired word reading: Computational principles in quasi-regular domains.Psychological Review,103, 56–115.
Powell, D., Plaut, D. C., & Funnell, E. (in press). Does the PMSP connectionist model of single word reading learn to read in the same way as a child?Journal of Research in Reading.
Pring, L. (1981). Phonological codes and functional spelling units: Reality and implications.Perception & Psychophysics,30, 573–578.
Pulgram, E. (1970).Syllable, word, nexus, cursus. The Hague: Mouton.
Radeau, M., &Morais J. (1990). The uniqueness point effect in the shadowing of spoken words.Speech Communication,9, 155–164.
Rey, A., Jacobs, A. M., Schimdt-Weigand, F., &Ziegler, J. C. (1998). A phoneme effect in visual word recognition.Cognition,68, 41–50.
Rohde, D. L. T., &Plaut, D. C. (1999). Language aquisition in the absence of explicit negative evidence: How important is starting small?Cognition,72, 67–109.
Saffran, J. R., Newport, E. L., &Aslin, R. N. (1996). Word segmentation: The role of distributional cues.Journal of Memory & Language,35, 606–621.
Seidenberg, M. S., Waters, G. S., Barnes, M. A., &Tanenhaus, M. K. (1984). When does irregular spelling or pronunciation influence word recognition?Journal of Verbal Learning & Verbal Behavior,23, 383–404.
Seidenberg, M. S., &Zevin, J. D. (2006). Connectionist models in de velopmental cognitive neuroscience: Critical periods and the paradox of success. In Y. Munakata & M. Johnson (Eds.),Attention and performance XXI: Processes of change in brain and cognitive development. Oxford: Oxford University Press.
Seymour, P. H. K., Aro, M., &Erskine, J. M. (2003). Foundation literacy acquisition in European orthographies.British Journal of Psychology,94, 143–174.
Sprenger-Charolles, L. (2003). Reading acquisition: Cross linguistic data. In T. Nunes & P. Bryant (Eds.),Handbook of children’s literacy (pp. 43–66). Dordrecht: Kluwer.
Sprenger-Charolles, L., Colé, P., Béchennec, D., &Kipffer-Piquard, A. (2005). French normative data on reading and related skills: from EVALEC, a new computarized battery of tests.European Review of Applied Psychology,55, 157–186.
Sprenger-Charolles, L., Siegel, L. S., Béchennec, D., &Serniclaes, W. (2003). Development of phonological and orthographic processing in reading aloud, in silent reading and in spelling: A four year longitudinal study.Journal of Experimental Child Psychology,84, 194–217.
Sprenger-Charolles, L., Siegel, L. S., &Bonnet, P. (1998). Phonological mediation and orthographic factors in reading and spelling.Journal of Experimental Child Psychology,68, 134–155.
Stage S. A., &Wagner R. K. (1992). Development of young children’s phonological and orthographic knowledge as revealed by their spellings.Developmental Psychology,28, 287–296.
Stanback, M. L. (1992). Syllable and rime patterns for teaching reading: Analysis of a frequency-based vocabulary of 17,602 words.Annals of Dyslexia,42, 196–221.
Tamaoka, K., &Makioka, S. (2004). Frequency of occurrence for units of phonemes, morae and syllables appearing in a lexical corpus of a Japanese newspaper.Behavior Research Methods, Instruments, & Computers,36, 531–547.
Treiman, R., Mullenix, J., Bijeljac-Babic, R., &Richmond-Welty, E. D. (1995). The special role of rimes in the description, use, and acquisition of English orthography.Journal of Experimental Psychology: General,124, 107–136.
Vander Beke G. E. (1935).French word book. New York: Macmillan.
Véronis, J. (1986). Etude quantitative sur le système graphique et phono-graphique du Français.Cahiers de Psychologie Cognitive,6, 501–531.
Wimmer, H., &Goswami, U. (1994). The influence of orthographic consistency on reading development: Word recognition in English and German children.Cognition,51, 91–103.
Wimmer, H., &Hummer, P. (1990). How German speaking first graders read and spell: Doubts on the importance of the logographic stage.Applied Psycholinguistics,11, 349–368.
Wingfield, A., Goodglass, H., Lindfield K. C. (1997). Word recognition from acoustic onsets and acoustic offsets: Effects of cohort size and syllabic stress.Applied Psycholinguistics,18, 85–100.
Zeno, S. M., Ivenz, S. H., Millard, R. T., &Duvvuri, R. (1995).The educator’s word frequency guide. Brewster, NY: Touchstone Applied Science Associates.
Zevin, J. D., &Seidenberg, M. S. (2002). Age of acquisition effects in word reading and other tasksJournal of Memory & Language,47, 1–29.
Zevin, J. D., &Seidenberg, M. S. (2004). Age of acquisition effects in reading aloud: Tests of cumulative frequency and frequency trajectory.Memory & Cognition,32, 31–38.
Ziegler, J., &Goswami, U. (2005). Reading acquisition, developmental dyslexia and skilled reading accross languages: A psycholinguistic grain size theory.Psychological Bulletin,13, 3–29.
Ziegler, J. C., Jacobs, A. M., &Stone, G. O. (1996). Statistical analyses of the bidirectional inconsistency of spelling and sound in French.Behavior Research Methods, Instruments, & Computers,28, 504–515.
Ziegler, J. C., Perry, C., &Coltheart, M. (2003). Speed of lexical and nonlexical processing in French: The case of the regularity effect.Psychonomic Bulletin & Review,10, 947–953.
Ziegler, J. C., Stone, G. O., &Jacobs, A. M. (1997). What is the pronunciation for -ough and the spelling for /u/? A data base for computing feedforward and feedback consistency in English.Behavior Research Methods, Instruments, & Computers,29, 600–618. (Manuscript received January 20, 2006; revision accepted for publication June 30, 2006.)
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was supported by grants from the French Ministry of Research (Ecole et Sciences Cognitives) to Arnaud Rey and R.P. and from the Conseil Régional de Bourgogne (Contrat de Plan Etat-Région 2006) to R.P. The authors are grateful to Caroline Calmus for her help with the phonological transcriptions of proper names, and to Vivian Waltz for her assistance with the English.
Rights and permissions
About this article
Cite this article
Peereman, R., Lété, B. & Sprenger-Charolles, L. Manulex-infra: Distributional characteristics of grapheme—phoneme mappings, and infralexical and lexical units in child-directed written material. Behavior Research Methods 39, 579–589 (2007). https://doi.org/10.3758/BF03193029
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03193029