Reading without spaces: The role of precise letter order

Mirault, Jonathan; Snell, Joshua; Grainger, Jonathan

doi:10.3758/s13414-018-01648-6

Reading without spaces: The role of precise letter order

Open access
Published: 09 January 2019

Volume 81, pages 846–860, (2019)
Cite this article

Download PDF

You have full access to this open access article

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Reading without spaces: The role of precise letter order

Download PDF

Jonathan Mirault¹,
Joshua Snell¹ &
Jonathan Grainger¹

Abstract

Prior research points to efficient identification of embedded words as a key factor in facilitating the reading of text printed without spacing between words. Here we further tested the primary role of bottom-up word identification by altering this process with a letter transposition manipulation. In two experiments, we examined silent reading and reading aloud of normal sentences and sentences containing words with letter transpositions, in both normally spaced and unspaced conditions. We predicted that letter transpositions should be particularly harmful for reading unspaced text. In line with our prediction, the majority of our measures of reading fluency showed that unspaced text with letter transpositions was disproportionately difficult to read. These findings provide further support for the claim that reading text without between-word spacing relies principally on efficient bottom-up processing, enabling accurate word identification in the absence of visual cues to identify word boundaries.

Effects of word spacing on children’s reading: Evidence from eye movements

Article 09 October 2021

Between-word processing and text-level skills contributing to fluent reading of (non)word lists and text

Article Open access 03 April 2024

On the noisy spatiotopic encoding of word positions during reading: Evidence from the change-detection task

Article Open access 09 October 2020

Introduction

A number of studies have investigated the ability of skilled readers to read text in which the extra interword spacing has been removed (e.g., Dreighe, Fitzsimmons & Liversedge, 2017; Epelboim, Booth, Ashkenazy, Taleghani & Steinman, 1997; Morris, Rayner & Pollatsek, 1990; Perea & Acha, 2009; Rayner, Fischer, & Pollatsek, 1998; Veldre, Drieghe & Andrews, 2017). The results of this research indicate that reading unspaced text is slower by about 40–70% relative to reading normally spaced text (Rayner & Pollatsek, 1996; Rayner et al., 1998). Readers make shorter saccades accompanied by longer fixations and more regressions when reading unspaced text, and the effect of word frequency on fixation durations is greater with unspaced text (Rayner et al., 1998). Furthermore, given the overall shorter saccade lengths, initial landing positions are closer to the beginning of words in unspaced text (Paterson & Jordan, 2010; Perea & Acha, 2009).^{Footnote 1}

The conclusion that has emerged from this research is that removing the spacing between words disrupts two distinct processes: saccade programming and word identification (Perea & Acha, 2009; Rayner et al., 1998). Firstly, given the key role for interword spaces in guiding eye movements during the reading of normally spaced text (e.g., Inhoff, Eiter, Radach, & Juhasz, 2003), removing interword spaces will logically affect saccade programming. The results of prior research suggest that readers adopt a more cautious oculomotor strategy when reading unspaced text, leading to a greater number of saccades per sentence (both forward and regressive) that are shorter in length. Secondly, the longer time spent inspecting each word when reading unspaced text (as reflected by longer fixation durations) is most likely due to the absence of visual cues for word beginnings and endings, and also possibly due to crowding effects occurring not only for the word’s inner-positioned letters, as is the case in normal (spaced) reading (e.g., Tydgat & Grainger, 2009), but also for the word’s outer-positioned letters.

In another study on reading unspaced text (Mirault, Snell, & Grainger, 2018) we investigated the role of sentence-level structures. In that study we compared reading of grammatically correct sentences and shuffled versions of the same words presented both with normal spacing and without spaces. In line with prior research, we found that reading was hampered by removing sentence structure (Schad, Nuthmann, & Engbert, 2010). Furthermore, there was only limited evidence that sentence structure facilitated the reading of unspaced text more so than reading spaced text. This pattern of results suggests that our ability to read grammatically correct unspaced text is not principally due to a greater involvement of top-down feedback from sentence-level structures.

On the other hand, our prior research did point to a key role for word identification processes in reading unspaced text, not only for linguistic processing, but also for guiding eye movements. We found that the length of the currently fixated word determined the amplitude of forward saccades leaving that word during the reading of unspaced text. This result suggests that readers of unspaced text use length information about the currently fixated word in order to program a saccade beyond that word’s rightward boundary. In the absence of visual cues, such length information can only be obtained by word identification providing access to information about word length. We therefore concluded that the relative ease with which skilled readers can read unspaced text is mainly due to efficient bottom-up word identification processes continuing to operate, and that support from sentence-level structures can facilitate these processes in certain conditions. Further support for this conclusion was found in the significantly greater impact of word frequency in the unspaced condition compared with normal spacing (see also, Veldre et al., 2017).

The present study was designed to further investigate the hypothesized importance of bottom-up word identification processes when reading unspaced text. Why might word identification be more important for reading unspaced text? First of all, we have shown that word identification guides eye movements when reading unspaced text, whereas the visual cues provided by interword spacing are the principal guiding factor when reading normally spaced text. Secondly, when reading unspaced text, word identification provides word order information that is necessary for the construction of a sentence-level representation. That is, the order in which words are identified is the main source of word order information, whereas with normally spaced text, the construction of a sentence-level representation benefits from the presence of interword spaces that facilitate the assignment of order information to word identities (Grainger, 2018; Snell & Grainger, 2017; Snell, Meeter, & Grainger, 2017). In line with this reasoning is the evidence obtained from readers of Thai, a language with an alphabetic script that does not use between-word spacing. It has been shown that Thai readers benefit from the artificial insertion of interword spaces, and the eye-movements of these readers suggest that this facilitation arises mainly at the level of word identification and sentence-level comprehension (Winskel, Perea, & Ratitamkul, 2012; Winskel, Radach, & Luksaneeyanawin, 2009).

In the present study, we tested the hypothesized greater role for word identification in reading unspaced text by selectively perturbing this process. We did so by introducing letter transpositions in certain words in each sentence. In a seminal study, Rayner, White, Johnson, and Liversedge (2006) recorded eye movements while participants read sentences that could either be formed of normally written words or contained a number of words with letter transpositions (e.g., The boy cuold not slove the probelm so he aksed for help). Rayner et al. reported that although reading text containing letter transpositions was relatively fluent, in line with prior findings from the single word recognition literature (e.g., Perea & Lupker, 2004; see Grainger, 2008, for a review), there was nonetheless a cost. That is, reading text containing letter transpositions induced longer fixation durations and more refixations and regressions compared with normally written text (see also Blythe, Johnson, Liversedge, & Rayner, 2014; White, Johnson, Liversedge, & Rayner, 2008).

The specific aim of the present study was to test the prediction that letter transpositions should have a significantly greater impact on reading unspaced text compared with normally spaced text. Two prior studies have conjointly manipulated between-word spacing and letter transpositions and have produced contradictory findings. Winskel et al. (2012) investigated the effects of letter transpositions and interword spacing in Thai. These authors reported an interfering effect of letter transpositions that did not interact with the spacing manipulation. However, the lack of an interaction in this study is likely due to the fact that Thai readers have developed efficient mechanisms for word segmentation in the absence of interword spacing, plus the fact that the presence of interword spaces is not natural for Thai readers. More directly related to the present study is the work of Johnson and Eisler (2012), who investigated effects of letter transpositions and interword spacing in English. The key results are those obtained in their Experiment 3, where rather than replacing interword spaces with filler stimuli, the typical greater spacing between words compared with inter-letter spacing was cancelled by increasing inter-letter spacing. The specific aim of that study, however, was to investigate effects of the position of letter transpositions, and the critical interaction between transposition effects (measured relative to a no-transposition condition) and spacing was not tested. Nevertheless, the condition means revealed much greater transposition effects in the absence of extra between-word spacing in all reading time measures except for first fixation durations. However, the choice to increase between-letter spacing rather than reducing between-word spacing might have impacted on their results. Therefore, in the present study we provide a further test of the predicted interaction between transposition effects and interword spacing in two experiments where normal inter-letter spacing was retained and the additional space between words was removed. In Experiment 1 we recorded eye movements while participants silently read sentences, and in Experiment 2 we collected audio recordings while participants read aloud the same set of sentences.

Experiment 1: Silent reading

Method

Participants

Thirty-two participants (24 female)^{Footnote 2} from Aix-Marseille University, Marseille, France, received either €10 per hour or course credit for their participation. The participants were all native French speakers and gave written consent prior to the experiment. They reported having normal or corrected-to-normal vision, ranged in age from 18 to 28 years (M = 22.07, SD = 2.46), and were naïve with regard to the purpose of the experiment. French language skills were assessed using a Spelling Dictation test (Beyersmann, Casalis, Ziegler, & Grainger, 2015) and the LexTale vocabulary test (Brysbaert, 2013). Participants’ average scores were 78.59% (SD = 13.75) on the dictation test, and 88.37% (SD = 4.86) on the vocabulary test.

Design and stimuli

We constructed 104 sentences in French, each containing seven words. The sentences ranged in length from 37 to 57 characters including spaces (M = 47.94, SD = 3.77), and the average word frequency was 1,825 parts per million (ppm) (based on the Lexique2 film frequency counts: New, Pallier, Brysbaert, & Ferrand, 2004), which is equivalent to 6.26 Zipf (van Heuven, Mandera, Keuleers, & Brysbaert, 2014). Following a 2 × 2 factorial design, we manipulated between-word spacing (Spacing: spaced vs. unspaced) and word letter order (Transposition: normally written words vs. words containing transposed letters). The introduction of letter transpositions was constrained by five criteria: (i) the letters were adjacent consonants^{Footnote 3}, (ii) the first two and the last two letters of words were never transposed, (iii) the letters did not form a complex grapheme, (iv) the word containing the transposed letters was at least five letters long, and (v) words containing the transposed letters were always at the second position (verb), the fourth position (noun), and the fifth position (adjective) in sentences (i.e., three critical words per sentence contained letter transpositions in the transposed-letter condition). These critical words had an average frequency of 4.55 Zipf and an average length of eight letters. Words containing letter transpositions were never repeated across the different sentences seen by a given participant. Sentences were presented in lower case, except for the initial uppercase letter, and only contained letters without accents (see Appendix for a complete list the sentences and their transposed-letter versions). A Latin-square design was used with four groups of participants to ensure that all sentences were tested in all four conditions, but were seen only once per participant. Therefore a given set of three critical words were seen normally written and written with letter transpositions in both the spaced and unspaced conditions but by different participants.

Apparatus

Stimuli were displayed using OpenSesame (Mathôt, Schreij & Theeuwes, 2012), with each sentence occupying a single line. Eye movements were recorded with an EyeLink 1000 system (SR Research, Mississauga, ON, Canada) with high spatial resolution (0.01°) and a sampling rate of 1,000 Hz. Viewing was binocular, but only the right eye was monitored. The sentences were displayed on a gamma-calibrated 20-in. ViewSonic CRT monitor with a refresh rate of 150 Hz and a screen resolution of 1,024 x 768 pixels (30 x 40 cm). Stimuli were presented in black (0.15 Cd/m²) on a gray background (21.70 Cd/m²). Participants were seated 86 cm from the monitor, such that 3.6 characters equaled approximately 1° of visual angle. A chin-rest and a forehead-rest were used to minimize head movements.

Procedure

At the beginning of the experiment, the participant’s eye position was calibrated using a 9-point calibration grid. Each trial started with a drift correction dot located 200 pixels to the right of the left edge of the display (Fig. 1). Participants were instructed to focus on this dot, which would trigger the onset of a sentence stimulus. The distance between the fixation point and the beginning of the sentence was randomly determined, within a range of -54 to +32 pixels. Participants were instructed to read from left to right for comprehension. An invisible boundary was defined at the end of the sentence, such that the sentence disappeared when the eyes crossed that boundary. Next, participants were shown a question that allowed us to check whether they had paid attention to the word sequence. Participants were instructed to indicate whether they had seen a given word (e.g., “Did you see the word ‘table’?”) by means of a two-button response for, respectively, “yes” and “no” responses (probe word classification). Half of these questions concerned a word that was present in the sentence, and the other half a word that was not present in the sentence. The probe words never contained a letter transposition. Finally, a feedback dot was presented over 2,000 ms after the probe word classification response (green if the response was correct or red if the response was incorrect). The sentences were presented in a different random order for each participant. Participants received ten practice trials to familiarize them with the experimental procedure.

Analyses

We used linear mixed-effects models (LMEs) to analyze our data, with items and participants as crossed random effects (including by-item and by-participant random intercepts) (Baayen, Davidson, & Bates, 2008) and with random slopes (Barr, Levy, & Tily, 2013), and with Spacing and Transposition plus their interaction as fixed effects. The model successfully converged under this maximal random-effects structure in some but not all cases. In case of a failure to converge, we excluded the by-item random slopes (a Chi-square test indicated that a model including the by-item random slopes did not differ significantly from a model including the by-participant random slopes, so this was an arbitrary choice); and if a model then still failed to converge, we included only random intercepts. Generalized (logistic) linear mixed-effects models (GLMEs) were used to analyze the error rate and fixation probabilities. The models were fitted with the lmer (for LMEs) and glmer (for GLMEs) functions from the lme4 package (Bates, Maechler, Bolker, & Walker, 2015) in the R statistical computing environment. The condition with normal spacing and without letter transpositions was used as a reference and we reported regression coefficients (b), standard errors (SE),s and t-values (for LMEs) or z-values (for GLMEs) for all factors. Fixed effects were deemed reliable if |t| or |z| > 1.96 (Baayen, 2008). All duration measures were inverse-transformed (-1,000/duration) prior to analysis for the purpose of normalization.

Results

The eye-movement data of one participant were removed prior to analysis due to a large number of eye blinks. All other participants depicted normal eye-movement behavior and responded with accuracy higher than 90% (M = 94.10, SD = 23.54) on the probe word classification trials. Response accuracy was significantly higher (b = 2.91; SE = 0.41; t = 6.97) with normally spaced sentences (98.47%) compared with unspaced sentences (89.75%). Prior to analysis we excluded trials containing blinks (5.04%) and trials with incorrect responses on the probe word classification task (5.89%). For the local word-based analyses, we used the data concerning the three critical target words in each sentence while excluding words that were skipped during first pass (1.81%). We measured and analyzed target word fixation durations and saccade type probabilities (skips, refixations, regressions), initial landing positions (ILPs; the location of the first forward fixation on a word), sentence reading speed, and estimated reading difficulty (evaluated by participants during post-experiment debriefing).

Fixation durations

From the eye-tracking data, we computed three fixation duration variables: First Fixation Duration (FFD), which represents the duration of the fixation immediately following the first forward saccade into a word; Gaze Duration (GD), which is the sum of all fixation durations on a word before the eyes leave that word (first pass fixations); and Total Viewing Time (TVT), which is the sum of all fixation durations on a word (thus including fixations made following a regressive saccade back to the word). These values were computed for the three critical target words in each sentence (i.e., words that involved a letter transposition manipulation in the transposition condition) and the average value per sentence entered in the analysis. From these data, we excluded words with values beyond 2.5 SD from the grand mean (FFD: 2.38%, GD: 2.46%, TVT: 2.98%). The mean duration values (in milliseconds) per experimental condition are presented in Fig. 2.

All the duration measures revealed significant effects of Spacing and Transposition, and in total viewing times there was also a significant interaction between these variables (see Table 1). Transposition effects were greater in the unspaced condition (83 ms) compared to the spaced condition (67 ms). We also analyzed second-pass reading times, which represent the amount of time spent re-reading a word after first-pass reading (Juhasz & Pollatsek, 2011). Here, we found a significant effect of Transposition (b = 0.10; SE = 0.03; t = 2.85) with longer reading times in the transposed-letter condition, but neither the effects of Spacing (b = 0.02; SE = 0.05; t = 0.40) nor the interaction were significant (b = 0.09; SE = 0.05; t = 1.73).

Table 1 Fixed effects from the LMEs for the Fixation Duration measures in Experiment 1

Full size table

Saccade-type probabilities

We calculated the probability of skipping a word (when a word is not fixated during first-pass forward eye movements), of refixating a word prior to leaving the word (within-word saccade), and of refixating a word after leaving that word (between-word regressive saccade). The average probabilities per experimental condition are shown in Fig. 3 and the results of the statistical analyses are reported in Table 2. We found that the absence of interword spaces caused a decrease in skipping probability accompanied by an increase in the probability of refixations and regressions. Letter transpositions had a significant effect in all three measures (Table 2), decreasing the skipping rate in the spaced condition and decreasing the skipping rate in the unspaced condition and increasing refixation and regression probabilities. For skipping probabilities, we observed an interaction between Spacing and Transposition, with a greater influence of letter transpositions in the unspaced condition compared to the spaced condition.

Table 2 Fixed effects from the GLMEs for the different measures of saccade type probabilities in Experiment 1

Full size table

Initial landing position (ILP)

Prior to statistical analysis of the initial landing positions (ILPs) we first excluded values lying beyond 2.5 SD from the mean (1.99%). Table 3 provides the mean ILP per experimental condition expressed in normalized values between the beginning (0) and the end (1) of words. The distributions of ILPs in each condition are shown in Fig. 4. There was a significant effect of Spacing (b = 0.06; SE = 0.00; t = 7.54), with ILPs being closer to the beginning of words in the unspaced condition, and a significant effect of Transposition (b = 0.01; SE = 0.00; t = 2.04), with the presence of transpositions causing the ILPs to shift slightly toward the beginning of words. The interaction between Spacing and Transposition was not significant (b = 0.00; SE = 0.00; t = 0.60).

Table 3 Mean initial landing positions from 0 (the beginning of the word) to 1 (the end of the word) in Experiment 1

Full size table

Sentence reading times

Sentence reading time was measured as the time between presentation of the stimulus and the moment participants’ eyes crossed the end boundary of the sentence. Thus, this measure gathered duration values for all the seven words of the sentence. Prior to analysis we excluded values beyond 2.5 SD from the mean (2.61% of trials). The average reading times (in ms) per experimental condition are shown in Table 4.

Table 4 Mean sentence reading times (ms) in the four experimental conditions of Experiment 1

Full size table

We found a significant effect of Spacing (b = 0.23; SE = 0.01; t = 16.72) and Transposition (b = 0.05; SE = 0.01; t = 5.21). The interaction between Spacing and Transposition was also significant (b = 0.02; SE = 0.01; t = 2.06) with greater transposition effects in the unspaced condition compared to the spaced condition (see Table 4).

Estimated reading difficulty

In order to evaluate the subjective difficulty of reading in the different conditions, at the end of the experiment we asked participants to estimate their experienced reading difficulty in each condition. To do so, they were instructed to move a cursor on a scale from 0 to 100, and the corresponding number of the location of the cursor was always visible. Responses were collected without time limit, and no data were excluded prior to analysis. The average values for each condition are reported in Table 5.

Table 5 Mean of estimated reading difficulty on a scale from 0 (very easy) to 100 (very difficult) in Experiment 1

Full size table

We found significant effects of Spacing (b = 28.29; SE = 3.32; t = 8.51) and Transposition (b = 17.80; SE = 2.60; t = 6.83), and also a significant interaction (b = 21.29; SE = 3.68 t = 5.87), with letter transpositions having a stronger effect when reading unspaced sentences compared to the normally spaced sentences (see Table 5).

Effects of vocabulary and spelling

Here we examined the impact of participants’ scores on the vocabulary and spelling dictation tests on the different dependent measures of Experiment 1, and whether these scores influenced the effects of Spacing and Transposition. We only report significant effects from LME and GLME analyses that successfully converged.

Gaze durations, total viewing times and initial landing positions were significantly influenced by vocabulary level (GD: b = 0.03; SE = 0.01; t = 2.05; TVT: b = 0.04; SE = 0.01; t = 2.27; ILP: b = 0.00; SE = 0.00; t = 2.62), with higher vocabulary scores leading to shorter viewing durations and a shift of the ILP toward the middle of words. First fixation duration, gaze duration and total viewing times were significantly influenced by spelling ability (FFD: b = 0.01; SE = 0.00; t = 2.30; GD: b = 0.01; SE = 0.00; t = 2.05; TVT: b = 0.01; SE = 0.00; t = 1.98), with higher spelling scores leading to shorter viewing durations.

More interesting is the fact that vocabulary level interacted with the effects of letter transpositions in gaze durations (b = 0.02; SE = 0.01; t = 1.98), such that the interfering effect of transposing letters was greater in participants with higher vocabulary scores. Letter transposition effects also interacted with spelling ability in gaze durations (b = 0.00; SE = 0.00; t = 2.62) and total viewing times (b = 0.01; SE = 0.00; t = 3.06).

Effects of boundary letter frequency

In these analyses we report on the effects of boundary letter frequencies. Boundary letter frequency refers to the position-specific token frequency of the first and last letters in words. These analyses are motivated by the findings of Kasisopa, Reilly, Luksaneeyanawin, and Burnham (2013) showing an impact of such variables when reading in Thai and suggesting that these letter frequencies might act as a cue to word boundaries when reading unspaced text. Averages of the first letter and last letter frequency values (in Zipf) across the three critical words in each sentence were used in the LME and GLME analyses. First and last letter frequency were entered as separate variables given that Kasisopa et al. found more robust effects of these two variables when analyzed separately as opposed to a combined bigram frequency measure. Here we only report significant effects obtained in analyses that successfully converged.^{Footnote 4}

There was a significant three-way interaction involving first letter frequency in the gaze durations (b = 0.37; SE = 0.15; t = 2.32) and total viewing times (b = 0.34; SE = 0.14; t = 2.42). The two-way interaction between Spacing and Transposition (i.e., the greater effect of letter transpositions in the unspaced condition) was found to be stronger with low first letter frequencies (GD: b = 0.29; SE = 0.10; t = 2.78; TVT: b = 0.32; SE = 0.09; t = 3.57) compared to high first letter frequencies (GD: b = 0.00.; SE = 0.11; t = 0.00; TVT: b = 0.11; SE = 0.10; t = 1.03). There was also a significant three-way interaction involving last letter frequency in the total viewing times (b = 0.34; SE = 0.14; t = 2.42). Again, the critical interaction between Spacing and Transposition was stronger when last letter frequency was low (b = 0.32; SE = 0.13; t = 2.46) compared to high (b = 0.13; SE = 0.14; t = 0.96).

Discussion

The results of Experiment 1 showed clear effects of both the Spacing factor and the Transposition factor on the majority of our measures of reading difficulty, both in terms of sentence-level measures (sentence reading speed and estimated reading difficulty), and in terms of local eye-movement behavior concerning the three critical target words in each sentence. The eye-movement results are in line with prior reports of effects of letter transpositions on fixation durations, and number of regressions and refixations (Blythe et al., 2014; Rayner et al., 2006; White et al., 2008), as well as prior reports of the influence of removing interword spaces on fixation durations, saccade-type probabilities, and initial landing positions (e.g., Mirault et al., 2018; Perea & Acha, 2009; Rayner et al., 1998). Crucial, with respect to the hypothesis under test, is that we observed a significantly stronger influence of letter transpositions when reading unspaced text compared with normally spaced text in total viewing times (per critical word) as well as for the sentence reading time and the estimated sentence reading difficulty. We also found that words containing letter transpositions were skipped more when reading unspaced text, whereas the opposite pattern was seen with normally spaced text. We return to discuss this finding in the General discussion. Overall, this pattern of results is in line with the hypothesized greater role for word identification processes when reading unspaced text, with letter transpositions selectively perturbing this process during reading.

In additional analyses we examined how the vocabulary scores and spelling ability of our participants influenced their reading behavior. The general pattern we observed was that higher vocabulary or spelling scores led to faster reading times in various measures. However, only vocabulary level affected initial landing positions, with a shift toward the middle of words for participants with higher scores. We also observed that the influence of vocabulary and spelling scores on certain duration measures was most pronounced in the condition with no letter transpositions. Vocabulary and spelling level had a much-reduced impact when reading text containing letter transpositions because participants with higher scores on these tests were more affected by interference from letter transpositions.

Finally, we found that differences in the frequency of the first and last letters of critical words impacted on the key interaction between Spacing and Transposition. The greater influence of letter transpositions in the unspaced condition significantly increased when first or last letter frequency was low. Low first and last letter frequencies increase uncertainty with respect to word boundaries, hence increasing the interference caused by introducing letter transpositions when there are no visual cues to word boundaries.

Experiment 2: Reading aloud

Eye-movement recordings do not actually tell us if words are correctly identified in the different conditions, and more precisely, whether or not participants were actually identifying the basewords from which the transposed-letter stimuli were generated. Experiment 2 was therefore run in order to measure how well participants can actually identify words, including the basewords of transposed-letter stimuli, in the different experimental conditions. To do so, we asked participants to read aloud the same set of sentences as tested in Experiment 1, and we recorded the vocal output.