In sentence reading, the processing of a word is affected by lexical properties such as frequency (Inhoff & Rayner, 1986; Rayner & Duffy, 1986) and the presence of orthographic neighbors (Andrews, 1989, 1997; Carreiras et al., 1997; Pollatsek et al., 1999). In addition, contextual variables such as a word’s predictability (Balota et al., 1985; Ehrlich & Rayner, 1981; Rayner & Well, 1996) and plausibility (Rayner et al., 2004; Warren & McConnell, 2007) also influence processing. In the present study, we manipulated the orthographic neighborhood and the predictability of target words in Chinese sentences, in two eye-movement experiments, to assess whether orthographic neighborhood effects are modulated by predictability. The reduction or elimination of neighborhood effects when a target word is predictable would suggest that a predictable target word is preactivated to such an extent that the word’s neighbors do not effectively compete for selection. Although effects of both orthographic neighborhood and predictability have been demonstrated numerous times, we know of no studies that have examined these variables simultaneously, in either alphabetic orthographies or in character-based orthographies (with the exception of a very recent study focusing on readers with central visual field deficits; Sauvan et al., 2020). Such studies may provide important insight into how predictability influences lexical processing.

The important role of prediction in lexical processing has been repeatedly demonstrated (Balota et al., 1985; Ehrlich & Rayner, 1981; Rayner & Well, 1996) and is captured in models of eye movements in reading such as E-Z Reader (Reichle et al., 2009). Current accounts of predictability effects in sentence reading (e.g., Luke & Christianson, 2016; Staub, 2015) propose that a predictable word is preactivated by the sentence context, approximately in proportion to its cloze probability, and that such pre-activation facilitates early stages of lexical processing. One important source of evidence in favor of an early locus for predictability effects is that this variable consistently influences the very earliest eye-movement measures (i.e., word skipping and first fixation duration; see Staub, 2015, for a review).

However, several empirical results present potential challenges for this view. For example, the predictability effect on early eye movement measures in reading is usually found to be additive with, rather than interactive with, the effect of word frequency (Ashby et al., 2005; Hand et al., 2010; Kretzschmar et al., 2015; Rayner et al., 2004; cf. Sereno et al., 2018). In addition, Staub (2020) has recently shown that predictability demonstrates only very weak, if any, interaction with visual contrast and font difficulty in the earliest eye-movement measures. These findings are arguably surprising on the view that predictability influences early processing stages by means of preactivation, given that these stages are, presumably, also influenced by factors such as frequency and contrast.

The current study further explored the nature of predictability effects on lexical processing by examining how the effect of predictability interacts with orthographic neighborhood effects in Chinese reading. Chinese has a nonalphabetic writing system in which each word is composed of one or more characters; each character is a morpheme that makes a contribution to word meaning. A significant feature of the Chinese writing system is that there is no explicit indication of word boundaries, which causes Chinese readers to rely on semantic contextual information to segment words (Huang et al., 2021) and process words (Braze & Gong, 2017).

Due to the difference in scripts, orthographic neighbors are also defined differently between alphabetic languages and Chinese. In alphabetic languages, a word’s orthographic neighbors are the words that can be created by changing one letter of the stimulus word while preserving the letters at other positions (Coltheart et al., 1977). In Chinese, however, orthographic neighbors can be defined in various ways, none of which are precise analogues of the notion of “neighbor” in alphabetic scripts. On the character level, characters sharing the same radical (Q. L. Li et al., 2011; Wang et al., 2014; Wu & Chen, 2003; F. L. Yang & Wu, 2014) or sharing around two thirds of strokes (Dong et al., 2015) may be defined as orthographic neighbors. Furthermore, characters’ orthographic neighbors can also be defined visually, based on the degree of pixel overlap, specifically Levenshtein distance (Sun et al., 2018). On the word level, orthographic neighbors may be defined as words that only have one constituent character difference while sharing the other characters in each position (Huang et al., 2006; Tsai et al., 2006). Because words are clearly critical processing units in Chinese sentence reading (X. Li & Pollatsek, 2020; J. M. Yang et al., 2009; Zhou & Li, 2021), and because predictability is usually regarded as a property of words, in the current research we focused on neighborhood effects at the word level. Nevertheless, it is important to recognize the important differences between neighbors in Chinese and in alphabetic languages; in particular, in the former case there is likely to be meaning as well as orthographic overlap, due to the fact that neighbors share a morpheme.

Previous studies have found that in Chinese reading, the recognition of a word is affected by its orthographic neighbors. The recognition of a word is facilitated when it has many neighbors compared with few (the facilitatory Neighborhood Size effect, henceforth the NS effect; Li et al., 2015; Tsai et al., 2006). Also, words with higher frequency neighbors are harder to process than words without higher frequency neighbors (the inhibitory Neighbor Frequency effect, henceforth the NF effect; Huang et al., 2006; Li et al., 2017). Though as we have noted, the notion of orthographic neighbor in Chinese is not closely analogous to the notion of neighbor in alphabetic scripts, it is also important to note that both facilitatory NS effects (Andrews, 1997; Sears et al., 1995; Yates et al., 2008; cf. Pollatsek et al., 1999) and inhibitory NF effects (Acha & Perea, 2008; Grainger, 1990; Grainger & Jacobs, 1996; Grainger & Segui, 1990; Perea & Pollatsek, 1998) have been observed in alphabetic languages.

Both the NS effect and the NF effect in Chinese can potentially be explained by the Chinese Reading Model (henceforth CRM; Li & Pollatsek, 2020). Based on the Interactive Activation Model (McClelland & Rumelhart, 1981; Rumelhart & McClelland, 1982), the CRM proposes that Chinese words are connected to position-specific characters through excitatory or inhibitory connections, while connected to each other through inhibitory connections. Visual input activates the matching position-specific characters, which then activate the corresponding words. To be specific, all the characters in the perceptual span are activated, and all the possible words formed by these characters (i.e., orthographic neighbors) are activated and compete with each other for recognition. The representation of each candidate word is continuously excited by the visual input and inhibited by orthographic neighbors. Once one of these candidates wins the competition, the word is identified/recognized. In this process, both NS (how many neighbors a word has) and NF (whether or not a word has higher frequency neighbors) affect lexical identification. The facilitatory NS effect occurs when the activation from a large number of words feeds back to facilitate character recognition, while the inhibitory NF effect occurs when the activation of a high frequency neighbor inhibits activation of the target word. From this perspective, word-level orthographic neighbors have an important role in lexical processing in Chinese reading. Therefore, the current research could offer important insights about the mechanisms of lexical processing in Chinese sentence reading by investigating how orthographic neighbor effects are modulated by predictability.

In the current study, we explored whether predictability reduces or even eliminates the effect of a word’s neighborhood in Chinese reading. To be specific, will the preactivation of target words that are highly predictable based on their prior context be strong enough to eliminate the activation of neighbors? Or will neighbors still be activated and having a detectable effect on the word recognition process even with preactivated target words? If the preactivation of a highly predictable word is strong enough to prevent the activation of neighbors (due to lateral inhibitory connections at the word level), neighbors would exert little or no influence during the word recognition process. Thus, there would no longer be a facilitatory NS effect, nor an inhibitory NF effect. Such a pattern would provide clear evidence that bottom-up activation of neighbors can be squashed by context in Chinese reading. On the other hand, it is also possible that predictability does not influence neighbor effects. Even though a word is preactivated, its neighbors may still be activated and compete with the target for recognition. In this case, neighbor effects are expected in both high- and low-predictability conditions. Such a pattern would suggest that contextual information has limited effects on bottom-up activation of neighbors.

We conducted two eye-tracking experiments to test these two possibilities. In Experiment 1, the target word did not have a higher frequency neighbor, and we explored the interaction between NS and predictability. In Experiment 2, NS did not differ between sets of target words, and we explored the interaction between NF and predictability.

Experiment 1

In this experiment, we explored whether the NS effect in Chinese is modulated by predictability in sentence reading. Importantly, and in contrast to Experiment 2, all target words were the highest frequency member in their neighbor groups.

Participants

Forty-two students (30 females, age range 18-40 years) from universities around the Institute of Psychology, Chinese Academy of Sciences participated in the experiment. The number of participants was determined based on the method proposed by Westfall et al. (2014). A pilot study with the same material and procedure reported below was conducted on 20 participants. Their total reading times on the target words in the low-predictability condition were analyzed using linear mixed-effect models focusing on the effect of NS, since previous studies found reliable facilitatory NS effects in isolated word recognition and in neutral sentence context in Chinese (Tsai et al., 2006). The powerSim function (Judd et al., 2012) was used to explore how power varies as a function of participant number. Results showed that when the participant number was 40, the power reached .85. All participants were native speakers of Mandarin Chinese and had normal or corrected-to-normal vision.

Materials

We focus on two-character words’ orthographic neighborhood effects, due to the fact that 72% of Chinese words are two-character words (according to the Lexicon of Common Words in Contemporary Chinese Research Team, 2008). For example, “花园” (/hua1yuan2/, garden) and “花店” (/hua1dian4/, flower store) both have the constituent “花” (/hua1/, flower) in the first position. These two words can be called first character orthographic neighbors. On the other hand, “花园” (/hua1yuan2/, garden) and “公园” (/gong1yuan2/, park), which share the second constituent “园” (an area of land for a certain purpose), are second character orthographic neighbors. Because the leading character has a dominant influence on word recognition in Chinese (Huang et al., 2006), we focus on first character orthographic neighbors.

There were two groups of target words: words with large NS (the number of neighbors ranged from 10 to 91, M = 28.25), and words with small NS (the number of neighbors ranged from 1 to 6, M = 3.64). Each group contained 28 words. All of these target words do not have any higher frequency neighbor (they are the highest frequency words in their neighbor groups). The descriptive features of these words are summarized in Table 1. Pairs of sentences were composed so that in one sentence of the pair, a large NS target word was predictable and a small NS target word was unpredictable, and in the other sentence, this was reversed; see Table 2 for an example. Target words’ predictability in each condition was estimated in a cloze task (Taylor, 1953). Twenty university students participated in this task, in which each sentence was presented to participants up to the word before the target word. Participants were asked to write down the first word that came to their mind on reading the preceding fragment. The proportions of target words given by participants were calculated as their cloze probability. In the high-predictability condition all target words had cloze probability greater than .80, while in the low-predictability condition target words’ cloze probability was lower than .10. The cloze probability of target words did not differ significantly between the large NS and small NS groups, as shown in Table 1.

Table 1 The descriptive features of target words in Experiment 1
Table 2 Material Examples in Experiment 1

In sum, there were four conditions: words with large NS and high predictability, words with large NS and low predictability, words with small NS and high predictability, and words with small NS and low predictability. Each condition contained 28 sentences. All of these sentences (112 in total) were normed by 10 native Mandarin speakers on a 7-point scale to ensure that they were grammatical and acceptable. The norming scores of sentences with target words associated with large or small NS were matched within high- and low-predictability conditions, as shown in Table 1. The lengths of the sentences varied from 15 to 28 characters. These 112 experimental sentences were pseudorandomly divided into two lists to ensure that each list contained all the target words, but only one version (either high or low predictability) of the sentences for each target word. In addition, there were 30 fillers and five practice trials in each list.

Apparatus

Participants’ eye movements were recorded using an EyeLink 1000 eye tracker with a sampling rate of 1000 Hz. Each sentence was presented in one line in the middle of a 21-inch cathode ray tube (CRT) monitor. Participants were seated 57 cm away from the monitor. Following a 3-point horizontal calibration and validation, the maximum gaze-position error was less than 0.5°. Eye movements were recorded from the right eye, but viewing was binocular.

Procedure

Participants were tested individually. The experiment started with a brief instruction and a standard horizontal three-point grid calibration and validation. Then, five practice trials were run to ensure that the participants understood the task and were familiar with the apparatus. Critical experimental trials were run after the practice trials. Participants were required to read sentences on the screen silently. At the beginning of each trial, a drift check was conducted. Each sentence appeared after participants fixated on a character-sized box at the location of the first character of each sentence. After finishing reading one sentence, participants were asked to press a button so that the original sentence disappeared and was replaced by a meaning-related question, to which the participant then responded by button press. The entire experiment lasted 30 mins.

Results

Four eye movement measures on the target words were analyzed. First fixation duration (FFD) is the duration of the first fixation in a region. Gaze duration (GD) is the sum of the fixation durations before the eyes first move out of a region. Total reading time (TT) is the sum of the durations of all fixations in a region. Skipping rate (SKR) is the probability of a region being skipped.

The mean response accuracy was 93%. We chose only the correctly answered trials for the final analysis. Fixations shorter than 80 ms or longer than 1,000 ms were removed. Linear mixed-effects (LME) models (Baayen et al., 2008) were constructed for log-transformed FFD, GD, and TT (because the distributions of these measures were not normal, especially for TT, which was strongly right-skewed), and a generalized LME model was used on the SKR. Target words’ NS (large vs. small) and predictability (high vs. low) as well as their interaction were treated as fixed effects, specifying participants and items as crossed random effects.Footnote 1 The target word was defined as the experimental item; thus, while predictability is a within-item manipulation, NS is between-item. Fixed effects in these models were centered, with the factor levels coded as −.5 and .5. The random effects structure of the models was trimmed down from the maximal structure (i.e., random intercepts and slopes for all fixed effects, for both subjects and items) when the models failed to converge based on Bates et al. (2015). The statistical procedure was conducted using the lmer program (Bates et al., 2011) and lmerTest program (Kuznetsova et al., 2017) in R (version 3.5.1; R Core Team 2018).

As shown in Table 3, the main effect of predictability was significant in FFD and TT, and marginally significant in GD and SKR (p = .05 and .07, respectively; see Table 4 for details). Words with low predictability had longer fixations (FFD, GD, and TT) and lower SKR than words with high predictability. The main effect of NS was significant in TT and marginally significant in GD and SKR (p = .08 and .05 respectively). Words with large NS had shorter fixations and higher SKR than words with small NS. The interaction between NS and predictability was significant in TT.

Table 3 Summary of the Mean and SE (in parentheses) for each index in each condition in Experiment 1
Table 4 The results of LME/GLME models in Experiment 1

Paired comparisons (conducted using emmeans package; Lenth, 2018) of TT indicated that a significant facilitatory NS effect was observed in the low-predictability condition (t = −4.01, p <.005) but not in the high-predictability condition (t = 0.44, p = .66). A Bayes factor analysis was further conducted on log-transformed TT to determine the strength of evidence for the null/significant NS effect in the high-/low-predictability conditions. The Bayes factors were computed using the lmBF function from the BayesFactor package for R (Morey et al., 2015). Following Abbott and Staub (2015) and Staub and Goddard (2019), in all analyses, we assumed the default Cauchy prior for effect size. In each condition, Bayes Factors were calculated to compare a model that included subject and item intercepts and the NS effect to a null model that included only subject and item intercepts. This analysis favored the null model in the high-predictability condition by a factor of 0.131, which delivered evidence against the NS effect. In the low-predictability condition, the Bayes factor was 31.118, offering supportive evidence for the observed NS effect.

Experiment 2

In this experiment, we aimed to explore whether the NF effect is modulated by predictability in sentence reading. In this experiment, all target words had a large NS.

Participants

Sixty-two students (43 females, age range 18–40 years) from universities around the Institute of Psychology, Chinese Academy of Sciences participated in the experiment. To find out how many participants were needed in Experiment 2, a pilot study with the same material and procedure reported below was conducted on 20 participants. Their TT on the target words in the low-predictability condition was analyzed using linear mixed-effect models focusing on the effect of NF, since previous studies found reliable inhibitory NF effects in isolated word recognition in Chinese (Huang et al., 2006; M. F. Li et al., 2017). Results showed that when the participant number was 60, the power reached .80. All participants were native speakers of Mandarin Chinese and had normal or corrected-to-normal vision. None of them participated in Experiment 1. Two participants were excluded from the data analysis due to their low accuracy on comprehension questions ( <75%).

Materials

There were two groups of target words: words with higher frequency neighbors (HFN) and words without HFN. Each group contained 27 words. All of these words have a large NS. The descriptive features of these words are summarized in Table 5. For each target word, two sentences were composed so that the target word was highly predictable in the high-predictability condition and was not predictable at all in the low-predictability condition. Target words’ predictability in each condition was estimated in a cloze task which was conducted on 20 participants in a similar way to Experiment 1. In the high-predictability condition, all target words had cloze probability greater than .70, while in the low-predictability condition target words’ cloze probability was lower than .15. The cloze probability of target words in two groups was matched, as shown in Table 5.

Table 5 The descriptive features of target words in Experiment 2

It should be noted that, instead of using the same sentence frames for high- and low-predictability words as in Experiment 1 (as illustrated in Table 2), we used different sentence frames for high- and low-predictability words in Experiment 2 (as illustrated in Table 6). As pointed out by Rayner et al. (2004), when a design as in Experiment 1 is used, the sentence frames in the high-predictability condition are always highly constraining toward one specific word, which means that in the low-predictability condition, it is not only the case that the target word is unpredictable, but that some other word is highly predictable. This arrangement could conceivably result in longer than normal fixation times on the unpredictable target words, since readers might have been expecting one word but encountered another. This is suggested by the norming scores of the sentences in Experiment 1: For words in both large and small NS groups, the sentences in the high-predictability condition were more natural and acceptable than in the low-predictability condition, ps <.001. Although previous studies found that this counterpredictability may have little effect on lexical processing as long as the unpredictable words were plausible (Frisson et al., 2017), we wanted to further explore the potential interaction between predictability and neighbor effects by matching the acceptability of sentences in high- and low-predictability conditions. Thus, in Experiment 2, different sentence frames were used for high- and low-predictability words. Similar to Experiment 1, all the sentences were normed by 10 native Mandarin speakers on a 7-point scale to measure their acceptability. The norming scores of sentences with target words that with or without HFN were matched within and cross high- and low-predictability conditions, as shown in Table 5.

Table 6 Material examples in Experiment 2

In sum, there were 108 experimental sentences (54 words × 2 versions of sentences) which were pseudorandomly divided into two lists to ensure that each list contained all the target words, but only one version (either high or low predictability) of the sentences for each target word. In addition, there were 58 fillers and five practice trials in each list.

Apparatus

Same as in Experiment 1.

Procedure

Same as in Experiment 1.

Results

Similar to Experiment 1, four eye-movement indexes (FFD, GD, TT, and SKR) on the target words were analyzed. The mean response accuracy for the 60 participants was 96%. The methods of data cleaning and modelling are the same as in Experiment 2, except substituting NS with NF.

As shown in Table 7, the main effect of predictability was significant for all eye-movement measures. Words with low predictability had longer fixations (FFD, GD, and TT) and lower SKR than words with high predictability (Table 8). The main effect of NF was significant in TT, with words with HFN having longer TT than words without HFN. The interaction between NF and predictability was significant in TT and marginally significant in GD (p = .09).

Table 7 Summary of the Mean and SE (in parentheses) for each index in each condition in Experiment 2
Table 8 The Results of LME/GLME models in Experiment 2

A paired comparison on TT indicated that a significant inhibitory NF effect was observed in the low-predictability condition (t = 4.75, p <.001) but not in the high-predictability condition (t = −0.25, p = .80). A Bayes factor analysis was further conducted to determine the strength of evidence for the null/significant NF effect in the high-/low-predictability condition. This analysis favored the null model in the high-predictability condition by a factor of 0.118, which delivered evidence against the NF effect. In the low-predictability condition, the Bayes factor was 256.68, offering supportive evidence for the observed NF effect.

General discussion

We conducted two eye-tracking experiments to investigate whether predictability modulates neighbor effects in Chinese reading. Target words’ predictability and NS or NF were manipulated. In both experiments, we observed a significant main effect of predictability: compared with words with low predictability, words with high predictability were associated with shorter processing times (FFD, GD, and TT) and higher SKR. This predictability benefit effect is consistent with previous studies in English (Rayner & Well, 1996) and Chinese (Rayner et al., 2005). In addition, we observed a facilitatory NS effect in TT in Experiment 1 when controlling the number of HFN, which is consistent with previous studies in alphabetic languages (Carreiras et al., 1997; Pollatsek et al., 1999; Sears et al., 1995) and Chinese (Huang et al., 2006). In Experiment 2, we observed an inhibitory NF effect in TT, which is consistent with previous studies in alphabetic languages (Perea & Pollatsek, 1998) and Chinese (Huang et al., 2006; Li et al., 2017).

More importantly, we observed a clear modulation of neighborhood effects by predictability. In the TT measure, in the low-predictability condition we observed a significant facilitatory NS effect in Experiment 1, and a significant inhibitory NF effect in Experiment 2, but neither of these effects were in evidence in the high-predictability condition. Furthermore, the null and significant effects of the neighborhood manipulations in the high- and low-predictability conditions respectively, in both experiments, were confirmed by Bayes Factor analysis.

The reliable interaction between predictability and neighbor effects indicates that in Chinese sentence reading, neighbor effects (both NS and NF effects) can be modulated by the word’s predictability based on prior context. Specially, when context strongly constrains towards the target word, the effects from neighbors can be squashed. We assume that in the high-predictability conditions, target words are preactivated to a level that results in inhibition of the activation of neighbors, leading to little or no effect from the neighbors. On the other hand, in the low-predictability condition, target words are not preactivated. Thus, neighbors that are activated by the visual input strengthens the activation of sublexical (character) units, ultimately facilitating recognition of the target (the facilitatory NS effect), or laterally inhibiting the activation of the target if the neighbors are more frequent (the inhibitory NF effect).

In the current research, we found neighbor effects on TT in both experiments, which were qualified by an interaction with predictability. The main effect of NS or NF did not reach significance in the earlier measures. However, it is important to note that in both experiments, all measures showed trends in the expected direction in the low-predictability conditions. Indeed, paired comparisons for these conditions reveal that the facilitatory NS effect was marginally significant on FFD (t = −1.74, p = .09) and significant on GD (t = −2.28, p = .03); the inhibitory NF effect was significant on GD (t = 2.24, p = .03). On the other hand, paired comparisons for the high-predictability conditions did not reveal any significant neighbor effect on either FFD or GD (ps <.6). Thus, we think that drawing a clear distinction between the presence of neighbor effects on TT and their absence in earlier measures would not actually be warranted. We do not see a reason to suppose that neighbor effects are exclusively “late” effects. It should be noted that as described in the result sections, the interactions between NF and predictability were not significant for FFD and GD (except a trend of interaction of GD in Experiment 2). Thus, the claims here are accompanied by a caveat, and the trends here need to be confirmed in future empirical research.

Despite the fact that we found similar patterns of neighborhood effects in Chinese and alphabetic languages, it should be noted that it remains unclear whether the predictability-by-neighborhood interactions observed in the current study would be obtained in alphabetic languages. There are two reasons for this uncertainty. First, compared to readers in alphabetic languages, Chinese readers have a stronger reliance on contextual semantic information in lexical processing and word segmentation due to the lack of word boundaries. Thus, prediction generated from prior context may affect lexical processing to a different extent in Chinese and alphabetic languages. Second, orthographic neighbors are defined differently in Chinese and alphabetic languages. The processing effects of neighbors sharing a character in a certain position may not be the same as the effects of neighbors differing on a position-specific letter. Chinese orthographic neighbors on the word level (the definition used in the current research), may be seen as analogous to English words like sunflower, sunbird, sunblock, and sunbathe.Footnote 2 In addition, as mentioned above, there are many different ways to define orthographic neighbors in Chinese. It remains unclear whether the observed predictability-by-neighborhood interactions would hold when using other operational definitions of neighbors. Further research is needed to tackle these questions.

In conclusion, in the current research, we conducted two eye-tracking experiments to investigate whether neighbor effects can be modulated by predictability in Chinese reading. A clear interaction between predictability and neighbor effects was observed in two experiments, with both the facilitatory NS effect and the inhibitory NF effect being eliminated when the target was highly predictable. These findings indicate that in Chinese sentence reading, predictability and orthographic neighbors jointly affect lexical processing, and neighbor effects can be eliminated by high predictability.