A central question for any model of visual word identification is the representation of the position at which letters are encoded. Consequently, all recently proposed models offer an explanation of this process in the course of orthographic processing (e.g., the LTRS model: Adelman, 2011; spatial-coding model: Davis, 2010; overlap model: Gomez, Ratcliff, & Perea, 2008; overlap open-bigram model: Grainger, Granier, Farioli, Van Assche, & van Heuven, 2006; Bayesian reader model: Norris, Kinoshita, & van Casteren, 2010; and SERIOL model: Whitney, 2001). Not surprisingly, most of the evidence that those models have tried to account for has been obtained using readers of languages with the Roman alphabet.

Expanding the empirical record to include the process of letter position coding in non-Roman alphabets is a worthwhile endeavor, beyond the goals of inclusion and diversity, because it encourages theory development; namely, the orthographic features of different writing systems could shape the process of letter position coding. Hence, the research agenda while studying different writing systems is to explore both the differences (the changes in the processes as a function of writing systems) and the invariances (the parts of the processes that stay the same across writing systems) in letter position coding (see Wiley, Wilson, & Rapp, 2016, for an examination of the similarities/differences in letter identity coding between the Arabic vs. Roman alphabets).

In this article, we examine whether the orthographic-specific characteristics of the Thai writing system shape the process of letter position coding. Prior research using the Roman alphabet has offered remarkably similar patterns of letter position coding across a variety of languages (e.g., across the Germanic [English: Perea & Lupker, 2003], Romance [Spanish: Perea & Lupker, 2004], pre-Indo-European [Basque: Perea & Carreiras, 2006], Semitic [Maltese: Perea, Gatt, Moret-Tatay, & Fabri, 2012], and Uralic [Hungarian: Tóth & Csépe, 2016] families). This could be taken to suggest that letter position coding processes are to some degree language-independent, at least in alphabetic languages using the Roman alphabet.Footnote 1

The Thai writing system has an alphabetic orthography, but it has two idiosyncratic features that make it particularly interesting to compare with the Roman alphabet in relation to letter position encoding. First, Thai has some commonly used vowels (i.e., เ /e:/, แ /ɛ:/, โ /o:/, ไ /aj/, ใ /aj/) that are written before the consonant but articulated after the consonant in speech (e.g., the word แบน “flat” is not pronounced as /ɛ:bn/ but as /bɛ:n/—i.e., the written vowel แ is misaligned relative to the spoken vowel /ɛ:/); however, not all vowels follow this pattern, since other vowels are pronounced in the order that they are written (e.g., the word บาท ‘Baht’ is pronounced /ba:t/—i.e., the written vowel า is aligned; see Winskel & Iemwanthong, 2010). Therefore, letter position coding in Thai needs to be flexible enough to deal with the processing of words containing either aligned or misaligned vowels. Second, the Thai script does not have interword spaces (e.g., the sentence นักศึกษาไปซื้อตะไคร้มาจากตลาด “The student goes to the market to buy lemon grass”). As Winskel, Perea, and Ratitamkul (2012) indicated, this implies that during sentence reading “there is a degree of ambiguity in relation to which word a given letter belongs to” (p. 1523) (e.g., the boundaries of the word ตะไคร้ “lemon grass” are difficult to discern in the sentence above).

Our tool for comparing letter position coding in the Thai and Roman writing systems is a mathematical/computational model: the overlap model (Gomez et al., 2008). The basic assumption of this model is that perceptual uncertainty is associated with locating objects in space (i.e., letters in words), so the locations of letters in a letter string should be considered distributions along a dimension rather than exact points (see Logan, 1996). Each letter position has a different degree of “perceptual uncertainty,” which is treated as a free parameter in the model (s parameter)—other models of visual word identification have incorporated similar mechanisms (e.g., Adelman, 2011; Davis, 2010; Grainger et al., 2006; Norris et al., 2010). As was shown by Gomez et al., the overlap model can account for letter position encoding in the Roman alphabet across a variety of manipulations: letter replacement, letter transposition, letter addition/deletion, and letter migration.

One of the critical benchmarks in the experiments and fits reported by Gomez et al. (2008) is that the overlap model captured the first letter as being more important for letter position coding than were interior letters in the Roman alphabet. According to the model, this occurs because the degree of perceptual noise in the internal letter positions is larger than the perceptual noise in the first letter position (see Gomez et al., 2008, Fig. 14). In other words, jugde resembles judge much more closely than ujdge—note that in the Roman alphabet, transposed-letter effects tend not to occur when the initial letter is involved (see White, Johnson, Liversedge, & Rayner, 2008). The presence of some misaligned vowels in Thai words and the absence of interword spaces may lead to a more flexible process of letter position coding than occurs in languages that employ the Roman alphabet. Indeed, some evidence from Thai supports the view that letter position coding is quite flexible, even in relation to the initial letter position (see Perea, Winskel, & Ratitamkul, 2012, and Winskel et al., 2012, for evidence from masked priming and sentence reading).

To compare how letter position coding is attained in Thai in comparison with English (i.e., a language that uses the Roman alphabet), we conducted an experiment that paralleled Experiment 3 in Gomez et al. (2008). In this experiment they examined how letter positions are encoded in English across 23 conditions that involved letter replacements, letter transpositions, and letter migrations. The present experiment with Thai words and Thai participants allowed us to compare the fits obtained by Gomez et al. in English with the fits obtained in the present Thai experiment.

Therefore, on the basis of previous research and the characteristics of Thai, we explored two questions: (1) Would the position uncertainty model (the overlap model) be able to account for the data obtained with Thai readers? (2) If the model could account for the data, would the configuration of the parameter values nonetheless be related to some of the idiosyncrasies of the Thai writing system? The present experiment involved 23 conditions that included (a) a single-letter replacement condition, (b) an adjacent-letter transposition condition, (c) a letter migration condition, and (d) an orthographic control for the letter migration condition (see Table 1 for further details). These are the same conditions as in Gomez et al.’s (2008) Experiment 3, in which they found an abrupt increase in perceptual uncertainty from the initial position to the other letter positions.

Table 1 Types of stimuli in the experiment

Method

Participants

Twenty students and staff from Chulalongkorn University, Bangkok, participated in the experiment. All of them were native speakers of Thai and had normal or corrected-to-normal vision.

Materials

In this experiment, sets of similar stimuli were formed by rearranging the letters in Thai pseudowords. The creation of the stimuli was parallel to that employed by Gomez et al. (2008, Exp. 1) with the Roman alphabet in English readers. We created 1,196 pseudowords by substituting one letter within five-letter Thai words obtained from the Thai National Corpus (Aroonmanakun, 2007). The base words included both aligned- and misaligned-vowel words. Tone diacritics were not included in the experimental pseudoword stimuli. Each target stimulus (or any of the items generated from it) was presented only once. During the generation of items, vowels were always substituted for vowels, and consonants were always substituted for consonants in the replacement conditions. There were 23 conditions altogether, which can be divided into four categories: (a) a single-letter replacement condition, (b) an adjacent-letter transposition condition, (c) a letter migration condition, and (d) a letter migration + replacement condition (refer to Table 1). Overall, the experimental session was composed of 13 blocks of 92 trials each (1,196 trials).

Procedure

Participants were tested individually in a quiet room at the Center for Research on Speech and Language Processing. Stimuli were presented in the 24-point Courier Proportional Thai font. The DMDX software (Forster & Forster, 2003) was used to display the stimuli and record the participants’ responses. We used a two-alternative forced choice paradigm that mimicked the procedure used by Gomez et al. (2008). On each trial, a fixation point was initially presented on the computer screen for 500 ms, and then a target stimulus was presented for 83 ms in the center of the screen, which was subsequently masked with segments of Thai letters. Note that Gomez et al. employed a duration of 50 ms for the target stimulus; however, a pilot study in Thai revealed that accuracy was substantially lower than in the Gomez et al. experiment, so the target duration was increased to 83 ms in order to produce more comparable results for the two scripts.Footnote 2 The participant had to choose between two alternatives that were presented simultaneously below the mask (one to the right and the other to the left of the mask, as can be seen in Gomez et al.’s, 2008, Fig. 2). The alternatives were the pseudoword and a letter string from one of the four conditions outlined below. Participants were asked to indicate which alternative was the letter string that had been presented briefly. Each alternative was the correct response on an equal number of trials. The order of trials was randomized for each participant. Prior to the experimental trials, 14 practice trials were given. The experiment took approximately 1 h to complete.

Results

The empirical findings are described in this section, and the model fits will be described and evaluated in the Model Fits subsection.Footnote 3 Given the two-alternative forced choice procedure presented in this article, our analysis focused on accuracy rates (the RT analyses are available in the online appendix). The average accuracy rates for each condition are shown in Table 2 in the columns labeled Thai. The results were rather straightforward: The transposition conditions were more difficult than the letter replacement conditions. To examine the evidence for and against differences among the different conditions within this experiment, we utilized Bayes factors. Bayes factors are ratios of the probabilities of the data given two competing models (H0 vs. H1, in this case; see Rouder, Morey, Speckman, & Province, 2012).

Table 2 Proportions of correct responses for the present experiment, as well as the results from Gomez et al. (2008) in English, for comparison purposes

As might be expected, the overall the accuracy rates were higher for replacement conditions than for transposition conditions, and when comparing replacements versus transpositions at specific letter positions, the comparisons that provide the most support for the H1 are replacement of the 1st letter versus transposition of Letters 1 and 2 (t = 3.89, BF10 = 36.9) and replacement of the 4th letter versus transposition of Letters 4 and 5 (t = 3.46, BF10 = 16).Footnote 4 Table 3 shows the BF10s for other comparisons.

Table 3 Comparisons among conditions in the experiment

In addition, we compared the accuracies for the present experiment to those of the parallel experiment in English. Notably, the conditions with the greatest evidence for a cross-language difference are those that involve the second letter: transposition of 2 and 3 (t = 3.16, BF10 = 12.55), migration of 2 to 4 (t = 2.791, BF10 = 5.81), migration of 4 to 2 (t = 2.761, BF10 = 5.47), migration of 5 to 2 (t = 4.459, BF10 = 320.67), and migration of 5 to 2 (t = 2.488, BF10 = 3.27). For the rest of the conditions, the Bayes factors are less than 1, meaning that there is in fact support for the null model (see Table 3).

To summarize, the data show some relevant qualitative patterns: (a) There is a transposed-letter effect, since the transposed-letter conditions are more difficult than the single-replacement conditions; (b) although it is numerically smaller than in the English data, there is also an advantage for first-letter manipulations, since the conditions in which the first letter are manipulated are easier than the conditions in which the first letter was not manipulated; and finally, (c) performance is quite similar in this experiment versus its English counterpart, except for conditions in which the second letter was different between the target and the lure, in which case the performance of Thai readers was superior to that of English readers.

Model fits

The main objective of the present experiment in Thai was not to establish that transpositions are harder to detect than replacements. Instead, the goal was to assess whether the overlap model could account for the differences and similarities in performance across the 23 conditions in our experiment. The model assumes that in our experimental paradigm the briefly presented target would be compared to the two alternatives; this comparison yields a measurement of orthographic similarity, which is computed from the overlap between the flashed string and the alternatives. The overlap is calculated by the model as follows: For each letter position, the area under the curve of the letter in the slot within the target string is multiplied by the area of the same letter in the corresponding slot within the study string, and these products are summed over all the slots, as described by Eq. 1 in Gomez et al. (2008):

$$ overlap = {\displaystyle \sum_{i = 1}^L}{\displaystyle \underset{i-\frac{1}{2}}{\overset{i + \frac{1}{2}}{\int }}}{f}_1(x) d x{\displaystyle {\int}_{i-\frac{1}{2}}^{i + \frac{1}{2}}{\displaystyle \int }{f}_2(x) d x,} $$

where i is the center of the position slot, f 1(x) is the distribution of the first stimulus centered on i, f 2(x) is the distribution of the second stimulus, and x is the position along the word order dimension (horizontal axis).

Given the use of two alternatives in our experiment, the model computes the two overlap measurements (i.e., the target stimulus with the correct alternative [overlap t] and the target stimulus with the foil [overlap f]). The overlaps between the target stimulus and the two alternatives are transformed into correct-response proportions using a power function (Eq. 2 in Gomez et al., 2008):

$$ p\left(\mathrm{correct}\right) = \frac{{\left( overla{p}_t\right)}^a}{{\left( overla{p}_t\right)}^a + {\left( overla{p}_f\right)}^a}, $$

where a is a scaling parameter greater than 1; this allows small differences in overlap to produce larger effects in accuracy.

As in the Gomez et al. (2008) study, to fit the overlap model to the data, we utilized the general-purpose optimization method in R based on the Nelder–Mead algorithm (Nocedal & Wright, 1999), which adjusts the values of the six parameters of the model. The data entered into the minimization routine were the response proportions for the 23 conditions. These data could be averages across participants, but they could also be the data for each individual participant. We performed calculations both ways: When comparing the model fits, we present the fits to the averaged data (as in Table 2 and Fig. 1). However, when performing statistical inferences—for example, when comparing the values of parameters for the present experiment against the parameter values in Gomez et al. (2008)—we use the parameters obtained from fitting the model to each participant’s data as our dependent variable. For these comparisons (see Table 4), we utilized Bayes factors, which revealed substantial evidence of a cross-language difference only for the second letter (with a larger s 2 value for English), and anecdotal evidence of a difference for the final letter position (with larger s 5 values for Thai).

Fig. 1
figure 1

Values of the s parameter of the overlap model for each of the letter positions: s 1 to s 5. For comparison purposes, we plot the parameter values for the experiment in English from Gomez et al. (2008) as the “E” points, and the parameter values for the present experiment as the “T” points. Note that these are parameters obtained from fitting the model to the averaged data. The solid line represents the exponential growth-to-asymptote function that best fits the E points (s parameters for the experiment in English), whereas the thick dashed line represents a linear function that fits the T points (s parameters for the experiment in Thai). The thin dotted line represents the exponential growth-to-asymptote function that best fits the T points (which misses the points badly). Refits of the empirical data with the simplified models can be seen in the online appendix: for English (Figs. A12 to A17) and for Thai (Figs. A4 to A9).

Table 4 Parameters of the overlap model for Thai, again with parallel data from Gomez et al. (2008) in English for comparison purposes

We can compare the parameters of the model for this experiment to the parameters in the parallel experiment with English-speaking participants. As can be observed in Fig. 1, there are qualitative differences between the parameters for the two languages. Namely, the parameters that describe the position uncertainty have higher values as the letter position increases in both languages; however, the growth in value is more pronounced in the English than in the Thai experiment.

Gomez et al. (2008) argued that the five s parameters could be described with a simple exponential growth-to-asymptote function over letter positions:

$$ {S}_i = d\left(1-{e}^{-\left( i-0.5\right)/ r}\right). $$

This function represents the idea that the value of the s parameter rises across letter positions (i) at a rate r until it reaches an asymptotic value d.

Importantly, the s parameters do not behave in this manner for the Thai experiment. Indeed, the s parameters for Thai rise in a linear manner. For comparison purposes, in Fig. 1 we show the parameters and the exponential growth to asymptote from Gomez et al.’s (2008) article, as well as both the linear and the exponential growth-to-asymptote fits for the s parameters from the present experiment with Thai. We compared the sums of squares of the residuals and found that the linear function had a much better fit (SS = 0.02) than the exponential growth function (SS = 0.07). Conversely, when we reanalyzed the Gomez et al. data, the exponential growth-to-asymptote function fared better (SS = 0.07, vs. SS = 13 for the linear function). Note that both the linear and exponential approaches to a limit have two parameters.

Discussion

The main aim of the present study was to uncover differences in the encoding of letter position due to the nature of the script in Thai (i.e., an unspaced orthographic system in which some of the letters may be misaligned). We used the overlap model as a medium to explore this question. The conclusions are straightforward: Whereas the general perceptual-uncertainty mechanisms might be the same in Thai and in English—as deduced by the good fits in both Thai and English—the specific characteristics of the languages do seem to modulate this mechanism. In both scripts, the degrees of perceptual uncertainty are not the same across all letter positions, with a growth in the value of the uncertainty parameter as a function of condition. Critically, the forms of the function are different for Thai and for English.

Although for English it can be described as an exponential growth to a limit, in Thai it seems to be a linear function. This means that the position encoding for the first letter is better than those for the rest of the positions in both languages, but not as dramatically different in Thai as in English. These results support the prediction that position uncertainty for initial letter position is somewhat greater in Thai than in English. Thus, these results also support the view that orthographic-specific characteristics of the Thai writing system play a role in shaping the process of letter position coding. Specifically, two differences between Thai and English writing might be at play: (1) the lack of interword spacing might make the first letter less salient for Thai readers, which might then make the decay of the position coding accuracy less pronounced than for English readers; and (2) the misaligned vowels might deemphasize the role of the first letter, since it is not necessarily the one that needs to articulated first during reading. We acknowledge that determining the relative importance of these two factors is beyond the scope of our work, and we do hope that these findings might result in further research with Thai readers.

Although the present study focused on the fits from the overlap model of letter position coding, we acknowledge that other models that have employed the principle of “perceptual uncertainty” to encode letter position can also capture the obtained effects (e.g., the spatial-coding model, overlap open-bigram model, Bayesian reader model, or LTRS model).

In sum, the present findings are consistent with well-studied phenomena related to the learned aspects of perception and attention. Indeed, previous research in our laboratory revealed that the degree of letter position coding during visual-word recognition is modulated by expertise in orthographic–lexical processing (Perea, Marcet, & Gomez, 2016). Although the task will be challenging, future directions of research may aim to uncover the possible trajectories of learning letter encoding (see Wiley et al., 2016, for evidence from letter identity coding).