Skip to main content
Log in

Effects of Syntactic Distance and Word Order on Language Processing: An Investigation Based on a Psycholinguistic Treebank of English

  • Original Research
  • Published:
Journal of Psycholinguistic Research Aims and scope Submit manuscript

Abstract

We conducted a broad-coverage investigation of the effects of syntactic distance and word order on language processing against a dependency-annotated reading time corpus of English. A combined method of quantitative syntax and psycholinguistic analyses was adopted to yield converging evidence. It was found that (i) head-initial structures allow greater structural complexity, i.e., larger head-dependent distance, than head-final structures in both language comprehension and production; (ii) within the capacity limit of working memory, syntactic distance is a positive predictor of reading time for a word with a preceding head, whereas a negative predictor of reading time for a word with a following head; and (iii) at the sentence level, syntactic distance is a significant predictor of sentence reading time. These results suggest that (i) different word orders may enjoy different processing mechanisms in terms of cognitive difficulty and processes, which can be explained by an incremental language parser; and (ii) in addition to distance, word order should also be considered as a factor affecting language processing, which is an important extension to distance-based language processing models. Taken as a whole, our study paves the way for corpus-based integration of quantitative linguistic and psycholinguistic methods into understanding language processing and its underlying cognitive mechanisms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig.1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. Demberg and Keller (2008) claimed that integration cost is not a sufficiently broad-coverage measure for naturally-occurring texts, because it only assigns difficulty to nouns and verbs. As assumed by the DLT, only the processing of nouns and verbs induces cognitive effort, but Demberg and Keller found proof suggesting the processing difficulty of other word categories, e.g., the auxiliaries, is larger than zero. This contradiction made them to conclude “It could be interesting to extend DLT in a way that makes it possible to also assign an integration cost to those word categories”.

  2. We did not choose the Universal Dependencies parses because its linguistic validity is being debated within the dependency grammar community, due mainly to its treatment of the content word as the head of the function word (e.g., Osborne & Gerdes, 2019).

  3. Note that DDM was investigated as a property of individual languages rather than different word orders within a language (e.g., Ferrer-i-Cancho et al., 2022; Gildea & Temperley, 2010). Given the aim of the current study, however, it is necessary to control for word orders, i.e., DDir.

  4. Although the fitting to power-law functions is good overall as suggested by R2, it gives a bad fit when DD = 1, as pointed out by one reviewer. Given that short and long dependencies are best fit by exponential and power-law models respectively (e.g., Ferrer-i-Cancho et al., 2022; Lu & Liu, 2016), the bad fit when DD = 1 may suggest that the distributions of DD is exponential with more than one regime. We did a re-fitting of our data, and found that DD distribution is best fit by an exponential function when DD ≤ 2 (head-final: y = 7863.1e−0.982x, head-initial: y = 2807.9e−0.452x, R2 > 0.9), and a power-law function when DD > 2 (head-final: y = 581.2x−1.547, head-initial: y = 690.09x−1.26, R2 > 0.9). Note that under each regime, head-final dependencies have a smaller regression coefficient than head-initial dependencies, which is consistent with the conclusions above.

  5. The probability of head-final or head-initial dependencies at a given DD = the frequency of head-final or head-initial dependencies at a given DD / the frequency of total dependencies at a given DD. For example, when DD = 1, there are 4,731 dependencies, among which 2,945 are head-final and the other 1,786 are head-initial. Thus, the proportions of head-final and head-final dependencies when DD = 1 are 62.25% (= 2,946 / 4,731) and 37.75% (= 1,785 / 4,731), respectively. This method helps one obtain a quick overview of the distributions of word orders at different syntactic complexities (DDs).

  6. Note that we define DD as a word’s linear distance to its head (see “Dependency distance and dependency direction” section). By this definition, only words that have heads are assigned DD, excluding non-headed words, i.e., sentence roots.

References

  • Altmann-Fitter. (2013). Altmann-Fitter user guide. Available at https://www.ram-verlag.eu/software-neu/software/. Accessed 5 June 2019.

  • Baddeley, A. D., & Hitch, G. (1974). Working memory. In G. A. Bower (Ed.), Recent advances in learning and motivation (Vol. 8, pp. 47–89). Academic Press.

    Google Scholar 

  • Balling, L. W., & Kizach, J. (2017). Effects of surprisal and locality on danish sentence processing: An eye-tracking investigation. Journal of Psycholinguistic Research, 46, 1119–1136.

    Article  PubMed  Google Scholar 

  • Best, K.-H. (2006). Quantitative Linguistik. Eine Annäherung (Göttinger Linguistische Abhandlungen 3). 3., stark überarbeitete undergänzte Auflage. Peust & Gutschmidt.

  • Brown, J. (1958). Some tests of the decay theory of immediate memory. Quarterly Journal of Experimental Psychology, 10(1), 12–21.

    Article  Google Scholar 

  • Coloma, G. (2017). The existence of negative correlation between linguistic measures across languages. Corpus Linguistics and Linguistic Theory, 13(1), 1–26.

    Article  Google Scholar 

  • Covington, M. (2003). A Free-word-order dependency parser in Prolog. Available at http://www.covingtoninnovations.com/mc/dparser/dparser.pdf. Accessed 5 June 2019.

  • Cowan, N. (2000). The magical number 4 in short-term memory: A reconsideration of mental storage capacity. Behavioral and Brain Sciences, 24, 87–114.

    Article  Google Scholar 

  • Culicover, P. W. (2013). Grammar and complexity: Language at the intersection of competence and performance. Oxford University Press.

    Google Scholar 

  • Demberg, V., & Keller, F. (2008). Data from eye-tracking corpora as evidence for theories of syntactic processing complexity. Cognition, 109(2), 193–210.

    Article  PubMed  Google Scholar 

  • Ferrer-i-Cancho, R. (2004). Euclidean distance between syntactically linked words. Physical Review E, 70, 056135.

    Article  Google Scholar 

  • Ferrer-i-Cancho, R. (2017). The placement of the head that maximizes predictability. An information theoretic approach. https://arxiv.org/pdf/1705.09932.pdf. Accessed 5 June 2019.

  • Ferrer-i-Cancho, R., Gómez-Rodríguez, C., Esteban, J. L., & Alemany-Puig, L. (2022). Optimality of syntactic dependency distances. Physical Review E, 105(1), 014308.

    Article  PubMed  Google Scholar 

  • Frazier, L., & Rayner, K. (1988). Parameterizing the language processing system: Left vs. right-branching within and across languages. In J. Hawkins (Ed.), Explaining language universals (pp. 247–279). Blackwell.

    Google Scholar 

  • Futrell, R., Mahowald, K., & Gibson, E. (2015). Large-scale evidence of dependency length minimization in 37 languages. PNAS, 112(33), 10336–10341.

    Article  PubMed  PubMed Central  Google Scholar 

  • Futrell, R., Levy, R. P., & Gibson, E. (2020). Dependency locality as an explanatory principle for word order. Language, 96(2), 371–412.

    Article  Google Scholar 

  • Futrell, R., Gibson, E., Tily, H., Blank, I., Vishnevetsky, A., Piantadosi, S. T., & Fedorenko, E. (2021). A reading-time corpus of English texts containing rare syntactic constructions. Language Resources & Evaluation, 55, 63–77.

    Article  Google Scholar 

  • Gibson, E. (1998). Linguistic complexity: Locality of syntactic dependencies. Cognition, 68(1), 1–76.

    Article  PubMed  Google Scholar 

  • Gibson, E. (2000). The dependency locality theory: A distance-based theory of linguistic complexity. In A. P. Marantz, Y. Miyashita, & W. O’Neil (Eds.), Image, language, brain (pp. 95–126). The MIT Press.

    Google Scholar 

  • Gildea, D., & Temperley, D. (2010). Do grammars minimize dependency length? Cognitive Science, 34, 286–310.

    Article  PubMed  Google Scholar 

  • Givón, T. (1991). Markedness in grammar: Distributional, communicative and cognitive correlates of syntactic structure. Studies in Languages, 15(2), 335–370.

    Article  Google Scholar 

  • Grodner, D. J., & Gibson, E. (2005). Consequences of the serial nature of linguistic input for sentential complexity. Cognitive Science, 29, 261–290.

    Article  PubMed  Google Scholar 

  • Hawkins, J. A. (1994). A performance theory of order and constituency. Cambridge University Press.

    Google Scholar 

  • Hawkins, J. A. (2004). Efficiency and complexity in grammar. Oxford University Press.

    Book  Google Scholar 

  • Heringer, H. J., Strecker, B., & Wimmer, R. (1980). Syntax: Fragen, Lösungen, Alternativen. Wilhelm Fink Verlag.

    Google Scholar 

  • Hockett, C. F. (1958). A course in modern linguistics. The Macmillan Company.

    Book  Google Scholar 

  • Hudson, R. (1993). Do we have heads in our minds? In G. G. Corbett, N. M. Fraser, & S. McGlashan (Eds.), Heads in grammatical theory (pp. 266–291). Cambridge University Press.

    Chapter  Google Scholar 

  • Hudson, R. (1995). Measuring syntactic difficulty. Unpublished paper. Available at http://dickhudson.com/wp-content/uploads/2013/07/Difficulty.pdf. Accessed 5 Dec 2018.

  • Hudson, R. (2010). An introduction to word grammar. Cambridge University Press.

    Book  Google Scholar 

  • Jiang, J., & Liu, H. (2015). The effects of sentence length on dependency distance, dependency direction and the implications–based on a parallel English-Chinese dependency treebank. Language Sciences, 50, 93–104.

    Article  Google Scholar 

  • Just, M. A., Carpenter, P. A., & Woolley, J. D. (1982). Paradigms and processes in reading comprehension. Journal of Experimental Psychology: General, 111(2), 228–238.

    Article  Google Scholar 

  • Köhler, R., Altmann, G., & Piotrowski, R. G. (Eds.). (2005). Quantitative Linguistik. Ein internationales Handbuch [Quantitative linguistics. An international handbook]. Mouton de Gruyter.

    Google Scholar 

  • Levy, R., Fedorenko, E., & Gibson, E. (2013). The syntactic complexity of Russian relative clauses. Journal of Memory and Language, 69(4), 461–495.

    Article  PubMed  PubMed Central  Google Scholar 

  • Lewis, R. L., Vasishth, S., & Van Dyke, J. A. (2006). Computational principles of working memory in sentence comprehension. Trends in Cognitive Sciences, 10(10), 447–454.

    Article  PubMed  PubMed Central  Google Scholar 

  • Liu, H. (2007). Probability distribution of dependency distance. Glottometrics, 15, 1–12.

    Google Scholar 

  • Liu, H. (2008). Dependency distance as a metric of language comprehension difficulty. Journal of Cognitive Science, 9(2), 159–191.

    Article  Google Scholar 

  • Liu, H. (2010). Dependency direction as a means of word-order typology: A method based on dependency treebanks. Lingua, 120(6), 1567–1578.

    Article  Google Scholar 

  • Liu, H. (2018). Language as a human-driven complex adaptive system. Physics of Life Reviews, 26–27, 149–151.

    Article  PubMed  Google Scholar 

  • Liu, H., Xu, C., & Liang, J. (2017). Dependency distance: A new perspective on syntactic patterns in natural languages. Physics of Life Reviews, 21, 171–193.

    Article  PubMed  Google Scholar 

  • Lu, Q., & Liu, H. (2016). Yicun juli fenbu you guilv ma? [Does dependency distance distribute regularly?]. Journal of Zhejiang University (Humanities and Social Sciences), 46(4), 63–76.

    Google Scholar 

  • Mel’čuk, I. (1988). Dependency syntax: Theory and practice. State University of New York Press.

    Google Scholar 

  • Miller, G. A. (1956). The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychological Review, 63, 81–97.

    Article  PubMed  Google Scholar 

  • Mitchell, J., Lapata, M., Demberg, V., & Keller, F. (2010). Syntactic and semantic factors in processing difficulty: An integrated measure. In Proceedings of the 48th annual meeting of the association for computational linguistics (pp. 196–206).

  • Nivre, J. (2015). Towards a universal grammar for natural language processing. In A. Gelbukh (Ed.), CICLing 2015, (Part I, LNCS 9041, pp. 3–16).

  • Osborne, T., & Gerdes, K. (2019). The status of function words in dependency grammar: A critique of universal dependencies (UD). Glossa, 4(1), 17.

    Article  Google Scholar 

  • Pritchett, B. L. (1991). Head position and parsing ambiguity. Journal of Psycholinguistic Research, 20(3), 251–270.

    Article  Google Scholar 

  • Shain, C., van Schijndel, M., Futrell, R., Gibson, E., & Schuler, W. (2016). Memory access during incremental sentence processing causes reading time latency. In Proceedings of the workshop on computational linguistics for linguistic complexity (CL4LC 2016) (pp. 49–58).

  • Temperley, D. (2007). Minimization of dependency length in written English. Cognition, 105(2), 300–333.

    Article  PubMed  Google Scholar 

  • Tesnière, L. (2015). Elements of structural syntax (T. Osborne & S. Kahane, Trans.). John Benjamins (Original work published 1959).

  • Ueno, M., & Polinsky, M. (2009). Does headedness affect processing? A new look at the VO–OV contrast1. Journal of Linguistics, 45(3), 675–710.

    Article  Google Scholar 

  • Vasishth, S., & Lewis, R. L. (2006). Argument-head distance and processing complexity: Explaining both locality and antilocality effects. Language, 82, 767–794.

    Article  Google Scholar 

  • Weyerts, H., Penke, M., Münte, T. F., Heinze, H., & Clahsen, H. (2002). Word order in sentence processing: An experimental study of verb placement in German. Journal of Psycholinguistic Research, 31(3), 211–268.

    Article  PubMed  Google Scholar 

  • Yadav, H., Vaidya, A., Shukla, V., & Husain, S. (2020). Word order typology interacts with linguistic complexity: A cross-linguistic corpus study. Cognitive Science, 44, e12822.

    Article  PubMed  Google Scholar 

  • Yngve, V. (1960). A model and a hypothesis for language structure. In Proceedings of the American philosophical society (Vol. 104, No. 5, pp. 444–466).

  • Zipf, G. K. (1949). Human behavior and the principle of least effort: An introduction to human ecology. Addison-Wesley Press.

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank Timothy Osborne for improving the language of an earlier draft and the two anonymous reviewers for their insightful comments.

Funding

The authors did not receive support from any organization for the submitted work.

Author information

Authors and Affiliations

Authors

Contributions

RN: Conceptualization, Methodology, Formal analysis and investigation, Writing—original draft preparation, Writing—review and editing. HL: Conceptualization, Writing—review and editing, Resources, Supervision.

Corresponding author

Correspondence to Haitao Liu.

Ethics declarations

Conflicts of interest

The authors declare that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Niu, R., Liu, H. Effects of Syntactic Distance and Word Order on Language Processing: An Investigation Based on a Psycholinguistic Treebank of English. J Psycholinguist Res 51, 1043–1062 (2022). https://doi.org/10.1007/s10936-022-09878-4

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10936-022-09878-4

Keywords

Navigation