Abstract
It is arguably the case that ‘The purpose of language testing is always to render information to aid in making intelligent decisions about possible courses of action’ (Carroll, 1961, p. 314). This holds true whether the decisions are primarily pedagogic, or affect the future education or employment of the test taker. If fair and useful decisions are to be made, three conditions must hold. Firstly, valid inferences must be made about the meaning of test scores. Secondly, score meaning must be relevant and generalisable to a real-world domain. Thirdly, score meaning should be (at least partially) predictive of post-decision performance. If any of these conditions are not met, the process of assessment and decision-making may be questioned not only in theory but also in the courts (Fulcher, 2014a). It is therefore not surprising that historically, testing practice has rested on the assumption that language competence, however defined, is a relatively stable cognitive trait. This is expressed clearly in classic statements of the role of measurement in the ‘human sciences’, such as this by father of American psychology James McKeen Cattell (1890, p. 380):
One of the most important objects of measurement … is to obtain a general knowledge of the capacities of a man by sinking shafts, as it were, at a few critical points. In order to ascertain the best points for the purpose, the sets of measures should be compared with an independent estimate of the man’s powers. We thus may learn which of the measures are the most instructive.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bachman, L. F. (2006). Generalizability: A journey into the nature of empirical research in applied linguistics. In M. Chalhoub-Deville (Ed.), Inference and generalizability in applied linguistics: Multiple perspectives (pp. 165–207). Amsterdam: John Benjamins.
Bachman, L. F., & Palmer, A. S. (1996). Language testing in practice. Oxford University Press, Oxford.
Biber, D. (2006). University language: A corpus-based study of spoken and written registers. Amsterdam: John Benjamins.
Brown, A. (2003). Interviewer variation and the co-construction of speaking proficiency. Language Testing 20 (1), 1–25.
Carroll, J. B. (1961). Fundamental considerations in testing for English language proficiency of foreign students. In A. Campbell (Ed.), Teaching English as a second language. A book of readings (pp. 311–321) (2nd ed., 1965). New York: McGraw-Hill.
Cattell, J. M. (1890). Mental tests and measurements. Mind, 59 (3), 373–381.
Cattell, R. B. (1946). Description and measurement of personality. Yonkers on Hudson: World Books.
CellarTracker (n.d.) Wine review website. Retrieved from http://www.cellar-tracker.com/
Chapelle, C. (1998). Construct definition and validity inquiry in SLA research. In L. F. Bachman & A. D. Cohen (Eds.), Interfaces between second language acquisition and language testing research (pp. 32–70). Cambridge: Cambridge University Press.
Cizek, G. J. (2012). Setting performance standards: Foundations, methods and innovations (2nd ed.). New York: Routledge.
Crease, R. P. (2011). World in the balance: The historic quest for an absolute system of measurement. New York & London: WW Norton and Co.
Cureton, E. E. (1951). Validity. In E. F. Lindquist (Ed.), Educational measurement (pp. 621–694). Washington, DC: American Council on Education.
Davies, A. (1978). Language testing. Reprinted from Language Teaching and Linguistics Abstracts. In V. Kinsella (Ed.), (1982) Surveys 1: Eight state-of-the-art articles on key areas in language teaching (pp. 127–159). Cambridge: Cambridge University Press.
Dörnyei, Z. (2009). Individual differences: Interplay of learner characteristics and learning environment. Language Learning 59 (Suppl. 1), 230–248.
Fulcher, G. (1995). Variable competence and second language acquisition: A problem for research methodology. System 25 (1), 25–33.
Fulcher, G. (2003). Testing second language speaking. Harlow: Longman.
Fulcher, G. (2010). Practical language testing. London: Hodder Education.
Fulcher, G. (2012). Scoring performance tests. In G. Fulcher & F. Davidson (Eds.), The Routledge handbook of language testing (pp. 378–392). London: Routledge.
Fulcher, G. (2014a). Language testing in the dock. In A. Kunnan (Ed.), Companion to language assessment (pp. 1553–1570). London: Wiley-Blackwell.
Fulcher, G. (2014b). Philosophy and language testing. In A. Kunnan (Ed.), Companion to language assessment (pp. 1431–1451). London: Wiley-Blackwell.
Fulcher, G., & Márquez-Reiter, R. (2003). Task difficulty in speaking tests. Language Testing 20 (3), 321–344.
Fulcher, G., & Svalberg, A. M-L. (2013). Limited aspects of reality: Frames of reference in language assessment. International Journal of English Studies, 13 (2), 1–19.
Galaczi, E. D. (2008). Peer-peer interaction in a speaking test: The case of the First Certificate in English examination. Language Assessment Quarterly, 5 (2), 89–119.
Glaser, R., & Klaus, D. (1962). Proficiency measurement: Assessing human performance. In R. M. Gagné (Ed.), Psychological principles in system development (pp. 421–427). New York: Holt, Rinehart and Winston.
Kaulfers, W. V. (1944). Wartime development in modern-language achievement testing. Modern Language Journal, 28 (2), 136–150.
Keverberg, Baron de. (1827). Notes. Appended to A. Quetelet (1827). Recherches sur la population, les naissances, les décès, les prisons, les dépôts de mendicité, etc., dans le royaume des Pays-Bas (pp. 175–192). Nouveaux mémoirs de l’Académie royale des sciences et belles-lettres de Bruxelles, 4, 117–192.
Klein, H. A. (1974). Masterpieces, mysteries and muddles of metrology: The world of measurement. London: George Allen and Unw in Ltd.
Lado, R., (1961). Language testing: The construction and use of foreign language tests. London: Longman.
Latham, H. (1877). On the action of examinations considered as a means of selection. Cambridge: Dighton, Bell and Company.
Lazaraton, A. (1996). Interlocutor support in oral proficiency interviews: The case of CASE. Language Testing, 13 (2), 151–172.
Loevinger, J. (1957). Objective tests as instruments of psychological theory. Psychological Reports 3, Monograph Supplement, 9, 635–694.
Martyniuk, W. (2010). Aligning tests with the CEFR: Reflections on using the Council of Europe’s draft manual. Cambridge: Cambridge University Press.
McNamara, T. (1995). Modelling performance: Opening Pandora’s box. Applied Linguistics, 16 (2), 159–179.
McNamara, T. (2001). Language assessment as social practice: challenges for research. Language Testing, 18 (4), 333–349.
McNamara, T. (2006). Validity and values: Inferences and generalizability in language testing. In M. Chalhoub-Deville (Ed.), Inference and generalizability in applied linguistics: Multiple perspectives (pp. 27–45). Amsterdam: John Benjamins.
Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational measurement (pp. 13–103). New York: American Council on Education/Macmillan.
Mihailović, D. T., Mimić, G., & Arsenić, I. (2014). Climate predictions: The chaos and complexity in climate models. Advances in Meteorology. Retrieved from http://dx.doi.org/10.1155/2014/878249
Mlodinow, L. (2008). The drunkard’s walk: The story of randomness and its role in our lives. New York: Pantheon.
Morrow, K. (1979). Communicative language testing: revolution of evolution? In C. K. Brumfit & K. Johnson (Eds.), The communicative approach to language teaching (pp. 143–159). Oxford: Oxford University Press.
Morrow, K. (2012). Communicative language testing. In C. Coombe, P. Davidson, B. O’Sullivan, & S. Stoynoff (Eds.), The Cambridge guide to second language assessment (pp. 140–146). Cambridge: Cambridge University Press.
Norton, J. (2013). Performing identities in speaking tests: Co-construction revisited. Language Assessment Quarterly, 10 (3), 309–330.
Peirce, C. S., & Jastrow, J. (1885). On small differences of sensation. Memoirs of the National Academy of Sciences 1884 (pp. 73–83). Washington, DC: National Academy of Sciences.
Ross, S., & Berwick, R. (1992). The discourse of accommodation in oral proficiency interviews. Studies in Second Language Acquisition, 14 (1), 159–176.
Tarone, E. (1998). Research on interlanguage variation: Implications for language testing. In L. F. Bachman & A. D. Cohen (Eds.), Interfaces between second language acquisition and language testing research (pp. 71–89). Cambridge: Cambridge University Press.
Van Moere, A. (2012). A psycholinguistic approach to oral language assessment. Language Testing, 29 (2), 325–344.
Editor information
Editors and Affiliations
Copyright information
© 2016 Glenn Fulcher
About this chapter
Cite this chapter
Fulcher, G. (2016). Context and Inference in Language Testing. In: King, J. (eds) The Dynamic Interplay between Context and the Language Learner. Palgrave Macmillan, London. https://doi.org/10.1057/9781137457134_12
Download citation
DOI: https://doi.org/10.1057/9781137457134_12
Publisher Name: Palgrave Macmillan, London
Print ISBN: 978-1-349-57489-6
Online ISBN: 978-1-137-45713-4
eBook Packages: EducationEducation (R0)