Abstract
The Leximancer system is a relatively new method for transforming lexical co-occurrence information from natural language into semantic patterns in an unsupervised manner. It employs two stages of co-occurrence information extraction—semantic andrelational—using a different algorithm for each stage. The algorithms used are statistical, but they employ nonlinear dynamics and machine learning. This article is an attempt to validate the output of Leximancer, using a set of evaluation criteria taken from content analysis that are appropriate for knowledge discovery tasks.
Article PDF
Similar content being viewed by others
References
Apté, C., Damerau, F., &Weiss, S. M. (1994). Towards language independent automated learning of text categorization models. In W. B. Croft & C. J. van Rijsbergen (Eds.),Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 23–30). New York: Springer.
Bahr, L. S., &Johnston, B. (Eds.) (1992).Collier’s encyclopedia. New York: Macmillan Educational.
Bassford, C. (1994).Clausewitz in English: The reception of Clausewitz in Britain and America, 1815–1945. New York: Oxford University Press. (See also www.clausewitz.com)
Beeferman, D., Berger, A., &Lafferty, J. (1997). A model of lexical attraction and repulsion. In P. R. Cohen & W. Wahlster (Eds.),Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics (pp. 373–380). Madrid: Association for Computational Linguistics.
Burgess, C., &Lund, K. (1997). Modelling parsing constraints with high-dimensional context space.Language & Cognitive Processes,12, 177–210.
Chalmers, M., &Chitson, P. (1992). Bead: Explorations in information visualisation. In N. J. Belkin, P. Ingwersen, & A. M. Pejtersen (Eds.), published as a special issue of sigir forum,Proceedings of the 15th Annual ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 330–337). New York: ACM Press.
Clausewitz, C. von (1873).On war (J. J. Graham, Ed. and Trans.). London: Trübner. (Original work published 1832) (This text obtained from: www.clausewitz.com)
Dumais, S. T., Platt, J., Heckerman, D., &Sahami, M. (1998). Inductive learning algorithms and representations for text categorization. In G. Gardarin, J. C. French, N. Pissinou, K. Makki, & L. Bougamin (Eds.),CIKM ’98: Proceedings of the 7th International Conference on Information and Knowledge Management (pp. 148–155). New York: ACM Press.
Grant, U. S. (1885).The personal memoirs of U. S. Grant. Retrieved from Project Gutenberg, www.gutenberg.org, September 2004.
Grech, M. R., Horberry, T., &Smith, A. (2002). Human error in maritime operations: Analyses of accident reports using the leximancer tool. InProceedings of the Human Factors and Ergonomics Society 46th Annual Meeting. Baltimore: Human Factors and Ergonomics Society.
International Rugby Board (2003).The laws of the game of rugby union: 2003 edition. Available at www.irb.com.
Katter, R. V., Montgomery, C. A., &Thompson, J. R. (1979).Human processes in intelligence analysis: Phase I overview (Research Rep. 1237). Woodland Hills, CA: Operating Systems, Inc.
Krippendorff, K. (2004).Content analysis: An introduction to its methodology (2nd ed.). Newbury Park, CA: Sage.
Landauer, T., &Dumais, S. (1997). A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge.Psychological Review,104, 211–240.
Landauer, T., Foltz, P., &Laham, D. (1998). Introduction to latent semantic analysis.Discourse Processes,25, 259–284.
Lefebvre, S. (2004). A look at intelligence analysis.International Journal of Intelligence & Counterintelligence,17, 231–264.
Major League Baseball (1999).Official baseball rules: 1999 edition. Available at www.amherst.edu/~baseball/rules.html.
Marylebone Cricket Club (2003).The laws of cricket (2000 Code 2nd edition). Available at www.lords.org.
Nelson, D., McEvoy, C. L., &Pointer, L. (2003). Spreading activation or spooky action at a distance?Journal of Experimental Psychology: Learning, Memory, & Cognition,29, 42–52.
Nisbett, R. E., &Wilson, T. D. (1977). Telling more than we can know: Verbal reports on mental processes.Psychological Review,84, 231–259.
North American Football League (2003).2003 playing rules of the NAFL. Available at www.nafl.org.
Osgood, C. E., Suci, G. J., &Tannenbaum, P. H. (1957).The measurement of meaning. Urbana: University of Illinois Press.
Salton, G. (1989).Automatic text processing: The transformation, analysis, and retrieval of information by computer. Reading, MA: Addison-Wesley.
Smith, A. E. (2000a). Machine learning of well-defined thesaurus concepts. In A.-H. Tan & P. S. Yu (Eds.),Proceedings of the International Workshop on Text and Web Mining (PRICAI 2000) (pp. 72–79). Melbourne.
Smith, A. E. (2000b). Machine mapping of document collections: The leximancer system. InProceedings of the Fifth Australasian Document Computing Symposium. Sunshine Coast, Australia: DSTC.
Smith, A. E. (2003). Automatic extraction of semantic networks from text using Leximancer. InHLT-NAACL 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: Companion volume (pp. Demo23-Demo24). Edmonton: ACL.
Sowa, J. F. (2000).Knowledge representation: Logical, philosophical, and computational foundations. Pacific Grove, CA: Brooks Cole.
Stubbs, M. (1996).Text and corpus analysis: Computer-assisted studies of language and culture. Oxford: Blackwell.
U.S. Marine Corps (1997).Marine corps doctrinal publications: Capstone publications (MCDP Nos. 1, 1–1, 1–2, 1–3). Washington, DC: United States Government. (Available at www.doctrine.usmc.mil)
Weber, R. (1990).Basic content analysis. Newbury Park, CA: Sage.
Yarowsky, D. (1995). Unsupervised word-sense disambiguation rivaling supervised methods. InProceedings of the 33rd Annual Meeting of the Association for Computational Linguistics (ACL-95) (pp. 189–196). Morristown, NJ: Association for Computational Linguistics.
Author information
Authors and Affiliations
Corresponding author
Additional information
A.E.S. has inventor rights to the Leximancer intellectual property.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Smith, A.E., Humphreys, M.S. Evaluation of unsupervised semantic mapping of natural language with Leximancer concept mapping. Behavior Research Methods 38, 262–279 (2006). https://doi.org/10.3758/BF03192778
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03192778