Skip to main content

A Quantitative Categorization of Phonemic Dialect Features in Context

  • Conference paper
Modeling and Using Context (CONTEXT 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3554))

Abstract

We test a method of clustering dialects of English according to patterns of shared phonological features. Previous linguistic research has generally considered phonological features as independent of each other, but context is important: rather than considering each phonological feature individually, we compare the patterns of shared features, or Mutual Information (MI). The dependence of one phonological feature on the others is quantified and exploited. The results of this method of categorizing 59 dialect varieties by 168 binary internal (pronunciation) features are compared to traditional groupings based on external features (e.g., ethnic, geographic). The MI and size of the groups are calculated for taxonomies at various levels of granularity and these groups are compared to other analyses of geographic and ethnic distribution. Applications that could be improved by using MI methods are suggested.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fetzer, A.: Recontextualizing context: Grammaticality meets appropriateness. Benjamins, Philadelphia (2004)

    Google Scholar 

  2. Giunchiglia, F., Bouquet, P.: Introduction to contextual reasoning. An Artificial Intelligence Perspective. In: Kokinov, B. (ed.) Perspectives on Cognitive Science 3. NBU Press, Sofia (1997)

    Google Scholar 

  3. Sarkar, P., Nagy, G.: Style consistent classification of isogenous patterns. IEEE Trans. Pattern Analysis and Machine Intelligence 27(1), 88–98 (2005)

    Article  Google Scholar 

  4. Veeramachaneni, S., Nagy, G.: Style context with second order statistics. IEEE Trans. Pattern Analysis and Machine Intelligence 27(1), 14–22 (2005)

    Article  Google Scholar 

  5. Carver, C.M.: American Regional Dialects: A Word Geography. University of Michigan Press, Ann Arbor (1987)

    Google Scholar 

  6. Labov, W., Ash, S., Boberg, C.: Atlas of North American English. Mouton de Gruyter, Paris (2005)

    Book  Google Scholar 

  7. Hughes, A., Trudgill, P.: English Accents and Dialects: An Introduction to Social and Regional Varieties of British English. Edward Arnold, London (1987)

    Google Scholar 

  8. Trudgill, P.: The Dialects of England. Blackwell, London (1999)

    Google Scholar 

  9. Nerbonne, J., Kleiweg, P.: Lexical distance in LAMSAS. Computers and the Humanities 37(3), 339–357 (2003)

    Article  Google Scholar 

  10. Gooskens, C., Heeringa, W.: Perceptive evaluation of Levenshtein dialect distance measurements using Norwegian dialect data. Language Variation and Change 16(3), 189–207 (2004)

    Article  Google Scholar 

  11. Cheng, C.-C.: Measuring Relationship among Dialects: DOC [Dictionary on computer] and Related Resources. Computational Linguistics and Chinese Language Processing 2(1), 41–72 (1997)

    Google Scholar 

  12. Heeringa, W., Braun, A.: The Use of the Almeida-Braun System in the Measurement of Dutch Dialect Distances. Computers and the Humanities 37(3), 257–271 (2003)

    Article  Google Scholar 

  13. Heeringa, W.: Measuring dialect pronunciation differences using Levenshtein distance. University of Groningen, Groningen (2004)

    Google Scholar 

  14. Heggarty, P.A.: Measured Language: From First Principles to New Techniques for Putting Numbers on Language Similarity. Blackwell, Oxford (in prep.)

    Google Scholar 

  15. Schneider, E.W., et al. (eds.): A Handbook of Varieties of English: A Multimedia Reference Tool. Mouton de Gruyter, Berlin (2005)

    Google Scholar 

  16. Nagy, N.: Addenda to Categorization of phonemic dialect features in context (2005), http://pubpages.unh.edu/~ngn/papers/Context05/CONTEXT05_addenda

  17. Wells, J.C. (ed.): Accents of English. Cambridge University Press, Cambridge (1982)

    Google Scholar 

  18. Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, Hoboken (1980)

    Google Scholar 

  19. Day, W.H.E., Edelsbrunner, H.: Efficient algorithms for agglomerative hierarchical clustering methods. Journal of Classification 1(1), 7–24 (1984)

    Article  MATH  Google Scholar 

  20. Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice Hall, Englewood Cliffs (1988)

    MATH  Google Scholar 

  21. Theodoridis, S., Koutroumbas, K.: Pattern Recognition. Academic, NY (1999)

    Google Scholar 

  22. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley-Interscience, Hoboken (2001)

    MATH  Google Scholar 

  23. Topchy, A., et al.: Adaptive Clustering Ensembles. In: Proc. ICPR, Cambridge (2004)

    Google Scholar 

  24. Jain, A.K., et al.: Landscape of Clustering Algorithms. In: Proc. ICPR, Cambridge (2004)

    Google Scholar 

  25. Redner, R.A., Walker, H.F.: Mixture densities, maximum likelihood, and the EM algorithm. SIAM Review 26(2), 195–235 (1984)

    Article  MathSciNet  MATH  Google Scholar 

  26. Topchy, A., Jain, A.K., Punch, W.: A Mixture Model for Clustering Ensembles. In: Proc. SIAM International Conference on Data Mining (SDM 2004), Florida (2004)

    Google Scholar 

  27. Foulkes, P.: Current trends in British sociophonetics. Univ. of PA Working Papers in Linguistics: A Selection of Papers from NWAV 30 8(3), 75–86 (2002)

    Google Scholar 

  28. Rabiner, L.R., Juang, B.H.: Fundamentals of Speech Recognition. Prenctice Hall, Englewood Cliffs (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nagy, N., Zhang, X., Nagy, G., Schneider, E.W. (2005). A Quantitative Categorization of Phonemic Dialect Features in Context. In: Dey, A., Kokinov, B., Leake, D., Turner, R. (eds) Modeling and Using Context. CONTEXT 2005. Lecture Notes in Computer Science(), vol 3554. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508373_25

Download citation

  • DOI: https://doi.org/10.1007/11508373_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26924-3

  • Online ISBN: 978-3-540-31890-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics