Abstract
Automatic content analysis is more and more becoming an accepted research method in social science. In political science researchers are using party manifestos and transcripts of political speeches to analyze the positions of different actors. Existing approaches are limited to a single dimension, in particular, they cannot distinguish between the positions with respect to a specific topic. In this paper, we propose a method for analyzing and comparing documents according to a set of predefined topics that is based on an extension of Latent Dirichlet Allocation for inducing knowledge about relevant topics. We validate the method by showing that it can reliably guess which member of a coalition was assigned a certain ministry based on a comparison of the parties’ election manifestos with the coalition contract.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Andrzejewski, D., Zhu, X., Craven, M., Recht, B.: A framework for incorporating general domain knowledge into latent dirichlet allocation using first-order logic. In: Proceedings of the 22nd International Joint Conference on Artificial Intelligence, IJCAI 2011 (2011)
Benoit, K., Mikhaylov, S., Laver, M.: Treating words as data with error: Uncertainty in text statements of policy positions. American Journal of Political Science 53(2), 495–513 (2009)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. Journal of Machine Learning Research (JMLR) 3, 993–1022 (2003)
Casella, G., George, E.I.: Explaining the gibbs sampler. The American Statistician 46(3), 167–174 (1992)
Hearst, M.: Texttiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics 23(1), 33–64 (1997)
Laver, M., Garry, J.: Estimating policy positions from political texts. American Journal of Political Science 44(3), 619–634 (2000)
Laver, M., Sergenti, E.: Party Competition: An Agent-Based Model. Princeton University Press (2011)
Lee, L.: On the effectiveness of the skew divergence for statistical language analysis. In: Artificial Intelligence and Statistics, pp. 65–72 (2001)
Pappi, F.U., Seher, N.M., Kurella, A.-S.: Das politikangebot deutscher parteien in den bundestagswahlen seit 1976 im dimensionsweisen vergleich: Gesamtskala und politikfeldspezifische skalen. Working Paper 142, Mannheimer Zentrum für Europäische Sozialforschung, MZES (2011)
Seher, N.M., Pappi, F.U.: Politikfeldspezifische positionen der landesverbände der deutschen parteien. Working Paper 139, Mannheimer Zentrum für Europäische Sozialforschung, MZES (2011)
Slapin, J.B., Proksch, S.-O.: A scaling model for estimating time-series policy positions from texts. American Journal of Political Science 52(3), 705–722 (2008)
Toutanova, K., Klein, D., Manning, C., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proceedings of HLT-NAACL 2003, pp. 252–259 (2003)
Volkens, A., Lacewell, O., Lehmann, P., Regel, S., Schultze, H., Werner, A.: The Manifesto Data Collection. Manifesto Project (MRG/CMP/MARPOR), Wissenschaftszentrum Berlin fĂĽr Sozialforschung, WZB (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Stuckenschmidt, H., Zirn, C. (2012). Multi-dimensional Analysis of Political Documents. In: Bouma, G., Ittoo, A., MĂ©tais, E., Wortmann, H. (eds) Natural Language Processing and Information Systems. NLDB 2012. Lecture Notes in Computer Science, vol 7337. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31178-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-31178-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31177-2
Online ISBN: 978-3-642-31178-9
eBook Packages: Computer ScienceComputer Science (R0)