Abstract
Global and national events in recent years have shown that social media, and particularly micro-blogging services such as Twitter, can be a force for good (e.g., Arab Spring) and harm (e.g., London riots). In both of these examples, social media played a key role in group formation and organisation, and in the coordination of the group’s subsequent collective actions (i.e., the move from rhetoric to action). Surprisingly, despite its clear importance, little is understood about the factors that lead to this kind of group development and the transition to collective action. This paper focuses on an approach to the analysis of data from social media to detect weak signals, i.e., indicators that initially appear at the fringes, but are, in fact, early indicators of such large-scale real-world phenomena. Our approach is in contrast to existing research which focuses on analysing major themes, i.e., the strong signals, prevalent in a social network at a particular point in time. Analysis of weak signals can provide interesting possibilities for forecasting, with online user-generated content being used to identify and anticipate possible offline future events. We demonstrate our approach through analysis of tweets collected during the London riots in 2011 and use of our weak signals to predict tipping points in that context.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
http://theguardian.com/world/2011/aug/05/man-shot-police-london-arrest (Accessed: 3/3/2016).
- 2.
http://theguardian.com/uk/2011/aug/07/tottenham-riots-peaceful-protest (Accessed: 3/3/2016).
- 3.
http://en.wikipedia.org/wiki/Topsy_Labs (Accessed: 3/3/2016).
- 4.
http://sentistrength.wlv.ac.uk/ (Accessed: 3/3/2016).
- 5.
http://ucrel.lancs.ac.uk/wmatrix/ (Accessed: 3/3/2016).
- 6.
http://ucrel.lancs.ac.uk/bnc2sampler/sampler.htm (Accessed: 3/3/2016).
- 7.
http://theguardian.com/uk/2011/dec/05/anger-police-fuelled-riots-study (Accessed: 3/3/2016).
- 8.
http://theguardian.com/uk/2011/aug/07/tottenham-riot-looting-north-london (Accessed: 3/3/2016).
- 9.
http://www.cs.waikato.ac.nz/ml/weka (Accessed: 3/3/2016).
- 10.
http://ucrel.lancs.ac.uk/claws6tags.html (Accessed: 3/3/2016).
- 11.
http://ucrel.lancs.ac.uk/usas/USASSemanticTagset.pdf (Accessed: 3/3/2016).
References
Abel F, Hauff C, Houben GJ, Stronkman R, Tao K (2012) Twitcident: fighting fire with information from social web streams. In: Proceedings of the 21st international conference companion on world wide web, WWW ’12 companion. ACM, New York, pp 305–308
Achrekar H, Gandhe A, Lazarus R, Yu SH, Liu B (2011) Predicting flu trends using twitter data. In: Proceedings of the 2011 IEEE conference on computer communications workshops (INFOCOM WKSHPS), pp 702–707
Ahlqvist T, Halonen M, Heinonen S (2007) Weak signals in social media. Report on two workshop experiments in futures monitoring. SOMED foresight report, 1
Asur S, Huberman BA (2010) Predicting the future with social media. In: Proceedings of the 2010 IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology, WI-IAT ’10. IEEE Computer Society, Washington, DC, pp 492–499
Baron A, Rayson P (2008) Vard2: a tool for dealing with spelling variation in historical corpora. In: Proceedings of the postgraduate conference in Corpus linguistics
Bodendorf F, Kaiser C (2009) Detecting opinion leaders and trends in online social networks. In: Proceedings of the 2nd ACM workshop on social web search and mining, SWSM ’09. ACM, New York, pp 65–68
Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. J Comput Sci 2(1):1–8
Castells M (2012) Networks of outrage and hope: social movements in the Internet age. Polity Press/Wiley, Malden/Hoboken
Chang CC, Lin CJ (2011) Libsvm: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):27:1–27:27
Charitonidis C, Rashid A, Taylor PJ (2015) Weak signals as predictors of real-world phenomena in social media. In: Proceedings of the 2015 IEEE/ACM international conference on advances in social networks analysis and mining 2015, ASONAM ’15. ACM, New York, pp 864–871
Conway M, Doan S, Kawazoe A, Collier N (2009) Classifying disease outbreak reports using n-grams and semantic features. Int J Med Inform 78(12):e47–e58. Mining of Clinical and Biomedical Text and Data Special Issue
Diani M, McAdam D (2003) Social movements and networks: relational approaches to collective action. Comparative politics series. Oxford University Press, Oxford
Forman G (2003) An extensive empirical study of feature selection metrics for text classification. J Mach Learn Res 3:1289–1305
Forsyth DR (2009) Group dynamics. Cengage Learning, Wadsworth
Garside R, Smith N (1997) A hybrid grammatical tagger: Claws4. Corpus annotation: linguistic information from computer text corpora, pp 102–121
Gonzalez-Bailon S, Borge-Holthoefer J, Rivero A, Moreno Y (2011) The dynamics of protest recruitment through an online network. Sci Rep 1. http://dx.doi.org/10.1038/srep00197
Granovetter MS (1973) The strength of weak ties. Am J Sociol 78(6):1360–1380
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. SIGKDD Explor Newsl 11(1):10–18
Haythornthwaite C (1996) Social network analysis: an approach and technique for the study of information exchange. Libr Inf Sci Res 18(4):323–342
Li R, Lei KH, Khadiwala R, Chang KCC (2012) Tedas: a twitter-based event detection and analysis system. In: Proceedings of the 2012 IEEE 28th international conference on data engineering (ICDE), pp 1273–1276
Lim SL, Quercia D, Finkelstein A (2010) Stakenet: using social networks to analyse the stakeholders of large-scale software projects. In: Proceedings of the 32nd ACM/IEEE international conference on software engineering, ICSE ’10, vol 1. ACM, New York, pp 295–304
Mathioudakis M, Koudas N (2010) Twittermonitor: trend detection over the Twitter stream. In: Proceedings of the 2010 ACM SIGMOD international conference on management of data, SIGMOD ’10. ACM, New York, pp 1155–1158
Piao SS, Rayson P, Archer D, McEnery T (2005) Comparing and combining a semantic tagger and a statistical tool for MWE extraction. Comput Speech Lang 19(4):378–397. Special issue on multiword expression
Prentice S, Taylor PJ, Rayson P, Hoskins A, O’Loughlin B (2011) Analyzing the semantic content and persuasive composition of extremist media: a case study of texts produced during the Gaza conflict. Inform Syst Front 13(1):61–73
Prentice S, Rayson P, Taylor PJ (2012) The language of islamic extremism: towards an automated identification of beliefs, motivations and justifications. Int J Corpus Linguis 17(2):259–286
Rad AA, Benyoucef M (2011) Towards detecting influential users in social networks. In: International conference on E-technologies. Springer, Berlin/Heidelberg, pp 227–240
Rashid A, Baron A, Rayson P, May-Chahal C, Greenwood P, Walkerdine J (2013) Who am i? Analyzing digital personas in cybercrime investigations. Computer 46(4):54–61
Rayson P (2008) From key words to key semantic domains. Int J Corpus Linguis 13(4):519–549
Rayson P, Garside R (2000) Comparing corpora using frequency profiling. In: Proceedings of the workshop on comparing corpora, WCC ’00. Association for Computational Linguistics, Stroudsburg, pp 1–6
Rayson P, Archer D, Piao S, McEnery A (2004) The UCREL semantic analysis system. In: Proceedings of the beyond named entity recognition semantic labelling for NLP tasks workshop, pp 7–12
Sakaki T, Okazaki M, Matsuo Y (2010) Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the 19th international conference on world wide web, WWW ’10. ACM, New York, pp 851–860
Sankaranarayanan J, Samet H, Teitler BE, Lieberman MD, Sperling J (2009) Twitterstand: news in tweets. In: Proceedings of the 17th ACM SIGSPATIAL international conference on advances in geographic information systems, GIS ’09. ACM, New York, pp 42–51
Taylor PJ, Dando CJ, Ormerod TC, Ball LJ, Jenkins MC, Sandham A, Menacere T (2013) Detecting insider threats to organizations through language change. Law Human Behav 37(4):267–275
Thelwall M, Buckley K, Paltoglou G, Cai D, Kappas, A (2010) Sentiment strength detection in short informal text. J Am Soc Inform Sci Technol 61(12):2544–2558
Thelwall M, Buckley K, Paltoglou G (2011) Sentiment in twitter events. J Am Soc Inform Sci Technol 62(2):406–418
Witten IH, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques, 3rd edn. Morgan Kaufmann Publishers Inc., San Francisco
Yang Y, Pedersen JO (1997) A comparative study on feature selection in text categorization. In: Proceedings of the fourteenth international conference on machine learning, ICML ’97. Morgan Kaufmann Publishers Inc., San Francisco, pp 412–420
Yu B (2008) An evaluation of text classification methods for literary study. Literary Linguis Comput 23(3):327–343
Acknowledgements
We would like to express our thanks to Dr. Paul Rayson for providing us access to Wmatrix’s web interface and API, and Prof. Mike Thelwall for providing us with the SentiStrength Java version.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Charitonidis, C., Rashid, A., Taylor, P.J. (2017). Predicting Collective Action from Micro-Blog Data. In: Kawash, J., Agarwal, N., Ă–zyer, T. (eds) Prediction and Inference from Social Networks and Social Media. Lecture Notes in Social Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-51049-1_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-51049-1_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51048-4
Online ISBN: 978-3-319-51049-1
eBook Packages: Computer ScienceComputer Science (R0)