Abstract
Social networks have become big data production engines and their analytics can reveal insightful trending topics, such that hidden knowledge can be utilized in various applications and settings. This paper addresses the problem of popular topics’ and trends’ early prediction out of social networks data streams which demand distributed software architectures. Under an online time series classification model, which is implemented in a flexible and adaptive distributed framework, trending topics are detected. Emphasis is placed on the early detection process and on the performance of the proposed framework. The implemented framework builds on the lambda architecture design and the experimentation carried out highlights the usefulness of the proposed approach in early trends detection with high rates in performance and with a validation aligned with a popular microblogging service.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arkaitz, Z.: Real-time classification of Twitter trends. J. Assoc. Inf. Sci. Technol. 66, 462–473 (2015)
Salvatore, G., Lo Re, G., Morana, M.: A framework for real-time Twitter data analysis. Comput. Commun. 73, 236–242 (2016)
Li, J.: Bursty event detection from microblog: a distributed and incremental approach. Concurrency Comput. Pract. Exp. 28, 3115–3130 (2015)
Manirupa, D.: Towards methods for systematic research on big data. In: IEEE International Conference on Big Data (Big Data). IEEE (2015)
Giatsoglou, M., Chatzakou, D., Shah, N., Faloutsos, C., Vakali, A.: Retweeting activity on Twitter: signs of deception. In: Cao, T., Lim, E.-P., Zhou, Z.-H., Ho, T.-B., Cheung, D., Motoda, H. (eds.) PAKDD 2015. LNCS, vol. 9077, pp. 122–134. Springer, Heidelberg (2015)
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceedings of the Nineteenth International WWW Conference. ACM (2010)
Stanislav, N., Chen, G., Shah, D.: A latent source model for nonparametric time series classification. In: Advances in Neural Information Processing Systems (2013)
Kontaki, M., Papadopoulos, A.N., Manolopoulos, Y.: Continuous trend-based classification of streaming time series. In: Eder, J., Haav, H.-M., Kalja, A., Penjam, J. (eds.) ADBIS 2005. LNCS, vol. 3631, pp. 294–308. Springer, Heidelberg (2005)
Szomszor, M., Kostkova, P., de Quincey, E.: #Swineflu: Twitter predicts swine flu outbreak in 2009. In: Szomszor, M., Kostkova, P. (eds.) e-Health. LNICST, vol. 69, pp. 18–26. Springer, Heidelberg (2011)
Lei, S.: Predicting US primary elections with Twitter. http://snap.stanford.edu/social2012/papers/shi.pdf. Accessed 2012
Wang, Y.: To Follow or Not to Follow: Analyzing the Growth Patterns of the Trumpists on Twitter. arXiv:1603.08174 (2016)
Mathioudakis, M., Koudas, N.: Twittermonitor: trend detection over the twitter stream. In: SIGMOD ACM (2010)
Gorton, I., Klein, K.: Distribution, data, deployment: Software architecture convergence in big data systems. IEEE Softw. 32(3), 78–85 (2015)
Tang, B.: A hierarchical distributed fog computing architecture for big data analysis in smart cities. In: Proceedings of the ASE BigData & SocialInformatics. ACM (2015)
Mariam, K.: Lambda architecture for cost-effective batch and speed big data processing. In: IEEE Big Data International Conference (2015)
Martínez-Prieto, M.: The solid architecture for real-time management of big semantic data. Future Gener. Comput. Syst. 47, 62–79 (2015)
Marz, N.: Big data : principles and best practices of scalable realtime data systems. O’Reilly Media (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Vakali, A., Kitmeridis, N., Panourgia, M. (2017). A Distributed Framework for Early Trending Topics Detection on Big Social Networks Data Threads. In: Angelov, P., Manolopoulos, Y., Iliadis, L., Roy, A., Vellasco, M. (eds) Advances in Big Data. INNS 2016. Advances in Intelligent Systems and Computing, vol 529. Springer, Cham. https://doi.org/10.1007/978-3-319-47898-2_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-47898-2_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47897-5
Online ISBN: 978-3-319-47898-2
eBook Packages: EngineeringEngineering (R0)