Abstract
Today lives of people have become so strenuous and hectic that they are least bothered about getting updated with the latest news. Presently, there are various sources of news ranging from newspaper to various news mobile applications. In case of newspaper, we have to read long articles and regarding news applications, we have to search for the ones which provide us credible and relevant news everyday which is also a very time-consuming task. To address this problem, a web application ‘TweetsDaily’ which fetches news in the form of tweets of two domains—Indian and Global which are posted by various news channels on their respective Twitter user timelines. The tweets of each domain are further classified into five categories, namely, Politics, Sports, Entertainment, Technology, and Miscellaneous (news related to day-to-day crimes, stocks and investment, communal disputes, terror attacks, etc.) In this paper, we represent a method for the classification of tweets in five categories of each domain. The method is a text-based classification technique based on ensemble model of different classification algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Gretel Liz De la Peña Sarracén [21] proposed a system based on an ensemble of five classifiers for IberEval2017 on Classification Of Spanish Election Tweets (COSET) task. F1-macro value for ensemble approach of the system is 0.5847, whereas F1-macro value for our Indian News Ensemble Classifier is 0.797.
References
Kwak, H., Lee, C., Park, H., & Moon, S. (2010, April). What is Twitter, a social network or a news media? In Proceedings of the 19th International Conference on World Wide Web (pp. 591–600). ACM.
Ritholdz, B. (2013). Twitter is becoming the first and quickest source of investment news. Retrieved July 15, 2018, from https://www.theguardian.com/technology/2013/apr/23/twitter-first-source-investment-news.
Manning, C., Raghavan, P., & Schütze, H. (2010). Introduction to information retrieval. Natural Language Engineering, 16(1), 258–262.
Manning, C., Raghavan, P., & Schütze, H. (2010). Introduction to information retrieval. Natural Language Engineering, 16(1), 263–270.
Wright, R. E. (1995). Logistic regression.
Ng, A., Ngiam, J., Foo, C. Y., Mai, Y., Suen, C., Coates, A., et al. (2018). Optimization: Stochastic gradient descent. Retrieved July 15, 2018, from http://ufldl.stanford.edu/tutorial/supervised/OptimizationStochasticGradientDescent.
Huang, T. M., Kecman, V., & Kopriva, I. (2006). Kernel based algorithms for mining huge data sets (Vol. 1). Heidelberg: Springer.
Go, A., Bhayani, R., & Huang, L. (2009). Twitter sentiment classification using distant supervision. CS224 N Project Report, Stanford (Vol. 1, No. 12, p. 2009).
Pang, B., Lee, L., Vaithyanathan, S. (2002). Thumbs up?: Sentiment classification using machine learning techniques. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing (Vol. 10, pp. 79–86). Association for Computational Linguistics.
Da Silva, N. F., Hruschka, E. R., & Hruschka, E. R., Jr. (2014). Tweet sentiment analysis with classifier ensembles. Decision Support Systems, 66, 170–179.
Wan, Y., & Gao, Q. (2015, November). An ensemble sentiment classification system of twitter data for airline services analysis. In 2015 IEEE International Conference on Data Mining Workshop (ICDMW) (pp. 1318–1325). IEEE.
Sankaranarayanan, J., Samet, H., Teitler, B. E., Lieberman, M. D., & Sperling, J. (2009, November). Twitterstand: News in tweets. In Proceedings of the 17th ACM Sigspatial International Conference on Advances in Geographic Information Systems (pp. 42–51).
Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M. (2010, July). Short text classification in twitter to improve information filtering. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 841–842). ACM.
Genc, Y., Sakamoto, Y., Nickerson, J. V. (2011, July). Discovering context: Classifying tweets through a semantic transform based on wikipedia. In International Conference on Foundations of Augmented Cognition (pp. 484–492). Berlin, Heidelberg: Springer
Kinsella, S., Passant, A., & Breslin, J. G. (2011, April). Topic classification in social media using metadata from hyperlinked objects. In European Conference on Information Retrieval (pp. 201–206). Berlin, Heidelberg: Springer.
Mohanavalli, S., Karthika, S., Srividya, K. R., & Sandya, N., (2018). Categorisation of Tweets using ensemble classification. International Journal of Engineering and Technology, 7(3), 12 (2018). Special Issue 12, 722–725.
Munjal, P., Kumar, S., Kumar, L., & Banati, A. (2017). Opinion dynamics through natural phenomenon of grain growth and population migration. In Hybrid intelligence for social networks (pp. 161–175). Cham: Springer.
Munjal, P., Narula, M., Kumar, S., & Banati, H. (2018). Twitter sentiments based suggestive framework to predict trends. Journal of Statistics and Management Systems, 21(4), 685–693.
Munjal, P., Kumar, L., Kumar, S., & Banati, H. (2019). Evidence of Ostwald Ripening in opinion driven dynamics of mutually competitive social networks. Physica A: Statistical Mechanics and its Applications.
Bird, S., Klein, E., & Loper, E. (2009). Natural language processing with Python: Analyzing text with the natural language toolkit. O’Reilly Media, Inc.
De la Peña Sarracén, G. L. (2017). Ensembles of methods for Tweet topic classification. In IberEval@ SEPLN (pp. 15–19).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Gupta, D., Sharma, A., Kumar, M. (2020). TweetsDaily: Categorised News from Twitter. In: Sharma, H., Govindan, K., Poonia, R., Kumar, S., El-Medany, W. (eds) Advances in Computing and Intelligent Systems. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-15-0222-4_5
Download citation
DOI: https://doi.org/10.1007/978-981-15-0222-4_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-0221-7
Online ISBN: 978-981-15-0222-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)