Abstract
Fake news spreads much faster than real news. False information and misleading texts are the most important elements that lead to disasters and even life threats. One such strategy is fake news, which has become a never-ending phenomenon with the rise of the internet. There can be several devastating consequences due to fake news spreading. It is therefore important to prevent the spread of fake news. This paper shows how we prepared fake news data sets for a few low-resource languages and how we used Logistic Regression and BERT models to perform fake news classification in low-resource languages. Through rigorous experiments, we show that BERT-based-multilingual-cased and Logistic Regression models reach maximum F1 scores of around 98% and 95% respectively. We have done fake news classification with the models for low-resource Indian languages like Tamil, Kannada, Gujarati, and Malayalam.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Batailler, C., Brannon, S.M., Teas, P.E., Gawronski, B.: A signal detection approach to understanding the identification of fake news. Perspect. Psychol. Sci. 17(1), 78–98 (2022)
Wickens, T.D.: Elementary Signal Detection Theory. Oxford University Press, Oxford (2001)
Pandey, S., Prabhakaran, S., Reddy, N.V.S., Acharya, D.: Fake news detection from online media using machine learning classifiers. In: Journal of Physics: Conference Series, vol. 2161, no. 1, p. 012027. IOP Publishing (2022)
Kareem, I., Awan, S.M.: Pakistani media fake news classification using machine learning classifiers. In: 2019 International Conference on Innovative Computing (ICIC), pp. 1–6. IEEE (2019)
Kar, D., Bhardwaj, M., Samanta, S., Azad, A.P.: No rumours please! a multi-indic-lingual approach for COVID fake-tweet detection. In: 2021 Grace Hopper Celebration India (GHCI), pp. 1–5. IEEE (2021)
Lee, J., Devlin, M., Chang, K., Toutanova, K.: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Magueresse, A., Carles, V., Heetderks, E.: Low-resource languages: a review of past work and future challenges. arXiv preprint arXiv:2006.07264 (2020)
Slovikovskaya, V.: Transfer learning from transformers to fake news challenge stance detection (FNC-1) task. arXiv preprint arXiv:1910.14353 (2019)
Kakwani, D., et al.: IndicNLPSuite: monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 4948–4961 (2020)
Saurav, K., Saunack, K., Kanojia, D., Bhattacharyya, P.: A Passage to India: Pre-trained Word Embeddings for Indian Languages. arXiv preprint arXiv:2112.13800 (2021)
Kong, S.H., Tan, L.M., Gan, K.H., Samsudin, N.H.: Fake news detection using deep learning. In: 2020 IEEE 10th Symposium on Computer Applications & Industrial Electronics (ISCAIE), pp. 102–107. IEEE (2020)
Guo, A., Yang, T.: Research and improvement of feature words weight based on TFIDF algorithm. In: 2016 IEEE Information Technology, Networking, Electronic and Automation Control Conference, pp. 415–419. IEEE, 2016
Kula, S., Choraś, M., Kozik, R.: Application of the BERT-based architecture in fake news detection. In: Herrero, Á., Cambra, C., Urda, D., Sedano, J., Quintián, H., Corchado, E. (eds.) CISIS 2019. AISC, vol. 1267, pp. 239–249. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-57805-3_23
Sommers, J.: On the characteristics of language tags on the web. In: Beverly, R., Smaragdakis, G., Feldmann, A. (eds.) PAM 2018. LNCS, vol. 10771, pp. 18–30. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76481-8_2
Nada, F., Khan, B. F., Maryam, A., Zuha, N., Ahmed, Z.: Fake news detection using logistic regression. Int. Res. J. Eng. Technol. (IRJET) 6 (2019). https://www.irjet.net/archives/V6/i5/IRJET-V6I5733.pdf
Koroteev, M.V.: BERT: A review of applications in natural language processing and understanding. arXiv preprint arXiv:2103.11943, 2021
Hirlekar, V.V., Kumar, A.: Natural language processing based online fake news detection challenges-a detailed review. In: 2020 5th International Conference on Communication and Electronics Systems (ICCES), pp. 748–754. IEEE (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sivanaiah, R., Ramanathan, N., Hameed, S., Rajagopalan, R., Suseelan, A.D., Thanagathai, M.T.N. (2023). Fake News Detection in Low-Resource Languages. In: M, A.K., et al. Speech and Language Technologies for Low-Resource Languages . SPELLL 2022. Communications in Computer and Information Science, vol 1802. Springer, Cham. https://doi.org/10.1007/978-3-031-33231-9_23
Download citation
DOI: https://doi.org/10.1007/978-3-031-33231-9_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33230-2
Online ISBN: 978-3-031-33231-9
eBook Packages: Computer ScienceComputer Science (R0)