Fake News Detection in Low-Resource Languages

Sivanaiah, Rajalakshmi; Ramanathan, Nishaanth; Hameed, Shajith; Rajagopalan, Rahul; Suseelan, Angel Deborah; Thanagathai, Mirnalinee Thanka Nadar

doi:10.1007/978-3-031-33231-9_23

Rajalakshmi Sivanaiah¹²,
Nishaanth Ramanathan¹²,
Shajith Hameed¹²,
Rahul Rajagopalan¹²,
Angel Deborah Suseelan¹² &
…
Mirnalinee Thanka Nadar Thanagathai¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1802))

Included in the following conference series:

International Conference on Speech and Language Technologies for Low-resource Languages

231 Accesses
2 Citations

Abstract

Fake news spreads much faster than real news. False information and misleading texts are the most important elements that lead to disasters and even life threats. One such strategy is fake news, which has become a never-ending phenomenon with the rise of the internet. There can be several devastating consequences due to fake news spreading. It is therefore important to prevent the spread of fake news. This paper shows how we prepared fake news data sets for a few low-resource languages and how we used Logistic Regression and BERT models to perform fake news classification in low-resource languages. Through rigorous experiments, we show that BERT-based-multilingual-cased and Logistic Regression models reach maximum F1 scores of around 98% and 95% respectively. We have done fake news classification with the models for low-resource Indian languages like Tamil, Kannada, Gujarati, and Malayalam.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Fake News Detection on Indian Sources

Contributions to the Study of Fake News in Portuguese: New Corpus and Automatic Detection Results

Fake News Detection in Mainstream Media Using BERT

References

Batailler, C., Brannon, S.M., Teas, P.E., Gawronski, B.: A signal detection approach to understanding the identification of fake news. Perspect. Psychol. Sci. 17(1), 78–98 (2022)
Article Google Scholar
Wickens, T.D.: Elementary Signal Detection Theory. Oxford University Press, Oxford (2001)
Book Google Scholar
Pandey, S., Prabhakaran, S., Reddy, N.V.S., Acharya, D.: Fake news detection from online media using machine learning classifiers. In: Journal of Physics: Conference Series, vol. 2161, no. 1, p. 012027. IOP Publishing (2022)
Google Scholar
Kareem, I., Awan, S.M.: Pakistani media fake news classification using machine learning classifiers. In: 2019 International Conference on Innovative Computing (ICIC), pp. 1–6. IEEE (2019)
Google Scholar
Kar, D., Bhardwaj, M., Samanta, S., Azad, A.P.: No rumours please! a multi-indic-lingual approach for COVID fake-tweet detection. In: 2021 Grace Hopper Celebration India (GHCI), pp. 1–5. IEEE (2021)
Google Scholar
Lee, J., Devlin, M., Chang, K., Toutanova, K.: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Magueresse, A., Carles, V., Heetderks, E.: Low-resource languages: a review of past work and future challenges. arXiv preprint arXiv:2006.07264 (2020)
Slovikovskaya, V.: Transfer learning from transformers to fake news challenge stance detection (FNC-1) task. arXiv preprint arXiv:1910.14353 (2019)
Kakwani, D., et al.: IndicNLPSuite: monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 4948–4961 (2020)
Google Scholar
Saurav, K., Saunack, K., Kanojia, D., Bhattacharyya, P.: A Passage to India: Pre-trained Word Embeddings for Indian Languages. arXiv preprint arXiv:2112.13800 (2021)
Kong, S.H., Tan, L.M., Gan, K.H., Samsudin, N.H.: Fake news detection using deep learning. In: 2020 IEEE 10th Symposium on Computer Applications & Industrial Electronics (ISCAIE), pp. 102–107. IEEE (2020)
Google Scholar
Guo, A., Yang, T.: Research and improvement of feature words weight based on TFIDF algorithm. In: 2016 IEEE Information Technology, Networking, Electronic and Automation Control Conference, pp. 415–419. IEEE, 2016
Google Scholar
Kula, S., Choraś, M., Kozik, R.: Application of the BERT-based architecture in fake news detection. In: Herrero, Á., Cambra, C., Urda, D., Sedano, J., Quintián, H., Corchado, E. (eds.) CISIS 2019. AISC, vol. 1267, pp. 239–249. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-57805-3_23
Chapter Google Scholar
Sommers, J.: On the characteristics of language tags on the web. In: Beverly, R., Smaragdakis, G., Feldmann, A. (eds.) PAM 2018. LNCS, vol. 10771, pp. 18–30. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76481-8_2
Chapter Google Scholar
Nada, F., Khan, B. F., Maryam, A., Zuha, N., Ahmed, Z.: Fake news detection using logistic regression. Int. Res. J. Eng. Technol. (IRJET) 6 (2019). https://www.irjet.net/archives/V6/i5/IRJET-V6I5733.pdf
Koroteev, M.V.: BERT: A review of applications in natural language processing and understanding. arXiv preprint arXiv:2103.11943, 2021
Hirlekar, V.V., Kumar, A.: Natural language processing based online fake news detection challenges-a detailed review. In: 2020 5th International Conference on Communication and Electronics Systems (ICCES), pp. 748–754. IEEE (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, Tamil Nadu, India
Rajalakshmi Sivanaiah, Nishaanth Ramanathan, Shajith Hameed, Rahul Rajagopalan, Angel Deborah Suseelan & Mirnalinee Thanka Nadar Thanagathai

Authors

Rajalakshmi Sivanaiah
View author publications
You can also search for this author in PubMed Google Scholar
Nishaanth Ramanathan
View author publications
You can also search for this author in PubMed Google Scholar
Shajith Hameed
View author publications
You can also search for this author in PubMed Google Scholar
Rahul Rajagopalan
View author publications
You can also search for this author in PubMed Google Scholar
Angel Deborah Suseelan
View author publications
You can also search for this author in PubMed Google Scholar
Mirnalinee Thanka Nadar Thanagathai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rajalakshmi Sivanaiah .

Editor information

Editors and Affiliations

National Institute of Technology Karnataka, Mangalore, India
Anand Kumar M
National University of Ireland, Galway, Ireland
Bharathi Raja Chakravarthi
Sri Sivasubramaniya Nadar College of Engineering, Kalavakkam, India
Bharathi B
National University of Ireland, Galway, Ireland
Colm O’Riordan
Indian Institute of Technology Madras, Chennai, India
Hema Murthy
Sri Sivasubramaniya Nadar College of Engineering, Kalavakkam, India
Thenmozhi Durairaj
University of Hildesheim, Hildesheim, Germany
Thomas Mandl

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sivanaiah, R., Ramanathan, N., Hameed, S., Rajagopalan, R., Suseelan, A.D., Thanagathai, M.T.N. (2023). Fake News Detection in Low-Resource Languages. In: M, A.K., et al. Speech and Language Technologies for Low-Resource Languages . SPELLL 2022. Communications in Computer and Information Science, vol 1802. Springer, Cham. https://doi.org/10.1007/978-3-031-33231-9_23

Download citation

DOI: https://doi.org/10.1007/978-3-031-33231-9_23
Published: 29 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33230-2
Online ISBN: 978-3-031-33231-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Fake News Detection in Low-Resource Languages

Abstract

Access this chapter

Similar content being viewed by others

Fake News Detection on Indian Sources

Contributions to the Study of Fake News in Portuguese: New Corpus and Automatic Detection Results

Fake News Detection in Mainstream Media Using BERT

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Fake News Detection in Low-Resource Languages

Abstract

Access this chapter

Similar content being viewed by others

Fake News Detection on Indian Sources

Contributions to the Study of Fake News in Portuguese: New Corpus and Automatic Detection Results

Fake News Detection in Mainstream Media Using BERT

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation