Skip to main content

Fake News Detection in Low-Resource Languages

  • Conference paper
  • First Online:
Speech and Language Technologies for Low-Resource Languages (SPELLL 2022)

Abstract

Fake news spreads much faster than real news. False information and misleading texts are the most important elements that lead to disasters and even life threats. One such strategy is fake news, which has become a never-ending phenomenon with the rise of the internet. There can be several devastating consequences due to fake news spreading. It is therefore important to prevent the spread of fake news. This paper shows how we prepared fake news data sets for a few low-resource languages and how we used Logistic Regression and BERT models to perform fake news classification in low-resource languages. Through rigorous experiments, we show that BERT-based-multilingual-cased and Logistic Regression models reach maximum F1 scores of around 98% and 95% respectively. We have done fake news classification with the models for low-resource Indian languages like Tamil, Kannada, Gujarati, and Malayalam.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Batailler, C., Brannon, S.M., Teas, P.E., Gawronski, B.: A signal detection approach to understanding the identification of fake news. Perspect. Psychol. Sci. 17(1), 78–98 (2022)

    Article  Google Scholar 

  2. Wickens, T.D.: Elementary Signal Detection Theory. Oxford University Press, Oxford (2001)

    Book  Google Scholar 

  3. Pandey, S., Prabhakaran, S., Reddy, N.V.S., Acharya, D.: Fake news detection from online media using machine learning classifiers. In: Journal of Physics: Conference Series, vol. 2161, no. 1, p. 012027. IOP Publishing (2022)

    Google Scholar 

  4. Kareem, I., Awan, S.M.: Pakistani media fake news classification using machine learning classifiers. In: 2019 International Conference on Innovative Computing (ICIC), pp. 1–6. IEEE (2019)

    Google Scholar 

  5. Kar, D., Bhardwaj, M., Samanta, S., Azad, A.P.: No rumours please! a multi-indic-lingual approach for COVID fake-tweet detection. In: 2021 Grace Hopper Celebration India (GHCI), pp. 1–5. IEEE (2021)

    Google Scholar 

  6. Lee, J., Devlin, M., Chang, K., Toutanova, K.: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

  7. Magueresse, A., Carles, V., Heetderks, E.: Low-resource languages: a review of past work and future challenges. arXiv preprint arXiv:2006.07264 (2020)

  8. Slovikovskaya, V.: Transfer learning from transformers to fake news challenge stance detection (FNC-1) task. arXiv preprint arXiv:1910.14353 (2019)

  9. Kakwani, D., et al.: IndicNLPSuite: monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 4948–4961 (2020)

    Google Scholar 

  10. Saurav, K., Saunack, K., Kanojia, D., Bhattacharyya, P.: A Passage to India: Pre-trained Word Embeddings for Indian Languages. arXiv preprint arXiv:2112.13800 (2021)

  11. Kong, S.H., Tan, L.M., Gan, K.H., Samsudin, N.H.: Fake news detection using deep learning. In: 2020 IEEE 10th Symposium on Computer Applications & Industrial Electronics (ISCAIE), pp. 102–107. IEEE (2020)

    Google Scholar 

  12. Guo, A., Yang, T.: Research and improvement of feature words weight based on TFIDF algorithm. In: 2016 IEEE Information Technology, Networking, Electronic and Automation Control Conference, pp. 415–419. IEEE, 2016

    Google Scholar 

  13. Kula, S., Choraś, M., Kozik, R.: Application of the BERT-based architecture in fake news detection. In: Herrero, Á., Cambra, C., Urda, D., Sedano, J., Quintián, H., Corchado, E. (eds.) CISIS 2019. AISC, vol. 1267, pp. 239–249. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-57805-3_23

    Chapter  Google Scholar 

  14. Sommers, J.: On the characteristics of language tags on the web. In: Beverly, R., Smaragdakis, G., Feldmann, A. (eds.) PAM 2018. LNCS, vol. 10771, pp. 18–30. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76481-8_2

    Chapter  Google Scholar 

  15. Nada, F., Khan, B. F., Maryam, A., Zuha, N., Ahmed, Z.: Fake news detection using logistic regression. Int. Res. J. Eng. Technol. (IRJET) 6 (2019). https://www.irjet.net/archives/V6/i5/IRJET-V6I5733.pdf

  16. Koroteev, M.V.: BERT: A review of applications in natural language processing and understanding. arXiv preprint arXiv:2103.11943, 2021

  17. Hirlekar, V.V., Kumar, A.: Natural language processing based online fake news detection challenges-a detailed review. In: 2020 5th International Conference on Communication and Electronics Systems (ICCES), pp. 748–754. IEEE (2020)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rajalakshmi Sivanaiah .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sivanaiah, R., Ramanathan, N., Hameed, S., Rajagopalan, R., Suseelan, A.D., Thanagathai, M.T.N. (2023). Fake News Detection in Low-Resource Languages. In: M, A.K., et al. Speech and Language Technologies for Low-Resource Languages . SPELLL 2022. Communications in Computer and Information Science, vol 1802. Springer, Cham. https://doi.org/10.1007/978-3-031-33231-9_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-33231-9_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-33230-2

  • Online ISBN: 978-3-031-33231-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics