Spam Classification: Genetically Optimized Passive-Aggressive Approach

Naravajhula, Priyatam; Naravajula, Alekhya

doi:10.1007/s42979-022-01517-y

Spam Classification: Genetically Optimized Passive-Aggressive Approach

Original Research
Published: 17 December 2022

Volume 4, article number 93, (2023)
Cite this article

SN Computer Science Aims and scope Submit manuscript

Priyatam Naravajhula¹ &
Alekhya Naravajula²

98 Accesses
2 Citations
Explore all metrics

Abstract

The growth of data has seen a huge upheaval of messages for various business purposes, engendering the need for spam classification to be prioritized as that of paramount importance. In this paper, A novel approach to spam classification using the algorithms of passive-aggressive spectrum with genetic optimization is proposed. The paper discusses application of such online learning algorithm to classify spam and do a comparative study with existing approaches to spam classification. The results demonstrate the robustness of the algorithm selected and provide a study of the effect of hyperparameters on classification. The Dataset used for classification study is public SMS spam dataset, Spam review and twitter spam datasets, 80% of each dataset was used for training and 20% for testing.The proposed algorithm outperforms standard benchmark algorithms in terms of accuracy,precision, recall scores.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A comparative analysis of gradient boosting algorithms

Article 24 August 2020

A Comprehensive Comparative Study of Artificial Neural Network (ANN) and Support Vector Machines (SVM) on Stock Forecasting

Article 02 June 2021

A Comparative Analysis of Logistic Regression, Random Forest and KNN Models for the Text Classification

Article 05 March 2020

Data availability

All the data utilized in the paper is publicly available on Kaggle datasets.

References

Alazab, Broadhurst Roderic. An Analysis of the Nature of Spam as Cybercrime 2017.
Bonaccorso G. Machine learning algorithms. Birmingham: Packt publishing; 2018.
Google Scholar
Cheng L-C, Tseng Judy CR, Chung T-Y. Case study of fake web reviews. In: International conference on advances in social network analysis and mining; 2017. IEEE/ACM, pp. 706–9.
Crammer K, Dekel O, Keshet J, Shalev-Shwartz S. Online passive-aggressive 32 algorithms. J Mach Learn Res. 2006;2006:551–85.
MATH Google Scholar
Emmanuel G. Machine learning for email spam filtering: review, approaches and open research problems Heliyon; 2019.
Zulfikar Alom BC. A deep learning model for Twitter spam detection. Online Social Networks and Media; 2020.
Hu YH, Chen YL, Chou HL. Opinion mining from online hotel reviews—a text summarization approach. Inf Process Manag. 2017;53:436–49.
Article Google Scholar
Li Y, Nie X, Huang R. Web Spam classification methods based on deep belief networks. Expert Syst Appl. 2018;96:261–70.
Article Google Scholar
Liu S, Zhang J, Xiang Y. Statistical detection of online drifting twitter spam. In: 11th ACM on Asia conference on computer and communication security; 2016. ACM, pp. 1–10.
Pandey AC, Rajpoot DS. Spam review detection using sprial cuckoo search clustering method. Evol Intell. 2019;12:147–64.
Article Google Scholar
Salehi S, Selamat A, Bostanian M. Enhanced Genetic Algorithm for spam detection in Email. IEEE; 2011.
Sanpakdee U, Walairacht A, Walairacht S. Adaptive spam mail filtering using genetic algorithm. IEEE 2006.
Babatunde OH, Armstrong L, Leng J, Diepeveen D. A genetic-algorithm-based feature selection. Int J Electron Commun Comput Eng 2014.
Frohlich H, Chapelle O, Scholkopf B. Feature Selection for support vector machines by means of genetic algorithms. In: Proceedings, 15th IEEE international conference on tools with artificial intelligence; 2003. pp. 142–148. https://doi.org/10.1109/TAI.2003.1250182.
Chowdhary M, Dhaka VS. E-mail Spam Filtering using Genetic Algorithm: A Depper Analysis. Int J Comput Sci Inf Technol. 2272–6 (n.d.).
Sivanandam SN, Deepa SN. Principles of Soft Computing. New Delhi: Wiely-India; 2nd Edition. publication in year 2011.
Google Scholar
David Schaffer J, Morishima A. An Adaptive crossover distribution mechanism for genetic algorithms. In: Proceedings of second international conference o genetic algorithms; 1987. Hillsdale: Lawerence Erlbaum Associates, Inc, pp. 36-40.
Morik K, Köpcke H. Analysing insurance data or the advantage of TF/IDF Features. Research Gate; 2003.

Download references

Funding

No funding.

Author information

Authors and Affiliations

CSE, Chaitanya Bharathi Institute of Technology, Gandipet, Hyderabad, Telangana, 500075, India
Priyatam Naravajhula
CSE, Vasavi College of Engineering, Ibrahimbagh, Hyderabad, Telangana, 500089, India
Alekhya Naravajula

Authors

Priyatam Naravajhula
View author publications
You can also search for this author in PubMed Google Scholar
Alekhya Naravajula
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alekhya Naravajula.

Ethics declarations

Conflict of interest

Author declares no conflict of interest exists.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Naravajhula, P., Naravajula, A. Spam Classification: Genetically Optimized Passive-Aggressive Approach. SN COMPUT. SCI. 4, 93 (2023). https://doi.org/10.1007/s42979-022-01517-y

Download citation

Received: 07 April 2021
Accepted: 19 November 2022
Published: 17 December 2022
DOI: https://doi.org/10.1007/s42979-022-01517-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Spam Classification: Genetically Optimized Passive-Aggressive Approach

Abstract

Access this article

Similar content being viewed by others

A comparative analysis of gradient boosting algorithms

A Comprehensive Comparative Study of Artificial Neural Network (ANN) and Support Vector Machines (SVM) on Stock Forecasting

A Comparative Analysis of Logistic Regression, Random Forest and KNN Models for the Text Classification

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Spam Classification: Genetically Optimized Passive-Aggressive Approach

Abstract

Access this article

Similar content being viewed by others

A comparative analysis of gradient boosting algorithms

A Comprehensive Comparative Study of Artificial Neural Network (ANN) and Support Vector Machines (SVM) on Stock Forecasting

A Comparative Analysis of Logistic Regression, Random Forest and KNN Models for the Text Classification

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation