Protecting Against Malicious Code Injection in Reviews on Web Applications
Nivetha. V1, Shalini. S2, Deepa. R3

1V. Nivetha*, P.G Graduate, Department of Computer Science Engineering, Prince Dr. K. .Vasudevan College of Engineering and Technology, Ponmar, Chennai, India.
2S. Shalini, Assistant Professor, Department of Computer Science Engineering, Prince Dr. K.. Vasudevan College of Engineering and Technology, Ponmar, Chennai, India.
3R. Deepa, M.E, Head of Department of Computer Science Engineering, Prince Dr. K.. Vasudevan College of Engineering and Technology, Ponmar, Chennai, India.
Manuscript received on December 11, 2021. | Revised Manuscript received on January 17, 2022. | Manuscript published on January 30, 2022. | PP: 13-17 | Volume-11, Issue-3, January 2022 | Retrieval Number: 100.1/ijitee.C97200111322 | DOI: 10.35940/ijitee.C9720.0111322
Open Access | Ethics and  Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Malicious code injection is done by the attackers or hackers mainly in reviews, these are the fake user identities that are created by the attackers, through sending continuous links to some user till the user clicks on that link. When user clicks the link, that particular users identity will be stolen by the attackers which is done like the phishing, generally these reviews which are maliciously injected are mixed with the original users review of the product in a website. To identify these malicious injection attacked reviews, in this paper the Naive Bayes Classifier (NBC) algorithm is used. Then to eliminate the unwanted disturbed data in the website the Natural Language Processing technique is used. The Natural Language Processing (NLP) technique, which removes unwanted data by understanding the text or words in review. Then the length of the reviews in the given sample dataset is reduced for easy understanding by using the Principal Component Analysis (PCA) algorithm which reduces the dimensionality of the reviews. Then user input review is compared with the sample dataset and classified as good review and bad review and also detected as malicious or not by using the Naïve Bayes Classifier algorithm which is used for the classification of the objects. 
Keywords: Malicious, Naïve Bayes Classifier, Natural Language Processing, Principal Component Analysis
Scope of the Article: Computer Science and Engineering.