Abstract
In recent years, the crime rate has increased considerably and there is a need to properly identify the different types of crimes so that it can be tackled. In this paper, a Bi-LSTM neural network for classification is proposed that classifies the different types of crime on data collected from Google News and Twitter. The data is pre-processed and an initial step of labeling is performed with the help of Fuzzy c-means algorithm and Term Frequency – Inverse Document Frequency vectors. GloVe word embeddings were performed for feature extraction. Dynamically generated ontologies with minimal human supervision using a weighted graph modeled from Google News and Social Web like Twitter has been encompassed in order to enhance the quality of crime classification. The proposed method has proven, after experiments, to achieve evaluation metrics better than the existing methods; evaluated on four different datasets and compared with four different methods with an increase in Accuracy and decrease in FNR for four distinguished datasets.
Similar content being viewed by others
References
Abbass Z, Ali Z, Ali M, Akbar B, Saleem A (2020) "a framework to predict social crime through twitter tweets by using machine learning," 2020 IEEE 14th International Conference on Semantic Computing (ICSC), San Diego, CA, USA, 2020, pp. 363–368, https://doi.org/10.1109/ICSC.2020.00073.
Alatrista-Salas H, Morzán-Samamé J, Nunez-del-Prado M (2020). “Crime Alert! Crime Typification in News Based on Text Mining”. In: Arai K., Bhatia R. (eds) Advances in Information and Communication. FICC 2019. Lecture notes in networks and systems, vol 69. Springer, Cham.
Anuar S, Selamat A, Sallehuddin R (2015) “Hybrid artificial neural network with artificial bee Colony algorithm for crime classification”. In: Phon-Amnuaisuk S., au T. (eds) computational intelligence in information systems. Advances in intelligent systems and computing, vol 331. Springer, Cham.
Ashagrie M, Tekli J, Taddesse FG, Chbeir R, Tekli G (2019) Generic metadata representation framework for social-based event detection, description, and linkage. Knowledge-Based Systems 188. https://doi.org/10.1016/j.knosys.2019.06.025
Bhalla A, Pawar RP (2019) Crime in India 2018, National Crime Records Bureau (Ministry of Home Affairs) Government of India
Bhati S, Vikramaditya and Tiwari S, Mandloi J, (2019). “Machine Learning and Deep Learning Integrated Model to Predict, Classify and Analyze Crime in Indore City”. Proceedings of Recent Advances in Interdisciplinary Trends in Engineering & Applications (RAITEA) 2019. Available at SSRN: https://ssrn.com/abstract=3364984 or https://doi.org/10.2139/ssrn.3364984.
Boppuru PR, Ramesha K (2019) Geo-spatial crime analysis using newsfeed data in Indian context. International Journal of Web-Based Learning and Teaching Technologies (IJWLTT) 14(4):49–64. https://doi.org/10.4018/IJWLTT.2019100103
Chen H, Chung W, Xu J, Wang G, Qin Y, Chau M (2004) Crime data mining: a general framework and some examples. IEEE Explore-Computer 37(4):50–56
Das P, Das AK (2020). “Graph-based crime reports clustering using relations extracted from named entities”. In: Behera H., Nayak J., Naik B., Pelusi D. (eds) computational intelligence in data mining. Advances in intelligent systems and computing, vol 990. Springer, Singapore
Das P, Das A, Nayak J, Pelusi D, Ding W (2019) Group incremental adaptive clustering based on neural network and rough set theory for crime report categorization. Neurocomputing. https://doi.org/10.1016/j.neucom.2019.10.109
Fares M, Moufarrej A, Jreij E, Tekli J, Grosky W (2019) Unsupervised word-level affect analysis and propagation in a lexical knowledge graph. Knowl Based Syst 165:432–459
Gerber M (2014) Predicting crime using twitter and kernel density estimation. Decis Support Syst 61. https://doi.org/10.1016/j.dss.2014.02.003
Ghankutkar S, Sarkar N, Gajbhiye P, Yadav S, Kalbande D, Bakereywala N (2019) "modelling machine learning for Analysing crime news", 2019 International Conference on Advances in Computing, Communication and Control (ICAC3), Mumbai, India, pp. 1–5, https://doi.org/10.1109/ICAC347590.2019.9036769.
Hardy J, Bell P, Allan D (2020) A crime script analysis of the Madoff investment scheme. Crime Prev Community Saf 22:68–97
Hochreiter S, Schmidhuber J (1997) Long Short-term Memory. Neural computation 9:1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Jurafsky D, Martin J. (2008). Speech and language processing: an introduction to natural language processing, Computational Linguistics, and Speech Recognition
Kumar A, Verma A, Shinde G, Sukhdeve Y, Lal N (2020). "crime prediction using K-nearest neighboring algorithm," 2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE), Vellore, India, 2020, pp. 1–4, https://doi.org/10.1109/ic-ETITE47903.2020.155
Lal, Sangeeta & Tiwari, Lipika & Ranjan, Ravi & Verma, Ayushi & Sardana, Neetu & Mourya, Rahul. (2020). “Analysis and Classification of Crime Tweets”. Procedia Computer Science. 167. 1911–1919. https://doi.org/10.1016/j.procs.2020.03.211.
Mikolov T, Chen K, Corrado G, Dean J (2013) “Efficient estimation of word representations in vector space”, CoRR (2013) 1–12 abs/ 1301.3781.
Mikolov T, Sutskever I, Chen K, Corrado G, Dean J (2013) “Distributed representations of words and phrases and their compositionality”, in: Proceedings of the 26th International Conference on Neural Information Processing Systems, Se- ries = NIPS’13, Vol. 2, 2013, pp. 3111–3119. abs/ 1310.4546.
Munir K, Anjum MS (2018) The use of ontologies for effective knowledge modelling and information retrieval. Applied Computing and Informatics 14:116–126
Nair S, Soniminde S, Sureshbabu S, Tamhankar A, Kulkarni S, (2019). “Assist Crime Prevention Using Machine Learning”. Proceedings 2019: Conference on Technologies for Future Cities (CTFC).
Noormanshah WMU, Nohuddin PNE, Zainol Z (2020) “Document content analysis based on random Forest algorithm”. In: Peng SL., son L., Suseendran G., Balaganesh D. (eds) intelligent computing and innovation on data science. Lecture notes in networks and systems, vol 118. Springer, Singapore
Pangestuti D, Herdiani A, Selviandro N (2019) “Analysis and implementation of ontology based text classification on criminality digital news”. IOP conference series: materials science and engineering. 662. 022135. https://doi.org/10.1088/1757-899X/662/2/022135.
Pennington J, Socher R, Manning C (2014). “Glove: global vectors for word representation”, in: proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), 2014, pp. 1532–1543
Priandini N, Zaman B, Purwanti E (2017). Categorizing document by fuzzy C-Means and K-nearest neighbors approach. AIP Conference Proceedings. 1867. 020012. https://doi.org/10.1063/1.4994415.
Ramasubbareddy S, Aditya Sai Srinivas T, Govinda K, Manivannan SS (2020). Crime prediction system. In: Saini H., Sayal R., Buyya R., Aliseri G. (eds) innovations in computer science and engineering. Lecture notes in networks and systems, vol 103. Springer, Singapore
Saha R, Naskar A, Dasgupta T, and Dey L (2020) “A System for Analysis, Visualization and Retrieval of Crime Documents”. In Proceedings of the 7th ACM IKDD CoDS and 25th COMAD (CoDS COMAD 2020). Association for Computing Machinery, New York, NY, USA, 317–321.
Soleimanian Gharehchopogh F, Haggi S (2020) An optimization K-modes clustering algorithm with elephant herding optimization algorithm for crime clustering. Journal of Advances in Computer Engineering and Technology 6(2):78–87
Sreejith AG, Lansy A, Krishna KSA, Haran VJ, Rakhee M (2020). Crime analysis and prediction using graph mining. In: Ranganathan G., Chen J., Rocha Á. (eds) inventive communication and computational technologies. Lecture notes in networks and systems, vol 89. Springer, Singapore
Sundhara Kumar KB, Bhalaji N. (2020) A Novel Hybrid RNN-ELM Architecture for Crime Classification. In: Smys S., Senjyu T., Lafata P. (eds) Second International Conference on Computer Networks and Communication Technologies. ICCNCT 2019. Lecture notes on data engineering and communications technologies, vol 44. Springer, Cham
Thilagam P, Karur S (2019) Crime base: Towards building a knowledge base for crime entities and their relationships from online news papers. Information Processing & Management:56. https://doi.org/10.1016/j.ipm.2019.102059
Wang P, Yu F, Niu S, Yang Z, Zhang Y, Guo J. 2019. Hierarchical matching network for crime classification. In proceedings of the 42nd international ACM SIGIR conference on Research and Development in information retrieval (SIGIR’19). Association for Computing Machinery, New York, NY, USA, 325–334.
Wang M, Cai Q, Wang L, Li J, Wang X. (2020) "Chinese news text classification based on attention-based CNN-BiLSTM", proc. SPIE 11430, MIPPR 2019: pattern recognition and computer vision
Zaidi NAS, Mustapha A, Mostafa SA, Razali MN (2020) “A classification approach for crime prediction”. Communications in Computer and Information Science, 68–78.
Zhang Z, Huang J, Hao J et al (2020) Extracting relations of crime rates through fuzzy association rules mining. Appl Intell 50:448–467
Haoxi Zhong, Guo Zhipeng, Cunchao Tu, Chaojun Xiao, Zhiyuan Liu, and Maosong Sun. 2018. Legal Judgment Prediction via Topological Learning. In Proceedings of the 2018 Conference on empirical methods in natural language processing. Association for Computational Linguistics, 3540–3549.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Deepak, G., Rooban, S. & Santhanavijayan, A. A knowledge centric hybridized approach for crime classification incorporating deep bi-LSTM neural network. Multimed Tools Appl 80, 28061–28085 (2021). https://doi.org/10.1007/s11042-021-11050-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11050-4