Skip to main content

Advertisement

Log in

A knowledge centric hybridized approach for crime classification incorporating deep bi-LSTM neural network

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In recent years, the crime rate has increased considerably and there is a need to properly identify the different types of crimes so that it can be tackled. In this paper, a Bi-LSTM neural network for classification is proposed that classifies the different types of crime on data collected from Google News and Twitter. The data is pre-processed and an initial step of labeling is performed with the help of Fuzzy c-means algorithm and Term Frequency – Inverse Document Frequency vectors. GloVe word embeddings were performed for feature extraction. Dynamically generated ontologies with minimal human supervision using a weighted graph modeled from Google News and Social Web like Twitter has been encompassed in order to enhance the quality of crime classification. The proposed method has proven, after experiments, to achieve evaluation metrics better than the existing methods; evaluated on four different datasets and compared with four different methods with an increase in Accuracy and decrease in FNR for four distinguished datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Abbass Z, Ali Z, Ali M, Akbar B, Saleem A (2020) "a framework to predict social crime through twitter tweets by using machine learning," 2020 IEEE 14th International Conference on Semantic Computing (ICSC), San Diego, CA, USA, 2020, pp. 363–368, https://doi.org/10.1109/ICSC.2020.00073.

  2. Alatrista-Salas H, Morzán-Samamé J, Nunez-del-Prado M (2020). “Crime Alert! Crime Typification in News Based on Text Mining”. In: Arai K., Bhatia R. (eds) Advances in Information and Communication. FICC 2019. Lecture notes in networks and systems, vol 69. Springer, Cham.

  3. Anuar S, Selamat A, Sallehuddin R (2015) “Hybrid artificial neural network with artificial bee Colony algorithm for crime classification”. In: Phon-Amnuaisuk S., au T. (eds) computational intelligence in information systems. Advances in intelligent systems and computing, vol 331. Springer, Cham.

  4. Ashagrie M, Tekli J, Taddesse FG, Chbeir R, Tekli G (2019) Generic metadata representation framework for social-based event detection, description, and linkage. Knowledge-Based Systems 188. https://doi.org/10.1016/j.knosys.2019.06.025

  5. Bhalla A, Pawar RP (2019) Crime in India 2018, National Crime Records Bureau (Ministry of Home Affairs) Government of India

  6. Bhati S, Vikramaditya and Tiwari S, Mandloi J, (2019). “Machine Learning and Deep Learning Integrated Model to Predict, Classify and Analyze Crime in Indore City”. Proceedings of Recent Advances in Interdisciplinary Trends in Engineering & Applications (RAITEA) 2019. Available at SSRN: https://ssrn.com/abstract=3364984 or https://doi.org/10.2139/ssrn.3364984.

  7. Boppuru PR, Ramesha K (2019) Geo-spatial crime analysis using newsfeed data in Indian context. International Journal of Web-Based Learning and Teaching Technologies (IJWLTT) 14(4):49–64. https://doi.org/10.4018/IJWLTT.2019100103

    Article  Google Scholar 

  8. Chen H, Chung W, Xu J, Wang G, Qin Y, Chau M (2004) Crime data mining: a general framework and some examples. IEEE Explore-Computer 37(4):50–56

    Article  Google Scholar 

  9. Das P, Das AK (2020). “Graph-based crime reports clustering using relations extracted from named entities”. In: Behera H., Nayak J., Naik B., Pelusi D. (eds) computational intelligence in data mining. Advances in intelligent systems and computing, vol 990. Springer, Singapore

  10. Das P, Das A, Nayak J, Pelusi D, Ding W (2019) Group incremental adaptive clustering based on neural network and rough set theory for crime report categorization. Neurocomputing. https://doi.org/10.1016/j.neucom.2019.10.109

  11. Fares M, Moufarrej A, Jreij E, Tekli J, Grosky W (2019) Unsupervised word-level affect analysis and propagation in a lexical knowledge graph. Knowl Based Syst 165:432–459

    Article  Google Scholar 

  12. Gerber M (2014) Predicting crime using twitter and kernel density estimation. Decis Support Syst 61. https://doi.org/10.1016/j.dss.2014.02.003

  13. Ghankutkar S, Sarkar N, Gajbhiye P, Yadav S, Kalbande D, Bakereywala N (2019) "modelling machine learning for Analysing crime news", 2019 International Conference on Advances in Computing, Communication and Control (ICAC3), Mumbai, India, pp. 1–5, https://doi.org/10.1109/ICAC347590.2019.9036769.

  14. Hardy J, Bell P, Allan D (2020) A crime script analysis of the Madoff investment scheme. Crime Prev Community Saf 22:68–97

    Article  Google Scholar 

  15. Hochreiter S, Schmidhuber J (1997) Long Short-term Memory. Neural computation 9:1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735

    Article  Google Scholar 

  16. Jurafsky D, Martin J. (2008). Speech and language processing: an introduction to natural language processing, Computational Linguistics, and Speech Recognition

  17. Kumar A, Verma A, Shinde G, Sukhdeve Y, Lal N (2020). "crime prediction using K-nearest neighboring algorithm," 2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE), Vellore, India, 2020, pp. 1–4, https://doi.org/10.1109/ic-ETITE47903.2020.155

  18. Lal, Sangeeta & Tiwari, Lipika & Ranjan, Ravi & Verma, Ayushi & Sardana, Neetu & Mourya, Rahul. (2020). “Analysis and Classification of Crime Tweets”. Procedia Computer Science. 167. 1911–1919. https://doi.org/10.1016/j.procs.2020.03.211.

  19. Mikolov T, Chen K, Corrado G, Dean J (2013) “Efficient estimation of word representations in vector space”, CoRR (2013) 1–12 abs/ 1301.3781.

  20. Mikolov T, Sutskever I, Chen K, Corrado G, Dean J (2013) “Distributed representations of words and phrases and their compositionality”, in: Proceedings of the 26th International Conference on Neural Information Processing Systems, Se- ries = NIPS’13, Vol. 2, 2013, pp. 3111–3119. abs/ 1310.4546.

  21. Munir K, Anjum MS (2018) The use of ontologies for effective knowledge modelling and information retrieval. Applied Computing and Informatics 14:116–126

    Article  Google Scholar 

  22. Nair S, Soniminde S, Sureshbabu S, Tamhankar A, Kulkarni S, (2019). “Assist Crime Prevention Using Machine Learning”. Proceedings 2019: Conference on Technologies for Future Cities (CTFC).

  23. Noormanshah WMU, Nohuddin PNE, Zainol Z (2020) “Document content analysis based on random Forest algorithm”. In: Peng SL., son L., Suseendran G., Balaganesh D. (eds) intelligent computing and innovation on data science. Lecture notes in networks and systems, vol 118. Springer, Singapore

  24. Pangestuti D, Herdiani A, Selviandro N (2019) “Analysis and implementation of ontology based text classification on criminality digital news”. IOP conference series: materials science and engineering. 662. 022135. https://doi.org/10.1088/1757-899X/662/2/022135.

  25. Pennington J, Socher R, Manning C (2014). “Glove: global vectors for word representation”, in: proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), 2014, pp. 1532–1543

  26. Priandini N, Zaman B, Purwanti E (2017). Categorizing document by fuzzy C-Means and K-nearest neighbors approach. AIP Conference Proceedings. 1867. 020012. https://doi.org/10.1063/1.4994415.

  27. Ramasubbareddy S, Aditya Sai Srinivas T, Govinda K, Manivannan SS (2020). Crime prediction system. In: Saini H., Sayal R., Buyya R., Aliseri G. (eds) innovations in computer science and engineering. Lecture notes in networks and systems, vol 103. Springer, Singapore

  28. Saha R, Naskar A, Dasgupta T, and Dey L (2020) “A System for Analysis, Visualization and Retrieval of Crime Documents”. In Proceedings of the 7th ACM IKDD CoDS and 25th COMAD (CoDS COMAD 2020). Association for Computing Machinery, New York, NY, USA, 317–321.

  29. Soleimanian Gharehchopogh F, Haggi S (2020) An optimization K-modes clustering algorithm with elephant herding optimization algorithm for crime clustering. Journal of Advances in Computer Engineering and Technology 6(2):78–87

    Google Scholar 

  30. Sreejith AG, Lansy A, Krishna KSA, Haran VJ, Rakhee M (2020). Crime analysis and prediction using graph mining. In: Ranganathan G., Chen J., Rocha Á. (eds) inventive communication and computational technologies. Lecture notes in networks and systems, vol 89. Springer, Singapore

  31. Sundhara Kumar KB, Bhalaji N. (2020) A Novel Hybrid RNN-ELM Architecture for Crime Classification. In: Smys S., Senjyu T., Lafata P. (eds) Second International Conference on Computer Networks and Communication Technologies. ICCNCT 2019. Lecture notes on data engineering and communications technologies, vol 44. Springer, Cham

  32. Thilagam P, Karur S (2019) Crime base: Towards building a knowledge base for crime entities and their relationships from online news papers. Information Processing & Management:56. https://doi.org/10.1016/j.ipm.2019.102059

  33. Wang P, Yu F, Niu S, Yang Z, Zhang Y, Guo J. 2019. Hierarchical matching network for crime classification. In proceedings of the 42nd international ACM SIGIR conference on Research and Development in information retrieval (SIGIR’19). Association for Computing Machinery, New York, NY, USA, 325–334.

  34. Wang M, Cai Q, Wang L, Li J, Wang X. (2020) "Chinese news text classification based on attention-based CNN-BiLSTM", proc. SPIE 11430, MIPPR 2019: pattern recognition and computer vision

  35. Zaidi NAS, Mustapha A, Mostafa SA, Razali MN (2020) “A classification approach for crime prediction”. Communications in Computer and Information Science, 68–78.

  36. Zhang Z, Huang J, Hao J et al (2020) Extracting relations of crime rates through fuzzy association rules mining. Appl Intell 50:448–467

    Article  Google Scholar 

  37. Haoxi Zhong, Guo Zhipeng, Cunchao Tu, Chaojun Xiao, Zhiyuan Liu, and Maosong Sun. 2018. Legal Judgment Prediction via Topological Learning. In Proceedings of the 2018 Conference on empirical methods in natural language processing. Association for Computational Linguistics, 3540–3549.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gerard Deepak.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Deepak, G., Rooban, S. & Santhanavijayan, A. A knowledge centric hybridized approach for crime classification incorporating deep bi-LSTM neural network. Multimed Tools Appl 80, 28061–28085 (2021). https://doi.org/10.1007/s11042-021-11050-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-11050-4

Keywords

Navigation