Skip to main content
Log in

A comprehensive survey on feature selection in the various fields of machine learning

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

In Machine Learning (ML), Feature Selection (FS) plays a crucial part in reducing data’s dimensionality and enhancing any proposed framework’s performance. However, in real-world applications, FS work suffers from high dimensionality, computational and storage complexity, noisy or ambiguous nature, high performance, etc. The area of FS is very vast and challenging in its nature. There are lots of work that have been reported on FS over the various area of applications. This paper has discussed FS’s framework and the multiple models of FS with detailed descriptions. We have also classified the various FS algorithms with respect to the data, i.e., structured or labeled data and unstructured data for the different applications of ML. We have also discussed what essential features are, the commonly used FS methods, the widely used datasets, and the widely used work done in the various ML fields for the FS task. Here we try to view the multiple comparison experimental results of FS work in different result discussions. This paper draws a descriptive survey on FS with the associated area of real-world problem domains. This paper’s main objective is to understand the main idea of FS work and identify the core idea of how FS will be applicable in various problem domains.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Sahu B, Dehuri S, Jagadev A (2018) A Study on the Relevance of Feature Selection Methods in Microarray Data. Open Bioinform J Bentham Open 11:117–139

    Article  Google Scholar 

  2. Li J, Cheng K, Wang S, Morstatter F, Trevino RP, Tang J, Liu H (2017) Feature Selection: A Data Perspective. ACM Comput Surv 50

  3. Liu H, Motoda H (1998) Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers, New York

  4. Agarwal S, Dhyani A, Ranjan P (2019) Newton’s second law based PSO for feature selection: Newtonian PSO. J Intell Fuzzy Syst 37:4923–4935

    Article  Google Scholar 

  5. Narendra PM, Fukunaga K (1977) A branch and bound algorithm for feature subset selection. IEEE Trans Comput C-26:917–922

    Article  MATH  Google Scholar 

  6. Khammassi C, Krichen S (2017) A GA-LR wrapper approach for feature selection in network intrusion detection. Comput Secur 70:255–277

    Article  Google Scholar 

  7. Brassard G, Bratley P (1996) Feature Selection for Knowledge Discovery and Data Mining. Prentice Hall, New Jersey

  8. Ververidis D, Kotropoulos C (2008) Fast and accurate sequential floating forward feature selection with the Bayes classifier applied to speech emotion recognition. Signal Process 88:2956–2970

    Article  MATH  Google Scholar 

  9. Sharan RV, Moir TJ (2018) Pseudo-color cochleagram image feature and sequential feature selection for robust acoustic event recognition. Appl Acoust 140:198–204

    Article  Google Scholar 

  10. Schumer M, Steiglitz K. (1968) Adaptive step size random search. IEEE Trans Autom Control 13:270–276

    Article  Google Scholar 

  11. Coetzee FM (2005) Correcting the Kullback-Leibler distance for feature selection. Pattern Recogn Lett 26:1675–1683

    Article  Google Scholar 

  12. Zhang J, Zhang J (2018) An Analysis of CNN Feature Extractor Based on KL Divergence. Int J Image Graph World Sci Publish Company 18

  13. Lei S (2012) A Feature Selection Method Based on Information Gain and Genetic Algorithm. 2012 International Conference on Computer Science and Electronics Engineering, IEEE

  14. Lee C, Lee GG (2006) Information gain and divergence based feature selection for machine learning based text categorization. Inf Process Manag 42:155–165

    Article  Google Scholar 

  15. Yamada M, Jitkrittum W, Sigal L, Xing EP, Sugiyama M (2014) High-Dimensional Feature Selection by Feature-Wise Kernelized Lasso. Neural Comput 26:185–207

    Article  MathSciNet  MATH  Google Scholar 

  16. He X, Li L, Liu Y, Yu X, Meng J (2017) A Two-Stage Biomedical Event Trigger Detection Method Integrating Feature Selection and Word Embeddings. IEEE/ACM Trans Comput Biol Bioinform 15:1325–1332

    Article  Google Scholar 

  17. Liu X, Ma L, Song L, Zhao Y, Zhao X, Zhou C (2014) Recognizing Common CT Imaging Signs of Lung Diseases through a New Feature Selection Method based on Fisher Criterion and Genetic Optimization. IEEE J Biomed Health Inform 19:635–647

    Article  Google Scholar 

  18. Vijayanand R, Devaraj D, Kannapiran B (2018) Intrusion detection system for wireless mesh network using multiple support vector machine classifiers with genetic-algorithm-based feature selection. Comput Secur 77:304–314

    Article  Google Scholar 

  19. Li B, Meng MQ-H (2012) Tumor Recognition in Wireless Capsule Endoscopy Images Using Textural Features and SVM-Based Feature Selection. IEEE Trans Inf Technol Biomed 16:323–329

    Article  Google Scholar 

  20. Saari P, Eerola T, Lartillot O (2010) Generalizability and Simplicity as Criteria in Feature Selection: Application to Mood Classification in Music. IEEE Trans Audio Speech Lang Process 19:1802–1812

    Article  Google Scholar 

  21. Mirzaei S, ElYacoubi M., Garcia-Salicetti S., Boudy J., Kahindo C., Cristancho-Lacroix V., Kerhervé H., Rigaud A-S (2018) Two-Stage Feature Selection of Voice Parameters for Early Alzheimer’s Disease Prediction. IRBM 39:430–435

    Article  Google Scholar 

  22. Novovicova J., Pudil P., Kittler J. (1996) Divergence based feature selection for multimodal class densities. IEEE Trans Pattern Anal Mach Intell 18:218–223

    Article  Google Scholar 

  23. Zhang Y, Li S, Wang T, Zhang Z (2013) Divergence based feature selection for separate classes. Neurocomputing 101:32–42

    Article  Google Scholar 

  24. Wang T, Li W (2017) Kernel learning and optimization with Hilbert-Schmidt independence criterion. Int J Mach Learn Cybern 9:1707–1717

    Article  Google Scholar 

  25. Song L, Smola A, Gretton A, Bedo J, Borgwardt K (2012) Feature Selection via Dependence Maximization. J Mach Learn Res 3:1393–1434

    MathSciNet  MATH  Google Scholar 

  26. Song L, Smola A, Gretton A, Borgwardt KM, Bedo J (2007) Supervised feature selection via dependence estimation. ICML ’07: Proceedings of the 24th international conference on Machine learning, pp 823–830

  27. Liu H, Setiono R (1996) A probabilistic approach to feature selection - a filter solution ICML’96: Proceedings of the Thirteenth International Conference on International Conference on Machine Learning, pp 319–327

  28. Dash M, Liu H (2003) Consistency based search in feature selection. Artif Intell 151:155–176

    Article  MathSciNet  MATH  Google Scholar 

  29. Ötürk O, Aksaç A, Elsheikh A, Özyer T, Alhajj R (2013) A Consistency Based Feature Selection Method Allied with Linear SVMs for HIV-1 Protease Cleavage Site Prediction PLOS ONE 8

  30. Neumann J, Schnörr C, Steidl G (2005) Combined SVM-Based Feature Selection and Classification. Mach Learn 61:129–150

    Article  MATH  Google Scholar 

  31. Bahassine S, Madani A, Al-Sarem M, Kissi M (2020) Feature selection using an improved Chi-square for Arabic text classification. J King Saud Univ Comput Inf Sci 32:225–231

    Google Scholar 

  32. Huffman C, Sobral H, Hinojosa ET (2019) Laser-induced breakdown spectroscopy spectral feature selection to enhance classification capabilities: A t-test filter approach. Spectroch Acta Part B: Atom Spectrosc 162

  33. Ravisankar P., Ravi V., Raghava Rao G., Bose I (2011) Detection of financial statement fraud and feature selection using data mining techniques. Decis Support Syst 50:491–500

    Article  Google Scholar 

  34. Elssied NOF, Ibrahim O, Osman AH (2014) A Novel Feature Selection Based on One-Way ANOVA F-Test for E-Mail Spam Classification. Res J Appl Sci Eng Technol 7:625–638

    Article  Google Scholar 

  35. Tang X, Dai Y, Xiang Y (2019) Feature selection based on feature interactions with application to text categorization. Expert Syst Appl 120:207–216

    Article  Google Scholar 

  36. Azam N, Yao JT (2012) Comparison of term frequency and document frequency based feature selection metrics in text categorization. Expert Syst Appl 39:4760–4768

    Article  Google Scholar 

  37. Mesut Togaçar M, Ergen B, Cömert Z (2020) Detection of lung cancer on chest CT images using minimum redundancy maximum relevance feature selection method with convolutional neural networks. Biocybern Biomed Eng 40:23–39

    Article  Google Scholar 

  38. Shao W, Liu M, Xu Y-Y, Shen H-B, Zhang D (2017) An Organelle Correlation-Guided Feature Selection Approach for Classifying Multi-Label Subcellular Bio-Images. IEEE/ACM Trans Comput Biol Bioinform 15:828–838

    Article  Google Scholar 

  39. Song QJ, Jiang H, Liu J (2017) Feature selection based on FDA and F-score for multi-class classification. Expert Syst Appl 81:22–27

    Article  Google Scholar 

  40. Liu Z-T, Wu M, Cao W-H, Mao J-W, Xu J-P, Tan G-Z (2018) Speech emotion recognition based on feature selection and extreme learning machine decision tree. Neurocomputing 273:271–280

    Article  Google Scholar 

  41. Lu W, Li Z, Chu J (2017) A novel computer-aided diagnosis system for breast MRI based on feature selection and ensemble learning. Comput Biol Med 83:157–165

    Article  Google Scholar 

  42. Zini L, Noceti N, Fusco G, Odone F (2015) Structured multi-class feature selection with an application to face recognition. Pattern Recogn Lett 55:35–41

    Article  Google Scholar 

  43. Kushwaha N, Pant M (2018) Link based BPSO for feature selection in big data text clustering. Fut Gener Comput Syst 82:190–199

    Article  Google Scholar 

  44. Wang H, Jing K, Niu B (2017) A Discrete Bacterial Algorithm for Feature Selection in Classification of Microarray Gene Expression Cancer Data. Knowl Based Syst 126:8–19

    Article  Google Scholar 

  45. Wang H, Jing X, Niu B (2016) Bacterial-inspired feature selection algorithm and its application in fault diagnosis of complex structures. In: 2016 IEEE Congress on Evolutionary Computation (CEC), pp 3809–3816

  46. Kanan HR, Faez K (2008) An improved feature selection method based on ant colony optimization (ACO) evaluated on face recognition system. Appl Math Comput 205:716–725

    MATH  Google Scholar 

  47. http://csse.szu.edu.cn/staff/zhuzx/Datasets.html, Accessed: 2020-12-21

  48. http://sdmc.i2r.a-star.edu.sg/GEDatasets/, Accessed: 2020-12-21

  49. Wang M, Barbu A (2019) Are screening methods useful in feature selection? an empirical study. PLOS ONE 14(9):1–15

    Article  Google Scholar 

  50. Dhal P, Azad C (2021) A multi-objective feature selection method using newton’s law based pso with gwo. Appl Soft Comput 107:107394

    Article  Google Scholar 

  51. https://www.kaggle.com/noaa/hurricane-database, Accessed: 2020-12-21

  52. http://www.bom.gov.au/climate/data, Accessed: 2020-12-21

  53. Wang Z, Wang T, Wan B, Han M (2020) Partial classifier chains with feature selection by exploiting label correlation in multi-label classification. Entropy 22(10)

  54. Al-Tashi Q, Abdulkadir S J, Rais H M, Mirjalili S, Alhussian H, Ragab M G, Alqushaibi A (2020) Binary multi-objective grey wolf optimizer for feature selection in classification. IEEE Access 8:106247–106263

    Article  Google Scholar 

  55. Pilnenskiy N, Smetannikov I (2020) Feature selection algorithms as one of the python data analytical tools. Fut Internet 12(3)

  56. Sakkis G, Androutsopoulos I, Paliouras G, Karkaletsis V, Spyropoulos C D, Stamatopoulos P (2001) Stacking classifiers for anti-spam filtering of e-mail. CoRR, arXiv:cs.CL/0106040

  57. Sha Z-C, Liu Z-M, Ma C, Chen J (2021) Feature selection for multi-label classification by maximizing full-dimensional conditional mutual information. Appl Intell 51:326–340

    Article  Google Scholar 

  58. Aich S, Al-Absi A A, Lee Hui K, Sain M (2019) Prediction of quality for different type of wine based on different feature sets using supervised machine learning techniques. In: 2019 21st International Conference on Advanced Communication Technology (ICACT), pp 1122–1127

  59. Neves C (2014) Categorical data analysis, third edition. J Appl Stat 41(4):915–916. https://doi.org/10.1080/02664763.2013.854979

    Article  MATH  Google Scholar 

  60. Maxwell J A (2010) Using numbers in qualitative research. Qual Inq 16(6):475–482

    Article  Google Scholar 

  61. Wahid A, Khan DM, Iqbal N, Khan SA, Ali A, Khan M, Khan Z (2020) Feature Selection and Classification for Gene Expression Data Using Novel Correlation Based Overlapping Score Method via Chou’s 5-Steps Rule. Chemometr Intell Labor Syst 199

  62. Potharaju SP, Sreedevi M (2019) Distributed feature selection (DFS) strategy for microarray gene expression data to improve the classification performance. Clin Epidemiol Glob Health 7:171–176

    Article  Google Scholar 

  63. Sun L, Zhang X, Qian Y, Xu J, Zhang S (2019) Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification. Inf Sci 502:18–41

    Article  MathSciNet  MATH  Google Scholar 

  64. Arora N, Kaur PD (2020) A Bolasso based consistent feature selection enabled random forest classification algorithm: An application to credit risk assessment. Appl Soft Comput 86

  65. Jain I, Jain VK, Jain R (2018) Correlation Feature Selection based improved-Binary Particle Swarm Optimization for Gene Selection and Cancer Classification. Appl Soft Comput 62:203–215

    Article  Google Scholar 

  66. Moon S-H, Kim Y-H (2020) An improved forecast of precipitation type using correlation based feature selection and multinomial logistic regression. Atmosph Res 40

  67. Pooja S.B, Siva Balan R.V., Anisha M, Muthukumaran M.S, Jothikumar R (2020) Techniques Tanimoto correlated feature selection system and hybridization of clustering and boosting ensemble classification of remote sensed big data for weather forecasting. Comput Commun 151:266–274

    Article  Google Scholar 

  68. Wang H, Tan L, Niu B (2019) Feature selection for classification of microarray gene expression cancers using Bacterial Colony Optimization with multi-dimensional population. Swarm Evol Comput 48:172–181

    Article  Google Scholar 

  69. Urbanowicz R J, Meeker M, La Cava W, Olson R S, Moore J H (2018) Relief-based feature selection: Introduction and review. J Biomed Inform 85:189–203

    Article  Google Scholar 

  70. Jia J, Yang N, Zhang C, Yue A, Yang J, Zhu D (2013) Object-oriented feature selection of high spatial resolution images using an improved relief algorithm. Math Comput Model 58(3):619–626. Computer and Computing Technologies in Agriculture 2011 and Computer and Computing Technologies in Agriculture 2012

    Article  Google Scholar 

  71. Xue B, Zhang M, Browne W N, Yao X (2016) A survey on evolutionary computation approaches to feature selection. IEEE Trans Evol Comput 20(4):606–626

    Article  Google Scholar 

  72. Palma-Mendoza R-J, de Marcos L, Rodriguez D, Alonso-Betanzos A (2019) Distributed correlation-based feature selection in spark. Inf Sci 496:287–299

    Article  Google Scholar 

  73. Chormunge S, Jena S (2018) Correlation based feature selection with clustering for high dimensional data. J Electr Syst Inf Technol 5(3):542–549

    Article  Google Scholar 

  74. https://www.yelp.com/dataset, Accessed: 2020-12-22

  75. https://github.com/cjhutto/vaderSentiment, Accessed: 2020-12-22

  76. http://www.cs.jhu.edu/~mdredze/datasets/sentiment/, Accessed: 2020-12-22

  77. http://www.cs.cornell.edu/people/pabo/movie-review-data/, Accessed: 2020-12-22

  78. https://alt.qcri.org/semeval2014/task4/, Accessed: 2020-12-22

  79. http://www.cad.zju.edu.cn/home/dengcai/Data/TextData.html, Accessed: 2020-12-21

  80. http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/, Accessed: 2020-12-21

  81. http://qwone.com/~jason/20Newsgroups/, Accessed: 2020-12-21

  82. http://gabrilovich.com/resources/data/techtc/techtc100/, Accessed: 2020-12-21

  83. Lloret E, Romá-Ferri M T, Palomar M (2013) Compendium: A text summarization system for generating abstracts of research papers. Data Knowl Eng 88:164–175

    Article  Google Scholar 

  84. Erkan G, Radev D R (2004) Lexrank: Graph-based lexical centrality as salience in text summarization. J Artif Int Res 22(1):457–479

    Google Scholar 

  85. Saranyamol C S, Sindhu L (2014) A Survey on Automatic Text Summarization. Int J Comput Sci Inf Technol 5

  86. Gupta V, Lehal GS (2010) A Survey of Text Summarization Extractive Techniques. J Emerg Technol Web Intell 2:258–268

    Google Scholar 

  87. Akhtar M S, Gupta D, Ekbal A, Bhattacharyya P (2017) Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis. Knowl-Based Syst 125:116–135

    Article  Google Scholar 

  88. Hatzivassiloglou V, McKeown K R (1997) Predicting the semantic orientation of adjectives. In: 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, Madrid, pp 174–181

  89. Miller G A (1995) Wordnet: A lexical database for english. Commun ACM 38(11):39–41

    Article  Google Scholar 

  90. Gupta V K, Siddiqui T J (2012) Multi-document summarization using sentence clustering. In: 2012 4th International Conference on Intelligent Human Computer Interaction (IHCI), pp 1–5

  91. El-Kassas W S, Salama C R, Rafea A A, Mohamed H K (2021) Automatic text summarization: A comprehensive survey. Expert Syst Appl 165:113679

    Article  Google Scholar 

  92. Toh Z, Wang W (2014) Dlirec: Aspect term extraction and term polarity classification system. In: SemEval@COLING

  93. Bharti KK, Singh PK (2016) Opposition chaotic fitness mutation based adaptive inertia weightBPSO for feature selection in text clustering. Appl Soft Comput 43:20–34

    Article  Google Scholar 

  94. Baccianella S, Esuli A, Sebastiani F (2013) Using micro-documents for feature selection: The case of ordinal text classification. Expert Syst Appl 40:4687–4696

    Article  Google Scholar 

  95. Yousefpour A, Ibrahim R, Hamed HNA (2017) Ordinal-based and Frequency-based Integration of Feature Selection Methods for Sentiment Analysis. Expert Syst Appl 75:80–93

    Article  Google Scholar 

  96. Sundararaman A, Ramanathan SV, Thati R (2018) Novel Approach to Predict Hospital Readmissions Using Feature Selection from Unstructured Data with Class Imbalance. Big Data Res 13:65–75

    Article  Google Scholar 

  97. Manochandar S., Punniyamoorthy M. (2018) Scaling Feature Selection Method for Enhancing the Classification Performance of Support Vector Machines in Text Mining. Comput Ind Eng 124:139–156

    Article  Google Scholar 

  98. Al-Salemi B, Ayob M, Noah SAM (2018) Feature ranking for enhancing boosting-based multi-label text categorization. Expert Syst Appl 113:531–543

    Article  Google Scholar 

  99. Yarlagadda M, Rao KG, Srikrishna A (2019) Frequent itemset-based feature selection and Rider Moth Search Algorithm for document clustering. Journal of King Saud University - Computer and Information Sciences

  100. Rehman A, Javed K, Babri HA (2017) Feature selection based on a normalized difference measure for text classification. Inf Process Manag 53:473–489

    Article  Google Scholar 

  101. Zhang Y, Wanga S, Phillips P, Ji G (2014) Binary PSO with mutation operator for feature selection using decision tree applied to spam detection. Knowl-Based Syst 64:22–31

    Article  Google Scholar 

  102. Sanghani G, Kotecha K (2019) Incremental Personalized E-mail Spam Filter using Novel TFDCR Feature Selection with Dynamic Feature Update. Expert Syst Appl 115:287–299

    Article  Google Scholar 

  103. Metin SK (2018) Feature selection in multiword expression recognition. Expert Syst Appl 92:106–123

    Article  Google Scholar 

  104. Karabulut M (2013) Fuzzy unordered rule induction algorithm in text categorization on top of geometric particle swarm optimization term selection. Knowl-Based Syst 54:288–297

    Article  Google Scholar 

  105. Lu Y, Liang M, Ye Z, Cao L (2015) Improved particle swarm optimization algorithm and its application in text feature selection. Appl Soft Comput 35:629–636

    Article  Google Scholar 

  106. Singh S, Singh A K (2018) Web-spam features selection using cfs-pso. Proced Comput Sci 125:568–575. The 6th International Conference on Smart Computing and Communications

    Article  Google Scholar 

  107. Bharti K K, Singh P K (2016) Opposition chaotic fitness mutation based adaptive inertia weight bpso for feature selection in text clustering. Appl Soft Comput 43:20–34

    Article  Google Scholar 

  108. Chen L, Jiang L, Li C (2021) Modified dfs-based term weighting scheme for text classification. Expert Syst Appl 168:114438

    Article  Google Scholar 

  109. Liu Y, Loh H T, Sun A (2009) Imbalanced text classification: A term weighting approach. Expert Syst Appl 36(1):690– 701

    Article  Google Scholar 

  110. Lan M, Tan C L, Su J, Lu Y (2009) Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans Pattern Anal Mach Intell 31(4):721–735

    Article  Google Scholar 

  111. Dogan T, Uysal A K (2019) Improved inverse gravity moment term weighting for text classification. Expert Syst Appl 130:45–59

    Article  Google Scholar 

  112. Uguz H (2011) A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm. Knowl-Based Syst 24(7):1024–1032

    Article  Google Scholar 

  113. Shang C, Li M, Feng S, Jiang Q, Fan J (2013) Feature selection via maximizing global information gain for text classification. Knowl-Based Syst 54:298–309

    Article  Google Scholar 

  114. Sadeghian Z, Akbari E, Nematzadeh H (2021) A hybrid feature selection method based on information theory and binary butterfly optimization algorithm. Eng Appl Artif Intell 97:104079

    Article  Google Scholar 

  115. Lee C, Lee G G (2006) Information gain and divergence-based feature selection for machine learning-based text categorization. Inf Process Manag 42(1):155–165. Formal Methods for Information Retrieval

    Article  Google Scholar 

  116. Wang D, Zhang H, Liu R, Lv W (2012) Feature selection based on term frequency and t-test for text categorization. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM ’12. Association for Computing Machinery, New York, pp 1482– 1486

  117. Wang D, Zhang H, Liu R, Lv W, Wang D (2014) t-test feature selection approach based on term frequency for text categorization. Pattern Recogn Lett 45:1–10

    Article  Google Scholar 

  118. Blei D M, Ng A Y, Jordan M I (2003) Latent dirichlet allocation. J Mach Learn Res 3 (null):993–1022

    MATH  Google Scholar 

  119. Zhou N, Wang L (2007) A modified T-test feature selection method and its application on the HapMap genotype data. Genom Proteom Bioinform:242–249

  120. https://www.nist.gov/itl/products-and-services/color-feret-database, Accessed: 2020-12-21

  121. http://vision.ucsd.edu/~leekc/ExtYaleDatabase/ExtYaleB.html, Accessed: 2020-12-21

  122. http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html, Accessed: 2020-12-21

  123. http://www.cbsr.ia.ac.cn/IrisDatabase.htm, Accessed: 2020-12-21

  124. http://vis-www.cs.umass.edu/lfw/, Accessed: 2020-12-21

  125. Tang K, Hou X, Shao Z, Ma L (2017) Deep feature selection and projection for cross-age face retrieval. In: 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), pp 1–7

  126. Cheng F, Yu J, Xiong H (2010) Facial expression recognition in jaffe dataset based on gaussian process classification. IEEE Trans Neural Netw 21(10):1685–1690

    Article  Google Scholar 

  127. http://www.ehu.eus/ccwintco/index.php?title=Hyperspectral_Remote_Sensing_Scenes, Accessed: 2020-12-21

  128. http://engineering.purdue.edu/biehl/MultiSpec, Accessed: 2020-12-21

  129. http://www.csr.utexas.edu/hyperspectral, Accessed: 2020-12-21

  130. http://cancerimagingarchive.net/, Accessed: 2020-12-21

  131. http://leo.ugr.es/elvira/DBCRepository/, Accessed: 2020-12-21

  132. https://www.ncbi.nlm.nih.gov/, Accessed: 2020-12-21

  133. https://luna16.grand-challenge.org/Data/, Accessed: 2020-12-21

  134. https://www.kaggle.com/navoneel/brain-mri-images-for-brain-tumor-detection/metadata, Accessed: 2020-12-21

  135. Dhal P, Azad C (2020) A novel approach for blood vessel segmentation with exudate detection in diabetic retinopathy. In: 2020 International Conference on Artificial Intelligence and Signal Processing (AISP), pp 1–6

  136. Dabbaghchian S, Aghagolzadeh A, Moin M S (2007) Feature extraction using discrete cosine transform for face recognition. In: 2007 9th International Symposium on Signal Processing and Its Applications, pp 1–4

  137. Wang X, Ding X, Liu C (2002) Optimized gabor filter based feature extraction for character recognition. In: Object recognition supported by user interaction for service robots, vol 4, 223–226

  138. Arivazhagan S, Ganesan L, Angayarkanni V (2005) Color texture classification using wavelet transform. In: Sixth International Conference on Computational Intelligence and Multimedia Applications (ICCIMA’05), pp 315–320

  139. Lowe D G (1999) Object recognition from local scale-invariant features. In: Proceedings of the Seventh IEEE International Conference on Computer Vision, vol 2, pp 1150–1157

  140. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), vol 1, pp 886–893

  141. Calonder M, Lepetit V, Strecha C, Fua P (2010) Brief: Binary robust independent elementary features. In: Daniilidis K, Maragos P, Paragios N (eds) Computer Vision – ECCV 2010. Springer, Berlin, pp 778–792

  142. Kumar R M (2014) A survey on image feature descriptors

  143. Bober M (2001) Mpeg-7 visual shape descriptors. IEEE Trans Circ Syst Video Technol 11 (6):716–719

    Article  Google Scholar 

  144. Zhang S, Huang J, Huang Y, Yu Y, Li H, Metaxas D N (2010) Automatic image annotation using group sparsity. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 3312–3319

  145. Lacombe T, Favreliere H, Pillet M (2020) Modal features for image texture classification. Pattern Recogn Lett 135:249– 255

    Article  Google Scholar 

  146. Tahir F, Fahiem MA (2014) A Statistical-Textural-Features Based Approach for Classification of Solid Drugs Using Surface Microscopic Images. Computational and Mathematical Methods in Medicine, Hindawi Publishing Corporation 2014

  147. Ojala T, Pietikäinen M, Harwood D (1996) A comparative study of texture measures with classification based on featured distributions. Pattern Recogn 29(1):51–59

    Article  Google Scholar 

  148. Leandro D, Vignolo D H M (2013) Feature selection for face recognition based on multi-objective evolutionary wrappers. Expert Syst Appl 40:5077–5084

    Article  Google Scholar 

  149. Sharma V., Juglan K. C. (2018) Automated Classification of Fatty and Normal Liver Ultrasound Images Based on Mutual Information Feature Selection. IRBM 39:313–323

    Article  Google Scholar 

  150. Wang Y-Y, Peng W-X, Qiu C-H, Jiang J, Xia S-R (2019) Fractional-order Darwinian PSO-Based Feature Selection for Media-Adventitia Border Detection in Intravascular Ultrasound Images. Ultrasonics 92:1–7

    Article  Google Scholar 

  151. Yurtkan K, Demırel H (2014) Feature selection for improved 3D facial expression recognition. Pattern Recogn Lett 38:26–33

    Article  Google Scholar 

  152. Ajit Krisshnaa N. L., Kadetotad Deepak V., Manikantan K., Ramachandran S. (2014) Face recognition using transform domain feature extraction and PSO-based feature selection. Appl Soft Comput 22:141–161

    Article  Google Scholar 

  153. Mistry KK, Zhang L, Neoh SC, Lim CP, Fielding B (2016) A Micro-GA Embedded PSO Feature Selection Approach to Intelligent Facial Emotion Recognition. IEEE Trans Cybern 47:1496–1509

    Article  Google Scholar 

  154. Peralta D, Triguero I, García S, Saeys Y, Benitez JM, Herrera F (2017) Distributed incremental fingerprint identification with re duce d database penetration rate using a hierarchical classification based on feature fusion and selection. Knowl-Based Syst 126:91–103

    Article  Google Scholar 

  155. Jiang Y, Li C (2015) mrmr-based feature selection for classification of cotton foreign matter using hyperspectral imaging. Comput Electron Agric 119:191–200

    Article  Google Scholar 

  156. Fu Y, Jia X, Huang W, Wang J (2014) A comparative analysis of mutual information based feature selection for hyperspectral image classification. In: 2014 IEEE China Summit International Conference on Signal and Information Processing (ChinaSIP), pp 148–152

  157. Ajit Krisshna NL, Deepak V K, Manikantan K, Ramachandran S (2014) Face recognition using transform domain feature extraction and pso-based feature selection. Appl Soft Comput 22:141–161

    Article  Google Scholar 

  158. Shetty S, Kelkar P, Manikantan K, Ramachandran S (2013) Shift invariance based feature extraction and weighted bpso based feature selection for enhanced face recognition. Procedia Technol 10:822–830. First International Conference on Computational Intelligence: Modeling Techniques and Applications (CIMTA) 2013

    Article  Google Scholar 

  159. Aneesh MU, Masand A AK, Manikantan K (2012) Optimal feature selection based on image pre-processing using accelerated binary particle swarm optimization for enhanced face recognition. Procedia Eng 30:750–758. International Conference on Communication Technology and System Design 2011

    Article  Google Scholar 

  160. López-Franco C, Villavicencio L, Arana-Daniel N, Alanis AY (2014) Image Classification Using PSO-SVM and an RGB-D Sensor. Mathematical Problems in Engineering, Hindawi 2014

  161. Li B, Lai Y-K, Rosin P L (2017) Example-based image colorization via automatic feature selection and fusion. Neurocomputing 266:687–698

    Article  Google Scholar 

  162. Li J, Wang J Z, Wiederhold G (2000) Classification of textured and non-textured images using region segmentation. In: Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101), vol 3, pp 754–757

  163. Busso C, Bulut M, Lee C-C, Kazemzadeh A, Mower E, Kim S, Chang J N, Lee S, Narayanan S S (2008) IEMOCAP: interactive emotional dyadic motion capture database. Lang Resour Eval 42 (4):335–359

    Article  Google Scholar 

  164. Burkhardt F, Paeschke A, Rolfes M, Sendlmeier W F, Weiss B (2005) A database of german emotional speech. In: INTERSPEECH 2005 - eurospeech, 9th european conference on speech communication and technology, ISCA, Lisbon, pp 1517–1520

  165. http://kahlan.eps.surrey.ac.uk/savee/Download.html, Accessed: 2020-12-21

  166. Tao J, Liu F, Zhang M, Jia H (2008) Design of speech corpus for mandarin text to speech

  167. Martin O, Kotsia I, Macq B, Pitas I (2006) The enterface’ 05 audio-visual emotion database. In: 22nd International Conference on Data Engineering Workshops (ICDEW’06), pp 8–8

  168. G. C, I. I, A. P, M. T (2014) Emovo corpus: An italian emotional speech database, pp 3501–3504

  169. Wu T, Yang Y, Wu Z, Li D (2006) Masc: A speech corpus in mandarin for emotion analysis and affective speaker recognition. In: 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop, pp 1–5

  170. Akçay M B, Oğuz K (2020) Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers. Speech Comm 116:56–76

    Article  Google Scholar 

  171. Yasmin G, Das A K, Nayak J, Pelusi D, Ding W (2020) Graph based feature selection investigating boundary region of rough set for language identification. Expert Syst Appl 158:113575

    Article  Google Scholar 

  172. Bóna J (2014) Temporal characteristics of speech: The effect of age and speech style. J Acoust Soc Amer 136(2):EL116–EL121

    Article  Google Scholar 

  173. Caka N (2015) What are the spectral and temporal features in speech signal?

  174. Teixeira J P, Oliveira C, Lopes C (2013) Vocal acoustic analysis - jitter, shimmer and hnr parameters. Procedia Technol 9:1112–1122. CENTERIS 2013 - Conference on ENTERprise Information Systems / ProjMAN 2013 - International Conference on Project MANagement/ HCIST 2013 - International Conference on Health and Social Care Information Systems and Technologies

    Article  Google Scholar 

  175. Özseven T (2019) A novel feature selection method for speech emotion recognition. Appl Acoust 146:320–326

    Article  Google Scholar 

  176. Casale S, Russo A, Serrano S (2007) Multistyle classification of speech under stress using feature subset selection based on genetic algorithms. Speech Commun 49:801–810

    Article  Google Scholar 

  177. Mencattini A, Martinelli E, Costantini G, Todisco M, Basile B, Bozzali M, Di Natal C (2014) Speech emotion recognition using amplitude modulation parameters and a combined feature selection procedure. Knowl-Based Syst 63:68–81

    Article  Google Scholar 

  178. Rong J, Li G, Chen Y-PP (2009) Acoustic feature selection for automatic emotion recognition from speech. Inf Process Manag 45:315–328

    Article  Google Scholar 

  179. Liu Z-T, Wu M, Cao W-H, Mao J-W, Xu J-P, Tan G-Z (2018) Speech emotion recognition based on feature selection and extreme learning machine decision tree. Neurocomputing 273:271–280

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pradip Dhal.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dhal, P., Azad, C. A comprehensive survey on feature selection in the various fields of machine learning. Appl Intell 52, 4543–4581 (2022). https://doi.org/10.1007/s10489-021-02550-9

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-021-02550-9

Keywords

Navigation