Skip to main content
Log in

Feature selection techniques for microarray datasets: a comprehensive review, taxonomy, and future directions

微阵列数据集的特征选择技术: 综合评述、 分类和未来方向

  • Review
  • Published:
Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Abstract

For optimal results, retrieving a relevant feature from a microarray dataset has become a hot topic for researchers involved in the study of feature selection (FS) techniques. The aim of this review is to provide a thorough description of various, recent FS techniques. This review also focuses on the techniques proposed for microarray datasets to work on multiclass classification problems and on different ways to enhance the performance of learning algorithms. We attempt to understand and resolve the imbalance problem of datasets to substantiate the work of researchers working on microarray datasets. An analysis of the literature paves the way for comprehending and highlighting the multitude of challenges and issues in finding the optimal feature subset using various FS techniques. A case study is provided to demonstrate the process of implementation, in which three microarray cancer datasets are used to evaluate the classification accuracy and convergence ability of several wrappers and hybrid algorithms to identify the optimal feature subset.

摘要

为获得最佳结果, 从微阵列数据集中检索相关特征已成为特征选择 (FS) 技术的研究热点. 本综述旨在全面阐述各种最新特征选择技术, 同时介绍了基于微阵列数据集的处理多类分类问题的技术以及提高学习算法性能的不同方法. 我们试图理解和解决数据集不平衡问题, 以证实研究人员在微阵列数据集上的工作. 对文献的分析为理解和强调在通过各种特征选择技术寻找最佳特征子集时存在的众多挑战和问题铺平了道路. 同时提供了一个案例说明该方法的实施过程, 该方法使用3个微阵列癌症数据集评估一些包装方法和混合方法的分类精度和收敛能力, 以确认最优特征子集.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

Download references

Author information

Authors and Affiliations

Authors

Contributions

Kulanthaivel BALAKRISHNAN designed the research. Kulanthaivel BALAKRISHNAN and Ramasamy DHANALAKSHMI processed the data. Kulanthaivel BALAKRISHNAN drafted the paper. Ramasamy DHANALAKSHMI helped organize the paper. Kulanthaivel BALAKRISHNAN revised and finalized the paper.

Corresponding author

Correspondence to Ramasamy Dhanalakshmi.

Ethics declarations

Kulanthaivel BALAKRISHNAN and Ramasamy DHANALAKSHMI declare that they have no conflict of interest.

Additional information

Project supported by the Department of Science and Technology under the Interdisciplinary Cyber-Physical Systems Scheme (No. T-54)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Balakrishnan, K., Dhanalakshmi, R. Feature selection techniques for microarray datasets: a comprehensive review, taxonomy, and future directions. Front Inform Technol Electron Eng 23, 1451–1478 (2022). https://doi.org/10.1631/FITEE.2100569

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/FITEE.2100569

Key words

关键词

CLC number

Navigation