A genetic algorithm–support vector machine method with parameter optimization for selecting the tag SNPs

https://doi.org/10.1016/j.jbi.2012.12.002Get rights and content
Under an Elsevier user license
open archive

Abstract

SNPs (Single Nucleotide Polymorphisms) include millions of changes in human genome, and therefore, are promising tools for disease-gene association studies. However, this kind of studies is constrained by the high expense of genotyping millions of SNPs. For this reason, it is required to obtain a suitable subset of SNPs to accurately represent the rest of SNPs. For this purpose, many methods have been developed to select a convenient subset of tag SNPs, but all of them only provide low prediction accuracy. In the present study, a brand new method is developed and introduced as GA–SVM with parameter optimization. This method benefits from support vector machine (SVM) and genetic algorithm (GA) to predict SNPs and to select tag SNPs, respectively. Furthermore, it also uses particle swarm optimization (PSO) algorithm to optimize C and γ parameters of support vector machine. It is experimentally tested on a wide range of datasets, and the obtained results demonstrate that this method can provide better prediction accuracy in identifying tag SNPs compared to other methods at present.

Highlights

► We developed a new method to select the tag SNPs. ► The method benefits from SVM and GA to predict SNPs and to select tag SNPs, respectively. ► In addition, PSO is used to optimize C and γ parameters of support vector machine. ► The method has considerably higher prediction accuracy than other methods.

Keywords

Single Nucleotide Polymorphisms (SNPs)
Tag SNPs
Genetic algorithm (GA)
Support vector machine (SVM)
Particle swarm optimization (PSO)

Cited by (0)

1

Fax: +90 332 241 06 35.