Abstract
Genomics and proteomics technologies have created a paradigm shift in the drug discovery process, with bioinformatics having a key role in the exploitation of genomic, transcriptomic, and proteomic data to gain insights into the molecular mechanisms that underlie disease and to identify potential drug targets. We discuss the current state of the art for some of the bioinformatic approaches to identifying drug targets, including identifying new members of successful target classes and their functions, predicting disease relevant genes, and constructing gene networks and protein interaction networks. In addition, we introduce drug target discovery using the strategy of systems biology, and discuss some of the data resources for the identification of drug targets.
Although bioinformatics tools and resources can be used to identify putative drug targets, validating targets is still a process that requires an understanding of the role of the gene or protein in the disease process and is heavily dependent on laboratory-based work.
Similar content being viewed by others
References
Drews J. Genomic sciences and the medicine of tomorrow. Nat Biotechnol 1996; 14: 1516–8
Drews J. Drug discovery: a historical perspective. Science 2000; 287: 1960–4
Terstappen GC, Reggiani A. In silico research in drug discovery. Trends Pharmacol Sci 2001; 22: 23–6
International Human Genome Sequence Consortium. Finishing the euchromatic sequence of the human genome. Nature 2004; 431: 931–45
Zambrowicz BP, Sands AT. Knockouts model the 100 best-selling drugs: will they model the next 100? Nat Rev Drug Discov 2003; 2: 38–51
Searls DB. Using bioinformatics in gene and drug discovery. Drug Discov Today 2000; 4: 135–43
Whittaker PA. What is the relevance of bioinformatics to pharmacology? Trends Pharmacol Sci 2003; 24: 434–9
Dahl SG, Kristiansen K, Sylte I. Bioinformatics: from genome to drug targets. Ann Med 2002; 34: 306–12
Allison DB. Statistical methods for microarray research for drug target identification 2002. Proceedings of the American Statistical Association, Bi-opharmaceutical Section [CD-ROM]. Alexandria (VA): American Statistical Association, 2002
Sands AT. The rule of three: selecting the best new drug targets. Drug Discovery & Development 2003 Mar [online]. Available from URL: http://www.ddd-mag.com/ [Accessed 2005 Oct 10]
Drews J, Ryser S. Classic drug targets. Nat Biotechnol 1997; 15: 1318–9
Bouvier M. Structural and functional aspects of G protein-coupled receptor oligomerization. Biochem Cell Biol 1998; 76: 1–11
Nambi P, Aiyar N. G protein-coupled receptors in drug discovery. Assay Drug Dev Technol 2003; 1: 305–11
Fredriksson R, Schioth HB. The repertoire of G-protein-coupled receptors in fully sequenced genomes. Mol Pharmacol 2005; 67: 1414–25
Palczewski K, Kumasaka T, Hori T, et al. Crystal structure of rhodopsin: a G protein-coupled receptor. Science 2000; 289: 739–45
Clapham DE. TRP channels as cellular sensors. Nature 2003; 426: 517–24
Terstappen GC. Ion channel screening technologies today. Drug Discov Today 2005; 2: 133–40
Cahalan MD, Chandy KG. Ion channels in the immune system as targets for immunosuppression. Curr Opin Biotechnol 1997; 8: 749–56
Bennett PB, Guthrie HRE. Trends in ion channel drug discovery: advances in screening technologies. Trends Biotechnol 2003: 21, 563–9
Duarte J, Perrière G, Laudet V, et al. NUREBASE: database of nuclear hormone receptors. Nucl Acids Res 2002; 30: 364–8
Inpharmatica successfully closes GBP13.9m (USD25m) third round financing. 2004 Nov 8 [press release; online]. Available from URL: http://www.inpharmatica.com/news/2004/081104.htm [Accessed 2005 Nov15]
Docherty AJ, Crabbe T, O’Connell JP, et al. Proteases as drug targets. Biochem Soc Symp 2003; 70: 147–61
Leung D, Abbenante G, Fairlie DP. Protease inhibitors: current status and future prospects. J Med Chem 2000; 43: 305–41
Cohen P. Protein kinases: the major drug targets of the twenty-first century. Nat Rev Drug Discov 2002; 1: 309–15
Shears SB. Rounding up the usual suspects: protein kinases as therapeutic targets. Drug Discovery Today 2005; 10: 240–2
Lau LF, Seymour PA, Sanner MA, et al. Cdk5 as a drug target for the treatment of Alzheimer’s disease. J Mol Neurosci 2002; 19(3): 267–73
Smith C. Drug target identification: a question of biology. Nature 2004; 428: 225–31
Augen J. The evolving role of information technology in the drug discovery process. Drug Discov Today 2002; 7: 315–23
Head-Gordon T, Wooley JC. Computational challenges in structural and functional genomics. IBM Systems J 2001; 40(2): 265–96
Horn F, Bettler E, Oliveira L, et al. GPCRDB information system for G proteincoupled receptors. Nucleic Acids Res 2003; 31: 294–7
Le Novère N, Changeux J-P. LGICdb: the ligand-gated ion channel database. Nucleic Acids Res 2001; 29: 294–5
IUPHAR receptor database [online]. Available from URL: http://www.iuphar-db.org/iuphar-rd [Accessed 2005 Oct 10]
Rawlings ND, O’Brien E, Barrett AJ. MEROPS: the protease database. Nucleic Acids Res 2002; 30: 343–6
Krupa A, Abhinandan KR, Srinivasan N. KinG: a database of protein kinases in genome. Nucleic Acids Res 2004; 32: 153–5
Horn F, Vriend G, Cohen FE. Collecting and harvesting biological data: the GPCRDB and NucleaRDB information systems. Nucleic Acids Res 2001; 29: 346–9
Duarte J, Perrière G, Laudet V, Robinson-Rechavi M. NUREBASE: database of nuclear hormone receptors Nucleic Acids Res 2002; 30: 364–368
Chen X, Ji ZL, Chen YZ. TTD: therapeutic target database. Nucleic Acids Res 2002; 30: 412–5
Aureus Pharma website [online]. Available from URL: http://www.aureus-pharma.com [Accessed 2005 Oct 10]
LifeSpan Biosciences website [online]. Available from URL: http://www.lsbio.com [Accessed 2005 Oct 10]
Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res 2000; 28: 27–30
Zheng CJ, Zhou H, Xie B, et al. TRMP: a database of therapeutically relevant multiple-pathways. Bioinformatics 2004; 20(14): 2236–41
Krieger CJ, Zhang P, Mueller LA, et al. MetaCyc: a multiorganism database of metabolic pathways and enzymes. Nucleic Acids Res 2004; 32: 438–42
Goto S, Okuno Y, Hattori M, et al. LIGAND: database of chemical compounds and reactions in biological pathways. Nucleic Acids Res 2002; 30: 402–4
Hamosh A, Scott AF, Amberger J, et al. Online mendelian inheritance in man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res 2002; 30: 52–5
Human Protein Interaction Database [online]. Available from URL: http://www.hpid.org [Accessed 2005 Oct 10]
PhRMA website [online]. Available from URL: http://www.phrma.org/ [Accessed 2005 Oct 10]
Bader GD, Betel D, Hogue CWV. BIND: the biomolecular interaction network database. Nucleic Acids Res 2003; 31: 248–50
Heart High-Performance 2-DE Database [online]. Available from URL: http://www.mdc-berlin.de/~emu/heart/ [Accessed Nov 15]
Hewett M, Oliver DE, Rubin DL, et al. PharmGKB: the pharmacogenetics knowledge base. Nucleic Acids Res 2002; 30: 163–5
Foord SM. Receptor classification: post genome. Curr Opin Pharmacol 2002; 2: 561–6
Manning G, Whyte DB, Martinez R, et al. The protein kinase complement of the human genome. Science 2002; 298: 1912–34
Lapinsh M, Gutcaits A, Prusis P, et al. Classification of G-protein coupled receptors by alignment-independent extraction of principal chemical properties of primary amino acid sequences. Protein Sci 2002; 11: 795–805
Takeda S, Kadowaki S, Haga T, et al. Identification of G protein-coupled receptor genes from the human genome sequence. FEBS Lett 2002; 520: 97–101
Karchin R, Karplus K, Haussler D. Classifying G-protein coupled receptors with support vector machines. Bioinformatics 2002; 18: 147–59
Bhasin M, Raghava GP. GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors. Nucleic Acids Res 2004; 32: W383–9
Sugiyama Y, Poluliakh N, Shimizu T. Identification of transmembrane protein functions by binary topology patterns. Protein Eng 2003; 16: 479–88
Inoue Y, Ikeda M, Shimizu T. Proteome-wide classification and identification of mammalian-type GPCRs by binary topology pattern. Comput Biol Chem 2004; 28: 39–49
Zhou YH, Yang L, Wang H, et al. Prediction of eukaryotic gene structures based on multilevel optimization. Chin Sci Bull 2004; 49(4): 321–8
Adams MD, Kerlavage AR, Fields C, et al. 3,400 new expressed sequence tags identify diversity of transcripts in human brain. Nat Genet 1993; 4: 256–67
Allikmets R, Gerrard B, Glavac D, et al. Characterization and mapping of three new mammalian ATP-binding transporter genes from an EST database. Mamm Genome 1995; 6: 114–7
Boguski MS, Lowe TM, Tolstoshev CM. dbEST: database for “expressed sequence tags”. Nat Genet 1993; 4(4): 332–3
Wittenberger T, Schaller HC, Hellebrand S. An expressed sequence tag (EST) data mining strategy succeeding in the discovery of new G-protein coupled receptors. J Mol Biol 2001; 307: 799–813
Marvanová M, Törönen P, Storvik M, et al. Synexpression analysis of ESTs in the rat brain reveals distinct patterns and potential drug targets. Mol Brain Res 2002; 104: 176–83
Zhou YH, Jing H, Li YE, et al. Identification of true EST alignments and exon regions of gene sequences. Chin Sci Bull 2004; 49(23): 2463–9
Aggarwal K, Lee KH. Functional genomics and proteomics as a foundation for systems biology. Brief Funct Genomic Proteomic 2003; 2: 175–84
Gabaldon T, Huynen MA. Prediction of protein function and pathways in the genome era. Cell Mol Life Sci 2004; 61: 930–44
Hennig S, Groth D, Lehrach H. Automated gene ontology annotation for anonymous sequence data. Nucleic Acids Res 2003; 31: 3712–5
Jensen LJ, Gupta R, Staerfeldt HH, et al. Prediction of human protein function according to gene ontology categories. Bioinformatics 2003; 19: 635–42
Schug J, Diskin S, Mazzarelli J, et al. Predicting gene ontology functions from ProDom and CDD protein domains. Genome Res 2002; 12: 648–55
Guffanti A. Modeling molecular networks: a systems biology approach to gene function. Genome Biol 2002; 3(10): 4031.1–4031.3
Meltzer PS. Spotting the target: microarrays for disease gene discovery. Curr Opin Genet Dev 2001; 11: 258–63
Marton MJ, De Risi JL, Bennet HA, et al. Drug target validation and identification of secondary drug target effects using DNA microarrays. Nat Med 1998; 4: 1293–301
Miklos GL, Maleszka R. Microarray reality checks in the context of a complex disease. Nat Biotechnol 2004; 22: 615–21
Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci U S A 2001; 98: 5116–21
Schulze A, Downward J. Navigating gene expression using microarrays: a technology review. Nat Cell Biol 2001; 3: E190–5
Draghici S, Khatri P, Martins RP, et al. Global functional profiling of gene expression. Genomics 2003; 81: 98–104
Volinia S, Evangelisti R, Francioso F, et al. GOAL: automated Gene Ontology analysis of expression profiles. Nucleic Acids Res 2004; 32: W492–9
Wang WQ, Zhou YH, Bi R. Correlating genes and functions to diseases by systematic differential analysis of expression profiles. Lecture Notes Comput Sci 2005; 3645: 11–20
Pericak-Vance MA, Yamaoka LH, Haynes CS, et al. Genetic linkage studies in Alzheimer’s disease families. Exp Neurol 1988; 102: 271–9
Corder EH, Saunders AM, Strittmatter WJ, et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer’s disease in late onset families. Science 1993; 261: 921–3
Freudenberg J, Propping P. A similarity-based method for genome-wide prediction of disease-relevant human genes. Bioinformatics 2002; 18Suppl. 2: S110–5
Van Driel MA, Cuelenaere K, Kemmeren PP, et al. A new web-based data mining tool for the identification of candidate genes for human genetic disorders. Eur J Hum Genet 2003; 11: 57–63
Perez-Iratxeta C, Bork P, Andrade MA. Association of genes to genetically inherited diseases using data mining. Nat Genet 2002; 31: 316–9
Zhou YH, Zhou QX, Liu HL, et al. Predicting disease genes for familial dilated cardiomyopathy based on the codon usage bias. Chin Sci Bull 2005; 50(18): 2028–32
Pettipher R, Mangion J, Hunter MG. et al. Identification of G-protein-coupled receptors involved in inflammatory disease by genetic association studies. Curr Opin Pharmacol 2005: 5, 412–7
Richard M, Bruskiewich AB, Cosico WE, et al. Linking genotype to phenotype: the International Rice Information System (IRIS). Bioinformatics 2003; 19: 63–5
Lindpaintner K. Science and society: the impact of pharmacogenetics and pharmacogenomics on drug discovery. Nature Rev Drug Discovery 2002; 1: 463–9
Lindpaintner K. Pharmacogenetics and pharmacogenomics in drug discovery and development: an overview. Clin Chem Lab Med 2003; 41(4): 398–410
Campbell DA, Valdes AM, Spurr N. Making drug discovery a SN(i)P. Drug Discov Today 2000; 5: 388–96
Evans WE, McLeod HL. Pharmacogenomics: drug disposition, drug targets, and side effects. N Engl J Med 2003; 348: 538–49
Cabusora L, Sutton E, Fulmer A, et al. Differential network expression during drug and stress response. Bioinformatics 2005; 21: 2898–905
Shmulevich I, Dougherty ER, Kim S, et al. Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks. Bioinformatics 2002; 18: 261–74
Akutsu T, Miyano S, Kuhara S. Inferring qualitative relations in genetic networks and metabolic pathways. Bioinformatics 2000; 16: 727–34
Imoto S, Savoie CJ, Aburatani S, et al. Use of gene networks for identifying and validating drug targets. J Bioinform Comput Biol 2003; 1: 459–74
Gardner TS, di Bernardo D, Lorenz D, et al. Inferring gene networks and identifying compound mode of action via expression profiling. Science 2003; 301: 102–5
Stagljar I, Hottiger MO, Auerbach D, et al. Protein-protein interactions as a basis for drug target identification. Innov Pharm Technol 2002; 1: 66–9
Rain JC, Selig L, De Reuse H, et al. The protein-protein interaction map of Helicobacter pylori. Nature 2001; 409: 211–5
Luan Y, Li H. Model-based methods for identifying periodically expressed genes based on time course microarray gene expression data. Bioinformatics 2004; 20: 332–9
Nariai N, Kim S, Imoto S, et al. Using protein-protein interactions for refining gene networks estimated from microarray data by Bayesian networks. Pac Symp Sympos Biocomput World Sci 2004; 9: 336–47
Bader JS, Chaudhuri A, Rothberg JM, et al. Gaining confidence in high-throughput protein interaction networks. Nat Biotechnol 2004; 22: 78–85
Butcher EC, Berg EL, Kunkel EJ. Systems biology in drug discovery. Nat Bi-otechnol 2003; 22: 1253–9
Hood L, Galas D. The digital code of DNA. Nature 2003; 421: 444–8
Nicholson JK, Wilson ID. Understanding ‘global’ systems biology: metabonomics and the continuum of metabolism. Nat Rev Drug Discov 2004; 2: 668–76
Parsons AB, Brost RL, Ding H, et al. Integration of chemical-genetic and genetic interaction data links bioactive compounds to cellular target pathways. Nat Biotechnol 2003; 22: 62–9
Acknowledgments
This work was supported by the National Natural Science Foundation of China (grants no. 90203011 and no. 30370354), and the Ministry of Education of China (grant no. 505010).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jiang, Z., Zhou, Y. Using Bioinformatics for Drug Target Identification from the Genome. Am J Pharmacogenomics 5, 387–396 (2005). https://doi.org/10.2165/00129785-200505060-00005
Published:
Issue Date:
DOI: https://doi.org/10.2165/00129785-200505060-00005