System Biology and Network-Based Computational Model Approaches in Biomarker Discovery in Reference to Neurological Disorder

Neurodegenerative diseases are irredeemable and incapacitating conditions that result in progressive degeneration. It is difficult to define the complexity of neuro-system quantitatively or meaningfully from a system standpoint. Thus, inclined towards the progress in developing new and effective therapeutic intervention, it is important to understand the underlying molecular mechanism and significance of neuro system and their complex molecular interaction. A biomarker discovery is an important need for early disease diagnosis, prognosis and monitoring of new therapy for neurological disorders. The emergence of system biology and network-based computational model approaches provides the underlying molecular mechanism and significance of disease and their complex molecular interaction. Thus, it becomes quite easy to understand the specific nature of neuro system as well as it plays a significant role in integrating the omics data at multiple levels that lead to key success in the development of more accurate and efficient biomarker for neurological disorders. The current review focused on significant contributions of system biology and network-based computational model approaches in biomarker discovery with special reference to neurological disorders.


Introduction
Neurodegenerative disease in human includes the wide range of complex disorders including -Alzheimer Disease (AD), Parkinson Disease (PD), Motor Neuron Disease (MND), and Huntington's Disease (HD) etc.Among these, many of them share common symptom & neuro-pathological condition.Thus, it becomes quite challenging to diagnosis the particular disorder [1].Therefore, it is essential to understand the molecular mechanism and pathological symptom of disease in order to predict the disease.Biomarker discovery provides a powerful and progressive approach to the mechanistic insight of neurological disorders along with the same level of specificity and sensitivity for evaluation and diagnosis of disease and also used to identify and develops potential drug target as well as a novel compound for the treatment of disease [2].Identification of novel biomarker will play key success in diagnosis and therapeutic intervention for the wide class of neurological disorders including neurodegenerative and neurodevelopmental disease, from which millions of people are affected around the world every year [3].
Biomarkers are specific pharmacological and physiological biochemical measurement in the body that presence is used for measuring the progress and diagnosis of disease and also in monitoring the treatment [4].Biomarker acts as an indicator for normal biological as well as pathogenic processes.They are used to indicate presence or onset of disease.Biomarkers discovery to aid accurate diagnosis, predict progression and for use in clinical trials has become a major research need in the current scenario [5].With the recent advances in the system biology approaches, help to computationally assimilate omics data with network and pathway to understand new underlying biological mechanisms at the system Volume 2018; Issue 02 Int J Genom Data Min, an open access journal ISSN: 2577-0616 level that leads to biomarker discovery and its clinical validation.The application of system biology in neurological disorder helps collect information regarding structure and function of neuro system in normal vs. diseases state at different stages in the brain by accommodating omics data including genomics, proteomics, metabolomics & transcriptomics at a different system level.This will provide deeper insight mechanism of the complex feature of neuro system that caused by various factors and changes in the biological system.Thus, the current review focused on system biology and network-based computational model approaches in novel biomarker discovery associated with neurological disorder based on data and knowledge-driven approaches.

Omics Data Analysis
The comprehensive analysis of omics data aims to identify significant biological process and pathway from the large dataset and also to find out key gene/protein/ metabolites as a target candidate biomarker.It provides a genome-wide molecular basis for diseases and used to identify a disease-specific biomarker for diagnosis and monitoring of diseases [6].The omics studies produced from a large amount of high-throughput experimental data is often becoming difficult to interpreted result.With the recent advances in the bioinformatics and the system biology, offers opportunities to interpret data from existing knowledgebase approaches in order to understand the whole mechanism of biological process and disease at the system level.Hence the integration of omics data derived from disease-affected cell and tissue provides the molecular basis for identification of network-based novel biomarker and drug target [7].Various public repository databases such as the Gene Expression Omnibus (GEO) repository (http://www.ncbi.nlm.nih.gov/geo) and the Array Express archive (http://www.ebi.ac.uk/ microarray-as/ae) are available for depositing human diseaseoriented omics data.These databases include biological significant information regarding disease network and biomarker which can be analyzed through bioinformatics approaches followed by experimental validation [8].

Gene Expression Analysis
Gene expression analysis is an inventive approach that provides the information regarding the role of the differential expressed gene in the normal biological process and disease state.It compares the expression level of genes in two or more sample.With the use of DNA microarray technology, it's become more convenient to monitor the genome-wide expression pattern of the gene in disease-affected tissue and cell [9].DNA microarray analysis of omics data can be analyzed by the various representing method such as R-statistical computing program.R statistical programming language is an open source developmental tool for analysis of high-throughput genomic data.It contain the utility for pre-processing Affymetrix, identifying Differentially Expressed Genes (DEGs); followed by statistical analysis using the t-test for comparison between two group or Analysis of Variance (ANOVA test) for comparison between more than three group followed by Multiple Comparison Test (MCT), controlling False Discovery Rate (FDR) and hierarchical clustering analysis.Various R-software packages are available such as Bioconductor 3.5 (htttp://www.bioconductor.com)[10], GENESPRING software (http//www.agilent.com)[11], The Comprehensive R Archive Network (CRAN) (http://cran.r-project.org) to carry out gene expression analysis in silico.

Network Analysis
Network analysis provides the key approach to high thought put data interpretation.It analyzes functionally related genes and networks that are common and biological relevance in response to biomarker discovery from large data scale [12].The emerging role of system biology approaches provides researcher to link diverse data into knowledgebase to understand the insight mechanism of disease (Figure 1).Network analysis can help to understand the underlying mechanism of molecular and cellular interaction between genes/ proteins with the surrounding environment.In the network, entities represent nodes i.e. gene, protein & enzyme and edges represent the biologically significant interaction between the two nodes based on experimental, database, text mining, co-expression data etc.A network can be constructed on the basis of i) Gene co-expression data -two genes are similar if their expression level is same throughout the gene expression study.ii) Pathway data -two-gene product are linked if they participated in the same reaction in the pathway.iii) Physical interaction or Protein-Protein Interaction Network (PPI) -the two-gene product is linked if they are found to interact in PPI network [13].Various pathway analysis tools and databases are available publicly for analysis (Table1).It is a database of functional associations that derived from a wide range of sources such as high-throughput experimental data, literature and database mining, analyses of co-expressed genes and computational predictions [16].Structural analysis of complex network can be performed based on the various topological parameter such betweenness centrality (shortest paths between all nodes through which a given node pass) and node degree (number of edges connected to the node), the degree distribution (probability of a node having a specific number of edges) the clustering coefficient (the degree to which nodes within a network cluster together), shortest path length (minimal distance, in number of edges, required to connect two nodes), robustness, etc.These biological network parameters provide useful information about the response of the whole system under study [20].Network analysis also includes identification of clusters in the network (densely connected nodes in the network) and enrichment analysis.Cytoscape (http://www.cytoscape.org) is an open source, software used for biological network visualization, data integration, and interactive network generation.It also includes various plugins that perform network analysis from different data sources based on advance topological parameters.Hence, the construction and analysis of network help to understand the biological mechanism at the system level that leads to playing important role in biomarker discovery.

Functional Enrichment Analysis
Functional analysis is used to identify set of an enriched gene with significant function in entire candidate gene list derived from network analysis.Serval tools & software are available for analysis among them some widely used are Gene Ontology (GO [21]; http:// www.geneontology.org),provides core biological knowledge representation for modern biologists, based computationally or experimentally.It represents the gene and its product in term of their biological process, cellular process, and metabolic process.The Database for Annotation, Visualization and Integrated Discovery (DAVID [22]; http:// david.ncifcrf.gov)provides functional enrichment analysis, functional annotation, clustering, bio-Carta and keg pathway mapping, identifying functionally related genes that provide biological significant function derived from the large dataset.Serval Cytoscape plugins such as BiNGO [23], ClueGO [24], Enrichment Analysis and Visualization (ENViz) [25] etc are also available for analysis based on interaction network and topological parameter (Table2).

Biomarker Evaluation
The efficacy and accuracy of the biomarker is an important Volume 2018; Issue 02 Int J Genom Data Min, an open access journal ISSN: 2577-0616 step from the clinical perspective.It provides enlighten as to whether the new biomarker is safe for clinical use to the patient.The major parameter for evaluation includes Area Under the Curve (AUC) of Receiver Operating Characteristics (ROC) curve, sensitivity & specificity.ROC curve analysis is used to analyze that whether the biomarker is capable of selecting between disease onset and healthy individual [26].It predicts whether an individual will experience even if he/she estimated risk is above the given threshold value c.The c-statistic is the area under the ROC curve [27] that estimates for positive and negative.It includes sensitivity/ specificity for all the possible prediction.Default parameter of cstatistic for the diagonal line would be 0.5; perfect discrimination related to c-statistic is 1.Experimental data can be used as a standard parameter for proper evaluation of biomarker.The In-Silico data can be cross-validated by the Precision Rate (PPV)the percentage of Positive Predictive Value (PPV) if disease state is present and Negative Predictive Value (NPV) if disease state is absent (Table 3).

Application of System Biology in Biomarker Discovery in Reference to Neurological Disorder
System biology plays an important role in understanding the complex nature of the disease.Emerging role of system biology and network-based computational model approaches helps to integrate computationally, omics data with pathway and network analysis to find out new biological mechanism that leads to biomarker discovery and their experimental validation.Implementation of system biology approaches in neurological disorder start with integrating omics data from the different system level.This includes genomics, transcriptomics, metabolomics, and proteomics that collect information regarding structure and function of neuro system in normal and diseases state at the different level in the brain.Autism Spectrum Disorder (ASD) is a neurodevelopmental disease that typically appears during early childhood and affect a person's ability to communicate.Etiology of ASD is still not clear.
The study investigated the expression profile of serum miRNA from ASD in disease vs. normal state using Taq Man Low-Density Array technology [28].It reveals upregulated miR-140-3p in ASD vs. normal state.Network functional analysis shows that CD38 and NRIP1 nodes controlled by miR-140-3p, involved in dysregulation in ASD.Further Biomarker analysis proved serum miR-140-3p (ASD vs. normal; Area under ROC curve, AUC: 0.70; sensitivity: 63.33%; specificity: 68%) as a potential biomarker for ASD.In the study, the integrated microarray study and networkbased approaches were used to investigates the functional link between Parkinson's Disease (PD) and Type 2 Diabetics Mellitus (T2DM) by comprising 478 genes that are closely associated with PD and T2DM [29].
Their finding reveals seven genes that dysregulate in the blood of PD and T2DM patients.Among them, the gene expression level of APP significantly upregulated in the blood of PD and T2DM patient in comparison to the healthy patient.Thus, the study suggests, the increased level of APP in blood with T2DM patient act as an indicator of neurodegeneration and maybe use as the potential biomarker for PD.A combined study of 2D and Mass Spectrometry (MS) was conducted to screen out protein biomarkers for Traumatic Brain Injury (TBI) in a rat model through proteomic followed by system biology analysis [30].System biology analysis identified Ubiquitin carboxyl-terminal isozyme 1, tyrosine hydroxylase, and syntaxin-6 as potential biomarker candidate for TBI.Further pathway analysis shows protein take part in neurite outgrowth and cell differentiation.The further result confirmed through semi-quantitative Western blotting analysis compare to control case.In the study [31], develops a network-based model of mutated and differentially expressed disease genes of neurological and psychiatric diseases to validate their association with aging.Further, the approach was used to identify disease-specific biomarkers for diagnosis and treatment (Table 4).

Limitation
Although system biology plays a key role in finding potential biomarker with good accuracy, it also has a little limitation regarding biomarker prediction.In neurological disorder, the disease is very specific to its pathological condition, thus obtained biomarker should be specific to the particular disorder.Regardless of this, the biomarker obtained from PPI network may have the possibility to be specific to other disease or disorder also.Thus, to overcome such a problem, all the disorder should be included in network and cross-validated to confirm the disease-specific.

Conclusion
The prediction and diagnosis of most of the neurological disorders are still difficult to examine.They only process through serval neurological test or examination.With emerging significance of system biology and network-based computational model approaches, based on interpreting large expression data from omics study and constructing on protein-protein interaction network specific to particular disease or disorder, has provided far most possibility for obtaining a potential novel biomarker through system biology approaches.The role of a biomarker is well established in various other diseases such as cancer, cardiovascular disease etc. Neuro-researcher is now also focusing, their area of interest on biomarker discovery for better prediction, diagnosis & treatment of neurological disorders.Thus, the system biology approaches may play a significant role in understanding the underlying biological and functional mechanism of complex neurobiology of disease at the system level that may lead to the discovery of potential biomarkers for early detection, diagnosis & monitoring of neurological disorders at different stages.

Figure 1 :
Figure 1: A road map to in-silico biomarker discovery: Form high thought put experimental omics data analysis to molecular network analysis and in-silico validation.

Table 1 :
It is a web-based functional analysis tool for comprehensive omics data.Use to Identify the most relevant signaling and metabolic pathways, molecular networks, and biological functions for the list of genes[17].Tools & resources of System Biology available for Pathway & Network analysis.
MANIA http//www.genemania.orgIt is a flexible, user-friendly web interface for generating hypotheses about gene function, analyzing gene lists and prioritizing genes for functional assays.Volume 2018; Issue 02 Int J Genom Data Min, an open access journal

Table 2 :
Publicly available functional enrichment analysis tools.

Table 3 :
List of few software & resources available for evaluation & discovery of Biomarker.

Table 4 :
Lists of few specific biomarkers for various neurological disorders discover through OMICS studies.