Noise Analysis of Gene Regulatory Networks Using Particle Filter

C l i n M e d International Library Citation: Wang H, Aberra D (2015) Noise Analysis of Gene Regulatory Networks Using Particle Filter. Int J Clin Biostat Biom 1:006 Received: August 10, 2015: Accepted: October 01, 2015: Published: October 03, 2015 Copyright: © 2015 Wang H. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Wang and Aberra. Int J Clin Biostat Biom 2015, 1:2


Introduction
One of the most important prosperities in gene expression is the stochasticity.Gene expression process is noisy and fluctuant.

The noise sources
Generally, noise sources can be partitioned into two categories [1][2][3][4].Firstly, gene expression is a sequence of biochemical reactions which have inherent stochasticity.Those biochemical reactions depend on the molecular events and the difference in the internal states of cells like random births, deaths, and collisions of molecules [5].The inherent stochasticity in the system is named as intrinsic noise.Secondly, variabilities in factors external to the system also contribute the noise.The environment is complicated and the subtle environment difference may result in fluctuations in gene expression.Those kinds of noise sources are referred to as extrinsic noise.Generally, the noise effects of gene expression are the joint effects of intrinsic noise and extrinsic noise.
The concept of intrinsic noise and extrinsic noise has been proved experimentally [5,3].Although experimentally, it is difficult to distinguish intrinsic noise from extrinsic noise in vivo [1].Experiments have shown that both intrinsic and extrinsic noise contribute substantially to the overall variation.Rapid fluctuations in mRNA are the sources of intrinsic noise [6].The experiments showed that noise in gene expression level caused the fluctuation in protein level in a clonal population of E. coli [3].Extrinsic noise is the of even a single component may affect the performance of the whole system.The mechanisms through which a natural genetic network can operate reliably despite noisy environments and stochasticity in gene expression are not known and remain a difficult challenge ion genetic network engineering [6].

Modeling of Gene Regulatory Networks with Noise
Gene expression data raise the possibility for functional understanding of genome dynamics by means of mathematical modeling.Many models have been proposed in the literature for modeling GRNs, such as Boolean networks, Probabilistic Boolean networks, Bayesian networks, linear additive regulation model, and neural networks [9][10][11][12][13].

Boolean networks model with noise
Boolean networks model is a binary model [9,[14][15][16].The basic assumption for this model is that a gene has two states, 1 for active and 0 for inactive.A Boolean function is used to describe the influence of other genes on a gene.Define a set of genes , , ,  , where Here, i e represents the state of the th i gene and N is the number of genes.Let the set of Boolean functions be defined by , , ,  .Then the dynamics of a GRN is determined through a set of discrete equations In [17,18], different noise levels are considered in the Boolean network model.The random noise is added to the binary data generated by the Boolean networks.The general equation is ( ) where p ε is an added noise.The Boolean network can reduce the error if the gene expression data has a lot of error inside.In [19], a Boolean network model with noise is proposed.The p ε is defined as the probability of

Bayesian networks model with noise
A Bayesian network is a graph model to estimate a complicated multivariate joint probability distribution through local probabilities [10,20].Figure 1 shows a 5 N = nodes Bayesian network.The vertices represent genes or other components.They are random variables.The edges represent the conditional dependence relation and interactions among genes.For the set of parent nodes of a node i X , a conditional distribution | ( ( )) i i P X Parents X is defined, where ( ) i Parents X denotes the variables corresponding to the direct regulators of th i node.
In [18], the dynamic Bayesian network with external noise from time series is introduced.The influence of external noise on the systems dynamics is due to flipping of the value of a gene i X at each time step with a probability P ε .The result shows that increasing the value of external noise can reduce the overall performance.

Linear additive regulation model with noise
In this model the expression level of a gene at a certain point can be calculated by the weighted sum of the expression levels of all genes in the network at a previous time point [11].It may be represented by ODEs Where i e is the gene expression level of the th i gene.N is the number of genes in the GRN.ij w represents the effect of the th j gene on the th i gene.Negative ij w means inhibition while positive ij w represents activation.k u is the th k external (control) variable.ik ν represents the effect of the th k external variable on the th i gene.k is the number of external variable.β is a bias term.In [21], the noise in the input data is considered in the linear additive regulation model.For each new input data, there is small amount of Gaussian noise with the same standard deviation added.

Neural networks model with noise
Neural networks model uses differential equations to describe GRN [13].
Where f is usually a nonlinear function, such as a sigmoidal function.w is the weight matrix.k u is the th k external (control) variable.ik ν represents the effect of the th k external variable on the th i gene.The constant i λ represents the rate constant of degradation of the gene product i .i β represents an external input.Neural networks can be used to assimilate the microarray data and construct GRNs [22].In [23], Hierarchical Bayesian Neural Network model was introduced.Two kinds of noise are considered: independent parameters with Gaussian noise and correlated parameters with Multivariate Normal distribution.In [24], stochastic neural network models are presented for gene regulatory networks.The Poisson random noise is used to represent chance events in the process of synthesis.For expression data with normalized concentrations, exponential or normal random noise is used to generate the synthetic data.

Continuous Nonlinear Ordinary Differential Equations Model with Noise
A continuous nonlinear ordinary differential equation model is adopted.It includes random noise parameters from intrinsic noise and external noise which come from the noise source from gene regulating process.Compared to linear models, identification of the nonlinear ordinary equation model is computationally more intensive and can require more data; however, the range of nonlinear behaviors exhibited by GRNs can be more thoroughly understood with nonlinear differential equations.With more time-series data become available owing to advances in microarray or other biological technologies, and assuming continued improvement in computational capacity, it can be expected that continuous dynamic model will play a critical role in revealing complicated gene behavior.
Assuming there are N genes of interest and i x denotes the state (such as the microarray reading) of the th i gene, then the dynamics of the GRN may be modeled as where i ν is external noise.
In this study we assume the functions ( i f i , ∀ ) are in the form where

ISSN: 2469-5831
The proposed model includes all the major characteristics of a gene regulatory network: it is nonlinear, dynamic, and noisy.The rationale behind the proposed model are two-fold: first, the proposed model is general and sufficiently flexible to include many well-known models and new models yet to be found; second, the noisy nature of GRNs is modeled explicitly.The deterministic model (without noise) corresponds to the nominal case, while the various stochastic effects are included as noise disturbances.Previous work has modeled these noise types by Gaussian white noise processes [7].The inclusion of noise also enables the proposed model to provide interpretation of the fact that GRNs are robust to noise, by which it is meant that the relationships among genes are not greatly affected by small changes caused by noise.
The nonlinear functions ( i f i , ∀ ) need to be identified from timeseries microarray measurements such that the identification error is minimized and the simplest model structure is selected.Both synthetic data and experimental data from microarray measurements are used to evaluate the proposed method.Note that although the proposed method is tested only using polynomials as the nonlinear terms, it is expected that it should perform similarly well for other choices of nonlinear terms in the proposed model, dependent of course on sufficient data for more complex nonlinear models.

The Algorithm for Noise Analysis in Polynomial Model, S-system model, and Microarray Data
In this section, the polynomial model, S-system model, and particle filter are described in details.The algorithm for noise analysis inside gene regulatory networks is proposed on the basis of particle filter.The key steps of the proposed algorithm are shown in figure 2. The proposed algorithm is shown in details below.

Polynomial model
In this research, polynomials are chosen as the nonlinear components ( ) in the proposed model and ODEs with dynamic polynomials are used in our test cases.The polynomials are utilized as universal approximators.In order to mitigate the effect of "the curse of dimensionality", only second-degree polynomials are selected.Note that an advantage of using low-degree polynomial models is that even when there exists some model mismatch, these models may be sufficiently accurate to represent many real systems, and thus are widely utilized in practice [25].We note that a similar GRN model has been adopted by [26], but without noise being included in the model.The polynomial model is given by: where N is the number of genes, i x is the th i gene, i α , i j β , , and i j k γ , , are constant factors.

S-system model
Inference of GRNs using S-system model from time-series microarray measurement data has attracted a lot of attention recently [27][28][29][30].The S-system model is given by: where i is the index for the genes , , i x are the state variables, i α and i β are the positive rate constants, and i j g , and i j h , are the exponential parameters called kinetic orders.If gene j will induce the expression of gene i .On the contrary, gene j will inhibit the expression of gene i if 0 i j g , < .i j h , will have the opposite effects on controlling gene expressions compared to i j g , .
The S-system is a quantitative model which is characterized by powerlaw functions.It has a rich structure capability of capturing various dynamics in many biochemical systems.In addition, the S-system model has been proven to be successful in modeling GRNs [30,28].Hence, the S-system model is adopted for modeling GRNs in this study.Because the microarray data usually contain noise, it is very hard to pinpoint the exact values of the parameters.Hence, the determined parameters in the system were seen as some kind of distributions [30].

Particle filter
Particle filter is also called Sequential Monte Carlo (SMC) methods.Particle filter is a set of genetic-type particle Monte Carlo methodologies to solve the filtering problem [31].It is a probabilitybased filter.The key idea is to generate a given number M state vectors based on the probability density function (pdf) [31].In particle filter,

ISSN: 2469-5831
the microarray data is represented in terms of the variables k y , which are also a set of noisy observations.A state space model is given below.
The covariance of i µ is R , and the covariance of i ν is Q .
The general procedure of particle filter is given by Algorithm 1.
Algorithm 1: The noise process of GRNs model on the basis of particle filter Input: S-system function , , , i j i f µ ν , the initial stage pdf (x 0 ) 1: Generate M initial posteriori particles on the basis of the pdf (x 0 ): 0, ( 1,...., ).
M is the number of particles.
2: for 1 i K = → do 3: Perform the time propagation step to obtain a priori particles ....., ) Where w − is generated on the basis of pdf of 1 k w − .
4: Compute the relative likelihood q j of each particle x − conditioned on the measurement of y i on the basis of non linear measurement equation (Polynomial and S-system models) and the pdf of the measurement noise Where y * is a specific measurement, and L is the number of elements.

5: Scale the relative likelihoods
6: Resampling step: generate posteriori particles , i j x + on the basis of the relative likelihoods q i 7: Compute the mean and covariance on the basis of , i j x + which are distributed according to the pdf ( ) x y 8: end for 9: Estimate the RMS error where * i X is one specific measurement.

Noise analysis of microarray data using particle filter
The operations of proposed method to analyze noise in GRNs model using particle filter is graphically depicted in figure 2. The corresponding pseudocode of our method is summarized in Algorithm 2 Algorithm 2: The simulation procedure of GRNs model using particle filter 1: Generate data without any noise using Runge Kutta algorithm ( ) .., x a a a a Where h is the time step.
2: Generate data with system noise (Covariance is Q) and measurement noise (Covariance is R) 3: Apply particle filter on the noisy data to estimate the GRNs 4: Estimate the RMS error

Synthetic Simulation Results
The noise analysis method for GRNs models is tested by synthetic data using polynomial and S-system GRNs models.

Synthetic data simulation using polynomial model
In order to test the noise analysis procedure using particle filter, the synthetic network using polynomial model can be approximated as:           x , 2 x , and 3 x can be inferred under external and internal noise environment which make sure to find the relations among genes.
In order to further analyze the noise relation to final results, RMS error is used to testify the effects of different noise level with (0 20) Q ∈ , and (0 0 1) R ∈ , . .The result is shown in figure 6.From the data and figure 6, we can see that RMS error is distributed within the interval [11 34 18 10]   ., .
when Q is in the range[0 20] , .We concluded that the effect of noise R is larger than the effect of noise Q on the final results.In the research of [3], the observation that the time scale for intrinsic noise fluctuations is much shorter than that for extrinsic noise suggests that extrinsic noise may affect cellular phenotype more strongly than intrinsic noise at least in E. coli.Our results further confirmed this conclusion.

Synthetic data simulation using S-system model
In order to examine the effectiveness of the proposed procedures for noise analysis in S-system using particle filter, a synthetic S-System model is used.The original S-system model is given as follows [30].
2.51 0.12 2 1 2 x x x x x x − = − = − Figure 7 and figure 8 show simulation results using particle filter with noise levels by Q = 10 and R = 0.1.In the simulation, K is equal to 200 and the number of particles M is equal to 300 .We notice that particle filter can predict the system with an error so negligible that it cannot be realized by linear filters.Figure 9 shows the relations among RMS error, process noise covariance, and measurement noise covariance.When measurement noise covariance increases, the RMS error increases more.This indicates that the increase of measurement noise covariance can incur much more error than the process noise covariance.

Microarray Data Simulation
During this part of the simulation, time-series gene-expression data corresponding to yeast protein synthesis [32]     Particle filter is applied to analyze the data with noise.The particle filter estimate is the mean of particles which caused the time shift or delay on the curve.It is inherent in particle filter and is called mean shift.The result is shown by figure 11, figure 12, figure 13, figure 14, and figure 15 for genes HAP1, CYB2, CYC7, CYT1 and COX5A.The branch pathway model is shown by figure 16 on the basis of the above results.The relations are in agreement with the biological experiment findings in [32].

Conclusions
In this paper, process noise and measurement noise of gene regulatory networks is analyzed using particle filter.The synthetic models on the basis of polynomial and S-system are studied to analyze the relations among RMS, measurement noise, and process noise.We found out that measurement noise is the main reason to incur the RMS error conforming to results from biological experimental research.Noise inside microarray data is considered and analyzed with five real genes.

Acknowledgement
The work of Haixin Wang and Dawit Aberra was supported by NSF Grant 1435152.

Figure 1 :
Figure 1: A simple Bayesian network structure.

Figure 2 :
Figure 2: Block diagram of noise analysis using particle filter.

Figure 6 :
Figure 6: The relations of Q and R to Standard RMS Error using polynomial model.

Figure 9 :
Figure 9: The relations of Q and R to Standard RMS Error using S-system model.

Figure 3 ,
Figure 3, Figure 4, and Figure 5 show simulation results by particle filter.Genes 1x , 2 x , and 3x can be inferred under external and internal noise environment which make sure to find the relations among genes.

Figure 10 :
Figure 10: The microarry data with noise in S-system model.
Synthetic data are tested in the model because the quality and quantity of the available real data are not enough for the proposed model.