Kalman Filtering for Genetic Regulatory Networks with Missing Values

Lin, Qiongbin; Liu, Qiuhua; Lai, Tianyue; Wang, Wu

doi:https://doi.org/10.1155/2017/7837109

Computational and Mathematical Methods in Medicine

On this page

Abstract Introduction Numerical Example Conclusion Conflicts of Interest References Copyright Related Articles

Research Article | Open Access

Volume 2017 | Article ID 7837109 | https://doi.org/10.1155/2017/7837109

Kalman Filtering for Genetic Regulatory Networks with Missing Values

Qiongbin Lin,¹Qiuhua Liu,¹Tianyue Lai,¹and Wu Wang^1,2

Academic Editor: Konstantin Blyuss

Received17 Mar 2017

Accepted08 Jun 2017

Published26 Jul 2017

Abstract

The filter problem with missing value for genetic regulation networks (GRNs) is addressed, in which the noises exist in both the state dynamics and measurement equations; furthermore, the correlation between process noise and measurement noise is also taken into consideration. In order to deal with the filter problem, a class of discrete-time GRNs with missing value, noise correlation, and time delays is established. Then a new observation model is proposed to decrease the adverse effect caused by the missing value and to decouple the correlation between process noise and measurement noise in theory. Finally, a Kalman filtering is used to estimate the states of GRNs. Meanwhile, a typical example is provided to verify the effectiveness of the proposed method, and it turns out to be the case that the concentrations of mRNA and protein could be estimated accurately.

1. Introduction

According to the genetic central dogma, a specific protein can be generated by a complex gene expression process (including transcription process, translation process, and other interaction process) among DNAs, RNAs, and gene products [1, 2]. To guide the gene expression correctly, each stage of the gene expression should be regulated. The regulation functions for each stage form genetic regulatory networks (GRNs). Cleary, gene expression levels can be determined by GRNs. For this, a lot of GRNs models have been built to track the concentration of mRNA and protein, like Boolean model [3, 4], Bayesian model [5–7], differential equation model [8–11], and state-space model [12, 13]. However, due to the uncertainties of the system, time-varying delays [14–16] and data missing [17, 18] in real gene expression process, the measurements obtained from the sensor are usually contaminated by noise and cannot represent the true values well. Thus, a lot of filtering methods are proposed to reveal the true values.

In studying the stability of genetic regulatory networks, noise disturbances are one of the main factors that cannot be ignored, and it is mainly composed of process noise and measurement noise. In order to restrain these noise disturbances, many filtering methods like filter [19] and Kalman filter [20] are proposed to obtain stable GRNs. Although process noise and measurement noise were usually taken into consideration, the correlation between process noise and measurement noise always is ignored in these methods, so it does not have the generality from this point of view. In this paper, in order to make the filtering method more representative, the correlation between process noise and measurement noise would be taken into consideration; meanwhile, the correlation will also be decoupled in theory.

Generally, gene expression levels (the concentration of mRNA and protein) can be measured by the DNA microarray technology, but there are many reasons which can cause value miss like dust or scratch on the slide, inappropriate thresholds in preprocessing, insufficient resolution of the microarray, experimental errors during the laboratory processes, or image corruption [18]. So, the measured value for gene expression levels would contain a certain degree of distortion that would cause concentration value deviating from real concentration. To overcome this drawbacks, the set-values filtering for GRNs with missing value was proposed in [17, 21]; although this method has dealt with the specific well, it did not give a detailed explanation about missing value in a detailed mathematical formula, so, in this paper, the observation model with missing value will be given; meanwhile, a Kalman filtering will also be designed to obtain stable GRNs with missing value.

In this paper, an estimation problem for a class of discrete-time GRNs model with time-varying delays, missing values, and correlation of noise is considered. The rest of the paper is organized as follows. In Section 2, a discrete model of genetic regulation networks is introduced; we also built observation model with missing value to give a detailed explanation about it in mathematical formula; meanwhile, the correlation between process noise and measurement noise is decoupled in theory. In Section 3, a Kalman filtering is designed to estimate the real concentrations of GRNs; meanwhile, the stability of Kalman filtering is analyzed. In the Section 4, a typical example is provided to illustrate the effectiveness of the proposed method.

2. Problem Formulation

Clearly, a discrete-time model of genetic regulatory networks (GRNs) can be described as follows [21–23]:where the descriptions of system’s parameters are shown in Table 1.

In addition, is a monotonic function in Hill form, which represents the feedback regulation of the protein. Here, , where is the Hill coefficient and is positive constant.

Let and denote the equilibrium points of system (1); define Thus, system (1) can be rewritten asBased on the first-order Taylor expansion, , system (3) can be expressed as

In practice, the actual GRNs might be influenced by the dynamic reaction of the networks, time delays, and molecular noise. Based on system (4), discrete-time GRNs with observation equation and noises are considered:where , is the sampled output, is the external noise, is the process noise, is the noise driven matrix, and is the observation matrix. In addition,

Then, in order to solve the time-delay of the system (5), a new state vector is defined as follows:Using the new state variable (7) giveswhere and are white, zero-mean, correlated noises; furthermore,

As for the measurements model with missing value, it can be expressed as that measurement values lost at a certain probability, so, the measurements model with missing value can be described as follows [24]:where is received by the estimator, the initial state is independent of , , and and satisfies the fact that , , and obey the Bernoulli distribution, and it is uncorrelated with other random variables. There are two basic properties about :where . If , it means the measurements value is lost at , and there is no missing value with . More properties about the distribution of are showed in [24].

Then, substituting the observation equation of system (8) into (10), thus, a discrete-time model of GRNs with the observation equation with missing value is established as follows:where Let denote the autocovariance matrix of , denote the autocovariance matrix of , and denote the cross-covariance matrix of and .

For (12), there is some statistical information: where and where .

To simplify the calculation, , , and can be broken down into some simple separations as follows:

Since the process noises of this system are correlated with the observation noises, to decouple the relevance about and , according to system (12), ; obviously,and then adding (17) to the state equation of (12), we havewhere . Clearly, the last two terms in (18) are the process noises

Since Kalman filtering requires that the process noise and the measurement noise must be white uncorrelated Gaussian noise, then consider the correlation between process noise and measurement noise firstly: Let , and then isClearly, if is chosen as (21), and are uncorrelated.

Secondly, we discuss ,so if , is a white, zero-mean noise.

3. Main Results

In this section, the Kalman filtering is designed for obtaining the minimum variance estimation. Firstly, the expression of the filtering error is calculated, and then the Kalman gain can be obtained by minimizing the covariance matrix of the filtering error ; at last, the recursion of the filtering error is calculated; thus, the design of Kalman filtering is completed.

According to system (12) and (18), the state prediction equation can be calculated asand the measurement update equation is

So, the optimal state estimation iswhere denotes the Kalman gain.

Then, the posterior estimation error can be computed as follows:and the covariance matrix of estimation error can be described asSubstituting (26) into (27) givesThen, is designed to minimize , andthus, can be rewritten aswhere Let ; the covariance matrix of estimation error is minimized. Thus Furthermore,

According to (18) and (25), the estimation error can be obtainedthus

The linear optimal filtering, (23), (25), (33), and (35), is uniformly asymptotically stable when the linear discrete-time-varying stochastic system (12) is uniformly controllable and observable [24].

4. Numerical Example

In this section, an example will be provided to show the effectiveness of the proposed method. In Escherichia coli [25], the dynamics of the networks have been experimentally studied, and the model of 3-gene repressilator is given as follows:where denotes the concentrations of three mRNA and denotes the concentrations of three repressor-proteins, is the feedback regulation coefficient, denotes the ratio of the protein decay rate to the mRNA, and is the Hill coefficient, ; .

The discrete-time GRNs model based on the method in [26] can be obtained as

Let , the Hill coefficient , the time-delay , , and the other parameters are taken as follows:

So, the parameters of system (4) can be obtained:

According to system (3), we can get that the mRNA and proteins will adjust each other; they will also degrade along with the time, so the GRNs would tend to be equilibrium if there are no noise disturbances, and the unique equilibrium can be checked easily when ; thus, the system’s states and with are shown in Figures 1 and 2.

From Figures 1 and 2, we can get that the states of the GRNs stay at a point stably, so the equilibrium can be calculated; that is,

Now, check the states of system (37) under the excitation of external disturbances; let the initial states be and , , and (where , ); the estimate values of the concentration of mRNA and proteins are shown in Figures 3–8.

According to Figures 3–8, the blue lines show the estimate values of mRNAs and protein, and the green lines illustrate the equilibrium of GRNs; we can get that the concentration of mRNAs and protein tends to the equilibrium well under the excitation of external disturbances, so, the Kalman filtering designed in this paper is effective for the GRNs with missing value and noise correlation.

In order to test out the influence of the missing rate, the experiments with four missing rates of 10%, 20%, 30%, and 50% are carried out. In addition, the normalized root mean squared error (NRMSE) [27] is used to indicate the influence level of the missing rate, and the NRMSE is defined as So, the NRMSE are shown in Table 2.

Compared with the NRMSE obtained by set-membership filtering given in [17], in spite of the missing rate increases from 10% to 30%, the NRMSE listed in Table 2 increases slightly; however, the NRMSE increases greatly with the increasing of the missing rate in [17]. Moreover, at the low level of missing rate, the set-membership filtering has a better performance, but at the high level of missing rate, the method proposed in this paper is more appropriate than the set-membership filtering, and the cut-off point roughly equals 14.66%. Thus, it shows that the proposed method is more effective for the filtering problem for GRNs.

5. Conclusion

In this paper, a discrete model of genetic regulation networks is introduced; we also built an observation model with missing value to give a detailed explanation about it in mathematical formula; meanwhile, the correlation between process noise and measurement noise is decoupled in theory. Finally, a Kalman filtering is designed to obtain stable GRNs; meanwhile, the simulation result shows that the method proposed in this paper is effective for the GRNs with missing value, and compared with the set-membership filtering, the Kalman filtering has a better performance when the missing rate stays at a high level.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

N. S. Hosseini and S. Ozgoli, “Delay-dependent filtering for stochastic nonlinear genetic regulatory networks with time-varying delays and extrinsic noises,” in Proceedings of the 2013 21st Iranian Conference on Electrical Engineering, ICEE 2013, pp. 1–6, May 2013.
View at: Publisher Site | Google Scholar
P. Smolen, D. A. Baxter, and J. H. Byrne, “Mathematical modeling of gene networks,” Neuron, vol. 26, no. 3, pp. 567–580, 2000.
View at: Publisher Site | Google Scholar
S. Huang, “Gene expression profiling, genetic networks, and cellular states: An integrating concept for tumorigenesis and drug discovery,” Journal of Molecular Medicine, vol. 77, no. 6, pp. 469–480, 1999.
View at: Publisher Site | Google Scholar
R. Somogyi and C. A. Sniegoski, “Modeling the complexity of genetic networks: understanding multigenic and pleiotropic regulation,” Complexity, vol. 1, no. 6, pp. 45–63, 1996.
View at: Publisher Site | Google Scholar | MathSciNet
P. Kellam, X. Liu, N. Martin, C. Orengo, S. Swift, and A. Tucker, “A framework for modelling virus gene expression data,” Intelligent Data Analysis, vol. 6, no. 3, pp. 267–279, 2002.
View at: Google Scholar
T.-F. Liu, W.-K. Sung, and A. Mittal, “Model gene network by semi-fixed bayesian network,” Expert Systems with Applications, vol. 30, no. 1, pp. 42–49, 2006.
View at: Google Scholar
K. Murphy et al., “Modelling gene expression data using dynamic bayesian networks,” Technical report, Computer Science Division, University of California, Berkeley, CA, USA, 1999.
View at: Google Scholar
M. De Hoon, S. Imoto, K. Kobayashi, N. Ogasawara, and S. Miyano, “Inferring gene regulatory networks from time-ordered gene expression data of bacillus subtilis using differential equations,” Pacific Symposium on Biocomputing, p. 17, 2002.
View at: Google Scholar
P. D'Haeseleer, X. Wen, S. Fuhrman, and R. Somogyi, “Linear modeling of MRNA expression levels during CNS development and injury,” in Proceedings of the Pacific Symposium on Biocomputing, vol. 4, pp. 41–52, Mauna Lani, Hawaii, USA, 1999.
View at: Publisher Site | Google Scholar
N. S. Holter, A. Maritan, M. Cieplak, N. V. Fedoroff, and J. R. Banavar, “Dynamic modeling of gene expression data,” Proceedings of the National Academy of Sciences of the United States of America, vol. 98, no. 4, pp. 1693–1698, 2001.
View at: Publisher Site | Google Scholar
Z. Wang, H. Gao, J. Cao, and X. Liu, “On delayed genetic regulatory networks with polytopic uncertainties: Robust stability analysis,” IEEE Transactions on Nanobioscience, vol. 7, no. 2, article no. 8, pp. 154–163, 2008.
View at: Publisher Site | Google Scholar
C. Rangel, J. Angus, Z. Ghahramani et al., “Modeling T-cell activation using gene expression profiling and state-space models,” Bioinformatics, vol. 20, no. 9, pp. 1361–1372, 2004.
View at: Publisher Site | Google Scholar
F. X. Wu, W. J. Zhang, and A. J. Kusalik, “Modeling gene expression from microarray expression data with state-space equations,” in Proceedings of the Pacific Symposium on Biocomputing, vol. 9, pp. 581–592, Hawaii, USA, 2004.
View at: Publisher Site | Google Scholar
Q. Zhou, X. Shao, H. Reza Karimi, and J. Zhu, “Stability of genetic regulatory networks with time-varying delay: delta operator method,” Neurocomputing, vol. 149, pp. 490–495, 2015.
View at: Publisher Site | Google Scholar
R. Rakkiyappan, A. Chandrasekar, F. A. Rihan, and S. Lakshmanan, “Exponential state estimation of Markovian jumping genetic regulatory networks with mode-dependent probabilistic time-varying delays,” Mathematical Biosciences, vol. 251, pp. 30–53, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
T. Jiao, G. Zong, and W. Zheng, “New stability conditions for GRNs with neutral delay,” Soft Computing, vol. 17, no. 4, pp. 703–712, 2013.
View at: Publisher Site | Google Scholar
W. Wang, X. Liu, Y. Li, and Y. Liu, “Set-membership filtering for genetic regulatory networks with missing values,” Neurocomputing, vol. 175, pp. 466–472, 2015.
View at: Publisher Site | Google Scholar
S. Oba, M.-A. Sato, I. Takemasa, M. Monden, K.-I. Matsubara, and S. Ishii, “A Bayesian missing value estimation method for gene expression profile data,” Bioinformatics, vol. 19, no. 16, pp. 2088–2096, 2003.
View at: Publisher Site | Google Scholar
A. Liu, L. Yu, W.-a. Zhang, and B. Chen, “H∞ filtering for discrete-time genetic regulatory networks with random delays,” Mathematical Biosciences, vol. 239, no. 1, pp. 97–105, 2012.
View at: Publisher Site | Google Scholar | MathSciNet
Z. Wang, X. Liu, Y. Liu, J. Liang, and V. Vinciotti, “An extended Kalman filtering approach to modeling nonlinear dynamic gene regulatory networks via short gene expression time series,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 6, no. 3, pp. 410–419, 2009.
View at: Publisher Site | Google Scholar
D. Zhang, H. Song, L. Yu, Q.-G. Wang, and C. Ong, “Set-values filtering for discrete time-delay genetic regulatory networks with time-varying parameters,” Nonlinear Dynamics, vol. 69, no. 1-2, pp. 693–703, 2012.
View at: Publisher Site | Google Scholar | MathSciNet
C. Li, L. Chen, and K. Aihara, “Stability of genetic networks with SUM regulatory logic: Lur’e system and lmi approach,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 53, no. 11, pp. 2451–2458, 2006.
View at: Publisher Site | Google Scholar | MathSciNet
Q. Ye and B. Cui, “Mean square exponential and robust stability of stochastic discrete-time genetic regulatory networks with uncertainties,” Cognitive Neurodynamics, vol. 4, no. 2, pp. 165–176, 2010.
View at: Publisher Site | Google Scholar
Y. Xu and W. Wang, “Kalman filtering for systems with multiple packet dropouts,” in Proceedings of the 2010 8th World Congress on Intelligent Control and Automation, WCICA 2010, pp. 4996–5001, chn, July 2010.
View at: Publisher Site | Google Scholar
M. B. Elowitz and S. Leibier, “A synthetic oscillatory network of transcriptional regulators,” Nature, vol. 403, no. 6767, pp. 335–338, 2000.
View at: Publisher Site | Google Scholar
J. Cao and F. Ren, “Exponential stability of discrete-time genetic regulatory networks with delays,” IEEE Transactions on Neural Networks, vol. 19, no. 3, pp. 520–523, 2008.
View at: Publisher Site | Google Scholar
J. Hu, H. Li, M. S. Waterman, and X. J. Zhou, “Integrative missing value estimation for microarray data,” BMC Bioinformatics, vol. 7, article 449, 2006.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2017 Qiongbin Lin et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

819

Downloads

926

Citations