Abstract

Artificial gene circuits have been proposed to be embedded into microbial cells that function as switches, timers, oscillators, and the Boolean logic gates. Building more complex systems from these basic gene circuit components is one key advance for biologic circuit design and synthetic biology. However, the behavior of bioengineered gene circuits remains unstable and uncertain. In this study, a nonlinear stochastic system is proposed to model the biological systems with intrinsic parameter fluctuations and environmental molecular noise from the cellular context in the host cell. Based on evolutionary systems biology algorithm, the design parameters of target gene circuits can evolve to specific values in order to robustly track a desired biologic function in spite of intrinsic and environmental noise. The fitness function is selected to be inversely proportional to the tracking error so that the evolutionary biological circuit can achieve the optimal tracking mimicking the evolutionary process of a gene circuit. Finally, several design examples are given in silico with the Monte Carlo simulation to illustrate the design procedure and to confirm the robust performance of the proposed design method. The result shows that the designed gene circuits can robustly track desired behaviors with minimal errors even with nontrivial intrinsic and external noise.

1. Introduction

The cell is the functional unit of all living things either unicellular or multicellular [1]. A cell can sense many different signals from the internal or external and respond to the constantly changing environment via appropriate cellular processes. Also, cells can interact to each other by cell-cell communication and achieve specific physiological functions essential for life cooperatively [1]. However, there are many fundamental questions: how cellular phenomena arise from the interactions among genes and proteins, what features make the cell operate reliably in diverse conditions, and how cells are responsible for the reliable operation. To gain insight into the questions, one could construct and analyze the underlying mechanisms that constitute the web of interactions. This idea is useful to separate a complicated network into many simpler ones that resemble the modules of gene regulation. These modules, which act as simple switches or oscillators, can work on independently and may be combined eventually into a functional entity. While the concept of synthetic gene networks is still in its infancy, the long-term goal of this work is to design and manufacture the biological systems with predictable functions [2, 3].

In recent years, the use of existing genetic engineering technologies together with concepts in circuit design has developed a new strategy to design gene networks [36]. In this approach, the web of interactions among gene and protein is then synonymous with genetic circuit which contains standard and well-characterized components such as promoters, ribosome binding sites, and regulatory sequences. These biological parts can be combined into devices with different configurations, which can be further integrated into a functional whole [3, 7]. In order to build genetic circuits with the specified purpose, one may need the design process that involves describing the specification with respect to the desired goal, converting the technical details into block diagrams at the abstract stage, evaluating the feasibility via mathematical model and computational simulation, and repeating the earlier stage until the parts in the combination and configuration could exhibit suitable performance. For example, Terzer et al. [8] present the construction of a genetic half adder that comprises two main biological gates, that is, AND gate and XOR gate. Two biological stimuli, light signal and chemical signal, are sensed by respective device. This composition allows the half adder to be in one of three possible states, RFP synthesis, GFP synthesis, or the absence of both reporter proteins. In the case when one of two stimuli signals is present, the XOR gate is activated and produces RFP protein. When two stimuli signals are present, the AND is activated and leads to the synthesis of GFP protein. For lack of both stimuli signals, the state is maintained and none of reporter protein is synthesized. This study includes mathematical and computational analysis in determining the combination of parts among different configurations to obtain the desired function. A similar design process has been used in other applications such as switch [911], oscillator [12], timer [13], counter [14], gene regulation [15], and pattern formation [16].

Although synthetic biologists have accomplished a great deal in a short time, major obstacles remain to be overcome before the practical application of the technology can be realized. One important problem is that the behavior of bioengineered systems remains noisy and uncertain. Synthetic gene circuits also tend to mutate rapidly and become nonfunctional [1720]. Generally, the variability in biological circuits might be classified into two main categories: intrinsic fluctuations, which are associated with fluctuations in transcription and translation, random changes in DNA sequence and variations in molecular concentrations, and extrinsic disturbances, which correspond to interactions with extra cellular environment [2]. If the intrinsic fluctuations and extrinsic disturbances are all considered in mathematical model and computational simulation to mimic the realistic environment in the host cell, the biological circuits will be more robust towards the desired goal under these uncertainty conditions. Therefore, in this work, a nonlinear stochastic system with state-dependent noise is introduced to model the biological systems in vivo with intrinsic perturbations and extrinsic noise in the cellular context. In fact, a number of studies recently have developed methodologies for nonlinear stochastic systems to assist the evaluation of parts in combinations and configurations. Sensitivity analysis [21] is a common way used in genetic circuit design to understand the effect of elements on overall system performance. As compared with the views available to the conventional design strategies for assembling devices by intuition alone, the sensitivity information may offer a possible guideline concerning the selection of the optimal configurations to satisfy the design consideration. Such analysis tool can help the choice of an appropriate element combination for biological circuit that functions well in noisy and uncertain environment. Recently, some robust synthetic gene network design methods have been proposed to select certain kinetic parameters of a gene circuit so that the effect of random parameter fluctuations and environmental noise can be efficiently attenuated by game theory [22] or the synthetic gene circuit can track a desired reference to satisfy design specifications [23].

As yet, however, applying the analysis of nonlinear stochastic system to evaluate the flexibility in combination of biological components is not straightforward in general. In light of natural selection on traits best-suited for environmental change being an important mechanism in evolution [24], the question arises whether a similar strategy can be adapted for genetic circuits design. Inspired by biological evolution such as mutation, recombination, and selection, the evolutionary algorithm is a population-based methodology to solve optimization problems [25]. Recently, evolutionary systems biology focuses on links between evolution and function, that is, evolutionary interplay between the genotype and phenotype, to develop methods to reconstruct and compare transcriptional regulatory network [2629]. Here, an evolutionary systems biology algorithm is employed in tuning the kinetic parameters with respect to component characteristics to maximize the fitness under the parameter variations and molecular noise in the cellular context. In order to mimic the naturally occurring biological systems in the evolutionary process, the fitness function is selected to be inverse proportion to the tracking error so that evolutionary kinetic parameters of biological circuit can achieve the optimal tracking via the maximization of fitness with all speed to mimic the evolution process of a gene circuit. If the adaptations of kinetic parameters of biological circuit are reflected by the proposed evolutionary systems biology algorithm to achieve the optimal fitness, the evolutionary gene circuit will track the desired biologic function in spite of intrinsic parameter fluctuations and extrinsic noise and will behave more robustly inside a living cell.

In this study, the intrinsic parameter fluctuations and external noise are modeled in the stochastic dynamic systems of biologic circuits to mimic the stochastic behaviors of biological system in the host cells. Then, based on the evolutionary systems biology algorithm, the kinetic parameters of biologic circuits can be self-adaptively tuned to optimally track the desired biologic function to achieve the optimal fitness to mimic the natural selection of naturally occurring gene circuits in evolution. Because the optimal tracking of evolutionary gene circuit to a desired logical function is achieved under intrinsic parameter fluctuations and external noise, the designed gene circuit will be more robust when embedded in the host cells with the similar stochastic environments. The remainder of this article is organized as follows. In the next section, we provide a brief description of the biologic model under intrinsic parameter fluctuation and extrinsic noise. Then an evolutionary systems biology design algorithm and several design examples with computer simulation will be presented. Finally, a conclusion is given to summarize the results in the last section.

2. Stochastic Model for Biological System In Vivo under Intrinsic Parameter Fluctuations and Extrinsic Noise

The dynamic behaviors of biological circuits have been modeled by nonlinear differential equations. These nonlinear differential equations contain four types of dynamics to describe constitutive transcription, enzymatic transformation, regulated transcription, and translation [30]. A simple biological module as shown in Figure 1 is given to illustrate the nonlinear dynamic equation of these biochemical processes [8]. Suppose this module generates an output signal only when it gets biochemical signals from both of its inputs. Based on the requirement, the substrate is designed to bind an enzyme essential for the reaction, which can be triggered by the input signals, say and , cooperatively. In the process, the two proteins can combine together to form a complex which then breaks down into the production releasing , that is, the conversion of substrate into production is catalyzed by an enzyme . Here the presence of production alone leads to the synthesis of the output signal, the reporter protein GFP. The following is a more detailed system description about dynamic interaction equations for biological module in synthetic gene circuit.

2.1. Constitutive Transcription

The constitutive transcription is the process of creating message RNA from the stimulation of biochemical signal. In this case, transcription of gene occurs only when the input signal is present. This is similar to gene if only the input signal appears. Both of these reactions result in the accumulation of mRNA molecular with respect to each gene, which can be balanced by the degradation process. Thus the dynamics of mRNA governed by the production and degradation as the equations below: where is the concentration of mRNA transcribed from and is the concentration of mRNA transcribed from . The input signals, say and , represent the number of RNA polymerases per unit of time to transcribe the gene and gene , respectively. The parameters and stand for the reaction rate constant; the parameters and are the degradation constant for mRNA.

2.2. Enzymatic Transformation

Followed by the constitutive transcription of gene and gene , the mRNA molecules can be further translated into substrate and enzyme . The process of enzymatic transformation is for the reaction catalyzed by an enzyme, which helps convert substrates into productions. Here we suppose that the substrate   combined with can form the complex which then dissociates into enzyme and production . The reverse reaction for complex synthesis is also considered here. Thus the change in concentration of substrate , enzyme , complex , and production with respect to time is given by where , , , and denote the concentration of substrate , enzyme , complex , and production , respectively. The parameters and stand for the kinetic constants of translation for substrate and enzyme , respectively. The parameter is the forward reaction rate constant for complex , and is the backward one. The production formation rate constant is . The parameters , , and are the degradation constants for substrate , enzyme , complex and production , respectively.

2.3. Regulated Transcription

The process related to the transcriptional regulation is to create message RNA resulted from the control of transcription factor. Transcription factor can bind to a specific DNA site to increase the rate of transcription when an activator, or to reduce the rate when a repressor. In this example, the production is made to activate the green fluorescent protein gene, rendering the accumulation of mRNA molecules. It means that the transcription of the green fluorescent protein gene is regulated by an activator, that is, the production . Then the mRNA dynamics for the GFP gene is described as the equation below: where denotes the concentration of mRNA for green fluorescent protein gene. The parameters and are the kinetic constant of transcription and degradation constant, respectively. represents the constant of background level. Cooperativeity is denoted by the Hill coefficient and is the Hill constant.

2.4. Translation

Translation corresponds to the process for coding protein from message RNA. Here the mRNA codon with respect to green fluorescent protein gene is translated into the output signal, the reporter protein GFP. It leads to the accumulation of reporter protein, which can be balanced by the degradation process. Then the dynamical evolution of the output signal is affected by synthesis and degradation. Consequently, the term of the GFP rate equation will be as follows: where represents the concentration of the reporter protein. The parameters and are the kinetic constant of translation and degradation constant for GFP, respectively.

From the nonlinear differential equations as mentioned above, it can be known that the dynamics of a biological module depends on some factors, such as the kinetic constants, degradation constant, and background level. However, these factors or parameters are uncertain inherently and the biological circuit also suffers from environmental noise in the context of host cell. In this situation, the dynamic model of biological circuit in vivo should be modified as follows: where , denote the amplitudes of parameter fluctuations around kinetic constants and degradation constants due to genetic variations, respectively. , , , and are random white noise with zero mean and unit variance, which denote the independent random fluctuation sources in transcription, translation, reaction, and degradation process, respectively. The variances of kinetic parameter perturbations are given as , ; that is, and represent the standard deviations of the corresponding stochastic parameter fluctuations. denotes the corresponding external stochastic noise with variance . Note that the perturbation of background level is merged into . Then depending on the stochastic system in (5), a more general stochastic model for any synthetic biological gene circuit with intrinsic parameter fluctuations and extrinsic noise in vivo can be represented as a set of nonlinear stochastic differential equation with state-dependent noise in the form where is a nonlinear nominal interaction vector function concerning the state vector for concentrations of reactant species, kinetic constants , and input signals . are the independent intracellular random fluctuation sources. are fluctuation functions due to random fluctuation source . represents the vector of external disturbances. stands for the output vector. In Figure 1, and .

In real biological system, the gene circuit in (6) could evolve adaptively with kinetic parameters in by natural selection through mutation and genetic variation so that can robustly achieve some desired behavior in spite of intracellular molecular noise and external disturbance. In this study, we will mimic the evolutionary biological system to adapt the kinetic parameters of synthetic gene network in (6) through an evolutionary systems biology algorithm under a fitness function so that the synthetic gene circuit can achieve a desired behavior in spite of intracellular noise and external disturbance specified beforehand to be tolerated in vivo.

3. Robust Design of Biological Circuit via Evolutionary Systems Biology Algorithm

Once a model describing the biological circuit system of interest has been constructed, the next step for synthetic biologists is to select adequate parameters , to achieve some design specifications. For example, in the design of biological logic circuit [8], transcriptional rate constants , , and in (5), which depend on the affinity of binding site in the promoter region of the corresponding target gene, should be selected to achieve the robust synthetic biological circuit design to optimally track a desired logic function under intrinsic parameter fluctuations and extrinsic noise in vivo. Based on the analysis above, the design specifications of robust synthetic biologic circuits for tracking a desired logic circuit are given as follows.

(i)Choose a desired logic circuit to be tracked by the synthetic biologic circuit.(ii)Give the standard deviations , in (6) to represent the stochastic parameter fluctuation to be tolerated and the variances to represent the external stochastic disturbances to be attenuated by the synthetic biologic circuit in host cells.(iii)Give the feasible ranges of design parameters according to the real design condition in vivo: Given the above design specifications, our design objective is to tune the design parameters , , to achieve the optimal tracking under intrinsic parametric fluctuations and extrinsic noise as shown in Figure 2. Suppose the tracking error is defined as where denotes the output of the desired logic circuit. Then, our design purpose is to tune design parameter by evolutionary systems biology algorithm so that the dynamic gene circuit in (6) can achieve the following optimal tracking: where denotes the expectation, denotes the present time, and denotes the transpose of . If the above mean square tracking error can be minimized under the design specifications (i)–(iii), the robust synthetic gene circuit design can be achieved under the stochastic parameter fluctuation and external noise in vivo so that the stochastic biologic circuit can track a desired logic circuit more reliably in the host cells.

In general, it is not easy to solve the constrained optimal tracking design problem in (9) for nonlinear stochastic gene network in (6) by the conventional optimal method directly. Here, to mimic the parametric tuning of biological circuit to achieve a desired function via natural selection in evolution, an artificial evolutionary systems biology algorithm is employed to tune the design parameter to achieve the optimal tracking in (9) but with a faster speed than natural selection. Evolutionary algorithms [25, 31] are a result of an effort to model adaptation phenomena in natural and artificial systems. These evolutionary algorithms will be modified to tune the kinetic parameters of nonlinear stochastic synthetic gene network in (6), that is, the so-called evolutionary systems biology algorithm, to fast evolve to a desired output behavior via a fitness function. In the nonlinear stochastic system of biological circuit in (6), the state vector is considered as phenotype and the kinetic parameter vector is considered as genotype. In the evolutionary systems biology algorithm, the kinetic parameter vector is called chromosome. Let us denote the mean square error in (9) as where the chromosome , for , the feasible parameter space or feasible genotype space. Define the fitness function as that is, a small mean square error means a large fitness and vice versa. If we adapt a parameter vector (chromosome) by evolutionary systems biology algorithm to minimize in (10) or (11), then we achieve the maximization of fitness function in (11) for synthetic gene circuit (6) to meet the natural selection in evolution. Therefore, the robust biological circuit design in (6) with a desired output behavior is equivalent to solving the following fitness maximization problem by the evolutionary systems biology method:

The evolutionary systems biology algorithm is employed to solve the above fitness maximization problem via the genetic operators such as selection, crossover, and mutation to mimic the nature selection in the evolutionary process to tune the kinetic parameter vector of synthetic gene circuit to solve the optimization problem in (12) to achieve robust optimal tracking of the desired behavior. A simple evolutionary systems biology algorithm is proposed as follows [25, 31].

3.1. Initialization

Initialize a population of candidate solutions to the problem, that is, randomly generate a population of candidate chromosomes. In the real coding representation, each chromosome with the same length as the vector of decision variables is encoded as a vector of floating-point numbers. The vector is as a chromosome to represent a solution of optimization problem for the desired behavior tracking of nonlinear synthetic gene circuit. Initialization procedure produces chromosomes , where denotes the population size.

3.2. Fitness

Fitness is a measure to evaluate the suitability of chromosome. By the principle of survival of the fittest, a chromosome with higher fitness value has a higher probability of contributing one or more offspring in the next generation. By employing evolutionary algorithms to our fitness optimization problem, we must relate the chromosomes with their fitness functions . In our synthetic gene circuit design problem, an optimal tracking circuit design is to select a maximum fitness function among these fitness functions .

3.3. Reproduction

Reproduction is a basic operator of evolutionary algorithms to generate more offspring to increase the possibility to search for the optimal fitness. It is operated on the basis of the survival of the fitness. In each generation, the chromosomes of the current population are reproduced or copied in the next generation according to their reproduction probability , which are defined as where is the population size. It is shown that the higher fitness value, the higher reproduction probability. Once the chromosomes are reproduced or copied in the next generation, the other chromosomes stay in a mating pool as shown in Figure 3 and await the action of the other two genetic operators.

3.4. Crossover

Reproduction directs the search of evolutionary algorithms toward the best individuals. Crossover performs to exchange the information of any chromosomes via probabilistic decision in the mating pool and provides a mechanism to mix and match the desirable qualities through a random process. For two chromosomes and randomly selected according to the crossover probability , the resulting offspring is where .

3.5. Mutation

Reproduction and crossover provide the most search power for evolutionary algorithms. However, the mating pool tends to become more and more homogeneous as one better solution begins to dominate after several generations and leads to premature convergence. In the situation, the third operator, mutation, is introduced into the evolutionary algorithm with appropriate probability . For a given chromosome , if the element is randomly selected for mutation, the resulting offspring is . The new gene is where is standard derivation and is a random variable with standard normal distribution function. Mutation is an insurance strategy to ensure that all points in the search space can be ultimately reached. Note that mutation should be used sparingly because it is inherently a random search operator. The evolutionary algorithm could become more similar to random search if the mutation probability is high.

Since the proposed evolutionary systems biology design method not only achieves the best fitness for optimal desired behavior tracking but also robustly tolerates random kinetic parameter fluctuation and external disturbance simultaneously, it will play an important role for synthetic gene network from the evolutionary systems biology perspective. The evolutionary systems biology approach of how to select a parameter vector (or chromosome) to solve the fitness maximization problem in (12) for the optimal behavior tracking is summarized as follows.

Step 1. Give the design specifications (i)–(iii).

Step 2. Model the synthetic gene network as the nonlinear stochastic system in (6).

Step 3. Specify the probabilities and for crossover and mutation, respectively, and the population size .

Step 4. Generate randomly a population of candidate chromosomes.

Step 5. Evaluate the fitness for each candidate solution (chromosome) in the population to find the best fit to achieve the best fitness .

Step 6. If the search goal is achieved, or an allowable generation is attained, then stop. Otherwise, continue.

Step 7. Replace the current population with a new population by applying selection, crossover, and mutation operations on the current population. Go to Step 5.

Remark 3.1. In addition, in the evolutionary systems biology algorithm, the elitist strategy can be incorporated to enhance the convergence of evolutionary systems biology algorithm. This strategy copies the best chromosomes from the old population into the next population to prevent losing the best solutions in the succeeding iterations. It has been shown that the elitist strategy can improve the performance of evolutionary systems biology algorithm.
The proposed evolutionary systems biology algorithms is a powerful search algorithm for parameter selection in the synthetic gene network design procedure based on natural genetics and are inherently parallel, because they simultaneously evaluate many points in the parameter space (genotype space) for the best fitness to achieve the robust synthetic gene network deign with a desired behavior in vivo.

4. Design Example In Silico

After the development of design procedure for the robust gene circuits in the above section, two examples will be given in silico to illustrate the design procedure and to confirm the performance of synthetic gene circuit by the proposed evolutionary systems biology design method.

4.1. Robust Biological AND Gate Design

Consider the biological AND gate as shown in Figure 4 [8]. The biological AND gate generates an output signal only when it gets biochemical signals from both of its inputs. In the process, the input signal leads to the transcription of T7 polymerase gene, containing an early stop codon in the coding sequences that block translation. The input signal leads to the synthesis of a suppressor tRNA, which prevents the premature termination and enables the translation of polymerase. When both of these inputs are present, the functional T7 RNA polymerase leads to the synthesis of the output signal, the reporter protein GFP. The following is a description about dynamic differential equations of biological AND gate in Figure 4 [8]: where , , and are the concentrations of mRNA transcribed from genes T7Pol, tRNA, and gfp, respectively; concentrations of the corresponding proteins are represented by , , and , respectively. , , and are the transcription rate. , , and are the respective degradation rate of mRNA for T7Pol, tRNA, and gfp. Parameters , , and are the translation rates of the proteins from the mRNAs, and , , and represent the degradation rate of protein nonfunctional T7 RNA polymerase, tRNA and GFP, respectively. is the concentration of functional T7 RNA polymerase. is the reaction rate constant and stands for the corresponding degradation constant. is the Hill coefficient and is the Hill constant. is the basal level.

From the nonlinear differential equations as mentioned above, it can be seen that the dynamics of the biological AND gate depends on some biochemical factors, such as the kinetic constant, degradation constant, and basal level. However, these factors or parameters are uncertain inherently and the biological circuit also suffers from environmental noise. In this situation, the dynamic model of synthetic biological circuit in vivo should be modified as follows: where , denote the amplitudes of parameter fluctuations for kinetic constants and degradation constant, respectively. are random white noise with zero mean and unit variance, which denote the independent random fluctuation sources in transcription, translation, reaction, and degradation process, respectively. The variances of parameter perturbations are given as , , that is, and represent the standard deviations of the corresponding stochastic parameter fluctuations to be tolerated by the synthetic gene circuit in vivo. denotes the corresponding external noise with variance . Note that the perturbation of basal level is merged into .

The design specifications and parameters are given as follows. The desired transient behaviors of biological AND gate are as shown in Figure 5. The standard deviations of uncertain fluctuations of kinetic parameters and decay rates to be robustly tolerated are specified as , , , , , , , , , , , , , and . The variance of external disturbance is . The feasible ranges of kinetic parameters to be designed are specified in the range from 0 to 1. The other parameters are set as , , , , and [8].

Given the above design specifications, our design objective is to adapt the design parameters by the proposed evolutionary systems biology method to achieve the optimal tracking under intrinsic parametric fluctuations and extrinsic noise. In this example, the parameters of evolutionary algorithm are chosen as , , and . The software MATLAB is used to perform the simulation. After 100 generations, the best fit design parameters , , , , , , and are obtained by the proposed evolutionary systems biology method with the best fitness . By Monte Carlo simulation with 50 rounds, the output of the synthetic gene network with the design parameters under intrinsic parametric fluctuations and extrinsic noise is shown in Figure 6(a). It can be seen that the synthetic gene network has robust regulation ability to achieve the desired transient behaviors in spite of uncertain initial state, kinetic parameter fluctuations, and disturbances on the host cell. On the contrary, as shown in Figure 6(b), if the design parameters are selected aside the optimal design parameter , for example, with , , , , , , and and the fitness , the expression of the synthetic gene network is with more fluctuation and cannot achieve the desired transient behaviors. Obviously, the robust synthetic gene network by the proposed evolutionary systems biology method has a good robust stability to overcome the uncertain initial conditions and an enough filtering ability to attenuate the disturbances on the host cell and eventually approach the desired transient behaviors.

4.2. Robust Repressilator Design

Consider the biological repressilator (biological oscillator) as shown in Figure 7. The repressilator is a network of three genes, whose products inhibit the transcription of each other in a cyclical manner [32]. The gene lacI (from E. Coli.) expresses protein LacI, which inhibits transcription of the gene tetR. The product of TetR inhibits the transcription of gene cI (from phage), and the protein product CI in turn inhibits expression of lacI, completing the cycle. From the scheme of coupled repressilator in Figure 7, the mRNA dynamics is governed by repressible transcription for three genes [32]: where , , and are the concentrations of mRNA transcribed from genes tetR, cI, and lacI, respectively; concentrations of the corresponding proteins are represented by , and , respectively. , , and are the transcription rate in the absence of repressor and is the repression coefficient. , , and are the respective degradation rate of mRNA for tetR, cI, and lacI, respectively, and is the Hill coefficient. Followed by the transcription, the dynamics of proteins TetR, CI and LacI is given, respectively, as [32] where parameters , , and are the translation rates of the proteins from the mRNAs, and , , and represent the dimensionless degradation rate of protein TetR, CI, and LacI, respectively.

From the nonlinear differential equations as mentioned above, it can be seen that the dynamics of the biological repressilator depends on some biochemical factors, such as the kinetic constant, degradation constant, and basal level. However, these factors or parameters are uncertain inherently and the biological circuit also suffers from environmental noise. In this situation, the dynamic model of synthetic biological circuit in vivo should be modified as follows: where , denote the amplitudes of parameter fluctuation for kinetic constant and degradation constant, respectively. , , and are random white noise with zero mean and unit variance, which denote the independent random fluctuation sources in transcription, translation, and degradation process, respectively. The variances of parameter perturbations are given as , , that is, and represent the standard deviations of the corresponding stochastic parameter fluctuations to be tolerated by the synthetic gene circuit in vivo. denotes the corresponding external stochastic noise with variance .

The design specifications and parameters are given as follows. The desired transient behaviors of biological repressilator are as shown in Figure 8. The standard deviations of random fluctuations of kinetic parameters and decay rates to be robustly tolerated are specified as , , , , , , , , , , , and . The variance of external disturbance is . The feasible ranges of kinetic parameters to be designed are specified in a range from 0 to 2. The other parameters are set as , , , , , , , and .

Given the above design specifications, our design objective is to adapt the design parameters by the proposed evolutionary systems biology method to achieve the optimal tracking under intrinsic parametric fluctuations and extrinsic noise. In this example, the parameters of evolutionary algorithm are chosen as , , and . The software MATLAB is used to perform the simulation. After 100 generations, the best fit design parameters , , , , , and are obtained by the proposed evolutionary systems biology method with the best fitness . By Monte Carlo simulation with 50 rounds, the output of the system with the best fit design parameters under intrinsic parametric fluctuations and extrinsic noise is shown in Figure 8. It can be seen that the synthetic gene network has robust tracking ability to achieve the desired transient behaviors in spite of uncertain initial state and disturbances on the host cell. On the contrary, as shown in Figure 9, if the design parameters are selected aside the best fit parameter , for example, with , , , , , and and the fitness , the expression of the synthetic gene network is with more fluctuation and cannot achieve the desired transient behaviors. Obviously, the robust synthetic gene network by the proposed evolutionary systems biology design method has a good tracking ability to achieve the desired behavior, robust stability to overcome the uncertain initial conditions, and kinetic parameters fluctuations and an enough filtering ability to attenuate the disturbances on the host cell and eventually approach the desired transient behaviors. If the parameters of decay rate are also considered to be designed by the proposed method, then the robust tracking performance can be improved significantly because of more design freedom to choose. However, this will make the design procedure more complicated.

5. Discussion

One of main challenges for synthetic biology is that the engineered biological circuits are influenced by the effects of unavoidable intracellular fluctuations and environmental disturbances [17, 18, 20, 3336]. The noise and uncertainties currently hinder us from engineering synthetic gene circuits with reliable functions. In this study, we propose an evolutionary systems biology methodology to select an adequate parameter set for nonlinear stochastic gene circuit systems. The proposed algorithm can evaluate biological parts in different combinations and configurations for their tracking ability. Therefore, our approach may offer a possible design guideline concerning the selection of the optimal configuration of a robust synthetic gene network with desired behavior. Each circuit is represented by a set of transcription rate vector , which in turn uses a combination of corresponding promoters and ribosome binding site (RBS) from preconstructed libraries [3, 7] to satisfy the design considerations under noisy and uncertain environment in vivo.

In the light of natural selection on traits best suited for environmental change being an important mechanism in evolution [24], the similar evolutionary strategy seems more suitable to gene circuit design if we can speed up the evolutionary process through fast parallel evolutionary computations. Inspired by biological evolution such as reproduction, mutation, recombination, and selection, the evolutionary algorithm is an efficient method to solve optimization problems [25]. An evolutionary systems biology algorithm is employed here to solve the best fitness function to gradually improve the desired behavior tracking ability of a synthetic gene circuit design one generation by one generation to mimic the natural selection in the evolutionary process. Unlike the necessity of some complicated computations in the conventional design strategies, only some simple operators (e.g., selection, crossover, and mutation) and some simple calculations are required for selecting optimal design parameters iteratively by the proposed evolutionary systems biology design method, but with synthetic gene circuit robust enough against intrinsic parameter fluctuation and external disturbance. This attractive property makes the proposed design method being easily implemented in the practical applications. Since the proposed evolutionary systems biology method has included design specifications such as the tolerable variances of intrinsic stochastic parameter variations and external disturbances, the feasible ranges of design kinetic parameters and decay rates, and the desired steady state or transient behaviors of output , the synthetic gene circuit can be guaranteed to achieve all possible design purposes by only solving a constrained optimization problem in (9) by maximizing a corresponding fitness function in (12) through evolutionary algorithm. Though the proposed evolutionary gene circuit designs may not produce the best tracking performance within the finite generations of evolutionary algorithm, they are near optimal gene circuit designs to achieve the desired behavior. Therefore, the proposed design method has potential applications to the synthetic gene circuit design for biotechnological purposes in the near future.

Given the above evolution framework, we are given the foundation to build robust biological parts. To achieve this goal, there are many more interesting problems to be solved. For gene circuits, it may not be feasible to implement the optimal set of transcription rate vector , where limited choices are available. It could be necessary to turn the above proposed algorithm into searching a set of biological parts from libraries of genetic devices, but the algorithm should remain flexible at choosing biological parts. The other design problem comes from the assumed noise level: an accurate estimation may not readily be available for fluctuations of all parameters in biochemical processes. Of course, we may start from basic operation principals and models to calculate the fluctuation level. For example, noise in gene regulation can be modeled from transcription control, alternative splicing, translation, and diffusion to biological modification reactions of transcription factors. Such stochastic process noise can affect significantly the dynamics of biological systems. Knowledge (and multitude in experimental analysis and design) in this regard in fact may help determine the flexibility of biological parts in combinations and configurations. However, absolute accuracy is not required in the above design process, since we can always include design margins into the rate parameters by considering tracking problems outside (near) the boundary of the noise ranges. Also, we cannot always control external environment fluctuations. A different set of gene circuits may be enabled through sensing environmental transcription factors, that are designed to handle wide variety of outside changes. Lastly but not the least, we have to consider the complexity of interacting with other gene operations, that may directly impact the designed circuit and cannot be modeled as pure noise. It is certainly possible to extend the proposed algorithm by considering relational biochemical reactions. Still, at certain level, we would like to abstract more information in the design process for the method to handle larger scale biological systems.

6. Conclusion

Robust design to overcome intrinsic and external molecular noise in vivo to achieve a desired behavior becomes an important topic in synthetic gene networks. The contribution of the paper includes the following. A stochastic synthetic biologic circuit with the intrinsic parameter fluctuations and external noise is modeled as an evolutionary nonlinear stochastic system with the state-dependent noise and external disturbance to mimic the stochastic behavior of synthetic biological circuit in host cells. Based on nonlinear stochastic system and design specifications, the design problem of robust synthetic biological circuit with desired behavior is transformed to an equivalent optimal tracking problem and then a fitness optimization problem in evolution. An evolution systems biology algorithm is proposed to tune the design kinetic parameters (chromosomes) of synthetic gene network to achieve the fitness optimization to some desired behaviors to mimic the evolutionary process of biological genetic circuit to fit the change of environment via nature selection. Finally, several simulation results have also confirmed the robust desired behavior tracking performance of synthetic biologic circuits by the proposed evolutionary systems biology design method.

Acknowledgment

The work was supported by the National Science Council of Taiwan under grant NSC 99-2745-E-007-001-ASP and NSC 100-2745-E-007-001- ASP.