- About this Journal ·
- Abstracting and Indexing ·
- Advance Access ·
- Aims and Scope ·
- Article Processing Charges ·
- Articles in Press ·
- Author Guidelines ·
- Bibliographic Information ·
- Citations to this Journal ·
- Contact Information ·
- Editorial Board ·
- Editorial Workflow ·
- Free eTOC Alerts ·
- Publication Ethics ·
- Reviewers Acknowledgment ·
- Submit a Manuscript ·
- Subscription Information ·
- Table of Contents
International Journal of Stochastic Analysis
Volume 2014 (2014), Article ID 603692, 9 pages
Diffusion Processes Satisfying a Conservation Law Constraint
Los Alamos National Laboratory, Los Alamos, NM 87545, USA
Received 12 November 2013; Accepted 5 January 2014; Published 4 March 2014
Academic Editor: Nikolai Leonenko
Copyright © 2014 J. Bakosi and J. R. Ristorcelli. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
We investigate coupled stochastic differential equations governing N nonnegative continuous random variables that satisfy a conservation principle. In various fields a conservation law requires a set of fluctuating variables to be nonnegative and (if appropriately normalized) sum to one. As a result, any stochastic differential equation model to be realizable must not produce events outside of the allowed sample space. We develop a set of constraints on the drift and diffusion terms of such stochastic models to ensure that both the nonnegativity and the unit-sum conservation law constraints are satisfied as the variables evolve in time. We investigate the consequences of the developed constraints on the Fokker-Planck equation, the associated system of stochastic differential equations, and the evolution equations of the first four moments of the probability density function. We show that random variables, satisfying a conservation law constraint, represented by stochastic diffusion processes, must have diffusion terms that are coupled and nonlinear. The set of constraints developed enables the development of statistical representations of fluctuating variables satisfying a conservation law. We exemplify the results with the bivariate beta process and the multivariate Wright-Fisher, Dirichlet, and Lochner’s generalized Dirichlet processes.
1. Introduction and Problem Statement
We investigate the consequences of the unit-sum requirement on nonnegative continuous random variables governed by a diffusion process. Such mathematical description is useful to represent fluctuating variables, , subject to the constraint . We are interested in stochastic diffusion models and statistical moment equations describing the temporal evolutions and their statistics. In particular, we study the consequences of the bounded sample space, required by the nonnegativity of and the unit-sum conservation principle, . A simple physical example is the mixture of different chemical species, represented by mass fractions undergoing reaction in a fluid whose overall mass is conserved. Such mathematical problems also appear in evolutionary theory , Bayesian statistics , geology [3–5], forensics , econometrics , turbulent mixing and combustion , and population biology . Mathematical properties of such random fractions are given in [10–13].
Mathematically, we are interested in the following question. What functions are allowed to represent the drift, , and diffusion, , terms of the system, governing the vector : if must hold for all . In (1) is a vector-valued Wiener process with mean and covariance ; see , and is Kronecker's delta. If the components of satisfy the constraints in (2), we call the event realizable. A consequence of the constraints in (2) imposed on the stochastic system (1) is that for all the following holds: In other words, we are interested in expressions for and , what constraints they must satisfy in addition to (3), and how to implement them so that (1) produces realizable events; that is, satisfies (2) for all .
We study diffusion processes as (1) they are mathematically simple vehicles for representing temporal evolutions of fluctuating fractions (of a unit) and their statistics, (2) they lend themselves to simple Monte-Carlo numerical methods , and (3) they serve as a starting point for representations of statistical moment equations if individual samples and joint probabilities are not required. The Markovian assumption  is made at the outset and jump contributions are ignored. We derive constraints for the drift and diffusion terms that assure that the modeled processes are realizable (i.e., produce nonnegative variables that satisfy the unit-sum constraint) for any realization at all times. We address the problem of the functional forms of the drift and diffusion terms from three perspectives: (1) the Fokker-Planck equation for the probability density function, (2) the stochastic differential equations for the individual realizations, and (3) the evolution equations for the jointly coupled statistics.
The plan of the paper is as follows. Section 2 introduces the geometry of the multidimensional sample space within which realizations of fractions of a unit are allowed and discusses constraints that ensure realizable statistical moments. Section 3 develops the implications of realizability on diffusion processes governing fractions. Section 4 follows by developing realizability constraints on the time evolutions of statistics. Section 5 surveys some existing realizable diffusion processes. A summary is given in Section 6.
2. Realizability due to Conservation
The notion of realizability due to a conservation law constraint was introduced and defined by (2). We now discuss the consequences of realizability pertaining to the individual samples of the state space, Section 2.1, and of their statistics, Section 2.2.
2.1. The Universal Geometry of Allowed Realizations
The geometrical definition of the sample space is given in which the vector is allowed if (2) is to be satisfied. This is used to derive constraints for stochastic diffusions and their moment equations in the subsequent sections.
A realization of the vector, , with coordinates , , specifies a point in the multidimensional sample space. The union of all such points, that satisfy is the space of allowed realizations; see (2). For example, in representing mass fraction constituents of a substance, (4) restricts the possible components of to those that are realizable; those vectors that point outside of the allowed space are not conserved; if (4) is violated, spurious mass is created or destroyed.
Mathematically, the geometry of allowed realizations is a simplex, the generalization of a triangle to multiple dimensions. For variables the -simplex is a bounded convex polytope, , on the -dimensional hyperplane; is the convex hull of its vertices. ’s boundary, , is defined as the closed surface of nonoverlapping hyperplanes of dimensions: plotted in Figure 1 for .
The domain (or support) of the joint probability, , with , , is the -simplex. Of all only are independent due to (4) and without loss of generality we take The same geometry of allowed realizations is discussed by Pope in the -dimensional state space in  in the context of ideal gas mixing in turbulent combustion.
We confine our attention here to dimensions, as one of the variables is determined by the unit-sum requirement; see (6). As a consequence, the -dimensional geometry of realizable events is remarkably simple and universal: it is the bounded convex polytope whose boundary is defined by (5). Consequently, the realizability constraint, (2), uniquely and universally determines the realizable region of the state space: it is the same in all points in space and time for all materials undergoing any physical process that conserves mass; see (4). The ensemble is realizable if and only if all samples reside inside the polytope given by (5). For this means that the support of is the triangle depicted in Figure 1.
2.2. Realizable Statistical Moments
If the fractions are nonnegative and sum to one, required by (2), they are also bounded: whose consequences on some of their statistical moments are now discussed.
Since both the instantaneous variables and their means are bounded, fluctuations about the means are also bounded: As a consequence, the variances and the covariances are also bounded: Multiplying (4) by , and taking the expectation yield that is, the row sums and, due to symmetry, the column sums of the covariance matrix are zero. Expressing , , and so forth, from the first equations of (12), and substituting them into the one yield the weaker constraint: Due to bounded fluctuations, see (9), the third central moments are also bounded: and in general, for we have
Ensuring nonnegativity and unit sum puts constraints on possible time evolutions of , represented by diffusion processes and that of their statistics. Some of these constraints are developed in the following sections.
3. Diffusion Processes for Random Fractions
Implications of the geometry of the realizable state space, discussed in Section 2, on diffusion processes are developed. First, the relevant mathematical properties of Fokker-Planck equations are reviewed in Section 3.1, followed by the constraints on their functional forms, Section 3.2.
3.1. Review of Some Boundary Conditions of Fokker-Planck Equations
The discussion is restricted to Markov processes which by definition obey a Chapman-Kolmogorov equation . Assuming that are continuous in space and time, jump processes are excluded. The temporal evolution of random fractions, , constrained by (2) can then be represented most generally by diffusion processes whose transitional probability, , is governed by the Fokker-Planck equation: where and denote drift and diffusion in state space, respectively, and is symmetric nonnegative semidefinite . Equation (17) is a partial differential equation that governs the joint probability, , of the fractions, , . is excluded from (17) and is determined by (6). Augmented by initial and boundary conditions, (17) describes the transport of probability in sample space whose boundary is with normal vector ; see .
Equation (17) can be written in conservation form as in terms of the probability flux; see [14, Section 5.1]: Using (18) and (19) the following boundary conditions are considered; see [14, Section 6.2].(1)Reflecting barrier. If everywhere on the boundary, is a reflecting barrier: a particle inside cannot cross the boundary and must be reflected there.(2)Absorbing barrier. If everywhere on the boundary, is an absorbing barrier: if a particle reaches the boundary, it is removed from the system.(3)Other types of boundary conditions. Some part of the boundary may be reflecting while some other may be absorbing: a combination is certainly possible. We only consider reflecting and absorbing barriers—other types of boundaries are discussed in .To support the forthcoming discussion, some well-established mathematical properties of multivariable Fokker-Planck equations have been reviewed.
3.2. Realizable Diffusion Processes
As discussed in Section 2, the region of the sample space allowed by the realizability requirement is the polytope defined by its boundary, , (5), in which all samples of must reside at all times. Consequently, the sample space, , of the Fokker-Planck equation (17) must coincide with , which constrains the possible functional forms of and . In the following, these constraints are developed for binary (single-variable) processes first, followed by ternary processes, and then generalized to multiple variables.
3.2.1. Realizable Binary Processes:
The Itô diffusion process , governing the variable , with is equivalent to and derived from (17) with ; see for example, : For the allowed space of realizations is a line with endpoints given by (5): This can be ensured if the drift and diffusion terms in (20) and (21) satisfy In other words, the realizability constraint in (2) on (20) mathematically corresponds to (23). A diffusion process, governed by (20), that satisfies (23), ensures that the fractions and satisfy , provided each event of the ensemble at satisfies . The equal signs in the constraints on the drift in (23) allow for absorbing barriers at and , respectively. The constraints on the diffusion term imply that must either be nonlinear in or for all . In other words, since the diffusion term must be nonnegative, required by (20), it can only be nonzero inside the allowed sample space if it is also nonlinear.
3.2.2. Realizable Ternary Processes:
For variables, the unit-sum-constrained sample space and its boundary are sketched in Figure 1. In this case individual samples of the joint probability, , are governed by the system: The allowed samples space is two dimensional (a triangle) whose boundary, defined by (5), consists of the loop of lines: For , the state vector, governed by (24) augmented by , stays inside the allowed region if The realizability constraint, (2), on the system of (24) mathematically corresponds to (26). The three fractions, , , and , governed by (6) and (24), remain fractions of unity if their drift and diffusion terms satisfy (26). Naturally, an initial ensemble that satisfies , , and is required. The constraints on the diffusion terms in (26) show that both and must either be nonlinear in and , respectively, or and , for all and , respectively. Furthermore, if one were to construct a process with , , and , then either or must be a function of both and if is to be maintained, required by with , , . In other words, the unit-sum constraint couples at least 2 of the 3 fractions, governed by the system given by (6) and (24).
3.2.3. Realizable Multi-Variable Processes:
The multivariate Itô diffusion process, equivalent to the Fokker-Planck equation (17), is  with and the vector-valued Wiener process, , with mean and covariance . The sample space of allowed realizations is now bounded by the nonoverlapping hyperplanes, defined by (5). The conditions, analogous to (23) and (26) that ensure realizability for multiple variables, are The realizability constraint in (2) on the system of (27) mathematically corresponds to (28). A diffusion process, governed by (27), that satisfies (28) ensures that the fractions satisfy , , with , provided each event of the initial ensemble at satisfies . As before, the equal signs in the constraints on the drifts in (28) allow for absorbing barriers at the boundaries. The constraints on the diffusion term imply that for any , must either be nonlinear in or for all . In other words, since the diffusion term must be nonnegative semidefinite, required by (27), it can only be nonzero inside the allowed sample space if it is also nonlinear. Equations (28) also show, that while it is conceivable, that and for a single and all , if is to be satisfied, either or must hold for all . In other words, the unit-sum constraint couples at least equations of the system of (6) and (27) governing , .
Constraints on the functional forms of the drift and diffusion terms of the multivariate Fokker-Planck equation (17), as a temporal representation of random fractions, , have been developed. Equations (28) are our central result which ensure that sample space events, generated by (17) or its equivalent system of diffusion processes, (27), satisfy the realizability constraint at all times, provided the initial ensemble is realizable. Since (17) and (27) govern variables and , the unit-sum requirement, (4), is satisfied at all times. An implication of (28), exemplified in Section 5, is that random fractions represented by diffusion processes must be coupled and nonlinear.
4. Realizable Evolution of Statistics
Some implications of (28) for the first few statistical moments of the joint probability, governed by (17), are now derived. This is useful for statistical moment equation representation of fractions if individual samples and joint probabilities are not required.
4.1. Realizable Evolution of the Means:
Multiplying (17) by and integrating over all sample space, see for example , yield the system of equations governing the means of the fractions: where . The evolution of the means can be made consistent with the realizability constraint in (2) if the means are bounded and sum to one at all times. Equation (29) shows that to keep the means bounded, required by (8), the rate of change of the means, , must be governed by functions that satisfy as the boundary of the state space is approached. In (30) . Equation (30) implies that inside the state space (i.e., away from the boundaries) must either be a function of or for all . The means may also sum to one, required by (8), if at least of (29) are coupled to each other. Consequently, must be a function of for all . Equation (29) shows how the means are governed if a Fokker-Planck equation (17) or a diffusion process (27) governs the underlying joint probability; for example, only the mean of the drift, , affects the evolution of the means.
4.2. Realizable Evolution of the Second Central Moments:
Multiplying the Fokker-Planck equation (17) by and then integrating over all sample space yield the equations governing the covariance matrix of the fractions: with and . The right hand side of (31) is denoted by , the evolution rate of the covariance matrix. Equation (31) shows how the covariances are governed if a Fokker-Planck equation (17) or a diffusion process (27) governs the underlying joint probability; for example, is symmetric at all times. Following the development in Section 2.2, four conditions must be satisfied by the system of second moment equations (31) to ensure an evolution of the covariances that is consistent with the realizability constraint in (2).(1) Symmetric covariance evolution. The symmetry of the covariance matrix can be ensured if is symmetric, as well as its evolution rates: (2) Boundedness of the variances, (10). This condition can be ensured with as the boundary of the state space is approached, indicating that in general the equation governing must either be a function of or for all .(3) Boundedness of the covariances, (11). This condition can be ensured if, for , as the boundary of the state space is approached, indicating that in general the equation governing must either be a function of or for all .(4)Zero row sums, (12). Differentiating (12) in time and using (31) yield the system Performing the same substitutions on (35) that resulted in (13) we obtain the weaker constraint: We see that the trivial specification, , satisfies all the above conditions but also fixes the covariance matrix at its initial state for all , which is of limited applicability.
4.3. Bounded Evolution of the Third Central Moments,
Multiplying the Fokker-Planck equation (17) by and then integrating yield the system governing the third central moments, , as with and . The right hand sides of (37) are the evolution rates of the third moments, denoted by . The boundedness of the third moments, required by (14), can be ensured if as the boundary of the state space is approached, indicating that in general the equation governing must either be a function of or for all . The conditions in (38) only ensure boundedness; consequently, they are necessary but not sufficient conditions for realizability of the third moments as required by (2). Note that the requirement on bounded sample space has no implications on the boundedness of the skewness: since , see (10).
4.4. Bounded Evolution of the Fourth Central Moments,
Multiplying the Fokker-Planck equation (17) by and then integrating yield the system governing the fourth central moments, , as with and . The right hand sides of (40) are the evolution rates of the fourth moments, denoted by . The boundedness of the fourth moments, required by (15), can be ensured if as the boundary of the state space is approached, indicating that in general the equation governing must either be a function of or for all . The conditions in (41) only ensure boundedness; consequently, they are necessary but not sufficient conditions for realizability of the fourth moments as required by (2). Note that, similar to the skewness in (39), the requirement on bounded sample space has no implications on the upper bound of the kurtosis: since , (10).
4.5. Summary on Realizable Statistics of Fractions
The unit-sum constraint, (4), applied to a set of nonnegative random variables, bounds and constrains their statistical moments, as shown in Section 2.2, as well as their time evolutions. We examined the evolution of the moments, , , , and , and showed how they are governed if an underlying diffusion process is known.
Realizability of the means, as defined by (2), can be ensured if (8) and (30) are satisfied. Realizability of the covariances can be ensured if (10)–(12) and (32)–(35) are satisfied. Boundedness of the third moments is ensured by (14) and (38), while boundedness of the fourth moments is ensured by (15) and (41). The procedure outlined above can be continued to derive additional constraints for consistency of the third, fourth, mixed, and higher moments with the unit-sum constraint. The constraints reflect the coupled and nonlinear nature of random fractions, both as instantaneous variables and their statistics.
5. A Survey of Realizable Diffusion Processes
A survey of existing diffusion processes that satisfy the realizability constraints on drift and diffusion on the state-space boundary, (28), is now given.
5.1. Realizable Binary Process: , Beta
An example for , satisfying the realizability constraints on the drift and diffusion terms on the sample-space boundary in (23), is given in , specifying the drift and diffusion as yielding the stochastic differential equation: with , , and excluding, while with allowing for absorbing barriers. In (44) the drift is linear and the diffusion is quadratic in . The invariant distribution of (44) is beta, which belongs to the family of Pearson distributions, discussed in detail by Forman & Sørensen . Of the special cases of the Pearson diffusions, discussed in , only Case 6, equivalent to (44), produces realizable events. A symmetric variant of (44) was constructed in , which does not allow a nonzero skewness in the statistically stationary state; see .
5.2. Realizable Multivariate Process: , Wright-Fisher
A system of stochastic differential equations that satisfies the realizability conditions for variables in (28) is the multivariate Wright-Fisher process , which specifies the drift and diffusion terms as yielding the stochastic process, where and are parameters. Equation (46) is a generalization of (44) for variables. The invariant distribution of (46) is Dirichlet [23, 24].
5.3. Realizable Multivariate Process: , Dirichlet
Another process that satisfies (28), developed in , specifies the drift and diffusion terms as resulting in the system of stochastic differential equations, with parameter vectors , , and , and given by (6). Equation (48) is also a generalization of (44) for variables. The invariant distribution of (48) is also Dirichlet, provided the parameters of the drift and diffusion terms satisfy Note that while there is no coupling among the parameters, , of the drift and diffusion terms in the Wright-Fisher equation (46), the parameters, ,, and , of (48) must be constrained by (49) to keep its invariant distribution Dirichlet.
5.4. Realizable Multivariate Process: , Lochner's Generalized Dirichlet
A generalization of (48) is developed in , where the drift and diffusion terms are given by with and , yielding the stochastic process, The invariant distribution of (51) is Lochner's generalized Dirichlet distribution , if the coefficients, , , , and , with for , , satisfy the conditions developed in . Similar to (48), the parameters of the drift and diffusion terms, , , , and , of (51) must be constrained to keep the invariant distribution generalized Dirichlet. Setting in (51) reduces to the standard Dirichlet process, (48).
All of (46), (48), and (51) have coupled and nonlinear diffusions terms. As discussed earlier, this is required to simultaneously satisfy the realizability conditions in (28), required to represent random fractions by diffusion processes.
We have demonstrated that the problem of fluctuating variables constrained by the unit-sum requirement can be discussed in a reduced sample space of dimensions. This allows working with the unique, universal, and mathematically well-defined realizable sample space which produces samples and statistics consistent with the underlying conservation principle.
We have studied multivariate diffusion processes governing a set of fluctuating variables required to satisfy two constraints: (1) nonnegativity and (2) a conservation principle that requires the variables to sum to one, defined as realizability. Our findings can be summarized as follows.(i)The diffusion coefficients in stochastic diffusion processes, governing fractions, must be coupled and nonlinear.(ii)If the set of constraints, is satisfied as the state-space boundary is approached, the stochastic system, with , ensures that the components of the vector of fractions, , remain nonnegative and sum to one at all times.(iii)Boundedness of the sample space requires boundedness of the moments.The constraints provide a method that can be used to develop drift and diffusion functions for stochastic diffusion processes for variables satisfying a conservation law and thus are inherently realizable.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
- K. Pearson, “Mathematical contributions to the theory of evolution. On a form of spurious correlation which may arise when indices are used in the measurement of organs,” Royal Society of London Proceedings I, vol. 60, pp. 489–498, 1896.
- C. D. M. Paulino and C. A. D. B. Pereira, “Bayesian methods for categorical data under informative general censoring,” Biometrika, vol. 82, no. 2, pp. 439–446, 1995.
- F. Chayes, “Numerical correlation and petrographic variation,” The Journal of Geology, vol. 70, no. 4, pp. 440–452, 1962.
- F. Chayes and W. Kruskal, “An approximate statistical test for correlations between proportions,” The Journal of Geology, vol. 74, no. 5, pp. 692–702, 1966.
- P. S. Martin and J. E. Mosimann, “Geochronology of pluvial Lake Cochise, Southern Arizona, [part] 3, Pollen statistics and Pleistocene metastability,” American Journal of Science, vol. 263, pp. 313–358, 1965.
- K. Lange, “Applications of the Dirichlet distribution to forensic match probabilities,” Genetica, vol. 96, no. 1-2, pp. 107–117, 1995.
- C. Gourieroux and J. Jasiak, “Multivariate Jacobi process with application to smooth transitions,” Journal of Econometrics, vol. 131, no. 1-2, pp. 475–505, 2006.
- S. S. Girimaji, “Assumed β-pdf model for turbulent mixing: validation and extension to multiple scalar mixing,” Combustion Science and Technology, vol. 78, no. 4, pp. 177–196, 1991.
- M. Steinrücken, Y. X. R. Wang, and Y. S. Song, “An explicit transition density expansion for a multi-allelic Wright-Fisher diffusion with general diploid selection,” Theoretical Population Biology, vol. 83, pp. 1–14, 2013.
- J. G. Mauldon, “Random division of an interval,” Mathematical Proceedings of the Cambridge Philosophical Society, vol. 47, no. 2, pp. 331–336, 1951.
- J. G. Mauldon, “A generalization of the beta-distribution,” The Annals of Mathematical Statistics, vol. 30, no. 2, pp. 509–520, 1959.
- I. J. Good, The Estimation of Probabilities, Number 30 in Research Monograph, The MIT Press, Cambridge, Mass, USA, 1965.
- R. Pyke, “Spacings,” Journal of the Royal Statistical Society B, vol. 27, no. 3, pp. 395–449, 1965.
- C. W. Gardiner, Stochastic Methods, A Handbook for the Natural and Social Sciences, Springer, Berlin, Germany, 4th edition, 2009.
- P. E. Kloeden and E. Platen, Numerical Solution of Stochastic Differential Equations, Springer, Berlin, Germany, 1999.
- S. B. Pope, “Accessed compositions in turbulent reactive flows,” Flow, Turbulence and Combustion, vol. 72, no. 2-4, pp. 219–243, 2004.
- N. G. van Kampen, Stochastic Processes in Physics and Chemistry, North Holland, Elsevier B. V., Amsterdam, The Netherlands, 2nd edition, 2004.
- W. Feller, “The parabolic differential equations and the associated semi-groups of transformations,” Annals of Mathematics, vol. 55, no. 3, pp. 468–519, 1952.
- S. B. Pope, “PDF methods for turbulent reactive flows,” Progress in Energy and Combustion Science, vol. 11, no. 2, pp. 119–192, 1985.
- J. Bakosi and J. R. Ristorcelli, “Exploring the beta distribution in variable-density turbulent mixing,” Journal of Turbulence, vol. 11, no. 37, pp. 1–31, 2010.
- J. L. Forman and M. Sørensen, “The Pearson diffusions: a class of statistically tractable diffusion processes,” Scandinavian Journal of Statistics, vol. 35, no. 3, pp. 438–465, 2008.
- G. Q. Cai and Y. K. Lin, “Generation of non-Gaussian stationary stochastic processes,” Physical Review E, vol. 54, no. 1, pp. 299–303, 1996.
- S. Wright, “Adaptation and selection,” in Genetics, Paleontology, and Evolution, E. Mayr, G. L. Jepson, and G. G. Simpson, Eds., pp. 365–389, Princeton University Press, 1949.
- J. Bakosi and J. R. Ristorcelli, “A stochastic diffusion process for the Dirichlet distribution,” International Journal of Stochastic Analysis, vol. 2013, Article ID 842981, 7 pages, 2013.
- J. Bakosi and J. R. Ristorcelli, “A stochastic diffusion process for Lochner's generalized Dirichlet distribution,” Journal of Mathematical Physics, vol. 54, no. 10, Article ID 102701, 2013.
- R. H. Lochner, “A generalized Dirichlet distribution in Bayesian life testing,” Journal of the Royal Statistical Society B, vol. 37, no. 1, pp. 103–113, 1975.