About this Journal Submit a Manuscript Table of Contents
International Journal of Stochastic Analysis
Volume 2011 (2011), Article ID 247329, 89 pages
doi:10.1155/2011/247329
Research Article

Multiresolution Hilbert Approach to Multidimensional Gauss-Markov Processes

1Lewis-Sigler Institute, Princeton University, Carl Icahn Laboratory, Princeton, NJ 08544, USA
2Laboratory of Mathematical Physics, The Rockefeller University, New York, NY 10065, USA
3Mathematical Neuroscience Laboratory, Collège de France, CIRB, 11 Place Marcelin Berthelot, CNRS UMR 7241 and INSERM U 1050, Université Pierre et Marie Curie ED, 158 and Memolife PSL, 75005 Paris, France
4INRIA BANG Laboratory, Paris, France

Received 28 April 2011; Accepted 6 October 2011

Academic Editor: Agnès Sulem

Copyright © 2011 Thibaud Taillefumier and Jonathan Touboul. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

The study of the multidimensional stochastic processes involves complex computations in intricate functional spaces. In particular, the diffusion processes, which include the practically important Gauss-Markov processes, are ordinarily defined through the theory of stochastic integration. Here, inspired by the Lévy-Ciesielski construction of the Wiener process, we propose an alternative representation of multidimensional Gauss-Markov processes as expansions on well-chosen Schauder bases, with independent random coefficients of normal law with zero mean and unit variance. We thereby offer a natural multiresolution description of the Gauss-Markov processes as limits of finite-dimensional partial sums of the expansion, that are strongly almost-surely convergent. Moreover, such finite-dimensional random processes constitute an optimal approximation of the process, in the sense of minimizing the associated Dirichlet energy under interpolating constraints. This approach allows for a simpler treatment of problems in many applied and theoretical fields, and we provide a short overview of applications we are currently developing.

1. Introduction

Intuitively, multidimensional continuous stochastic processes are easily conceived as solutions to randomly perturbed differential equations of the form ̇ 𝐗 𝑡 = 𝑓 𝐗 𝑡 , 𝑡 , 𝝃 𝑡 , ( 1 . 1 ) where the perturbative term 𝝃 𝑡 implicitly defines a probability space and 𝑓 satisfies some ad hoc regularity conditions. If the existence of such processes is well established for a wide range of equations through the standard Itô integration theory (see, e.g., [1]), studying their properties proves surprisingly challenging, even for the simplest multidimensional processes. Indeed, the high dimensionality of the ambient space and the nowhere differentiability of the sample paths conspire to heighten the intricacy of the sample paths spaces. In this regard, such spaces have been chiefly studied for multidimensional diffusion processes [2], and more recently, the development of rough paths theory has attracted renewed interest in the field (see [37]). However, aside from these remarkable theoretical works, little emphasis is put on the sample paths since most of the available results only make sense in distribution. This is particularly true in the Itô integration theory, where the sample path is completely neglected for the Itô map being defined up to null sets of paths.

To overcome the difficulty of working in complex multidimensional spaces, it would be advantageous to have a discrete construction of a continuous stochastic process as finite-dimensional distributions. Since we put emphasis on the description of the sample paths space, at stake is to write a process 𝐗 as an almost surely pathwise convergent series of random functions 𝐗 𝑡 = l i m 𝑁 𝐗 𝑁 𝑡 w i t h 𝐗 𝑁 𝑡 = 𝑁 𝑛 = 0 𝐟 𝑛 ( 𝑡 ) 𝚵 𝑛 , ( 1 . 2 ) where 𝐟 𝑛 is a deterministic function and Ξ 𝑛 is a given random variable.

The Lévy-Ciesielski construction of the 𝑑 -dimensional Brownian motion 𝐖 (also referred to as Wiener process) provides us with an example of discrete representation for a continuous stochastic process. Noticing the simple form of the probability density of a Brownian bridge, it is based on completing sample paths by interpolation according to the conditional probabilities of the Wiener process [8]. More specifically, the coefficients Ξ 𝑛 are Gaussian independent and the elements 𝐟 𝑛 , called the Schauder elements and denoted by 𝐬 𝑛 , are obtained by time-dependent integration of the Haar basis elements: 𝐬 0 , 0 ( 𝑡 ) = 𝑡 𝐈 𝑑 and 𝐬 𝑛 , 𝑘 ( 𝑡 ) = 𝑠 𝑛 , 𝑘 ( 𝑡 ) 𝐈 𝑑 , with for all 𝑛 > 0 𝑠 𝑛 , 𝑘 ( 𝑡 ) = 2 ( 𝑛 1 ) / 2 𝑡 𝑙 𝑛 , 𝑘 , 𝑘 2 𝑛 + 1 𝑡 ( 2 𝑘 + 1 ) 2 𝑛 , 2 ( 𝑛 1 ) / 2 𝑟 𝑛 , 𝑘 𝑡 , ( 2 𝑘 + 1 ) 2 𝑛 𝑡 ( 𝑘 + 1 ) 2 𝑛 + 1 , 0 , o t h e r w i s e . ( 1 . 3 ) This latter point is of relevance since, for being a Hilbert system, the introduction of the Haar basis greatly simplifies the demonstration of the existence of the Wiener process [9]. From another perspective, fundamental among discrete representations is the Karhunen-Loève decomposition giving a representation of stochastic processes by expanding it on a basis of orthogonal functions [10, 11]. The definition of the basis elements 𝑓 𝑛 depends only on the second-order statistics of the considered process and the coefficients 𝜉 𝑛 are pairwise uncorrelated random variables. Incidentally, such a decomposition is especially suited to study the Gaussian processes because the coefficients of the representation are Gaussian and independent. For these reasons, the Karhunen-Loève decomposition is of primary importance in exploratory data analysis, leading to methods referred to as “principal component analysis,” “Hotelling transform" [12] or “proper orthogonal decomposition” [13] according to the field of application. In particular, it was directly applied to the study of the stationary Gaussian Markov processes in the theory of random noise in radio receivers [14].

It is also important for our purpose to realize that the Schauder elements 𝐬 𝑛 have compact supports that exhibit a nested structure: this fact entails that the finite sums 𝐖 𝑁 are processes that interpolate the limit process 𝐖 on the endpoints of the supports, that is, on the dyadic points 𝑘 2 𝑁 , 0 𝑘 2 𝑁 . One of the specific goal of our construction is to maintain such a property in the construction of all multidimensional the Gauss-Markov processes 𝐗 (i.e., processes that are both Gaussian and satisfy the Markov property) of the form: 𝐗 𝑡 = 𝐠 ( 𝑡 ) 𝑡 0 𝐟 ( 𝑠 ) 𝑑 𝐖 𝑠 ( 1 . 4 ) (covering all 1-dimensional Gauss-Markov processes thanks to Doob’s representation of Gauss-Markov processes), being successively approximated by finite-dimensional processes 𝑋 𝑁 that interpolates 𝐗 at ever finer resolution. In that respect, it is only in that sense that we refer to our framework as a multiresolution approach as opposed to the wavelet multiresolution theory [15]. Other multiresolution approaches have been developed for certain Gaussian processes, most notably for the fractional Brownian motion [16].

In view of this, we propose a construction of the multidimensional Gaussian Markov processes using a multiresolution Schauder basis of functions. As for the Lévy-Ciesielski construction, and in contrast with Karhunen-Loève decomposition, our basis is not made of orthogonal functions but the elements are of nested compact support and the random coefficients Ξ 𝑛 are always independent and Gaussian (for convenience with law 𝒩 ( 𝟎 , 𝐈 𝑑 ) , i.e., with zero mean and unitary variance). We first develop a heuristic approach for the construction of stochastic processes reminiscent of the midpoint displacement technique [8, 9], before rigorously deriving the multiresolution basis that we will be using the paper. This set of functions is then studied as a multiresolution Schauder basis of functions: in particular, we derive explicitly from the multiresolution basis an Haar-like Hilbert basis, which is the underlying structure explaining the dual relationship between basis elements and coefficients. Based on these results, we study the construction application and its inverse, the coefficient applications, that relate coefficients on the Schauder basis to sample paths. We follow up by proving the almost-sure and strong convergence of the process having independent standard normal coefficients on the Schauder basis to a Gauss-Markov process. We also show that our decomposition is optimal in some sense that is strongly evocative of spline interpolation theory [17]: the construction yields successive interpolations of the process at the interval endpoints that minimize the Dirichlet energy induced by the differential operator associated with the Gauss-Markov process [18, 19]. We also provide a series of examples for which the proposed Schauder framework yields bases of functions that have simple closed form formulae: in addition to the simple one-dimensional Markov processes, we explicit our framework for two classes of multidimensional processes, the Gauss-Markov rotations and the iteratively integrated Wiener processes (see, e.g., [2022]).

The ideas underlying this work can be directly traced back to the original work of Lévy. Here, we intend to develop a self-contained Schauder dual framework to further the description of multidimensional Gauss-Markov processes, and, in doing so, we extend some well-known results of interpolation theory in signal processing [2325]. To our knowledge, such an approach is yet to be proposed. By restraining our attention to the Gauss-Markov processes, we obviously do not assume generality. However, we hope our construction proves of interest for a number of points, which we tentatively list in the following. First, the almost-sure pathwise convergence of our construction together with the interpolation property of the finite sums allows to reformulate results of the stochastic integration in term of the geometry of finite-dimensional sample paths. In this regard, we found it appropriate to illustrate how in our framework, the Girsanov theorem for the Gauss-Markov processes appears as a direct consequence of the finite-dimensional change of variable formula. Second, the characterization of our Schauder elements as the minimizer of a Dirichlet form paves the way to the construction of infinite-dimensional Gauss-Markov processes, that is, processes whose sample points themselves are infinite-dimensional [26, 27]. Third, our construction shows that approximating a Gaussian process by a sequence of interpolating processes relies entirely on the existence of a regular triangularization of the covariance operator, suggesting to further investigate this property for non-Markov Gaussian processes [28]. Finally, there is a number of practical applications where applying the Schauder basis framework clearly provides an advantage compared to standard stochastic calculus methods, among which first-hitting times of stochastic processes, pricing of multidimensional path-dependant options [2932], regularization technique for support vector machine learning [33], and more theoretical work on uncovering the differential geometry structure of the space of the Gauss-Markov stochastic processes [34]. We conclude our exposition by developing in more detail some of these direct implications which will be the subjects of forthcoming papers.

2. Heuristic Approach to the Construction

In order to provide a discrete multiresolution description of the Gauss-Markov processes, we first establish basic results about the law of the Gauss-Markov bridges in the multidimensional setting. We then use them to infer the candidate expressions for our desired bases of functions, while imposing its elements to be compactly supported on nested sequence segments. Throughout this paper, we are working in a complete probability space ( Ω , , 𝐏 ) .

2.1. Multidimensional Gauss-Markov Processes

After recalling the definition of the multidimensional Gauss-Markov processes in terms of stochastic integral, we use the well-known conditioning formula for the Gaussian vectors to characterize the law of the Gauss-Markov bridge processes.

2.1.1. Notations and Definitions

Let ( 𝐖 𝑡 , 𝑡 , 𝑡 [ 0 , 1 ] ) be an 𝑚 -dimensional Wiener process, consider the continuous functions 𝜶 [ 0 , 1 ] 𝑑 × 𝑑 Γ [ 0 , 1 ] 𝑑 × 𝑚 , and define the positive bounded continuous function Γ = Γ Γ 𝑇 [ 0 , 1 ] 𝑑 × 𝑑 . The 𝑑 -dimensional Ornstein-Uhlenbeck process associated with these parameters is solution of the equation 𝑑 𝐗 𝑡 = 𝜶 ( 𝑡 ) 𝐗 𝑡 𝑑 𝑡 + 𝚪 ( 𝑡 ) 𝑑 𝐖 𝑡 , ( 2 . 1 ) and with initial condition 𝐗 𝑡 0 in 𝑡 0 , it reads 𝐗 𝑡 = 𝐅 𝑡 0 , 𝑡 𝐗 𝑡 0 + 𝐅 𝑡 0 , 𝑡 𝑡 𝑡 0 𝐅 𝑠 , 𝑡 0 𝚪 ( 𝑠 ) 𝑑 𝐖 𝑠 , ( 2 . 2 ) where 𝐅 ( 𝑡 0 , 𝑡 ) is the flow of the equation, namely, the solution in 𝑑 × 𝑑 of the linear equation: 𝜕 𝐅 𝑡 0 , 𝑡 𝜕 𝑡 = 𝜶 ( 𝑡 ) 𝐅 ( 𝑡 ) , 𝐅 𝑡 0 , 𝑡 0 = 𝐈 𝑑 . ( 2 . 3 ) Note that the flow 𝐅 ( 𝑡 0 , 𝑡 ) enjoys the chain rule property: 𝐅 𝑡 0 , 𝑡 = 𝐅 𝑡 1 , 𝑡 𝐅 𝑡 0 , 𝑡 1 . ( 2 . 4 ) For all 𝑡 , 𝑠 such that 𝑡 0 < 𝑠 , 𝑡 , the vectors 𝐗 𝑡 and 𝐗 𝑠 admit the covariance 𝐂 𝑡 0 ( 𝑠 , 𝑡 ) = 𝐅 𝑡 0 , 𝑡 𝑡 𝑠 𝑡 0 𝐅 𝑤 , 𝑡 0 𝚪 ( 𝑤 ) 𝐅 𝑤 , 𝑡 0 𝑇 𝑑 𝑤 𝐅 𝑡 0 , 𝑠 𝑇 = 𝐅 𝑡 0 , 𝑡 𝐡 𝑡 0 ( 𝑠 , 𝑡 ) 𝐅 𝑡 0 , 𝑠 𝑇 , ( 2 . 5 ) where we further defined 𝐡 𝑢 ( 𝑠 , 𝑡 ) the function 𝐡 𝑢 ( 𝑠 , 𝑡 ) = 𝑡 𝑠 𝐅 ( 𝑤 , 𝑢 ) 𝚪 ( 𝑤 ) 𝐅 ( 𝑤 , 𝑢 ) 𝑇 𝑑 𝑤 , ( 2 . 6 ) which will be of particular interest in the sequel. Note that because of the chain rule property of the flow, we have 𝐡 𝑣 ( 𝑠 , 𝑡 ) = 𝐅 ( 𝑣 , 𝑢 ) 𝐡 𝑢 ( 𝑠 , 𝑡 ) 𝐅 ( 𝑣 , 𝑢 ) 𝑇 . ( 2 . 7 ) We suppose that the process 𝐗 is never degenerated, that is, for all 𝑡 0 < 𝑢 < 𝑣 , all the components of the vector 𝐗 𝑣 taking into account 𝐗 𝑢 are nondeterministic random variables, which is equivalent to saying that the covariance matrix of 𝐗 𝑣 taking into account 𝐗 𝑢 , denoted by 𝐂 𝑢 ( 𝑣 , 𝑣 ) is symmetric positive definite for any 𝑢 𝑣 . Therefore, assuming the initial condition 𝐗 0 = 𝟎 , the multidimensional centered process 𝐗 has a representation (similar to Doob’s representation for one-dimensional processes, see [35]) of form 𝐗 𝑡 = 𝐠 ( 𝑡 ) 𝑡 0 𝐟 ( 𝑠 ) 𝑑 𝐖 𝑠 , ( 2 . 8 ) with 𝐠 ( 𝑡 ) = 𝐅 ( 0 , 𝑡 ) and 𝐟 ( 𝑡 ) = 𝐅 ( 𝑡 , 0 ) Γ ( 𝑡 ) .

Note that the processes considered in this paper are defined on the time interval [ 0 , 1 ] . However, because of the time-rescaling property of these processes, considering the processes on this time interval is equivalent to considering the process on any other bounded interval without loss of generality.

2.1.2. Conditional Law and Gauss-Markov Bridges

As stated in the introduction, we aim at defining a multiresolution description of Gauss-Markov processes. Such a description can be seen as a multiresolution interpolation of the process that is getting increasingly finer. This principle, in addition to the Markov property, prescribes to characterize the law of the corresponding Gauss-Markov bridge, that is, the Gauss-Markov process under consideration, conditioned on its initial and final values. The bridge process of the Gauss process is still a Gauss process and, for a Markov process, its law can be computed as follows.

Proposition 2.1. Let 𝑡 𝑥 𝑡 𝑧 two times in the interval [ 0 , 1 ] . For any 𝑡 [ 𝑡 𝑥 , 𝑡 𝑧 ] , the random variable 𝐗 𝑡 conditioned on 𝐗 𝑡 𝑥 = 𝐱 and 𝐗 𝑡 𝑧 = 𝐳 is a Gaussian variable with covariance matrix 𝚺 ( 𝑡 ) and mean vector 𝝁 ( 𝑡 ) given by 𝚺 𝑡 ; 𝑡 𝑥 , 𝑡 𝑧 = 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝐡 𝑡 ( 𝑡 𝑥 , 𝑡 𝑧 ) 1 𝐡 𝑡 𝑡 , 𝑡 𝑧 , 𝝁 ( 𝑡 ) = 𝝁 𝑙 𝑡 ; 𝑡 𝑥 , 𝑡 𝑧 𝐱 + 𝝁 𝑟 𝑡 ; 𝑡 𝑥 , 𝑡 𝑧 𝐳 , ( 2 . 9 ) where the continuous matrix functions 𝝁 𝑙 ( ; 𝑡 𝑥 , 𝑡 𝑧 ) and 𝝁 𝑟 ( ; 𝑡 𝑥 , 𝑡 𝑧 ) of 𝑑 × 𝑑 are given by 𝝁 𝑙 𝑡 ; 𝑡 𝑥 , 𝑡 𝑧 = 𝐅 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑥 𝑡 , 𝑡 𝑧 𝐡 𝑡 𝑥 𝑡 𝑥 , 𝑡 𝑧 1 , 𝝁 𝑟 𝑡 ; 𝑡 𝑥 , 𝑡 𝑧 = 𝐅 𝑡 𝑧 , 𝑡 𝐡 𝑡 𝑧 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑧 ( 𝑡 𝑥 , 𝑡 𝑧 ) 1 . ( 2 . 1 0 )

Note that the functions 𝜇 𝑙 and 𝜇 𝑟 have the property that 𝝁 𝑙 ( 𝑡 𝑥 ; 𝑡 𝑥 , 𝑡 𝑧 ) = 𝝁 𝑟 ( 𝑡 𝑧 ; 𝑡 𝑥 , 𝑡 𝑧 ) = 𝐈 𝑑 and 𝝁 𝑙 ( 𝑡 𝑧 ; 𝑡 𝑥 , 𝑡 𝑧 ) = 𝝁 𝑟 ( 𝑡 𝑥 ; 𝑡 𝑥 , 𝑡 𝑧 ) = 𝟎 ensuring that the process is indeed equal to 𝐱 at time 𝑡 𝑥 and 𝐳 at time 𝑡 𝑧 .

Proof. Let 𝑡 𝑥 , 𝑡 𝑧 be two times of the interval [ 0 , 1 ] such that 𝑡 𝑥 < 𝑡 𝑧 , and let 𝑡 [ 𝑡 𝑥 , 𝑡 𝑧 ] . We consider the Gaussian random variable 𝝃 = ( 𝐗 𝑡 , 𝐗 𝑡 𝑧 ) conditioned on the fact that 𝐗 𝑡 𝑥 = 𝐱 . Its mean can be easily computed from expression (2.2) and reads 𝐦 𝑡 , 𝐦 𝑡 𝑧 = 𝐅 𝑡 𝑥 , 𝑡 𝐱 , 𝐅 𝑡 𝑥 , 𝑡 𝑧 𝐱 = 𝐠 ( 𝑡 ) 𝐠 1 𝑡 𝑥 𝐱 , 𝐠 𝑡 𝑧 𝐠 1 𝑡 𝑥 𝐱 , ( 2 . 1 1 ) and its covariance matrix, from (2.5), reads 𝐂 𝑡 , 𝑡 𝐂 𝑡 , 𝑡 𝑧 𝐂 𝑡 𝑧 , 𝑡 𝐂 𝑡 𝑧 , 𝑡 𝑧 = 𝐅 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑥 𝑡 𝑥 , 𝑡 𝐅 𝑡 𝑥 , 𝑡 𝑇 𝐅 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑥 𝑡 𝑥 , 𝑡 𝐅 𝑡 𝑥 , 𝑡 𝑧 𝑇 𝐅 𝑡 𝑥 , 𝑡 𝑧 𝐡 𝑡 𝑥 𝑡 𝑥 , 𝑡 𝐅 𝑡 𝑥 , 𝑡 𝑇 𝐅 𝑡 𝑥 , 𝑡 𝑧 𝐡 𝑡 𝑥 𝑡 𝑥 , 𝑡 𝑧 𝐅 𝑡 𝑥 , 𝑡 𝑧 𝑇 = 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝐅 𝑡 , 𝑡 𝑧 𝑇 𝐅 𝑡 , 𝑡 𝑧 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝐅 𝑡 , 𝑡 𝑧 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝑧 𝐅 𝑡 , 𝑡 𝑧 𝑇 . ( 2 . 1 2 ) From there, we apply the conditioning formula for the Gaussian vectors (see, e.g., [36]) to infer the law of 𝐗 𝑡 conditioned on 𝐗 𝑡 𝑥 = 𝐱 and 𝐗 𝑡 𝑧 = 𝐳 , that is the law 𝒩 ( 𝝁 ( 𝑡 ) , 𝚺 ( 𝑡 ; 𝑡 𝑥 , 𝑡 𝑧 ) ) of 𝐁 𝑡 where 𝐁 denotes the bridge process obtained by pinning 𝐗 in 𝑡 𝑥 and 𝑡 𝑧 . The covariance matrix is given by 𝚺 𝑡 ; 𝑡 𝑥 , 𝑡 𝑧 = 𝐂 𝑦 , 𝑦 𝐂 𝑦 , 𝑧 𝐂 1 𝑧 , 𝑧 𝐂 𝑧 , 𝑦 = 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝑧 1 𝐡 𝑡 𝑡 𝑥 , 𝑡 = 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑡 𝑥 , 𝑡 𝑧 1 𝐡 𝑡 𝑡 , 𝑡 𝑧 , ( 2 . 1 3 ) and the mean reads 𝝁 ( 𝑡 ) = 𝐦 𝑦 + 𝐂 𝑦 , 𝑧 𝐂 1 𝑧 , 𝑧 𝐳 𝐦 𝑧 = 𝐅 𝑡 𝑥 , 𝑡 𝐈 𝑑 𝐡 𝑡 𝑥 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑥 ( 𝑡 𝑥 , 𝑡 𝑧 ) 1 𝐱 + 𝐅 𝑡 𝑧 , 𝑡 𝐡 𝑡 𝑧 𝑡 𝑥 , 𝑡 𝐡 𝑡 𝑧 ( 𝑡 𝑥 , 𝑡 𝑧 ) 1 𝐳 , = 𝐅 ( 𝑡 𝑥 , 𝑡 ) 𝐡 𝑡 𝑥 ( 𝑡 , 𝑡 𝑧 ) 𝐡 𝑡 𝑥 ( 𝑡 𝑥 , 𝑡 𝑧 ) 1 𝝁 𝑙 ( 𝑡 ; 𝑡 𝑥 , 𝑡 𝑧 ) 𝐱 + 𝐅 ( 𝑡 𝑧 , 𝑡 ) 𝐡 𝑡 𝑧 ( 𝑡 𝑥 , 𝑡 ) 𝐡 𝑡 𝑧 ( 𝑡 𝑥 , 𝑡 𝑧 ) 1 𝝁 𝑟 ( 𝑡 ; 𝑡 𝑥 , 𝑡 𝑧 ) 𝐳 , ( 2 . 1 4 ) where we have used the fact that 𝐡 𝑡 𝑥 ( 𝑡 𝑥 , 𝑡 𝑧 ) = 𝐡 𝑡 𝑧 ( 𝑡 𝑥 , 𝑡 ) + 𝐡 𝑡 𝑥 ( 𝑡 , 𝑡 𝑧 ) . The regularity of the thus-defined functions 𝝁 𝑥 and 𝝁 𝑧 directly stems from the regularity of the flow operator 𝐅 . Moreover, since for any 0 𝑡 , 𝑢 1 , we observe that 𝐅 ( 𝑡 , 𝑡 ) = 𝐈 𝑑 and 𝑢 ( 𝑡 , 𝑡 ) = 𝟎 ; we clearly have 𝝁 𝑥 ( 𝑡 𝑥 ) = 𝝁 𝑦 ( 𝑡 ) = 𝐈 𝑑 and 𝝁 𝑥 ( 𝑡 ) = 𝝁 𝑦 ( 𝑡 𝑥 ) = 0 .

Remark 2.2. Note that these laws can also be computed using the expression of the density of the processes but involve more intricate calculations. An alternative approach also provides a representation of Gauss-Markov bridges with the use of integral and anticipative representation [37]. These approaches allow to compute the probability distribution of the Gauss-Markov bridge as a process (i.e., allows to compute the covariances), but since this will be of no use in the sequel, we do not provide the expressions.

2.2. The Multiresolution Description of Gauss-Markov Processes

Recognizing the Gauss property and the Markov property as the two crucial elements for a stochastic process to be expanded to Lévy-Cesielski, our approach first proposes to exhibit bases of deterministic functions that would play the role of the Schauder bases for the Wiener process. In this regard, we first expect such functions to be continuous and compactly supported on increasingly finer supports (i.e., subintervals of the definition interval [ 0 , 1 ] ) in a similar nested binary tree structure. Then, as in the Lévy-Ciesielski construction, we envision that, at each resolution (i.e., on each support), the partially constructed process (up to the resolution of the support) has the same conditional expectation as the Gauss-Markov process when conditioned on the endpoints of the supports. The partial sums obtained with independent Gaussian coefficients of law 𝒩 ( 0 , 1 ) will thus approximate the targeted Gauss-Markov process in a multiresolution fashion, in the sense that, at every resolution, considering these two processes on the interval endpoints yields finite-dimensional Gaussian vectors of the same law.

2.2.1. Nested Structure of the Sequence of Supports

Here, we define the nested sequence of segments that constitute the supports of the multiresolution basis. We construct such a sequence by recursively partitioning the interval [ 0 , 1 ] .

More precisely, starting from 𝑆 1 , 0 = [ 𝑙 1 , 0 , 𝑟 1 , 0 ] with 𝑙 1 , 0 = 0 and 𝑟 1 , 0 = 1 , we iteratively apply the following operation. Suppose that, at the 𝑛 th step, the interval [ 0 , 1 ] is decomposed into 2 𝑛 1 intervals 𝑆 𝑛 , 𝑘 = [ 𝑙 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 ] , called supports, such that 𝑙 𝑛 , 𝑘 + 1 = 𝑟 𝑛 , 𝑘 for 0 𝑘 < 2 𝑛 1 . Each of these intervals is then subdivided into two child intervals, a left-child 𝑆 𝑛 + 1 , 2 𝑘 and a right-child 𝑆 𝑛 + 1 , 2 𝑘 + 1 , and the subdivision point 𝑟 𝑛 + 1 , 2 𝑘 = 𝑙 𝑛 + 1 , 2 𝑘 + 1 is denoted by 𝑚 𝑛 , 𝑘 . Therefore, we have defined three sequences of real 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 , and 𝑟 𝑛 , 𝑘 for 𝑛 > 0 and 0 𝑘 < 2 𝑛 1 satisfying 𝑙 0 , 0 = 0 𝑙 𝑛 , 𝑘 < 𝑚 𝑛 , 𝑘 < 𝑟 𝑛 , 𝑘 𝑟 0 , 0 = 1 and 𝑙 𝑛 + 1 , 2 𝑘 = 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 = 𝑟 𝑛 + 1 , 2 𝑘 = 𝑙 𝑛 + 1 , 2 𝑘 + 1 , 𝑟 𝑛 + 1 , 2 𝑘 + 1 = 𝑟 𝑛 , 𝑘 ( 2 . 1 5 ) with the convention 𝑙 0 , 0 = 0 and 𝑟 0 , 0 = 1 and 𝑆 0 , 0 = [ 0 , 1 ] . The resulting sequence of supports { 𝑆 𝑛 , 𝑘 ; 𝑛 0 , 0 𝑘 < 2 𝑛 1 } clearly has a binary tree structure.

For the sake of compactness of notations, we define the set of indices = 𝑛 < 𝑁 𝑛 w i t h 𝑁 = ( 𝑛 , 𝑘 ) 2 0 < 𝑛 𝑁 , 0 𝑘 < 2 𝑛 1 , ( 2 . 1 6 ) and for 𝑁 > 0 , we define 𝐷 𝑁 = { 𝑚 𝑛 , 𝑘 , ( 𝑛 , 𝑘 ) 𝑁 1 } { 0 , 1 } , the set of endpoints of the intervals 𝑆 𝑁 , 𝑘 . We additionally require that there exists 𝜌 ( 0 , 1 ) such that for all ( 𝑛 , 𝑘 ) m a x ( 𝑟 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 ) < 𝜌 ( 𝑟 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 ) which in particular implies that l i m 𝑛 s u p 𝑘 𝑟 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 = 0 ( 2 . 1 7 ) and ensures that the set of endpoints 𝑁 N 𝐷 𝑁 is everywhere dense in [ 0 , 1 ] . The simplest case of such partitions is the dyadic partition of [ 0 , 1 ] , where the endpoints for ( 𝑛 , 𝑘 ) read 𝑙 𝑛 , 𝑘 = 𝑘 2 𝑛 + 1 , 𝑚 𝑛 , 𝑘 = ( 2 𝑘 + 1 ) 2 𝑛 , 𝑟 𝑛 , 𝑘 = ( 𝑘 + 1 ) 2 𝑛 + 1 , ( 2 . 1 8 ) in which case the endpoints are simply the dyadic points 𝑁 𝐷 𝑁 = { 𝑘 2 𝑁 | 0 𝑘 2 𝑁 } . Figure 1 represents the global architecture of the nested sequence of intervals.

247329.fig.001
Figure 1: A sequence of nested intervals.

The nested structure of the supports, together with the constraint of continuity of the bases elements, implies that only a finite number of coefficients are needed to construct the exact value of the process at a given endpoint, thus providing us with an exact schema to simulate the sample values of the process on the endpoint up to an arbitrary resolution, as we will further explore.

2.2.2. Innovation Processes for Gauss-Markov Processes

For 𝐗 𝑡 , a multidimensional Gauss-Markov process, we call the multiresolution description of a process the sequence of conditional expectations on the nested sets of endpoints 𝐷 𝑛 . In detail, if we denote by 𝑁 the filtration generated by { 𝐗 𝑡 ; 𝑡 𝐷 𝑁 } given the values of the process at the endpoints 𝐷 𝑁 of the partition, we introduce the sequence of the Gaussian processes ( 𝐙 𝑁 𝑡 ) 𝑁 1 defined by: 𝐙 𝑁 𝑡 = 𝔼 𝐗 𝑡 𝑁 = 𝔼 𝑁 𝐗 𝑡 . ( 2 . 1 9 ) These processes 𝐙 𝑁 can be naturally viewed as an interpolation of the process 𝐗 sampled at the increasingly finer times 𝐷 𝑁 since for all 𝑡 𝐷 𝑁 we have 𝐙 𝑁 𝑡 = 𝐗 𝑁 𝑡 . The innovation process ( 𝜹 𝑁 𝑡 , 𝑡 , 𝑡 [ 0 , 1 ] ) is defined as the update transforming the process 𝐙 𝑁 𝑡 into 𝐙 𝑁 + 1 𝑡 , that is, 𝜹 𝑁 𝑡 = 𝐙 𝑁 + 1 𝑡 𝐙 𝑁 𝑡 . ( 2 . 2 0 ) It corresponds to the difference the additional knowledge of the process at the points 𝑚 𝑁 , 𝑘 make on the conditional expectation of the process. This process satisfies the following important properties that found our multiresolution construction.

Proposition 2.3. The innovation process 𝜹 𝑁 𝑡 is a centered Gaussian process independent of the processes 𝐙 𝑛 𝑡 for any 𝑛 𝑁 . For 𝑠 𝑆 𝑁 , 𝑘 and 𝑡 𝑆 𝑁 , 𝑝 with 𝑘 , 𝑝 𝑁 , the covariance of the innovation process reads 𝔼 𝑁 𝜹 𝑁 𝑡 𝜹 𝑁 𝑠 𝑇 = 𝝁 𝑁 , 𝑘 ( 𝑡 ) 𝚺 𝑁 , 𝑘 𝝁 𝑁 , 𝑘 ( 𝑡 ) 𝑇 i f 𝑘 = 𝑝 , 𝟎 i f 𝑘 𝑝 , ( 2 . 2 1 ) where 𝝁 𝑁 , 𝑘 ( 𝑡 ) = 𝝁 𝑟 𝑡 ; 𝑙 𝑁 , 𝑘 , 𝑚 𝑁 , 𝑘 , 𝑡 𝑙 𝑁 , 𝑘 , 𝑚 𝑁 , 𝑘 , 𝝁 𝑙 𝑡 ; 𝑚 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 , 𝑡 𝑚 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 . ( 2 . 2 2 ) with 𝝁 𝑙 , 𝝁 𝑟 and 𝚺 𝑁 , 𝑘 = 𝚺 ( 𝑚 𝑁 , 𝑘 ; 𝑙 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 ) as defined in Proposition 2.1.

Proof. Because of the Markovian property of the process 𝐗 , the law of the process 𝐙 𝑁 can be computed from the bridge formula derived in Proposition 2.1 and we have 𝐙 𝑁 𝑡 = 𝝁 𝑙 𝑡 ; 𝑙 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 𝐗 𝑙 𝑁 , 𝑘 + 𝝁 𝑟 𝑡 ; 𝑙 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 𝐗 𝑟 𝑁 , 𝑘 , 𝐙 𝑁 + 1 𝑡 = 𝝁 𝑙 𝑡 ; 𝑙 𝑁 , 𝑘 , 𝑚 𝑁 , 𝑘 𝐗 𝑙 𝑁 , 𝑘 + 𝝁 𝑟 𝑡 ; 𝑙 𝑁 , 𝑘 , 𝑚 𝑁 , 𝑘 𝐗 𝑚 𝑁 , 𝑘 , f o r 𝑡 𝑙 𝑁 , 𝑘 , 𝑚 𝑁 , 𝑘 , 𝝁 𝑙 𝑡 ; 𝑚 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 𝐗 𝑚 𝑁 , 𝑘 + 𝝁 𝑟 𝑡 ; 𝑚 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 𝐗 𝑟 𝑁 , 𝑘 , f o r 𝑡 𝑙 𝑁 , 𝑘 , 𝑚 𝑁 , 𝑘 . ( 2 . 2 3 ) Therefore, the innovation process can be written for 𝑡 𝑆 𝑁 , 𝑘 as 𝜹 𝑁 𝑡 = 𝝁 𝑁 𝑁 , 𝑘 ( 𝑡 ) 𝐗 𝑚 𝑁 , 𝑘 + 𝝂 𝑁 ( 𝑡 ) 𝐐 𝑁 𝑡 , ( 2 . 2 4 ) where 𝐐 𝑁 𝑡 is a 𝑁 measurable process 𝝂 𝑁 ( 𝑡 ) a deterministic matrix function and 𝝁 𝑁 , 𝑘 ( 𝑡 ) = 𝝁 𝑟 𝑡 ; 𝑙 𝑁 , 𝑘 , 𝑚 𝑁 , 𝑘 , 𝑡 𝑙 𝑁 , 𝑘 , 𝑚 𝑁 , 𝑘 , 𝝁 𝑙 𝑡 ; 𝑚 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 , 𝑡 𝑚 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 . ( 2 . 2 5 ) The expressions of 𝝂 and 𝐐 are quite complex but are highly simplified when noting that 𝔼 𝜹 𝑁 𝑡 𝑁 = 𝔼 𝐙 𝑁 + 1 𝑡 𝑁 𝐙 𝑁 𝑡 = 𝔼 𝔼 𝐙 𝑡 𝑁 + 1 𝑁 𝐙 𝑁 𝑡 = 𝟎 ( 2 . 2 6 ) directly implying that 𝝂 ( 𝑡 ) 𝐐 𝑁 𝑡 = 𝝁 𝑁 ( 𝑡 ) 𝐙 𝑁 𝑚 𝑁 , 𝑘 and yielding the remarkably compact expression 𝜹 𝑁 𝑡 = 𝝁 𝑁 , 𝑘 ( 𝑡 ) 𝐗 𝑚 𝑁 , 𝑘 𝐙 𝑁 𝑚 𝑁 , 𝑘 . ( 2 . 2 7 ) This process is a centered Gaussian process. Moreover, observing that it is 𝑁 -measurable, it can be written as 𝜹 𝑁 𝑡 = 𝝁 𝑁 , 𝑘 ( 𝑡 ) 𝐗 𝑚 𝑁 , 𝑘 𝑁 𝐙 𝑁 𝑚 𝑁 , 𝑘 , ( 2 . 2 8 ) and the process { 𝐗 𝑚 𝑁 , 𝑘 𝑁 } appears as the Gauss-Markov bridge conditioned at times 𝑙 𝑁 , 𝑘 and 𝑟 𝑁 , 𝑘 , and whose covariance is given by Proposition 2.1 and that has the expression 𝚺 𝑁 , 𝑘 = 𝚺 𝑚 𝑁 , 𝑘 ; 𝑙 𝑁 , 𝑘 , 𝑟 𝑁 , 𝑘 = 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 1 𝐡 𝑚 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 . ( 2 . 2 9 ) Let ( 𝑠 , 𝑡 ) [ 0 , 1 ] 2 , and assume that 𝑠 𝑆 𝑁 , 𝑘 and 𝑡 𝑆 𝑁 , 𝑝 . If 𝑘 𝑝 , then, because of the Markov property of the process 𝐗 , the two bridges are independent and therefore the covariance 𝔼 𝑁 [ 𝜹 𝑁 𝑡 ( 𝜹 𝑁 𝑠 ) 𝑇 ] is zero. If 𝑘 = 𝑝 , we have 𝔼 𝑁 𝜹 𝑁 𝑡 𝜹 𝑁 𝑠 𝑇 = 𝝁 𝑁 , 𝑘 ( 𝑡 ) 𝚺 𝑁 , 𝑘 𝝁 𝑁 , 𝑘 ( 𝑠 ) 𝑇 . ( 2 . 3 0 ) Eventually, the independence property stems from the simple properties of the conditional expectation. Indeed, let 𝑛 𝑁 . We have 𝔼 𝐙 𝑛 𝑡 𝜹 𝑁 𝑠 𝑇 = 𝔼 𝐙 𝑛 𝑡 𝐙 𝑁 + 1 𝑠 𝐙 𝑁 𝑠 𝑇 = 𝔼 𝔼 𝐗 𝑡 𝑛 𝔼 𝐗 𝑇 𝑠 𝑁 + 1 𝔼 𝐗 𝑇 𝑠 𝑁 = 𝔼 𝔼 𝐗 𝑡 𝑛 𝔼 𝐗 𝑇 𝑠 𝑁 + 1 𝔼 𝔼 𝐗 𝑡 𝑛 𝔼 𝐗 𝑇 𝑠 𝑁 = 𝔼 𝐙 𝑛 𝑡 𝐙 𝑛 𝑠 𝑇 𝔼 𝐙 𝑛 𝑡 𝐙 𝑛 𝑠 𝑇 = 𝟎 ( 2 . 3 1 ) and the fact that a zero covariance between two Gaussian processes implies the independence of these processes concludes the proof.

2.2.3. Derivation of the Candidate Multiresolution Bases of Functions

We deduce from the previous proposition the following fundamental theorem of this paper.

Theorem 2.4. For all 𝑁 , there exists a collection of 𝝍 𝑁 , 𝑘 [ 0 , 1 ] 𝑑 × 𝑑 that are zero outside the subinterval 𝑆 𝑁 , 𝑘 such that in distribution one has: 𝜹 𝑁 𝑡 = 𝑘 𝑁 𝝍 𝑁 , 𝑘 ( 𝑡 ) 𝚵 𝑁 , 𝑘 , ( 2 . 3 2 ) where Ξ 𝑁 , 𝑘 are independent 𝑑 -dimensional standard normal random variables (i.e., of law 𝒩 ( 0 , 𝐈 𝑑 ) ). This basis of functions is unique up to an orthogonal transformation.

Proof. The two processes 𝜹 𝑁 𝑡 and 𝐝 𝑁 𝑡 d e f = 𝑘 𝑁 𝝍 𝑁 , 𝑘 ( 𝑡 ) Ξ 𝑁 , 𝑘 are two Gaussian processes of mean zero. Therefore, we are searching for functions 𝝍 𝑁 , 𝑘 vanishing outside 𝑆 𝑁 , 𝑘 and ensuring that the two processes have the same probability distribution. A necessary and sufficient condition for the two processes to have the same probability distribution is to have the same covariance function (see, e.g., [36]). We therefore need to show the existence of a collection of functions 𝝍 𝑁 , 𝑘 ( 𝑡 ) functions that vanish outside the subinterval 𝑆 𝑁 , 𝑘 and that ensure that the covariance of the process 𝐝 𝑁 is equal to the covariance of 𝜹 𝑁 . Let ( 𝑠 , 𝑡 ) [ 0 , 1 ] such that 𝑠 𝑆 𝑁 , 𝑘 and 𝑡 𝑆 𝑁 , 𝑝 . If 𝑘 𝑝 , the assumption fact that the functions 𝜓 𝑁 , 𝑘 vanish outside 𝑆 𝑁 , 𝑘 implies that 𝔼 𝐝 𝑁 𝑡 𝐝 𝑁 𝑠 𝑇 = 𝟎 . ( 2 . 3 3 ) If 𝑘 = 𝑝 , the covariance reads 𝔼 𝐝 𝑁 𝑡 𝐝 𝑁 𝑠 𝑇 = 𝔼 𝝍 𝑁 , 𝑘 ( 𝑡 ) 𝚵 𝑁 , 𝑘 𝚵 𝑇 𝑁 , 𝑘 𝝍 𝑁 , 𝑘 ( 𝑠 ) 𝑇 = 𝝍 𝑁 , 𝑘 ( 𝑡 ) 𝝍 𝑁 , 𝑘 ( 𝑠 ) 𝑇 , ( 2 . 3 4 ) which needs to be equal to the covariance of 𝜹 𝑁 , namely, 𝝍 𝑁 , 𝑘 ( 𝑡 ) 𝝍 𝑁 , 𝑘 ( 𝑠 ) 𝑇 = 𝝁 𝑁 , 𝑘 ( 𝑡 ) 𝚺 𝑁 , 𝑘 𝝁 𝑁 , 𝑘 ( 𝑠 ) 𝑇 . ( 2 . 3 5 ) Therefore, since 𝝁 𝑁 , 𝑘 ( 𝑚 𝑁 , 𝑘 ) = 𝐈 𝑑 , we have 𝝍 𝑁 , 𝑘 𝑚 𝑁 , 𝑘 𝝍 𝑁 , 𝑘 𝑚 𝑁 , 𝑘 𝑇 = 𝚺 𝐍 , 𝐤 . ( 2 . 3 6 ) We can hence now define 𝝍 𝑁 , 𝑘 ( 𝑚 𝑁 , 𝑘 ) as a square root 𝝈 𝑁 , 𝑘 of the symmetric positive matrix 𝚺 𝐍 , 𝐤 , by fixing 𝑠 = 𝑚 𝑁 , 𝑘 in (2.35) 𝝍 𝑁 , 𝑘 ( 𝑡 ) 𝝈 𝑇 𝑁 , 𝑘 = 𝝁 ( 𝑡 ) 𝝈 𝑁 , 𝑘 𝝈 𝑇 𝑁 , 𝑘 . ( 2 . 3 7 ) Eventually, since by assumption we have that 𝚺 𝑁 , 𝑘 is invertible, so is 𝝈 𝑁 , 𝑘 , and the functions 𝝍 𝑁 , 𝑘 can be written as 𝝍 𝑁 , 𝑘 ( 𝑡 ) = 𝝁 𝑁 , 𝑘 ( 𝑡 ) 𝝈 𝑁 , 𝑘 ( 2 . 3 8 ) with 𝝈 𝑁 , 𝑘 being a square root of 𝚺 𝑁 , 𝑘 . Square roots of positive symmetric matrices are uniquely defined up to an orthogonal transformation. Therefore, all square roots of 𝚺 𝑁 , 𝑘 are related by orthogonal transformations 𝝈 𝑁 , 𝑘 = 𝝈 𝑁 , 𝑘 𝐎 𝑁 , 𝑘 , where 𝐎 𝑁 , 𝑘 𝐎 𝑇 𝑁 , 𝑘 = 𝐈 𝑑 . This property immediately extends to the functions 𝝍 𝑁 , 𝑘 we are studying: two different functions 𝝍 𝑁 , 𝑘 and 𝝍 𝑁 , 𝑘 satisfying the theorem differ from an orthogonal transformation 𝐎 𝑁 , 𝑘 . We proved that, for 𝝍 𝑁 , 𝑘 ( 𝑡 ) Ξ 𝑁 , 𝑘 to have the same law as 𝜹 𝑁 ( 𝑡 ) in the interval 𝑆 𝑁 , 𝑘 , the functions 𝝍 𝑁 , 𝑘 with support in 𝑆 𝑁 , 𝑘 are necessarily of the form 𝝁 𝑁 , 𝑘 ( 𝑡 ) 𝝈 𝑁 , 𝑘 . It is straightforward to show the sufficient condition that provided such a set of functions, the processes 𝜹 𝑁 𝑡 and 𝐝 𝑁 𝑡 are equal in law, which ends the proof of the theorem.

Using the expressions obtained in Proposition 2.1, we can make completely explicit the form of the basis in terms of the functions 𝐟 , 𝐠 , and 𝐡 : 𝝍 𝑛 , 𝑘 ( 𝑡 ) = 𝐠 ( 𝑡 ) 𝐠 1 𝑚 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑡 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 1 𝝈 𝑛 , 𝑘 , f o r 𝑙 𝑛 , 𝑘 𝑡 𝑚 𝑛 , 𝑘 , 𝐠 ( 𝑡 ) 𝐠 1 𝑚 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑡 , 𝑟 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 1 𝝈 𝑛 , 𝑘 , f o r 𝑚 𝑛 , 𝑘 𝑡 𝑟 𝑛 , 𝑘 , ( 2 . 3 9 ) and 𝝈 𝑛 , 𝑘 satisfies 𝝈 𝑛 , 𝑘 𝝈 𝑇 𝑛 , 𝑘 = 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 1 𝐡 𝑚 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 . ( 2 . 4 0 ) Note that 𝝈 𝑛 , 𝑘 can be defined uniquely as the symmetric positive square root, or as the lower triangular matrix resulting from the Cholesky decomposition of 𝚺 𝑛 , 𝑘 .

Let us now define the function 𝝍 0 , 0 [ 0 , 1 ] 𝑑 × 𝑑 such that the process 𝝍 0 , 0 ( 𝑡 ) Ξ 0 , 0 has the same covariance as 𝐙 0 𝑡 , which is computed using exactly the same technique as that developed in the proof of Theorem 2.4 and that has the expression 𝝍 0 , 0 ( 𝑡 ) = 𝐠 ( 𝑡 ) 𝐡 0 𝑙 0 , 0 , 𝑡 𝐡 0 ( 𝑙 0 , 0 , 𝑟 0 , 0 ) 1 𝐠 1 𝑟 0 , 0 𝝈 0 , 0 , ( 2 . 4 1 ) for 𝝈 0 , 0 , a square root of 𝐂 𝑟 0 , 0 the covariance matrix of 𝐗 𝑟 0 , 0 which from (2.5) reads 𝐅 ( 0 , 1 ) 𝐡 0 ( 1 , 1 ) 𝐅 ( 0 , 1 ) 𝑇 = 𝐠 ( 1 ) 𝐡 0 ( 1 , 1 ) ( 𝐠 ( 1 ) ) 𝑇 . ( 2 . 4 2 )

We are now in position to show the following corollary of Theorem 2.4.

Corollary 2.5. The Gauss-Markov process 𝐙 𝑁 𝑡 is equal in law to the process 𝐗 𝑁 𝑡 = 𝑁 1 𝑛 = 0 𝑘 𝑛 𝝍 𝑛 , 𝑘 ( 𝑡 ) 𝚵 𝑛 , 𝑘 , ( 2 . 4 3 ) where Ξ 𝑛 , 𝑘 are independent standard normal random variables 𝒩 ( 0 , 𝐈 𝑑 ) .

Proof. We have 𝐙 𝑁 𝑡 = 𝐙 𝑁 𝑡 𝐙 𝑁 1 𝑡 + 𝐙 𝑁 1 𝑡 𝐙 𝑁 2 𝑡 + + 𝐙 2 𝑡 𝐙 1 𝑡 + 𝐙 1 𝑡 = 𝑁 1 𝑛 = 1 𝜹 𝑛 𝑡 + 𝐙 1 𝑡 = 𝑁 1 𝑛 = 1 𝑘 𝑛 𝝍 𝑛 , 𝑘 ( 𝑡 ) 𝚵 𝑛 , 𝑘 + 𝝍 0 , 0 ( 𝑡 ) 𝚵 0 , 0 = 𝑁 1 𝑛 = 0 𝑘 𝑛 𝝍 𝑛 , 𝑘 ( 𝑡 ) 𝚵 𝑛 , 𝑘 . ( 2 . 4 4 )

We therefore identified a collection of functions { 𝝍 𝑛 , 𝑘 } ( 𝑛 , 𝑘 ) that allows a simple construction of the Gauss-Markov process iteratively conditioned on increasingly finer partitions of the interval [ 0 , 1 ] . We will show that this sequence 𝐙 𝑁 𝑡 converges almost surely towards the Gauss-Markov process 𝐗 𝑡 used to construct the basis, proving that these finite-dimensional continuous functions 𝐙 𝑁 𝑡 form an asymptotically accurate description of the initial process. Beforehand, we rigorously study the Hilbertian properties of the collection of functions we just defined.

3. The Multiresolution Schauder Basis Framework

The above analysis motivates the introduction of a set of functions { 𝝍 𝑛 , 𝑘 } ( 𝑛 , 𝑘 ) we now study in details. In particular, we enlighten the structure of the collection of functions 𝝍 𝑛 , 𝑘 as a Schauder basis in a certain space 𝒳 of continuous functions from [ 0 , 1 ] to 𝑑 . The Schauder structure was defined in [38, 39], and its essential characterization is the unique decomposition property: namely that every element 𝑥 in 𝒳 can be written as a well-formed linear combination 𝑥 = ( 𝑛 , 𝑘 ) 𝝍 𝑛 , 𝑘 𝝃 𝑛 , 𝑘 , ( 3 . 1 ) and that the coefficients satisfying the previous relation are unique.

3.1. System of Dual Bases

To complete this program, we need to introduce some quantities that will play a crucial role in expressing the family 𝝍 𝑛 , 𝑘 as a Schauder basis for some given space. In (2.39), two constant matrices 𝑑 × 𝑑 appear that will have a particular importance in the sequel for ( 𝑛 , 𝑘 ) in with 𝑛 0 : 𝐋 𝑛 , 𝑘 = 𝐠 𝑇 𝑚 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 1 𝝈 𝑛 , 𝑘 = 𝐡 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 1 𝐠 1 𝑚 𝑛 , 𝑘 𝝈 𝑛 , 𝑘 , 𝐑 𝑛 , 𝑘 = 𝐠 𝑇 𝑚 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 1 𝝈 𝑛 , 𝑘 = 𝐡 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 1 𝐠 1 𝑚 𝑛 , 𝑘 𝝈 𝑛 , 𝑘 , ( 3 . 2 ) where 𝐡 stands for 𝐡 0 . We further define the matrix 𝐌 𝑛 , 𝑘 = 𝐠 𝑇 𝑚 𝑛 , 𝑘 𝝈 1 𝑛 , 𝑘 𝑇 ( 3 . 3 ) and we recall that 𝝈 𝑛 , 𝑘 is a square root of 𝚺 𝑛 , 𝑘 , the covariance matrix of 𝐗 𝑚 𝑛 , 𝑘 , conditionally to 𝐗 𝑙 𝑛 , 𝑘 and 𝐗 𝑟 𝑛 , 𝑘 , given in (2.29). We stress that the matrices 𝐋 𝑛 , 𝑘 , 𝐑 𝑛 , 𝑘 , 𝐌 𝑛 , 𝑘 , and 𝚺 𝑛 , k are all invertible and satisfy the important following properties.

Proposition 3.1. For all ( 𝑛 , 𝑘 ) in , 𝑛 0 , one has:(i) 𝐌 𝑛 , 𝑘 = 𝐋 𝑛 , 𝑘 + 𝐑 𝑛 , 𝑘 (ii) 𝚺 1 𝑛 , 𝑘 = ( 𝐡 𝑚 𝑛 , 𝑘 ( 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 ) ) 1 + ( 𝐡 𝑚 𝑛 , 𝑘 ( 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 ) ) 1 .

To prove this proposition, we first establish the following simple lemma of linear algebra.

Lemma 3.2. Given two invertible matrices 𝐴 and 𝐵 in 𝐺 𝐿 𝑛 ( ) such that 𝐶 = 𝐴 + 𝐵 is also invertible, if one defines 𝐷 = 𝐴 𝐶 1 𝐵 , one has the following properties:(i) 𝐷 = 𝐴 𝐶 1 𝐵 = 𝐵 𝐶 1 𝐴 (ii) 𝐷 1 = 𝐴 1 + 𝐵 1 .

Proof. (i) 𝐷 = 𝐴 𝐶 1 𝐵 = ( 𝐶 𝐵 ) 𝐶 1 𝐵 = 𝐵 𝐵 𝐶 1 𝐵 = 𝐵 ( 𝐼 𝐶 1 𝐵 ) = 𝐵 𝐶 1 ( 𝐶 𝐵 ) = 𝐵 𝐶 1 𝐴 .(ii) ( 𝐴 1 + 𝐵 1 ) 𝐷 = 𝐴 1 𝐷 + 𝐵 1 𝐷 = 𝐴 1 𝐴 𝐶 1 𝐵 + 𝐵 1 𝐵 𝐶 1 𝐴 = 𝐶 1 ( 𝐵 + 𝐴 ) = 𝐶 1 𝐶 = 𝐼 .

Proof of Proposition 3.1. (ii) Directly stems from Lemma 3.2, item (ii) by posing 𝐴 = 𝐡 𝑚 𝑛 , 𝑘 ( 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 ) , 𝐵 = 𝐡 𝑚 𝑛 , 𝑘 ( 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 ) , and 𝐶 = 𝐴 + 𝐵 = 𝐡 𝑚 𝑛 , 𝑘 ( 𝑙 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 ) . Indeed, the lemma implies that 𝐷 1 = 𝐴 1 𝐶 𝐵 1 = 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 1 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 1 = 𝚺 1 𝑛 , 𝑘 . ( 3 . 4 ) (i)We have 𝐋 𝑛 , 𝑘 + 𝐑 𝑛 , 𝑘 = 𝐠 𝑚 𝑛 , 𝑘 𝑇 𝐡 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 1 + 𝐡 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 1 𝝈 𝑛 , 𝑘 = 𝐠 𝑚 𝑛 , 𝑘 𝑇 𝚺 1 𝑛 , 𝑘 𝝈 𝑛 , 𝑘 = 𝐠 𝑚 𝑛 , 𝑘 𝑇 𝝈 1 𝑛 , 𝑘 𝑇 , ( 3 . 5 ) which ends the demonstration of the proposition.

Let us define 𝐋 0 , 0 = ( 𝐡 ( 𝑙 0 , 0 , 𝑟 0 , 0 ) ) 1 𝐠 1 ( 𝑟 0 , 0 ) 𝝈 0 , 0 . With this notations we define the functions in a compact form as follows.

Definition 3.3. For every ( 𝑛 , 𝑘 ) in with 𝑛 0 , the continuous functions 𝝍 𝑛 , 𝑘 are defined on their support 𝑆 𝑛 , 𝑘 as 𝝍 𝑛 , 𝑘 ( 𝑡 ) = 𝐠 ( 𝑡 ) 𝐡 𝑙 𝑛 , 𝑘 , 𝑡 𝐋 𝑛 , 𝑘 , 𝑙 𝑛 , 𝑘 𝑡 𝑚 𝑛 , 𝑘 , 𝐠 ( 𝑡 ) 𝐡 𝑡 , 𝑟 𝑛 , 𝑘 𝐑 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 𝑡 𝑟 𝑛 , 𝑘 , ( 3 . 6 ) and the basis element 𝝍 0 , 0 is given on [ 0 , 1 ] by 𝝍 0 , 0 ( 𝑡 ) = 𝐠 ( 𝑡 ) 𝐡 𝑙 0 , 0 , 𝑡 𝐋 0 , 0 . ( 3 . 7 )

The definition implies that 𝝍 𝑛 , 𝑘 are continuous functions in the space of piecewise derivable functions with piecewise continuous derivative which takes value zero at zero. We denote such a space by 𝐶 1 0 ( [ 0 , 1 ] , 𝑑 × 𝑑 ) .

Before studying the property of the functions 𝝍 𝑛 , 𝑘 , it is worth remembering that their definitions include the choice of a square root 𝝈 𝑛 , 𝑘 of 𝚺 𝑛 , 𝑘 . Properly speaking, there is thus a class of bases 𝝍 𝑛 , 𝑘 and all the points we develop in the sequel are valid for this class. However, for the sake of simplicity, we consider from now on that the basis under scrutiny results from choosing the unique square root 𝝈 𝑛 , 𝑘 that is lower triangular with positive diagonal entries (the Cholesky decomposition).

3.1.1. Underlying System of Orthonormal Functions

We first introduce a family of functions 𝜙 𝑛 , 𝑘 and show that it constitutes an orthogonal basis on a certain Hilbert space. The choice of this basis can seem arbitrary at first sight, but the definition of these function will appear natural for its relationship with the functions 𝝍 𝑛 , 𝑘 and Φ 𝑛 , 𝑘 that is made explicit in the sequel, and the mathematical rigor of the argument lead us to choose this apparently artificial introduction.

Definition 3.4. For every ( 𝑛 , 𝑘 ) in with 𝑛 0 , we define a continuous function 𝜙 𝑛 , 𝑘 [ 0 , 1 ] 𝑚 × 𝑑 which is zero outside its support 𝑆 𝑛 , 𝑘 and has the expressions: 𝜙 𝑛 , 𝑘 ( 𝑡 ) = 𝐟 ( 𝑡 ) 𝑇 𝐋 𝑛 , 𝑘 i f 𝑙 𝑛 , 𝑘 𝑡 < 𝑚 𝑛 , 𝑘 , 𝐟 ( 𝑡 ) 𝑇 𝐑 𝑛 , 𝑘 i f 𝑚 𝑛 , 𝑘 𝑡 < 𝑟 𝑛 , 𝑘 . ( 3 . 8 ) The basis element 𝜙 0 , 0 is defined on [ 0 , 1 ] by 𝜙 0 , 0 ( 𝑡 ) = 𝐟 ( 𝑡 ) 𝑇 𝐋 0 , 0 . ( 3 . 9 )

Remark that the definitions make apparent the fact that these two families of functions are linked for all ( 𝑛 , 𝑘 ) in through the simple relation 𝝍 𝑛 , 𝑘 = 𝜶 𝝍 𝑛 , 𝑘 + 𝚪 𝜙 𝑛 , 𝑘 . ( 3 . 1 0 ) Moreover, this collection of functions 𝜙 𝑛 , 𝑘 constitutes an orthogonal basis of functions, in the following sense.

Proposition 3.5. Let 𝐿 2 𝐟 be the closure of 𝐮 [ 0 , 1 ] 𝑚 𝐯 𝐿 2 [ 0 , 1 ] , 𝑑 , 𝐮 = 𝐟 𝑇 𝐯 , ( 3 . 1 1 ) equipped with the natural norm of 𝐿 2 ( [ 0 , 1 ] , 𝑚 ) . It is a Hilbert space, and moreover, for all 0 𝑗 < 𝑑 , the family of functions 𝑐 𝑗 ( 𝜙 𝑛 , 𝑘 ) defined as the columns of 𝜙 𝑛 , 𝑘 , namely 𝑐 𝑗 𝜙 𝑛 , 𝑘 = 𝜙 𝑛 , 𝑘 𝑖 , 𝑗 0 𝑖 < 𝑚 , ( 3 . 1 2 ) forms a complete orthonormal basis of 𝐿 2 𝐟 .

Proof. The space 𝐿 2 𝐟 is clearly a Hilbert space as a closed subspace of the larger Hilbert space 𝐿 2 ( [ 0 , 1 ] , 𝑚 ) is equipped with the standard scalar product: 𝐮 , 𝐯 𝐿 2 [ 0 , 1 ] , 𝑑 , ( 𝐮 , 𝐯 ) = 1 0 𝐮 ( 𝑡 ) 𝑇 𝐯 ( 𝑡 ) 𝑑 𝑡 . ( 3 . 1 3 ) We now proceed to demonstrate that the columns of 𝜙 𝑛 , 𝑘 form an orthonormal family which generates a dense subspace of 𝐿 2 𝐟 . To this end, we define 𝑀 ( [ 0 , 1 ] , 𝑚 × 𝑑 ) as the space of functions 𝐀 [ 0 , 1 ] 𝑚 × 𝑑 𝑗 0 𝑗 < 𝑑 , 𝑡 𝐀 𝑖 , 𝑗 ( 𝑡 ) 0 𝑖 < 𝑚 𝐿 2 ( [ 0 , 1 ] , 𝑚 ) , ( 3 . 1 4 ) that is, the space of functions that take values in the set of 𝑚 × 𝑑 -matrices whose columns are in 𝐿 2 ( [ 0 , 1 ] , 𝑚 ) . This definition allows us to define the bilinear function 𝒫 𝑀 ( [ 0 , 1 ] , 𝑚 × 𝑑 ) × 𝑀 ( [ 0 , 1 ] , 𝑚 × 𝑑 ) 𝑑 × 𝑑 as 𝒫 ( 𝐀 , 𝐁 ) = 1 0 𝐀 ( 𝑡 ) 𝑇 𝐁 ( 𝑡 ) 𝑑 𝑡 s a t i s f y i n g 𝒫 ( 𝐁 , 𝐀 ) = 𝒫 ( 𝐀 , 𝐁 ) 𝑇 , ( 3 . 1 5 ) and we observe that the columns of 𝜙 𝑛 , 𝑘 form an orthonormal system if and only if ( ( 𝑝 , 𝑞 ) , ( 𝑛 , 𝑘 ) ) × , 𝒫 𝜙 𝑛 , 𝑘 , 𝜙 𝑝 , 𝑞 = 1 0 𝜙 𝑛 , 𝑘 ( 𝑡 ) 𝑇 𝜙 𝑝 , q ( 𝑡 ) 𝑑 𝑡 = 𝛿 𝑛 , 𝑘 𝑝 , 𝑞 𝐈 𝑑 , ( 3 . 1 6 ) where 𝛿 𝑛 , 𝑘 𝑝 , 𝑞 is the Kronecker delta function, whose value is 1 if 𝑛 = 𝑝 and 𝑘 = 𝑞 , and 0 otherwise.
First of all, since the functions 𝜙 𝑛 , 𝑘 are zero outside the interval 𝑆 𝑛 , 𝑘 , the matrix 𝒫 ( 𝜙 𝑛 , 𝑘 , 𝜙 𝑝 , 𝑞 ) is nonzero only if 𝑆 𝑛 , 𝑘 𝑆 𝑝 , 𝑞 . In such cases, assuming that 𝑛 𝑝 and, for example, that 𝑛 < 𝑝 , we necessarily have 𝑆 𝑛 , 𝑘 strictly included in 𝑆 𝑝 , 𝑞 : more precisely, 𝑆 𝑛 , 𝑘 is either included in the left-child support 𝑆 𝑝 + 1 , 2 𝑞 or in the right-child support 𝑆 𝑝 + 1 , 2 𝑞 + 1 of 𝑆 𝑝 , 𝑞 . In both cases, writing the matrix 𝒫 ( 𝜙 𝑛 , 𝑘 ( 𝑡 ) , 𝜙 𝑝 , 𝑞 ) shows that it is expressed as a matrix product whose factors include 𝒫 ( 𝜙 𝑛 , 𝑘 , 𝐟 𝑇 ) . We then show that 𝒫 𝜙 𝑛 , 𝑘 , 𝐟 𝑇 = 1 0 𝜙 𝑛 , 𝑘 ( 𝑡 ) 𝑇 𝐟 ( 𝑡 ) 𝑇 = 𝐋 𝑇 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 𝐟 ( 𝑢 ) 𝐟 𝑇 ( 𝑢 ) 𝑑 𝑢 𝐑 𝑇 𝑛 , 𝑘 𝑟 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 𝐟 ( 𝑢 ) 𝐟 𝑇 ( 𝑢 ) 𝑑 𝑢 , = 𝐋 𝑇 𝑛 , 𝑘 𝐡 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 𝐑 𝑇 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 = 𝝈 𝑇 𝑛 , 𝑘 𝐠 1 𝑚 𝑛 , 𝑘 𝑇 𝝈 𝑇 𝑛 , 𝑘 𝐠 1 𝑚 𝑛 , 𝑘 𝑇 , ( 3 . 1 7 ) which entails that 𝒫 ( 𝜙 𝑛 , 𝑘 , 𝐟 𝑇 ) = 0 if 𝑛 < 𝑝 . If 𝑛 > 𝑝 , we remark that 𝒫 ( 𝜙 𝑛 , 𝑘 , 𝜙 𝑝 , 𝑞 ) = 𝒫 ( 𝜙 𝑝 , 𝑞 , 𝜙 𝑛 , 𝑘 ) 𝑇 , and we conclude that 𝒫 ( 𝜙 𝑛 , 𝑘 , 𝜙 𝑝 , 𝑞 ) = 𝟎 from the preceding case. For 𝑛 = 𝑝 , we directly compute for 𝑛 > 0 the only nonzero term 𝒫 𝜙 𝑛 , 𝑘 , 𝜙 𝑛 , 𝑘 = 𝐋 𝑇 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 𝐟 ( 𝑢 ) 𝐟 𝑇 ( 𝑢 ) 𝑑 𝑢 𝐋 𝑛 , 𝑘 + 𝐑 𝑇 𝑛 , 𝑘 𝑟 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 𝐟 ( 𝑢 ) 𝐟 𝑇 ( 𝑢 ) 𝑑 𝑢 𝐑 𝑛 , 𝑘 , = 𝝈 𝑇 𝑛 , 𝑘 𝐠 1 𝑚 𝑛 , 𝑘 𝑇 𝐡 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 1 𝐠 1 𝑚 𝑛 , 𝑘 𝝈 𝑛 , 𝑘 + 𝝈 𝑇 𝑛 , 𝑘 𝐠 1 𝑚 𝑛 , 𝑘 𝑇 𝐡 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 1 𝐠 1 𝑚 𝑛 , 𝑘 𝝈 𝑛 , 𝑘 . ( 3 . 1 8 ) Using the passage relationship between the symmetric functions 𝐡 and 𝐡 𝑚 𝑛 , 𝑘 given in (2.7), we can then write 𝒫 𝜙 𝑛 , 𝑘 , 𝜙 𝑛 , 𝑘 = 𝝈 𝑇 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 1 𝝈 𝑛 , 𝑘 + 𝝈 𝑇 𝑛 , 𝑘 𝐡 𝑚 𝑛 , 𝑘 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 1 𝝈 𝑛 , 𝑘 . ( 3 . 1 9 ) Proposition 3.1 implies that 𝐡 𝑚 𝑛 , 𝑘 ( 𝑙 𝑛 , 𝑘 , 𝑚 𝑛 , 𝑘 ) 1 + 𝐡 𝑚 𝑛 , 𝑘 ( 𝑚 𝑛 , 𝑘 , 𝑟 𝑛 , 𝑘 ) 1 = 𝚺 1 𝑛 , 𝑘 = ( 𝝈 1 𝑛 , 𝑘 ) 𝑇 𝝈 1 𝑛 , 𝑘 which directly implies that 𝒫 ( 𝜙 𝑛 , 𝑘 , 𝜙 𝑇 𝑛 , 𝑘 ) = 𝐈 𝑑 . For 𝑛 = 0 , a computation of the exact same flavor yields that 𝒫 ( 𝜙 0 , 0 , 𝜙 0 , 0 ) = 𝐈 𝑑 . Hence, we have proved that the collection of columns of 𝜙 𝑛 , 𝑘 forms an orthonormal family of functions in 𝐿 2 𝐟 (the definition of 𝜙 𝑛 , 𝑘 clearly states that its columns can be written in the form of elements of 𝐿 2 𝐟 ).
The proof now amounts showing the density of the family of functions we consider. Before showing this density property, we introduce for all ( 𝑛 , 𝑘 ) in the functions 𝐏 𝑛 , 𝑘 [ 0 , 1 ] 𝑑 × 𝑑 with support on 𝑆 𝑛 , 𝑘 defined by 𝐏 𝑛 , 𝑘 ( 𝑡 ) = 𝐋 𝑛 , 𝑘 i f 𝑙 𝑛 , 𝑘 𝑡 < 𝑚 𝑛 , 𝑘 , 𝐑 𝑛 , 𝑘 i f 𝑚 𝑛 , 𝑘 𝑡 < 𝑟 𝑛 , 𝑘 , 𝑛 0 , 𝐏 0 , 0 ( 𝑡 ) = 𝐋 0 , 0 . ( 3 . 2 0 ) Showing that the family of columns of 𝜙 𝑛 , 𝑘 is dense in 𝐿 2 𝐟 is equivalent to show that the column vectors of the matrices 𝐏 𝑛 , 𝑘 seen as a function of 𝑡 are dense in 𝐿 2 ( [ 0 , 1 ] , 𝑑 ) . It is enough to show that the span of such functions contains the family of piecewise continuous 𝑑 -valued functions that are to be constant on 𝑆 𝑛 , 𝑘 , ( 𝑛 , 𝑘 ) in (the density of the endpoints of the partition 𝑁 𝐷 𝑁 entails that the latter family generates 𝐿 2 ( [ 0 , 1 ] , 𝑑 ) ).
In fact, we show that the span of functions 𝑉 𝑁 = s p a n 𝑡 𝑐 𝑗 𝐏 𝑛 , 𝑘 ( 𝑡 ) 0 𝑗 < 𝑑 , ( 𝑛 , 𝑘 ) 𝑁 ( 3 . 2 1 ) is exactly equal to the space 𝐾 𝑁 of piecewise continuous functions from [ 0 , 1 ] to 𝑑 that are constant on the supports 𝑆 𝑁 + 1 , 𝑘 , for any ( 𝑁 + 1 , 𝑘 ) in . The fact that 𝑉 𝑁 is included in 𝐾 𝑁 is clear from the fact that the matrix-valued functions 𝐏 𝑁 , 𝑘 are defined constant on the support 𝑆 𝑁 + 1 , 𝑘 , for ( 𝑁 , 𝑘 ) in 𝐼 .
We prove that 𝐾 𝑁 is included in 𝑉 𝑁 by induction on 𝑁 0 . The property is clearly true at rank 𝑁 = 0 since 𝐏 0 , 0 is then equal to the constant invertible matrix 𝐋 0 , 0 . Assuming that the proposition true at rank 𝑁 1 for a given 𝑁 > 0 , let us consider a piecewise continuous function 𝐜 [ 0 , 1 ] 𝑑 in 𝐾 𝑁 1 . Remark that, for every ( 𝑁 , 𝑘 ) in , the function 𝐜 can only take two values on 𝑆 𝑁 , 𝑘 and can have discontinuity jump in 𝑚 𝑁 , 𝑘 : let us denote these jumps as 𝐝 𝑁 , 𝑘 = 𝐜 𝑚 + 𝑁 , 𝑘 𝐜 𝑚 𝑁 , 𝑘 . ( 3 . 2 2 ) Now, remark that for every ( 𝑁 , 𝑘 ) in , the matrix-valued functions 𝐏 𝑁 , 𝑘 take only two matrix values on 𝑆 𝑁 , 𝑘 , namely, 𝐋 𝑁 , 𝑘 and 𝐑 𝑁 , 𝑘 . From Proposition 3.1, we know that 𝐋 𝑁 , 𝑘 + 𝐑 𝑁 , 𝑘 = 𝐌 𝑁 , 𝑘 is invertible. This fact directly entails that there exist vectors 𝐚 𝑁 , 𝑘 , for any ( 𝑁 , 𝑘 ) in , such that 𝐝 𝑁 , 𝑘 = ( 𝐋 𝑁 , 𝑘 + 𝐑 𝑁 , 𝑘 ) ( 𝐚 𝑁 , 𝑘 ) . We then necessarily have that the function 𝐜 = 𝐜 + 𝐏 𝑛 , 𝑘 𝐚 𝑛 , 𝑘 is piecewise constant on the supports 𝑆 𝑁 , 𝑘 , ( 𝑁 , 𝑘 ) in . By recurrence hypothesis, 𝐜 belongs to 𝑉 𝑁 1 , so that 𝐜 belongs to 𝑉 𝑁 , and we have proved that 𝐾 𝑁 𝑉 𝑁 . Therefore, the space generated by the column vectors 𝑃 𝑛 , 𝑘 is dense in 𝐿 2 [ 0 , 1 ] , which completes the proof that the functions 𝑡 [ ( 𝜙 𝑛 , 𝑘 ( 𝑡 ) ) 𝑖 , 𝑗 ] 0 𝑖 < 𝑚 form a complete orthonormal family of 𝐿 2 [ 0 , 1 ] .

The fact that the column functions of 𝜙 𝑛 , 𝑘 form a complete orthonormal system of 𝐿 2 𝐟 directly entails the following decomposition of the identity on 𝐿 2 𝐟 .

Corollary 3.6. If 𝛿 is the real delta Dirac function, one has ( 𝑛 , 𝑘 ) 𝜙 𝑛 , 𝑘 ( 𝑡 ) 𝜙 𝑇 𝑛 , 𝑘 ( 𝑠 ) = 𝛿 ( 𝑡 𝑠 ) 𝐼 𝑑 𝐿 2 𝐟 . ( 3 . 2 3 )

Proof. Indeed it easy to verify that, for all 𝐯 in 𝐿 2 𝐟 , we have for all 𝑁 > 0 𝑈 ( 𝑛 , 𝑘 ) 𝑁 𝜙 𝑛 , 𝑘 ( 𝑡 ) 𝜙 𝑇 𝑛 , 𝑘 ( 𝑠 ) 𝐯 ( 𝑠 ) 𝑑 𝑠 = ( 𝑛 , 𝑘 ) 𝑁 𝜙 𝑛 , 𝑘 ( 𝑡 ) 𝒫 𝜙 𝑛 , 𝑘 , 𝐯 = ( 𝑛 , 𝑘 ) 𝑁 𝑑 1 𝑝 = 0 𝑐 𝑝 𝜙 𝑛 , 𝑘 𝑐 𝑝 𝜙 𝑛 , 𝑘 , 𝐯 , ( 3 . 2 4 ) where ( 𝑐 𝑝 ( 𝜙 𝑛 , 𝑘 ) , 𝐯 ) denotes the inner product in 𝐿 2 𝐟 between 𝐯 and the 𝑝 -column of 𝝍 𝑛 , 𝑘 . Therefore, by the Parseval identity, we have in the 𝐿 2 𝐟 sense 𝑈 ( 𝑛 , 𝑘 ) 𝜙 𝑛 , 𝑘 ( 𝑡 ) 𝜙 𝑇 𝑛 , 𝑘 ( 𝑠 ) 𝐯 ( 𝑠 ) 𝑑 𝑠 = 𝐯 ( 𝑡 ) . ( 3 . 2 5 )

From now on, abusing language, we will say that the family of 𝑚 × 𝑑 -valued functions 𝜙 𝑛 , 𝑘 is an orthonormal family of functions to refer to the fact that the columns of such matrices form orthonormal set of 𝐿 2 𝐟 . We now make explicit the relationship between this orthonormal basis and our functions ( 𝝍 𝑛 , 𝑘 ) derived in our analysis of the multidimensional Gauss-Markov processes.

3.1.2. Generalized Dual Operators

The Integral Operator 𝓚
The basis 𝜙 𝑛 , 𝑘 is of great interest in this paper for its relationship to the functions 𝝍 𝑛 , 𝑘 that naturally arise in the decomposition of the Gauss-Markov processes. Indeed, the collection 𝝍 𝑛 , 𝑘 can be generated from the orthonormal basis 𝜙 𝑛 , 𝑘 through the action of the integral operator 𝒦 defined on 𝐿 2 ( [ 0 , 1 ] , 𝑚 ) into 𝐿 2 ( [ 0 , 1 ] , 𝑑 ) by 𝐮 𝒦 [ 𝐮 ] = 𝑡 𝐠 ( 𝑡 ) 𝑈 𝟙 [ 0 , 𝑡 ] ( 𝑠 ) 𝐟 ( 𝑠 ) 𝐮 ( 𝑠 ) 𝑑 𝑠 , ( 3 . 2 6 ) where 𝑈 [ 0 , 1 ] is an open set and, for any set 𝐸 𝑈 , 𝟙 𝐸 ( ) denotes the indicator function of 𝐸 . Indeed, realizing that 𝒦 acts on 𝑀 ( [ 0 , 1 ] , 𝑚 × 𝑑 ) into 𝑀 ( [ 0 , 1 ] , 𝑑 × 𝑑 ) through 𝐀 𝑀 [ 0 , 1 ] , m × 𝑑 , 𝒦 [ 𝐀 ] = 𝒦 𝑐 0 ( 𝐀 ) , , 𝒦 𝑐 𝑑 1 ( 𝐀 ) , ( 3 . 2 7 ) where 𝑐 𝑗 ( 𝐀 ) denotes the 𝑗 th 𝑚 -valued column function of 𝐀 , we easily see that for all ( 𝑛 , 𝑘 ) in , 0 𝑡 1 , 𝝍 𝑛 , 𝑘 ( 𝑡 ) = 𝐠 ( 𝑡 ) 𝑡 0 𝐟 ( 𝑠 ) 𝜙 𝑛 , 𝑘 ( 𝑠 ) 𝑑 𝑠 = 𝒦 𝜙 𝑛 , 𝑘 ( 𝑡 ) . ( 3 . 2 8 ) It is worth noticing that the introduction of the operator 𝒦 can be considered natural since it characterizes the centered Gauss-Markov process 𝐗 through loosely writing 𝐗 = 𝒦 [ 𝐝 𝐖 ] .
In order to exhibit a dual family of functions to the basis 𝝍 𝑛 , 𝑘 , we further investigate the property of the integral operator 𝒦 . In particular, we study the existence of an inverse operator 𝒟 , whose action on the orthonormal basis 𝜙 𝑛 , 𝑘 will conveniently provide us with a dual basis to 𝝍 𝑛 , 𝑘 . Such an operator does not always exist; nevertheless, under special assumptions, it can be straightforwardly expressed as a generalized differential operator.

The Differential Operator 𝓓
Here, we make the assumptions that 𝑚 = 𝑑 , that, for all 𝑡 , 𝐟 ( 𝑡 ) is invertible in 𝑑 × 𝑑 , and that 𝐟 and 𝐟 1 have continuous derivatives, which especially implies that 𝐿 2 𝐟 = 𝐿 2 ( 𝑑 ) . In this setting, we define the space 𝐷 0 ( 𝑈 , 𝑑 ) of functions in 𝐶 0 ( 𝑈 , 𝑑 ) that are zero at zero and denote by 𝐷 0 ( 𝑈 , 𝑑 ) its dual in the space of distributions (or generalized functions). Under the assumptions just made, the operator 𝒦 𝐷 0 ( 𝑈 , 𝑑 ) 𝐷 0 ( 𝑈 , 𝑑 ) admits the differential operator 𝒟 𝐷 0 ( 𝑈 , 𝑑 ) 𝐷 0 ( 𝑈 , 𝑑 ) defined by 𝐮 𝐷 0 𝑈 , 𝑑 𝒟 [ 𝐮 ] = 𝑡 𝐟 1 ( 𝑡 ) 𝑑 𝑑 𝑡 𝐠 1 ( 𝑡 ) 𝐮 ( 𝑡 ) ( 3 . 2 9 ) as its inverse, that is, when restricted to 𝐷 0 ( 𝑈 , 𝑑 ) , we have 𝒟 𝒦 = 𝒦 𝒟 = 𝐼 𝑑 on 𝐷 0 ( 𝑈 , 𝑑 ) . The dual operators of 𝒦 and 𝒟 are expressed, for any 𝐮 in 𝐷 0 ( 𝑈 , 𝑑 ) , as 𝒟 [ 𝐮 ] = 𝑡 𝐠 1 ( 𝑡 ) 𝑇 𝑑 𝑑 𝑡 𝐟 1 ( 𝑡 ) 𝑇 𝐮 ( 𝑡 ) , 𝒦 [ 𝐮 ] = 𝑡 𝐟 ( 𝑡 ) 𝑇 𝑈