K-Optimal Gradient Encoding Scheme for Fourth-Order Tensor-Based Diffusion Profile Imaging
The design of an optimal gradient encoding scheme (GES) is a fundamental problem in diffusion MRI. It is well studied for the case of second-order tensor imaging (Gaussian diffusion). However, it has not been investigated for the wide range of non-Gaussian diffusion models. The optimal GES is the one that minimizes the variance of the estimated parameters. Such a GES can be realized by minimizing the condition number of the design matrix (-optimal design). In this paper, we propose a new approach to solve the -optimal GES design problem for fourth-order tensor-based diffusion profile imaging. The problem is a nonconvex experiment design problem. Using convex relaxation, we reformulate it as a tractable semidefinite programming problem. Solving this problem leads to several theoretical properties of -optimal design: (i) the odd moments of the -optimal design must be zero; (ii) the even moments of the -optimal design are proportional to the total number of measurements; (iii) the -optimal design is not unique, in general; and (iv) the proposed method can be used to compute the -optimal design for an arbitrary number of measurements. Our Monte Carlo simulations support the theoretical results and show that, in comparison with existing designs, the -optimal design leads to the minimum signal deviation.
Diffusion-weighted MRI is a noninvasive imaging technique to probe microstructures in living tissues, for example, the human brain. It involves acquiring a series of diffusion-weighted images, each corresponding to diffusion sensitization along a particular gradient direction. Non-Gaussian diffusion models have gained wide attention among researchers because of their potential ability to resolve complex multifiber microstructures. Özarslan and Mareci  introduced high order tensors (HOTs) as an alternative to conventional second-order tensor model. In regions with complex microstructures, HOTs can model the apparent diffusion coefficient (ADC) with higher accuracy than the conventional second-order model . Several aspects of HOT-based ADC profile estimation have been addressed in the literature [3–5]. HOTs have also been used to represent orientation distribution functions that are required for tractography [6, 7].
The need for robust estimation of diffusion parameters in a limited acquisition time has given rise to many studies on optimal gradient encoding scheme (GES) design. In the case of the classical second-order model they include [8–14]. However, there are few studies tackling the problem of optimal GES design for non-Gaussian diffusion models [15, 16]. The only study on GES design for HOTs  is limited to comparison of existing GESs mainly devised for second-order tensor imaging, for example, the minimum condition number (MCN) scheme . A caveat here is that the condition number is computed from the design matrix associated with the linear least square estimation of parameters of interest. Thus, by definition, it is model-dependent. This implies that the minimum condition number GES for second-order tensor estimation is not an optimal GES for fourth-order tensor estimation. An experiment design that minimizes the condition number of the design matrix is called -optimal design. In this paper we solve the problem of -optimal GES design for HOT-based ADC profile imaging as follows. First, we reformulate it as a nonconvex experiment design problem. Then, by convex relaxation we obtain a tractable semidefinite programming (SDP) problem. The last step is to extract design points (the gradient encoding directions) from the optimal design matrix. Finally, to show the relevance of the proposed design approach, we evaluate our solutions using the rotational variance test and Monte Carlo simulations. Throughout the paper “experiment design” and “gradient encoding scheme (GES)” are used interchangeably. The former is used in the optimization context while the latter is used in the diffusion MRI (dMRI) community.
2. Problem Statement
This section briefly reviews the basics of HOT-based ADC profile estimation both for the sake of completeness and to define notation. The reader is referred to [4, 5] for more details. For definitions of symmetry, positive semidefiniteness and eigendecomposition of high order tensors () see [5, 17]. The Stejskal-Tanner equation for dMRI signal attenuation is where is the diffusivity function, is the measured signal when the diffusion sensitizing gradient is applied in the direction , is the observed signal in the absence of such a gradient, and is the diffusion weighting taken to be constant over all measurements. The diffusivity function is modeled using even order symmetric positive semidefinite tensors as follows:where contains distinct entries of the th-order tensor. Here we focus on the case of , where . It is worth mentioning that both vectors and are vectors in and is used for simplification. Given measurements in different directions , the least squares estimator (LSE) of the HOT is obtained as follows:where is an design matrix defined as and . The closed-form solution is .
In the framework described above, the precision of the estimation problem is dependent on the experiment designs , . For independent and zero mean measurement noise with constant variance the LSE is unbiased and has the following covariance matrix :where and is usually called the “information matrix.” Optimal experiment design entails making the covariance matrix small in some sense. It is usual to minimize a scalar function of the covariance matrix. One design approach is to minimize the condition number of the information matrix (-optimal design) [11, 12, 19]. In this paper, we solve the -optimal experiment design problem for HOT-based ADC profile imaging.
Remark 1. For isotropic diffusion, it has been shown that (4) holds [9, 20]. We investigate the significance of the noise assumptions in the case of anisotropic diffusion, later in Section 4. Therein we present Monte Carlo simulations for a more realistic case (with anisotropic tensor and Rician distributed noise on ).
3. Proposed GES Design Approach
In Section 3.1 we present mathematical formulations for the -optimal GES design problem. The solutions are given in Section 3.2. Section 3.3 then considers the problem of extracting the design points from the optimal information matrix. Finally, the properties of the obtained solutions and some theoretical results are discussed in the last subsection.
3.1. Mathematical Formulations of the -Optimal Design Problem
The condition number measures the sensitivity of the solution to changes in measurements . Hence, it is desirable to minimize the condition number of (denoted by ) or equivalently to minimize . The -optimal design in HOT-based ADC profile imaging can be performed with respect to either the design matrix or information matrix because where and are the maximum and minimum eigenvalues of , respectively. The -optimal experiment design problem in HOT estimation can be written as follows:This problem, in its current form, is not convex. Our aim here is to reformulate this problem as an SDP problem that can be efficiently solved. Before describing the approach, it is worth mentioning that conventional experiment design problems (as in ) seek to minimize the objective function over a finite and thus countable set , that is, . In the present case, however, is not a countable set but includes the whole set of feasible solutions. Note that the degree of freedom in this design problem is 45. In other words, can be parameterized in 45 independent variables. For example, , , . To reformulate the problem, we first parameterize in 45 distinct variables assuch that we obtain the affine mapping (its range is the set of symmetric positive semidefinite matrices of size fifteen). This can equivalently be expressed aswhere is a symmetric matrix and . To clarify how is decomposed into s, consider the following example:where is the element of placed in the th row and th column. Carefully note the relationship between and the original design variables (s) because this is used in Section 3.3. For example, and .
It is possible to relax the constraints , , and solve the problem by the algorithm given in  to obtain a lower bound on the optimal value of problem (6). However, we instead convert the constraints in (6) to a convex constraint as follows:where has only fifteen nonzero elements. We then have the following relaxed problem:Given that the conversion in (10) is not reversible, the optimal value of the problem in (11) is a lower bound on the optimal value of the problem in (6). The objective function is a quasiconvex function . Thus, an approximate solution of (11) may be obtained by solving a sequence of feasibility problems [21, 22]. Alternatively, this problem can be formulated as an SDP problem:where is the identity matrix, is the condition number, and equals . This is a bilinear matrix inequality problem that can be solved by the line search method. For a constant , it becomes a tractable linear matrix inequality (LMI) problem. The optimal value of (12) can be obtained by performing a line search on . Let the optimal value of the following problem be , where is a real nonnegative constant:Then we have . The problem in (13) can be efficiently solved by LMI solvers.
3.2. Solutions to the -Optimal Design Problem
The -optimal design problem in (13) can be solved for different values of , using the YALMIP  and SDPT3 solvers . By close inspection of the results for different values of , one can conclude the following about -optimal solutions:(i)If is a solution to (13) with , then is a solution to (13) with for any real positive . Thus, the optimal solution () is proportional to .(ii)The minimum condition number is independent of and is given by .(iii)The -optimal solution is(iv)And (where ).
3.3. Extracting Design Points
The task of extracting the design points (, ) from the optimal information matrix is straight forward, as outlined in . By expressing the optimal in terms of the original decision variables, one obtains 45 equations as listed in (15) and (16). Furthermore, equations of the form can be added to guarantee that the resulting solutions belong to the feasible set of the original problem. Thus, one obtains a nonlinear system of equations in unknowns. Given that is required, the system is usually underdetermined. By numerically solving the nonlinear system, one can extract the design points. The odd moments of the optimal design must be zero (). We refer to this fact as symmetry of the optimal design. This property means that the following holds true:The even moments of the optimal design must satisfy the following conditions ():As an example, Table 1 lists the -optimal design points for derived using our proposed method. We solved the above-mentioned nonlinear system of equations using the fsolve command in MATLAB. For a discussion on the uniqueness of this solution, see the next subsection where we explain some properties of the -optimal design.
3.4. Properties and Theoretical Results
(1) Global Optimality. In summary, our approach to find the -optimal experiment design involves the following steps. We begin with the original formulation of the -optimal experiment design problem as in (6). Next we apply the relaxation in (10). Finally we solve the relaxed version of the problem as stated in (13). Any point in the feasible set of the original minimization problem gives an upper bound (UB) on the optimal value of its objective function. The optimal point in the feasible set of the relaxed problem gives a lower bound (LB) on the optimal value of the original objective function. Thus, if an optimal solution of the relaxed problem belongs to the feasible set of the original problem, which implies that UB = LB, then it is a globally optimal solution of the original problem. By construction, this is the case for the proposed solutions in previous subsection.
(2) Relation with Number of Measurements. As mentioned above, elements of the optimal information matrix are proportional to the number of available measurements . However, the optimal value of the objective function (condition number) is constant.
(3) Symmetry. The presented -optimal design is symmetric in the sense specified in (15).
(4) Nonuniqueness. The -optimal design is not unique. Let be the -optimal information matrix when a total of measurements is permitted. Let be the set of corresponding design points. If , then one can easily verify that will result in the same information matrix as . This is because of the linear dependency of the elements of the optimal on , so that . As an example, for one can find the following optimal designs: , , , and even four repetitions of .
(5) Consistency with Previous Studies. The -optimal design problem for second-order DTI has been studied in . Therein, the solution is approximated using the downhill simplex method (a stochastic optimization method). Using the proposed approach for second-order DTI (set , , and repeat the whole process in Section 3) one can see that, for an arbitrary , the optimal condition number of the design matrix is . The -optimal GES for is listed in Table 2. All these findings are in agreement with the results in .
Remark 2. The set of second-order tensors can be seen as a subset of fourth-order tensors. As an example, the equality implies that the second-order tensor can be represented by a fourth-order tensor
, where denotes the elements of . In such cases, the number of free parameters of the fourth-order tensor is reduced to six, and thus it can be estimated using the -optimal designs for second-order DTI. See Table 2 for an example with .
4. Evaluations and Results
In this section we evaluate the proposed -optimal GES in comparison to several existing methods. The evaluation framework is adopted from . More specifically we consider two quality measures: condition number and signal deviation.
4.1. Condition Number
Table 3 shows that the proposed -optimal GES has the minimum condition number. References for the competing GESs are also provided in this table.
4.2. Signal Deviation
Signal deviation is defined in  to measure the rotational variance of a GES. As the diffusion tensor is reoriented, the accuracy and precision of the estimated parameters may vary. Knowledge of the rotational variance is thus very important in the dMRI community. For details see chapter 15 in . Signal deviation is defined as  where is the measured signal and is the signal produced by the estimated tensor . To evaluate the rotational variance of a GES we select 343 rotation matrices. These matrices are obtained by taking equally spaced steps in each of , , and and computing . To rotate fourth-order tensors, we use the approach in . We evaluated the signal deviation using Algorithm 1 given in the Appendix, where we used the following setup for the Monte Carlo simulations: , s/(mm2), (number of Monte Carlo trials), , (number of rotations), and , . All tensors used in the evaluation are listed in Table 4 and are plotted in Figure 1. The software in  is used to plot fourth-order tensors. As it can be seen in Figure 1, three tensors (a)–(c) correspond to single-fiber microstructures while six tensors (d)–(i) represent two crossing fibers (with different crossing angle and weights of the lobes) and the tensor in (j) shows three perpendicular fibers. Crossing angles below 60 degrees are not considered as it is known that fourth-order tensors cannot resolve such fiber architectures . In Figure 2, the average signal deviation over Monte Carlo trials () is plotted as a function of tensor orientation for the top two GESs (based on the condition number). It shows that, for , signal deviation of the -optimal GES is consistently less than that of the DISCOBALL scheme. The mean value and standard deviation of the (over rotations) for all evaluated GESs/tensors are given in Table 5. It can be seen that, in all cases, -optimal GES has the minimum mean value of signal deviation (corresponding numbers are denoted by bold font). Considering , , , and the standard deviation of is almost the same for all GESs. For to , the -optimal GES has the maximum . However, even in these cases mean signal deviation of the -optimal GES is far better than that of others. Thus, the proposed -optimal GES is the most favorable choice in all cases.
The distribution of gradient encoding directions over the unit sphere for the evaluated GESs is plotted in Figure 3. The DISCOBALL and Jones schemes produce an approximately uniform (equidistance) distribution of points while the -optimal, MCN, and Wong schemes produce a nonuniform distribution of points. Regarding the effect of uniform distribution of points, an interesting observation is that uniformly distributed GESs (the Jones and DISCOBALL) have minimum except for and (see Table 5). However, they do not lead to an overall better performance (as the -optimal GES performs far better in terms of mean ).
The proposed approach can be applied in experiment design for other tensors, although the current work focuses on its application and results on fourth-order tensor estimation. In Section 2, the order of the diffusion tensor, , can be any even natural number (to ensure antipodal symmetry). The extension to higher order tensors is possible using the same strategy (the dimension of the the information matrix in (13) will increase to ).
In Section 2, we assumed that the noise (on ) is zero mean and independent with constant variance. We acknowledge that, in the dMRI context, these noise assumptions may not hold, in general. However, our Monte Carlo simulations in Section 4 show that, for realistic cases (Rician noise on and anisotropic tensors), the proposed GES yields the minimum signal deviation and the minimum rotational variance. Moreover, minimizing the condition number of the information matrix is motivated in [11, 12] regardless of the noise distribution. The condition number describes, without any assumptions about the noise distribution, how the noise in measurements propagates to the noise in diffusion tensor .
As we mentioned above the -optimal design is not unique. This raises several questions including (i) why should we favor 60-point -optimal design over four repetitions of 15-point -optimal designs? and (ii) why should we favor a 60-point -optimal design over a union of 40- and 20-point designs? To answer these types of questions further extensive studies and experiments with real data (or Monte Carlo simulations) are required (as in [30–32]) which is beyond the scope of this paper.
We showed that the -optimal GES design for HOT estimation can be formulated as a nonconvex experiment design problem. Next, we solved the problem using convex relaxation and semidefinite programming. We also showed that resulting solutions have the following properties: (i) the proposed solution is the globally optimal solution; (ii) the obtained solutions are not unique, in general; (iii) nonzero entries of the optimal information matrix are proportional to the total number of measurements; (iv) odd moments of the -optimal design must be zero; and (v) union of -optimal solutions with and measurements leads to the -optimal design for measurements. Another advantage of this work is that it establishes a theoretical foundation for the experiment design in even order diffusion tensor imaging.
Pseudocode for the Algorithm Used in Section 4.2
See Algorithm 1.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
A. Ghosh and R. Deriche, “4th order symmetric tensors and positive ADC modelling,” in Visualization and Processing of Tensors and Higher Order Descriptors for Multi-Valued Data (Dagstuhl Seminar 11501), A. Vilanova, C.-F. Westin, and B. Burgeth, Eds., Springer, 2013.View at: Google Scholar
M. Alipoor, I. Y. H. Gu, A. J. H. Mehnert, Y. Lilja, and D. Nilsson, “On high order tensor-based diffusivity profile estimation,” in Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC '13), pp. 93–96, July 2013.View at: Publisher Site | Google Scholar
A. Barmpoutis and B. C. Vemuri, “Unified framework for estimating diffusion tensors of any order with symmetric positive-definite constraints,” in Proceedings of the 7th IEEE International Symposium on Biomedical Imaging: From Nano to Macro (ISBI '10), pp. 1385–1388, April 2010.View at: Publisher Site | Google Scholar
A. Barmpoutis, Y. T. Weldeselassie, and S. Atkins, “Symmetric positive-definite Cartesian tensor orientation distribution functions (CT-ODF),” in Medical Image Computing and Computer-Assisted Intervention—MICCAI 2010, vol. 6361 of Lecture Notes in Computer Science, pp. 582–589, Springer, Berlin, Germany, 2010.View at: Publisher Site | Google Scholar
F. J. Chen, Progress in Brain Mapping Research, Nova Science Publishers, Hauppauge, NY, USA, 2006.
P. A. Narayana and K. M. Hasan, “DTI parameter optimization at 3.0 T: potential application in entire normal human brain mapping and multiple sclerosis research,” MedicaMundi, vol. 49, no. 1, pp. 30–45, 2005.View at: Google Scholar
D. K. Jones, Diffusion MRI Theory, Methods, and Applications, Oxford University Press, 2011.
J. Löfberg, “YALMIP: a toolbox for modeling and optimization in MATLAB,” in Proceedings of the IEEE International Symposium on Computer Aided Control System Design (CACSD '04), pp. 284–289, Taipei, Taiwan, September 2004.View at: Google Scholar
K. C. Toh, R. H. Tütüncü, and M. J. Todd, “On the implementation and usage of SDPT3—a Matlab software package for semidefinite-quadratic-linear programming, version 4.0,” 2006.View at: Google Scholar
T. Stöcker, R. Stirnberg, and N. J. Shah, “A new and versatile gradient encoding scheme for DTI: a direct comparison with the Jones scheme,” Proceedings of the International Society for Magnetic Resonance in Medicine, vol. 17, p. 3574, 2009.View at: Google Scholar
H. Ni, V. Kavcic, T. Zhu, S. Ekholm, and J. Zhong, “Effects of number of diffusion gradient directions on derived diffusion tensor imaging indices in human brain,” The American Journal of Neuroradiology, vol. 27, no. 8, pp. 1776–1781, 2006.View at: Google Scholar
A. Yamamoto, Y. Miki, S. Urayama et al., “Diffusion tensor fiber tractography of the optic radiation: analysis with 6-, 12-, 40-, and 81-directional motion-probing gradients, a preliminary study,” American Journal of Neuroradiology, vol. 28, no. 1, pp. 92–96, 2007.View at: Google Scholar
E. Widjaja, S. Z. Mahmoodabadi, D. Rea, R. Moineddin, L. Vidarsson, and D. Nilsson, “Effects of gradient encoding and number of signal averages on fractional anisotropy and fiber density index in vivo at 1.5 tesla,” Acta Radiologica, vol. 50, no. 1, pp. 106–113, 2009.View at: Publisher Site | Google Scholar