- About this Journal ·
- Abstracting and Indexing ·
- Aims and Scope ·
- Article Processing Charges ·
- Articles in Press ·
- Author Guidelines ·
- Bibliographic Information ·
- Citations to this Journal ·
- Contact Information ·
- Editorial Board ·
- Editorial Workflow ·
- Free eTOC Alerts ·
- Publication Ethics ·
- Reviewers Acknowledgment ·
- Submit a Manuscript ·
- Subscription Information ·
- Table of Contents
International Journal of Biomedical Imaging
Volume 2011 (2011), Article ID 658930, 17 pages
Sparse Representation of Deformable 3D Organs with Spherical Harmonics and Structured Dictionary
1Department of Electrical and Computer Engineering, University of Texas at Austin, Austin, TX 78712, USA
2Department of Urologic Surgery, University of Minnesota, Minneapolis, MN 55455, USA
Received 3 March 2011; Revised 28 June 2011; Accepted 29 June 2011
Academic Editor: Kenji Suzuki
Copyright © 2011 Dan Wang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
This paper proposed a novel algorithm to sparsely represent a deformable surface (SRDS) with low dimensionality based on spherical harmonic decomposition (SHD) and orthogonal subspace pursuit (OSP). The key idea in SRDS method is to identify the subspaces from a training data set in the transformed spherical harmonic domain and then cluster each deformation into the best-fit subspace for fast and accurate representation. This algorithm is also generalized into applications of organs with both interior and exterior surfaces. To test the feasibility, we first use the computer models to demonstrate that the proposed approach matches the accuracy of complex mathematical modeling techniques and then both ex vivo and in vivo experiments are conducted using 3D magnetic resonance imaging (MRI) scans for verification in practical settings. All results demonstrated that the proposed algorithm features sparse representation of deformable surfaces with low dimensionality and high accuracy. Specifically, the precision evaluated as maximum error distance between the reconstructed surface and the MRI ground truth is better than 3 mm in real MRI experiments.
Organ deformation during operations has imposed substantial challenges for performing precise diagnosis and surgery in minimum invasive surgery (MIS), such as natural orifice transluminal endoscopic (NOTES) . The deformation markedly decreases the precision of the prior surgical plan that is based on the preoperative images (e.g. computed tomography (CT) or magnetic resonance imaging (MRI)), so it must be effectively compensated to lower surgical risks. However, this is not a trivial task due to the high degree of freedom of the 3D nonrigid deformation and limited field of view  for observation during MIS. To recovery, the 3D deformation with high resolution in real time, a critical issue is to seek an efficient representation of deformable surface, according to which the sampling and surface recovery strategy can be designed for updating the 3D visualization. This paper focuses on the topic of block sparse representation of deformable surfaces. The later topic of real time tracking an deformable organ with limited access to the organ is explored further in [2–4].
Various techniques have been proposed for surface description, and each has its own advantage and disadvantage according to the application requirements. Broadly speaking, there are two major categories of surface representation methods: local feature-based models and global or parametric models. The work in  based on geometric partial differential equations (PDE) belongs to the former category which derives Euler-Lagrange equation and then a geometric evolution equation (or geometric flow) to describe the surfaces. Similarly, the method in  treats the whole surface as a union of localized patches. Global surface representation methods [7–10], particularly, decomposing surfaces into other primitive shapes, are more appropriate for shape analysis and classification due to the lower dimensionality of the parameter space. This paper falls into the category of parametric global surface description.
Parametric surface representation describes a surface in a single functional form, such that the surface is fully characterized by a set of parameters in a particular domain. Surface harmonics such as spheroidal harmonics, cylindrical harmonics, and spherical harmonics (SH)  are widely used as building blocks for global surface description. Each harmonic does not bear localized features but contributes to the entire shape description. Among those different types of harmonics, a well-known approach is the spherical harmonic decomposition (SHD), which has advantages of smoothness and high accuracy [7, 11]. With proper parameterization [11, 12], any genus-0 surface (The genus of a connected surface is an integer representing the maximum number of cuttings along nonintersecting closed simple curves without rendering the resultant manifold disconnected. It is equal to the number of handles on it, so a sphere is genus 0 and a torus is genus 1.) can be analyzed in the harmonic domain with reduced data dimensions. SHD has been widely used in applications related to surface description, including static modeling of kidney  and brain [9, 10], as well as spatial-temporal modeling of left-ventricular with known motion period. In , hemisphere is also applied to open surfaces.
Sparse signal representation has steadily gained attention over the years in the signal processing community. The aim is to find a representation which is sparse, or compact, such that most of the energy of a signal can be captured with only a few nonzero coefficients in a given dictionary. The first widely applied methods to seek sparse representation are greedy approaches, including matching pursuit (MP) , orthogonal matching pursuit (OMP) , and orthogonal least square (OLS) . Those methods iteratively first select the most correlated element from dictionary and then remove the contribution of that element with decorrelation, before finding the next atom. Iteration terminates when any stopping criteria is met. The second type is global optimization algorithms, such as Basis Pursuit (BP) , FOCUSS  and Iterative Thresholding . Global Optimization, in the approximate sense, relaxes the sparseness constrain, and its sparsity is a side-effect of the optimization. For example, the basis pursuit (BP) method approximates the -norm sparsity constraint with an -norm criteria, which effectively converts the problem into a convex optimization one, solved globally with linear programming. The orthogonal subspace pursuit (OSP) method  used in our paper belongs to the greedy category, which does not require prior knowledge of the dimension of the subspaces and combines the learned subspaces to produce a data-driven dictionary with good sparseness and generalizability.
Besides sparse decomposition algorithms as mentioned above, an equally important issue for sparse representation is how to select a dictionary for an application. The two main groups for dictionary design methods are structured dictionaries built out of common bases, and trained dictionaries that are inferred from the training data. For the common bases, it is well known that the wavelet transform can be used to generate sparse multiscale representations of images, the short-time Fourier transform (STFT) generates sparse time-frequency representation of speech signals, and the DCT is another transform that has been used for compression in audio coding algorithms due to its good compaction property. For dictionary learning, the applicable approaches include maximum likelihood estimation (MLE) , method of optimal directions (MOD) , maximum a-posteriori (MAP) , and so forth. Those methods attempt to generalize the type of considered signal with the basis identified from a representative training data set. The proposed approach is based on trained dictionary, since, to the best of our knowledge, there is no common basis in which random surfaces can be sparsely represented.
Although sparse representation has been widely applied in the fields of signal compression, image denoising, blind source separation, and compressed sensing, there is still very limited application in 3D surfaces . In fact, the main statistically motivated surface modeling methods are based on principle component analysis which is not sparse [25–29]. Those methods first compute the mean shape and then build the model by establishing legal variations learned from a set of training data for a given type of images, such as bone . With PCA, the major variations of the shape populations are described by the first few basis vectors, such that any surface of that shape population can be projected into an orthogonal subspace spanned by the retained vectors. More advanced techniques, such as multiresolution deformable model , are provided to improve the accuracy considering limited sample size.
Most of the previous surface modeling/representation works are designed for either static models [8–10], or particular deformable organs with known physical properties (such as motion cycle) . Further, computation bottleneck caused by large spherical harmonics basis hampered the applicability for real-time application. For PCA-based modeling methods, the resulting space that captures the variation in the population is either a super subspace including all training data or a truncated subspace with sacrificed generalization. In addition, PCA tends to be computational expensive when performing eigenvector decomposition as training data dimension increases, and it does not lead to any structure in the representation.
To bring the demonstrated merits of sparse coding to 3D surface representation, we propose a generally applicable algorithm of parametric sparse representation of deformable surfaces based on SHD and orthogonal subspace pursuit. The main contributions of this paper include the following. (i)Propose an algorithm of sparse representation of deformable surfaces.(ii)Generalize the representation approach for organs involving both interior and exterior surfaces.(iii)Present evaluation results conducted using computer models, ex vivo experiments based on 3D MRI scans of freshly excised porcine kidneys, and in vivo cardiac MRI scans of real patients.
This paper is organized as follows: in Section 2, we describe the proposed algorithm of sparse representation of deformable surface, denoted as SRDS thenceforth. Section 3 presents some experimental results using finite element model (FEM) data, ex vivo and in vivo experimental data. Finally, in Section 4, we finish with a few conclusions.
2. The SRDS Algorithm
The SRDS algorithm consists of three main steps to achieve sparse representations of deformable surfaces, as outlined in Figure 1. Initially, SHD is performed to depict the deformable surfaces in the training set in the harmonic domain. Then OSP is applied in the transformed domain to identify the subspaces in which the SH coefficient vectors of the deformations can be linearly represented. Finally, each deformation is clustered to the proper subspace and represented with the corresponding coefficient vector with block sparsity. This representation method is also extended to organ deformations occurred on both interior and exterior layers as described in Section 2.4. Furthermore, as a practical issue, pixel-wise surface alignment among all the 3D surfaces is also addressed in Section 2.5.
2.1. Step 1: SHD
Spherical harmonics are solutions to Laplace's equation expressed in the spherical coordinate system, defined as where is the polar angle within [0, ], is the azimuthal angel within [0, 2), is the harmonic degree within [0, ], and is the harmonic order varying in [, ]. is the associated Legendre function. After proper parameterization [11, 12], a 3D surface with finite energy can be expanded with SH series as Each harmonic coefficient is calculated using the inner product of the function and basis
Assume that harmonics up to level () are involved in the transformation. Let denote the matrix composed of all discretized harmonics, so has the following formation: Then a surface can be represented in the matrix format as where stands for a surface with samples and is the harmonic coefficient vector. Notice that this equation is not exactly equal but approximately. For simplicity, we still use equal sign with least square estimation in this paper. The linear problem in (5) can be solved with the least square (LS) constraints outputting
Perform SHD for each of the training deformed surfaces in , so the group of deformations can be described by matrix Consequently, the training set of deformations can be totally characterized by columns in as
2.2. Step 2: Subspace Identification with OSP
The aim of the second step is to explore the structures in those training deformations in the transformed harmonic domain and recognize the inherent subspaces in which the SHD coefficient vectors of the training deformations can be projected with high accuracy. The newly developed OSP algorithm  is adopted since it features better generalization and less computational cost compared to the gold standard K-SVD algorithm . OSP is an iterative process that terminates when one of the predefined criteria is met. In this paper, we specify the following two stopping criteria: (1) an error threshold for subspaces selection and (2) a maximum number of iterations for both controlling the subspace dimensions and avoiding deadlock searching. Further, the threshold for vector clustering is denoted as , that is, we declare that a vector lives in a subspace if it can be projected to that subspace with error ( distance) no larger than .
A. Subspace Pursuit
Initially, each vector of length in is normalized by norm. For convenience, we still use to denote SH coefficient matrix even after normalization. The algorithm first identifies a subspace from based on the stop criteria, then finds all the vectors in that can be represented in that subspace with error level below and remove those vectors from to prepare for the next subspace pursuit. The process can be generalized as follows, in which means that elements from are excluded from , and stands for inclusion.
(1)Initialization: , , .(2)Subspace searching and clustering (i); choose a vector from (e.g. first column of ) and let to remove . (ii)Find vectors from for representing with error no larger than within iterations and the vectors form . (iii)Perform SVD decomposition on : ; let contain the first vectors of ; update . (iv)Select vectors from that can be represented by with error no larger than , and then remove them from .(v)Repeat above steps until all the vectors are clustered.
B. Subspace Pruning
One disadvantage of the traditional OSP algorithm is the presence of “spurious” or redundant subspaces especially as the dimension of the training data set increases. Those subspaces identified in the earlier iterations actually can be better represented by the later identified subspaces. Therefore, a postprocessing step is used to identify and then discard the redundancy among the subspaces without decreasing the performance. This is implemented by repartitioning the training data among the initial subspaces and then eliminating subspaces in which very few or no vector is clustered. In some cases, where the subspace size is limited to some constraint, an optimization step can be applied in conjunction with the pruning step. The details of the subspace optimization design is described in .
C. Matrix Factorization
After identifying the inherent subspaces, the coefficient matrix of training set can be partitioned into two-part union of subspaces and corresponding coefficients via the following procedures.
Since each vector has been clustered into the belonging subspace during the subspace identification process, the corresponding coefficients for each can be obtained accordingly. Suppose that lives in subspace which is spanned by orthogonal basis, so its corresponding coefficients can be calculated using LS estimator Then can be characterized by in its subspace
If there are totally subspaces identified from , a structured dictionary constructed by concatenating all deformation subspaces is established as , with dimension . Since each vector lies in one of the subspaces, can also be represented in the structured dictionary with a block sparse vector , which is obtained via extending the coefficients by zero padding in positions corresponding to other subspaces in . Figure 2 provides an example of 3 subspaces to illustrate the sparsity of coefficient vector . If lies in subspace which are spanned by the 5th, 6th, and 7th columns in , then has nonzero values only at index of 5, 6, and 7. Consequently, can be factorized as where is the corresponding coefficient matrix with block sparsity.
2.3. Step 3: Structured Sparse Surface Representation
A. Sparse Representation of Training Surfaces
Integrating the subspace pursuing results in the harmonic domain in (11) with the initial SHD process in (8), the training deformations can be sparsely represented in the original spatial domain as where with size of is the desired structured dictionary in the spatial domain. Since , is inherently structured by subspaces of with of size .
Up to this point, with matrix that captures the deformation features in the considered population, each training deformation in can be fully characterized by a sparse coefficient as The sparsity of has already been illustrated in Figure 2.
B. Sparse Representation of Testing Surfaces
For the testing deformations beyond the training set, we utilized the fact that the dictionary identified from an extensive training data features good generalization such that any deformation of that particular population can be represented in the subspaces with high accuracy. This is justified because organs only deform in limited ways due to their mechanical properties, so the deformation variations can be fully learned from a training data set. This applied structure allows fast deformation representation in subspaces of low dimensionality.
The testing set is denoted as , where is the number of deformations to be represented. The straightforward strategy is to find a best-fit subspace for by projecting it to every subspace and choose the subspace with minimal projection error. Since the number of subspaces and the dimension of each subspace are both small with the postprocessing of subspace pruning, this best-fit strategy still results in low computational cost. However, when the number of subspaces is too large, an alternative threshold approach can be applied by finding a subspace in which can be represented with an error level smaller than . The former best-fit method is implemented in Section 3 for performance validation.
Suppose that is the chosen subspace, so coefficient vector can be estimated with LS as Then block sparse vector is obtained according to the rules described in Section 2.2. Further, the sparsity of or can be increased by trimming off nonzero elements with absolute value lower than a given threshold .
It is worth noting that, different from the traditional learning approaches relying on orthogonal least square (OLS)  or matching pursuit (MP)  algorithms which select atoms from the training set and recombine them for representing each surface in the testing set sparsely, our SRDS algorithm avoids this heavy overload caused by reshuffling all the atoms. Instead, we apply the block structure of the dictionary learned from a representative training data set. This essentially enables the representation of each deformable surface compactly and sparsely with high accuracy and low computational cost.
2.4. Extended Sparse Surface Representation
For an organ with both interior and exterior surfaces, such as bladder, deformations can take place on both layers. The above theory can be extended to achieve sparse surface representation for deformations occurred on both interior and exterior wall of the object.
Initially, spherical parameterization is conducted on interior and exterior parts separately. We denote and () as the corresponding interior and exterior of each training surface . Then each pair of (with vertices) and (with vertices) can be approximated by spherical harmonic basis as where of size and of size denote the spherical harmonic basis for inner and outer surfaces, respectively. is the highest degree of harmonics included. and are the corresponding harmonic coefficient vectors. Therefore, each deformation is represented by vector and all training frames can be characterized by as
The following procedures of subspace identification and sparse surface representation as described in Sections 2.2 and 2.3 can be applied straightforwardly. After identifying subspaces from SH coefficient matrix , each training deformation can be sparsely represented with block sparse coefficient vector as: where is the subspace with size of , and is the desired structured dictionary. Accordingly, is the block sparse vector, and is the nonzero coefficient values in the selected subspace.
2.5. Surface Correspondence
Similar to other surface modeling methods [25, 27, 28], the proposed approach requires point-wise correspondence across difference surfaces besides rigid registration . Specifically, this point-to-point alignment is established, such that an identical spherical parameterization can be applied in the SHD procedure. Figure 3 illustrates the goal of surface correspondence. Same colored vertices on deformation 1 and 2 indicate a matched pair. After established correspondence of the point pairs over the two deformations, vertices on deformation 2 can be numbered in the same order as deformation 1.
Different correspondence methods have been proposed, such as minimum description length , SH-coefficient alignment , and so forth. We applied the SH based method  in this paper as well as ray-casting method for simple surfaces. The former SH-based method is based on the underlying fact that two points with the same parameter pair when mapped to a sphere are considered to be a corresponding pair. Therefore, it fixes parameterization of the template and rotate the other to optimize the surface correspondence by minimizing the root mean squared distance of the two SH coefficient vectors. The latter ray-casting method is introduced in the following section.
2.5.1. Point Correspondence with Ray Casting
For surfaces, if unique intersection exists between a ray starting from its object center and the surface, a ray-casting method can be applied to obtain sample pairs across all the deformations. For illustration, Figures 4(a) and 4(b) depict two different cases of ray-surface intersection in the simplified 2D space. In Figure 4(a), there is only one intersected point (p1 for ray , and p2 for ray ) between each ray and the deformed surface S1. By contrast, Figure 4(b) gives an example when multiple intersections (p1, p1′, p1′′ for ray and p2 for ray ) are involved between rays and the surface S2.
If the condition of single ray-surface intersection applies, deformations can be resampled through the following steps to achieve point correspondence. (i)Construct an icosahedron of vertices with radius large enough to embrace the largest deformation volume among those under consideration; larger results in denser samples to maintain the local details but incurs more computational cost. (ii)Align the center of the 3D surfaces to the origin of icosahedron such that rays casting from the origin can intersect with the surface. (iii)For each ray segment originated from the center to a vertex on the icosahedron, find the triangle on the surface mesh that intersects with the segment and use that intersected point as new surface sample.
Figure 4(c) illustrates the desired sample pairs as and over two deformations S1 and S2. This resampling process also establishes a one-to-one map between a point on the object and a point on the sphere (icosahedron), which naturally meets the purpose of spherical parameterization. As a result, point-wise correspondence can be achieved across all the resampled surfaces, and a uniform spherical harmonic matrix can be applied. As an example, Figures 5(a) and 5(b) compare an original kidney surface with the corresponding resampled surface. We can see that ray-casting procedure well maintains the shape.
Three types of experiments are conducted to demonstrate the feasibility of the proposed SRDS algorithm. The computer-generated FEM data is first used to demonstrate that the SRDS approach matches the accuracy of complex mathematical modeling techniques, then an ex vivo experiment is conducted using 3D MRI scans of porcine kidneys for evaluation in practical settings, and finally in vivo experiment is carried over dynamic cardiac MRI scans for evaluation in real patients.
3.1. Experiment with FEM Data
Three representative organs are employed in this FEM experiment: 3D cortical mesh as an example of complicated shapes, gallbladder as an instance with geometrically simple shape, and bladder consisting of both interior and exterior walls.
3.1.1. Computer Model Setup
The initial 3D models of different organs are fed into a FEM-based surgical simulation tool to generate deformation data for testing. For instance, Figure 6 demonstrates two examples of shape distortions due to the endoscope poking and grasping one side of the gallbladder.
Table 1 lists the FEM experimental setup of the three organs including number of vertices , SH level , number of deformations for training , and testing . “GBL” stands for gallbladder in all the tables. The maximum SH level used for brain model is chosen according to , and the levels for gallbladder and bladder are determined when the SHD representation error is below 0.1% (EOF). The complex brain structure requires more vertices and higher SH level for surface representation to achieve sufficient accuracy. To evaluate the representation precision qualitatively, an evaluation parameter EOF is defined as the normalized Euclidean distance between the original surface and the reconstructed surface All surfaces are centered to the origin of the coordinate system so that EOF will not be heavily affected by the denominator.
With the FEM data, the proposed SRDS algorithm is evaluated from three perspectives: (1) subspace dimensionality, (2) sparsity and accuracy of representations, and (3) the effect of subspace pursuit threshold and coefficient truncation threshold on the performance. The sparsity is defined as the norm of the coefficient vector .
A. Training Results
During training stage, we set for subspace detection, for clustering, as the maximum iteration times, and for coefficient truncation. Subspaces on , , and axis are identified separately. Table 2 shows that the subspace number and dimensions of resulting dictionary () are markedly small relative to or in all three tests. We notice that the subspace dimensions of brain are relatively smaller than the other two. This is because of smaller training data size and minor extent of deformation considered in the brain experiment, which results in smaller dictionary size to capture the deformation features.
B. Sparsity and Accuracy Evaluation
Sparsity is examined in terms of (), where is the average norm of the coefficient vector (training) or (testing) and is the corresponding standard deviation. To verify that whether our method achieves equivalent sparsity and precision when applying the structure of the dictionary, we also test the case without relying on any structure learned from training set, during which sparse representation of each deformation in the testing set is repursued from the training set using OSP approach. In the following tables, we use “OSP” to refer to the results obtained using such a repursuing process. Table 3 summarizes the sparsity of the SRDS representation of three organs for both training and testing set. It illustrates that the number of atoms needed for representing the complex deformations is much smaller than the dimension of spherical harmonic vectors , and particularly the sparsity and accuracy via SRDS is very close to that from OSP repursuit for the testing deformations, which indicates the good generalization of the structured dictionary.
The reconstruction error in terms of EOF is further compared with that from standard SHD method, as shown in Figure 7. In general, the accuracy of SRDS is equivalent to that of SHD method. Specifically, it shows that the SRDS method achieves average EOF of 1.32% (brain) and 0.14% (gallbladder) versus 1.29% (brain) and 0.13% (gallbladder) with SHD method. For bladder model with deformations on multiple layers, the overall representation error with SRDS is 0.07%, very close to 0.06% with SHD. Figures 11 and 12 show typical reconstructed deformations of the testing data for the three organs with SHD and SRDS methods. Figures 12(e) and 12(f) demonstrate the interior and exterior representation of the bladder at a same time instance. From those results, we can see that the SRDS algorithm achieves the accuracy equivalent to complex mathematical modeling techniques while significantly lowers the representation dimensionality.
C. Effect of
The performance of SRDS algorithm is examined as the subspace pursuit threshold varies. Specifically, we study the effect of on the dimensionality () of the structured dictionary , sparsity and accuracy of the surface representation. Figure 8 shows how the subspace dimensions on three axis change during the training stage as increases from 0.001 to 0.01. Figure 9 displays the influence of on the average sparsity of the surface representation results for both training and testing data sets. In general, smaller leads to larger subspace size and less description sparsity, since lower usually leads to more recruited atoms to meet the desired representation accuracy. Therefore, there is a tradeoff between representation accuracy and desired sparsity. Figure 10 reveals the representation EOF as a function of . Not surprisingly, the reconstruction error is increased as becomes larger. An empirical point can be chosen according to the training curve when space dimension expands significantly but only trivial EOF improvement is gained, that is, is a preferred value in this test according to Figure 10.
D. Effect of
The influence of coefficient truncation threshold on the performance of SRDS algorithm is also tested while is varied among . Figure 13 shows the effect of on the average sparsity of the surface representation results. We can see that, as the truncation threshold enlarges, the sparsity of the representation is increased for both training and testing data sets at the price of decreased representation error as shown in Figure 14, so there is tradeoff between sparsity and accuracy. Empirically, one can choose the value when the representation precision remarkably deteriorates while the sparsity is still increasing. Therefore, according to Figures 13 and 14, an appropriate value for is between 0.005 and 0.01.
3.2. Ex vivo Experiment Using MRI
To evaluate the proposed algorithm in real applications, an ex vivo experiment using three porcine kidneys were conducted at the Center for Interdisciplinary Applications in Magnetic Resonance (CIA-MR) of University of Minnesota. Deformations imposed to each kidney were controlled and maintained still during imaging by a customized nonmagnetic mechanical device as shown in Figure 15. Each deformed kidney shape was scanned in 3D MRI mode with spatial resolution of 1.2 mm to generate both training set and testing set. The SH degree of the organ representation is set to be 20, and each 3D kidney mesh after surface correspondence has vertices. Different from computer-generated deformations where surface correspondence is intrinsically established, the shapes from MRI scans are rendered independently, so the method described in Section 2.5 is applied to achieve point-wise correspondence.
Both intramodel and intermodel experiments are conducted. The former uses training and testing deformations from the same kidney; the later utilizes two out of the three kidneys for training and the third one for testing in a cross-evaluation fashion. Besides sparsity and EOF, Hausdorff distance between the represented shape and corresponding MRI surface is also examined as a physical measurement of error. The Hausdorff distance between surface and is defined as where is defined as the distance between a point on surface and the closest point on surface , that is, with denoting the Euclidean norm.
3.2.1. Intramodel Test
In the intramodel experiment, 31 deformations of the same kidney were generated and scanned by the MRI machine, among which 20 frames were randomly selected as training set, and the other 11 were applied for testing the generalization of the learned subspaces.
Table 4 shows the trained subspace dimensions ( as number of subspace, as dictionary size of ), the sparsity of the descriptors in each axis for both training set and testing set, and the corresponding errors in terms of EOF and Hausdorff distance. Similar to the FEM experiment, the sparsity is also evaluated with OSP repursuit process in the testing set for comparison. The table shows that the sparsity and the accuracy achieved with SRDS is very close to that from OSP repursuing process. However, the SRDS method features delay-free surface representation by applying the structure in the identified dictionary. Further results about computational efficiency are shown in Section 3.4. One may notice that the size of training data in the MRI experiment is smaller than that in FEM test due to the less availability of 3D MRI images. As a rule of thumb, larger training set carries richer deformation information and thus leads to better generalization of the dictionary. However, given the size of training data and extent of deformation involved in the ex vivo experiment, high representation precision is still achieved.
Figure 16 illustrates the accuracy of the surface representation in the intramodel test. The average EOF in Figure 16(a) for training set is 0.24% and 0.64% for testing set, with maximum rate less than 1%. Further, error as Hausdorff distance (shown in Figure 16(b)) is mm with 95th percentile error of 0.86 mm for the training set, and mm with 95th percentile error of 0.96 mm for the testing set. This intramodel experiment demonstrated that the SRDS algorithm identifies subspaces generalizable enough to accurately represent deformations beyond the training set for the same object.
Figure 17 visualizes the color-coded error distribution at all vertices on the represented surface with SRDS relative to the actual MRI scans. Figure 17(a) illustrates the error range for different colors. Figures 17(b) and 17(d) show the error distribution for a typical reconstruction in the training and testing set, respectively. Figures 17(c) and 17(e) show maximum 90% level reconstruction errors, that is, 90% of all deformations in the training or testing set have representation point errors less than the values shown in the figures. Consistent with the EOF and Hausdorff distance results, the color diffusion in Figure 17 indicates that the precision in the testing group is relatively lower than that in the training group. However, among all the pixel-wise errors shown in Figure 17(e), less than 3% of all the surface points have error distance larger than 0.5 mm.
3.2.2. Intermodel Test
Three intermodel experiments are performed to further validate the proposed SRDS method applied to organs from different subjects. In the following context, “Ex1” stands for the experiment training on Kidney 2 and 3 plus one initial shape of Kidney 1 while testing on deformations of Kidney 1, and the like for “Ex2” and “Ex3”. In each experiment, both sparsity and accuracy are examined.
The number of subspaces () and dimensions () of the identified dictionary are listed in Table 5. The training results vary among the three experiments but all features low subspace dimensions. Table 6 shows the sparsity of the intermodel experiments using the SRDS algorithm, and the error level is evaluated in terms of EOF and Hausdorff distance. Each testing deformation is also sparsely retrained using OSP for comparison. We can see that the sparsity and representation error resulting from SRDS method is very close to that using OSP.
Figure 18 shows the representation accuracy using SRDS algorithm in training and testing sets for the three tests. In general, the error in testing set is larger than that in the training set. Particularly, as for EOF evaluation, “Ex1” leads to the largest EOF error relative to “Ex2” and “Ex3”, but the average error rate is still as low as 0.3% for training set and 2.0% for testing set. Table 7 lists the specific Hausdorff measurements corresponding to boxplots in Figures 18(c) and 18(d), including minimum, 95th percentile and mean. We can see that the 95th percentile Hausdorff distance across all experiments is below 3 mm, and the mean is belong 2.1 mm. Comparing those error levels with the intramodel test, one can see that the homology existing among the training and testing deformations contributes to better dictionary generalization and, thus, leads to higher representation accuracy.
Figures 19, 20 and 21 show the color-coded error fields of a typical representation and at the maximum 90% level for the three intermodel experiments. In either training set or testing set, it is observed that large errors are mostly distributed around the edge area where local details are rich. Consistent with the previous EOF and Hausdorff distance measurements, the color diffusion in Figures 19–21 indicates that errors in testing set is larger than that in training set and “Ex1” generates relatively larger error comparing to “Ex2” or “Ex3.”
3.3. In Vivo Experiment Using MRI
The proposed approach is also tested over the in vivo cardiac MR images , consisting of automatically segmented images from volumetric MRI scans of a diastole-systole-diastole cycle. For each patient, there are around 22 phases in a cardiac cycle. Surface correspondence of LV shapes within and across patients are accomplished using the approach described in Section 2.5. Since the generated surfaces from automatic segmentation software are quite rough, we use the spherical harmonic representation as a filter to smooth out those surface noises and then apply the smoothed surfaces as training and testing data. Therefore, the demonstrated error in this section is relative to the SHD surfaces, not to the original raw surfaces. The iter-patient results are reported as follows.
Similar to the ex vivo test, we use the segmented left ventricles (LV) of 2 different patients plus an initial LV surface for the third patient as training data, and the remaining LV shapes in a beating cycle of the third patient are used to test the generalization of the identified subspaces. The formulated tests are noted as “Ex1,” “Ex2,” and “Ex3.” Table 8 lists the sparsity test results of the three cross validations for both training and testing sets. We can see that the sparsity in the training set is close to that in the testing set, but the former achieves much higher accuracy. This is because that the identified subspaces generalize perfectly for those elected atoms among the training set after spherical harmonic smoothing. Consistent with the previous experiments, the representation of testing surfaces using SRDS is also compared with that using repursuing OSP. According to the results, SRDS achieves performance slightly worse than but close to that of OSP. However, as demonstrated in Section 3.4, without relying on the structured dictionary learned from the training population, OSP is a computational expensive task, since for each testing surface, it requires to research for atoms from the training set to achieve sparse representation.
Figure 22 provides boxplots for the representation accuracy of the testing set in terms of EOF and Hausdorff distance. Table 9 provides the minimum, 95th percentile, and mean Hausdorff measurements corresponding to Figure 22(b). In coincidence, “Ex1” leads to slightly larger errors than the other two tests, with average EOF of 3.2% (“Ex1”), and mean Hausdorff distance of mm. The 95th percentile Hausdorff distance across all experiments is below 2.2 mm. Figures 23(b) and 23(c) show the color-coded error field of a typical representation and at the maximum 90% level for the testing set in one interpatient experiment (c). As indicated by the color distribution, majority of the point errors are below 0.9 mm. Particularly, in the 90th percentile evaluation in Figure 23, only 3% of all the point-wise errors are above 0.9 mm.
To examine the efficiency of the proposed SRDS method quantitatively, the computational time to represent each surface in the testing set using SRDS method is compared with that resulting from OSP repursuing approach for the above five organs. The results are summarized in Table 10, including training set size , maximum SH level , average time (in seconds) required with SRDS () and OSP (), respectively, and the ratio between the two. As shown in Table 10, the time consumption for seeking sparse representation of the testing surfaces using the SRDS is at least 10 times lower than that using the original OSP method which does not rely on the dictionary structure learned from the training data set. The advantage is more pronounced when the training data size or the SH level is large. For example, in the brain model, the high SH level leads to substantial computational delay during the search for proper atoms for representation, such that the SRDS achieves a speed orders of magnitude faster than the OSP method without training. On the other hand, for the case of gallbladder, the large training size also increases the time used by repursuing OSP, so it runs 65 times slower than SRDS.
To summarize, considering the test results for sparsity, accuracy, and efficiency given in this experiment section, we can see that the proposed SRDS method achieves sparse surface representation with high computational efficiency and accuracy.
4. Conclusions and Discussion
This paper introduced a new algorithm for block sparse representation of deformable organ surfaces with high accuracy. The proposed SRDS design first identifies the deformation subspaces from the training data set in the transformed spherical harmonic domain and then represents each deformed surface with a block sparse vector in the structured dictionary. SRDS is generalized to applications involving organs with multiple surface layers, such as bladder. The algorithm has been validated with FEM data and real 3D MRI scans under both ex vivo and in vivo conditions. The FEM test results demonstrate that SRDS achieves accuracy matching that of complex mathematical modeling techniques. Further, the maximum representation error in ex vivo experiment is below 1 mm for intramodel test and below 3 mm for intermodel test. For the in vivo experiment, the SRDS achieves an accuracy of better than 2.5 mm.
SRDS algorithm has already been used in tracking organ deformations in minimum invasive surgery [2–4]. The structure introduced in the dictionary enables efficient surface recovery from limited samples. In addition, the merits of block sparse surface representation presented here can be applied to various medical organ modeling, shape classification, and similarity retrieval where reduced parameter dimension can potentially speed up the implementations.
- M. S. Wagh and C. C. Thompson, “Surgery insight: natural orifice transluminal endoscopic surgery—an analysis of work to date,” Nature Clinical Practice Gastroenterology & Hepatology, vol. 4, no. 7, pp. 386–392, 2007.
- D. Wang and A. H. Tewfik, “In vivo tracking of 3D organs using spherical harmonics and subspace clustering,” in Proceedings of the IEEE International Conference on Image Processing, (ICIP '09), pp. 817–820, November 2009.
- D. Wang, Y. C. Zhang, and A. H. Tewfik, “Real time tracking of exterior and interior organ surfaces using sparse sampling of the exterior surfaces,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, (ICASSP '10), pp. 1022–1025, March 2010.
- D. Wang and A. H. Tewfik, “Real time tracking of 3D organ surfaces using single MR image and limited optical viewing,” in Proceedings of the 7th IEEE International Symposium on Biomedical Imaging, (ISBI '10), pp. 620–623, April 2010.
- G. L. Xu and Q. Zhang, “A general framework for surface modeling using geometric partial differential equations,” Computer Aided Geometric Design, vol. 25, no. 3, pp. 181–202, 2008.
- G. Zeng, S. Paris, L. Quan, and F. Sillion, “Accurate and scalable surface representation and reconstruction from images,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 29, no. 1, pp. 141–158, 2007.
- A. Matheny and D. B. Goldgof, “The use of three- and fourdimensional surface harmonics for rigid and nonrigid shape recovery and representation,” IEEE Transaction on Pattern Analysis and Machine Intelligence, vol. 17, no. 10, pp. 967–981, 1995.
- J. L. Dillenseger, H. Guillaume, and J. J. Patard, “Spherical harmonics based intrasubject 3-D kidney modeling/registration technique applied on partial information,” IEEE Transactions on Biomedical Engineering, vol. 53, no. 11, pp. 2185–2193, 2006.
- M. K. Chung, K. M. Dalton, L. Shen, A. C. Evans, and R. J. Davidson, “Weighted Fourier series representation and its application to quantifying the amount of gray matter,” IEEE Transactions on Medical Imaging, vol. 26, no. 4, pp. 566–581, 2007.
- M. K. Chung, K. M. Dalton, and R. J. Davidson, “Tensor-based cortical surface morphometry via weighted spherical harmonic representation,” IEEE Transactions on Medical Imaging, vol. 27, no. 8, Article ID 4449087, pp. 1143–1151, 2008.
- C. Brechbhler, G. Gerig, and O. Kbler, “Parametrization of closed surfaces for 3-D shape description,” Computer Vision and Image Understanding, vol. 61, no. 2, pp. 154–170, 1995.
- L. Shen and F. Makedon, “Spherical mapping for processing of 3D closed surfaces,” Image and Vision Computing, vol. 24, no. 7, pp. 743–761, 2006.
- H. Huang, L. Zhang, D. Samaras, et al., “Hemispherical harmonic surface description and applications to medical image analysis,” in Proceedings of the 3rd International Symposium on 3D Data Processing, Visualization, and Transmission, (DPVT '06), pp. 381–388, June 2006.
- P. J. Huber, “Projection pursuit,” The Annals of Statistics, vol. 13, no. 2, pp. 435–475, 1985.
- S. G. Mallat, G. Davis, and Z. Zhang, “Adaptive time-frequency decompositions,” Optical Engineering, vol. 33, no. 7, pp. 2183–2191, 1994.
- S. Chen, S. A. Billings, and W. Luo, “Orthogonal least squares methods and their application to non-linear system identification,” International Journal of Control, vol. 50, no. 5, pp. 1873–1896, 1989.
- S. S. Chen, D. L. Donoho, and M. A. Saunders, “Atomic decomposition by basis pursuit,” SIAM Journal on Scientific Computing, vol. 20, no. 1, pp. 33–61, 1998.
- I. F. Gorodnitsky and B. D. Rao, “Sparse signal reconstruction from limited data using FOCUSS: a re-weighted minimum norm algorithm,” IEEE Transactions on Signal Processing, vol. 45, no. 3, pp. 600–616, 1997.
- T. Blumensath and M. E. Davies, “Gradient pursuits,” IEEE Transactions on Signal Processing, vol. 56, no. 6, pp. 2370–2382, 2008.
- B. V. Gowreesunker and A. H. Tewfik, “Learning sparse representation using iterative subspace identification,” IEEE Transactions on Signal Processing, vol. 58, no. 6, Article ID 5419963, pp. 3055–3065, 2010.
- B. A. Olshausen and D. J. Field, “Sparse coding with an overcomplete basis set: a strategy employed by V1?” Vision Research, vol. 37, no. 23, pp. 3311–3325, 1997.
- K. Engan, S. O. Aase, and J. H. Husy, “Multi-frame compression: theory and design,” Signal Processing, vol. 80, no. 10, pp. 2121–2140, 2000.
- J. F. Murray and K. Kreutz-Delgado, “An improved FOCUSS-based learning algorithm for solving sparse linear inverse problems,” in Proceedings of the 35th Asilomar Conference on Signals, Systems and Computers, pp. 347–351, November 2001.
- M. Mahmoudi and G. Sapiro, Sparse Representations for Three-Dimensional Range Data Restoration, IMA, Minneapolis, Minn, USA, 2009.
- K. T. Rajamania, M. A. Stynerb, H. Taliba, G. Y. Zheng, L. P. Noltea, and M. A. Ballester, “Statistical deformable bone models for robust 3D surface extrapolation from sparse data,” Medical Image Analysis, vol. 11, no. 2, pp. 99–109, 2007.
- C. Basso and T. Vetter, “Statistically motivated 3D faces reconstruction,” in Proceedings of the 2nd International Conference on Reconstruction of Soft Facial Parts, Remagen, Germany, March 2005.
- J. Feng and H. S. Ip, “A multi-resolution statistical deformable model (MISTO) for soft-tissue organ reconstruction,” Journal of Pattern Recognition, vol. 42, no. 7, pp. 1543–1558, 2009.
- T. Albrecht, R. Knothe, and T. Vetter, “Modeling the remaining flexibility of partially fixed statistical shape models,” in Proceedings of the Workshop on the Mathematical Foundations of Computational Anatomy, New York, NY, USA, March 2008.
- G. Y. Zheng, S. Gollmerb, S. Schumanna, X. Dong, T. Feilkasb, and M. A. G. Ballester, “A 2D/3D correspondence building method for reconstruction of a patient-specific 3D bone surface model using point distribution models and calibrated X-ray images,” Medical Image Analysis, vol. 13, no. 6, pp. 883–899, 2009.
- H. Huang, R. Zhang, F. Makedon, L. Shen, and J. Pearlman, “A spatio-temporal modeling method for shape representation,” in Proceedings of the 3rd International Symposium on 3D Data Processing, Visualization, and Transmission, (DPVT '06), pp. 1034–1040, June 2006.
- M. Aharon, M. Elad, and A. Bruckstein, “K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation,” IEEE Transactions on Signal Processing, vol. 54, no. 11, pp. 4311–4322, 2006.
- S. Chen, C. F. Cowan, and P. M. Grant, “Orthogonal least squares learning algorithm for radial basis function networks,” IEEE Transactions on Neural Networks, vol. 2, no. 2, pp. 302–309, 1991.
- S. G. Mallat and Z. Zhang, “Matching pursuits with time-frequency dictionaries,” IEEE Transactions on Signal Processing, vol. 41, no. 12, pp. 3397–3415, 1993.
- P. J. Besl and N. D. McKay, “A method for registration of 3-D shapes,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 14, no. 2, pp. 239–256, 1992.
- R. H. Davies, C. J. Twining, T. F. Cootes, J. C. Waterton, and C. J. Taylor, “A minimum description length approach to statistical shape modeling,” IEEE Transactions on Medical Imaging, vol. 21, no. 5, pp. 525–537, 2002.
- L. Shen, H. Farid, and M. A. McPeek, “Modeling three-dimensional morphological structures using spherical harmonics,” Evolution, vol. 63, no. 4, pp. 1003–1016, 2009.
- L. Najman, J. Cousty, M. Couprie, et al., “An open, clinically validated database of 3d+t cine-MR images of the left ventricle with associated manual and automated segmentations,” Insight Journal, special issue entitled ISC/NA-MIC Workshop on Open Science, 2007.