Segmentation of Hyperacute Cerebral Infarcts Based on Sparse Representation of Diffusion Weighted Imaging

Zhang, Xiaodong; Jing, Shasha; Gao, Peiyi; Xue, Jing; Su, Lu; Li, Weiping; Ren, Lijie; Hu, Qingmao

doi:https://doi.org/10.1155/2016/2581676

Computational and Mathematical Methods in Medicine

On this page

Abstract Introduction Discussion Conclusion Acknowledgments References Copyright Related Articles

Research Article Corrigendum

!

A Corrigendum for this article has been published. To view the article details, please click the ‘Corrigendum’ tab above.

Research Article | Open Access

Volume 2016 | Article ID 2581676 | https://doi.org/10.1155/2016/2581676

Segmentation of Hyperacute Cerebral Infarcts Based on Sparse Representation of Diffusion Weighted Imaging

Xiaodong Zhang,¹Shasha Jing,¹Peiyi Gao,²Jing Xue,²Lu Su,²Weiping Li,³Lijie Ren,³and Qingmao Hu¹

Academic Editor: Chuangyin Dang

Received27 Jan 2016

Revised02 Aug 2016

Accepted18 Aug 2016

Published22 Sept 2016

Abstract

Segmentation of infarcts at hyperacute stage is challenging as they exhibit substantial variability which may even be hard for experts to delineate manually. In this paper, a sparse representation based classification method is explored. For each patient, four volumetric data items including three volumes of diffusion weighted imaging and a computed asymmetry map are employed to extract patch features which are then fed to dictionary learning and classification based on sparse representation. Elastic net is adopted to replace the traditional -norm/-norm constraints on sparse representation to stabilize sparse code. To decrease computation cost and to reduce false positives, regions-of-interest are determined to confine candidate infarct voxels. The proposed method has been validated on 98 consecutive patients recruited within 6 hours from onset. It is shown that the proposed method could handle well infarcts with intensity variability and ill-defined edges to yield significantly higher Dice coefficient (0.755 ± 0.118) than the other two methods and their enhanced versions by confining their segmentations within the regions-of-interest (average Dice coefficient less than 0.610). The proposed method could provide a potential tool to quantify infarcts from diffusion weighted imaging at hyperacute stage with accuracy and speed to assist the decision making especially for thrombolytic therapy.

1. Introduction

Irreversible infarcts are critical for the assessment of potential risk and benefit pertaining to thrombolysis in hyperacute ischemic stroke [1]. Due to the high sensitivity and specificity of diffusion weighted (DW) imaging (which consists of T2-weighted image () to be denoted as B0, a diffusion weighted image (DWI) with the value being 1000–1500 s/mm², and the calculated apparent diffusion coefficient (ADC) map), it is considered the optimum clinical imaging modality for hyperacute ischemic stroke [2]. Previously, it was reported that DW imaging reversal was not rare [3], putting a question mark on determination of infarcts from DW imaging. Recently, it has been found that the DW imaging reversal is rare [1] and does not translate to permanent tissue salvage [4]. This new finding justifies the urgent need for accurate determination of infarcts from DW imaging.

Ischemic lesions are inhomogeneous in terms of ischemic injury and recovery potential [5] and could be classified into 4 categories on DWIs [6]: single lesion with well-defined edges, single lesion with ill-defined edges, multiple lesions with well-defined edges, and multiple lesions with ill-defined edges. It has been shown that lesions at hyperacute stage (i.e., within 6 hours from ictus) exhibit greatest variability in appearance (Figure 1) which makes expert delineation difficult and inconsistent [6]. The difficulties in identifying infarcts at hyperacute stage were also reflected in an automatic method to segment infarcts from DWIs, where the segmentation was substantially less accurate for patients imaged at the time of admission than for those imaged at 72 hours (mean Dice coefficient (DC) of 0.63 versus 0.81) [7].

(a)

(b)

(c)

There have been efforts on automatic segmentation of infarcts from DW imaging. Tsai and coworkers segmented infarcts from DWIs and ADC maps based on fuzzy C-means (FCM) clustering [8]: it used the most frequent normalized DWI intensity within the brain to remove nonrelevant voxels, classified the remaining voxels into 50 clusters, removed clusters and connected components whose average DWI intensity was not greater than + 0.2, eliminated false positive regions without apparent edges, and got rid of false positive regions due to magnetic inhomogeneity by imposing ADC constraints. It reported an average DC of 0.899 ± 0.065 for 22 ischemic patients with stroke onset within 10 days. Mujumdar et al. [9] segmented infarcts from DWIs through 3 values (, 1000, and 2000 s/mm²): multiple values were employed to impose local contrast constraints; the left candidates were passed to an active contour model to refine segmentation. It reported an average DC of 0.810 ± 0.120 for 41 ischemic stroke patients without specifying stroke onset time. Prakash et al. [7] segmented infarcts from DWIs by first identifying axial slices with ischemic lesions followed by binarization with a global DWI intensity threshold derived from histogram divergence: it was basically a global thresholding method based on the assumption of asymmetry induced by the ischemic lesion; it was tested on 57 datasets with 46 scanned at the time of admission without knowing the exact imaging time and 11 scanned at 72 hours from admission, to yield an average DC of 0.670 ± 0.220.

As images are naturally sparse and have redundant information, sparse representation has been widely used in image processing [10]. The first successful application of sparse representation to computer aided diagnosis was by Liu and coworkers [11]. Their method was extensively validated for both colorectal polyp and lung nodule detection and could achieve superior classification/segmentation performance to existing methods using support vector machine and its variants, boosting, logistic regression, relevance vector machine, or -nearest neighbors. The success of sparse representation based classification/segmentation owes to the fact that a high-dimensional image can be represented or coded by a few representative samples from the same class in a low-dimensional manifold and the recent progress of -norm and -norm minimization technique [12]. Sparse representation could be used as a classifier for voxel-wise classification. Zhang et al. [13] made use of sparse representation to segment cervigram images. Based on traditional constructive dictionary learning method, they proposed a discriminative dictionary learning method. The learned discriminative dictionaries were more suitable for classification and achieved better performance to segment tissues in optical images of the uterine cervix. A prostate segmentation framework based on sparse representation was proposed by Gao et al. [14]. Different from conventional dictionary learning method, discriminative dictionaries were learned through k-means after feature selection. New samples were classified according to reconstruction error computed with the learned discriminative dictionaries. It performed better to segment prostate in computed tomography (CT) images compared with other state-of-the-art methods.

Sparse representation has also been incorporated into atlas-based methods for image segmentation. Wang et al. [15] used sparse representation to build subject-specific atlas from a series of aligned atlases. The subject-specific atlas built based on the reconstruction coefficient was integrated into a level set framework for further accurate segmentation. Following the same principle, Wang et al. [16] built a patient-specific atlas from the patch-based sparse representations for tissue segmentation of cone-beam CT images. Then the built atlas was integrated into a maximum a posteriori probability-based convex segmentation framework for accurate segmentation.

There are other applications of sparse representation in the field of medical image processing. Fang et al. [17] proposed a sparse perfusion deconvolution method to estimate cerebral blood flow in CT perfusion at low radiation dose. The sparse dictionary was built from high-dose perfusion maps. Then the built dictionary was applied to low-dose data to perform deconvolution-based hemodynamic parameters estimation.

On DWIs, infarcts appear as hyperintense and inhomogeneous in the form of intensity variation, with complex shapes and ambiguous boundaries, which makes manual segmentation difficult, time consuming, and rater dependent. As time is critical for hyperacute ischemic stroke patients, especially for those who are potential candidates to have thrombolytic therapy, it is highly desirable to quantify their infarcts with accuracy and speed. To the best of our knowledge, on the one hand, segmenting infarcts based on sparse representation has not been reported; on the other hand, efforts on segmenting hyperacute infarcts with great variability are scarce. These two issues are to be investigated in this study.

2. Sparse Representation Based Classification

2.1. Sparse Representation

Sparse representation is a powerful tool for acquiring, representing, and compressing signals. Given a dictionary, it selects only a few elements in the dictionary under certain constraints to reconstruct best the signal through linear combination of the selected elements. Suppose a dictionary with elements of dimensions; sparse representation of a signal is formulated as follows:where is the sparse code or sparse vector, is -norm, and is the sparsity which constrains the number of nonzero elements in the sparse vector Formula (1) is optimized to find the optimal sparse code that leads to lowest reconstruction error with a fixed sparsity .

The dictionary is generally attained through a dictionary learning process. Given a set of training signals , , the dictionary is constructed such that the following conditions are met:It aims at finding the optimal dictionary that best reconstructs input training signals under -norm constraint on sparse code of signal . The objective function can be optimized by several algorithms such as K-SVD [18] or MOD [19].

As the objective functions defined in formulae (1) and (2) are nonconvex and nonsmooth, finding the solution is NP-hard. -norm could be used as a convex relaxation to replace -norm:where is a parameter that controls the sparsity of sparse vector and is -norm. It can be treated as the LASSO problem and solved effectively by LARS [20].

2.2. Sparse Representation Based Classification

Sparse representation based classification (SRC) has been employed in the pattern recognition field and achieved state-of-the-art results in areas like human face recognition [21]. SRC consists of two stages. At first, given training samples and labelsSubdictionaries are learned with corresponding samples in every class by formula (2) or (4), with being the size of dictionary .

In the second stage, when a new sample is given to be classified, the global dictionary is used to sparsely represent in a competitive manner to select basis elements. The reconstruction error of every class iswhere is a characteristic function that selects the coefficients associated with the th class (). Then the new sample is classified into the class with the lowest reconstruction error.

The SRC framework could be employed for medical image segmentation. However, different from face recognition, medical images are inherently three-dimensional (3D) to be computationally demanding. In addition, there could be strongly correlated samples in the object and background such that the learned subdictionaries could contain strongly correlated basis elements which make the sparse code unstable. We explore the extension of SRC for segmenting cerebral infarcts from DW imaging to be elaborated next: to reduce the computational cost by confining the searching space and to avoid the unstable sparse coding by replacing the -norm/-norm with elastic net.

3. Infarct Segmentation Based on Sparse Representation

The proposed method extends the SRC framework to segment cerebral infraction (Figure 2). It consists of dictionary learning and voxel-wise classification. For a patient, in addition to the 3 volumes of DW imaging, a new volume is calculated to represent the asymmetry feature due to infarction. Local patches of the 4 volumes are employed to extract patch features. During voxel-wise classification, regions-of-interest (ROIs) are extracted based on expansion of the ischemic regions that have been validated previously [22]. Only voxels within ROIs are considered as candidates for classification based on elastic net.

Denote the baseline , , and images as , , and , respectively. The algorithm can be decomposed into preprocessing, feature extraction, dictionary learning, derivation of ROIs, and classification.

3.1. Preprocessing [23]

The original volumes can have large intensity range which is rescaled to [] to facilitate subsequent processing.

The 3 volumes , , and all play a role in differentiating infarct voxels from noninfarct voxels. Specifically, hyperintense could be used for differentiating fresh from old infarction; an infarct will have hyperintense and hypointense . A volume can be generated to emphasize infarcts.As infarction is generally asymmetrical with respect to the midsagittal plane (MSP), a composite volume is formulated and is denoted as in the following way:where and are symmetrical to the midsagittal line (which is the intersection between the MSP and the axial slice ) on the axial slice and is the neighborhood of and is set as neighborhood. MSP is extracted from based on local symmetry and outlier removal [24]. Figure 3 shows an axial slice of , and .

(a)

(b)

(c)

3.2. Feature Extraction

Based on the fact that image patches could capture more anatomical information than a single voxel, patch-based methods have been recommended for label fusion and segmentation [10]. For every sample voxel, four patches centered at the voxel with size are obtained from volumes B0, DWI, ADC, and ASYM. Then intensity values of each patch are rearranged into a column-vector. These column-vectors are concatenated into the final feature vector.

In this study, both two-dimensional (2D) and 3D patches will be explored. Square patches are obtained from axial slice of four volumes to form a feature vector for 2D patches; cuboid patches are obtained from four volumes and concatenated into a feature vector for 3D patches. Here with , being the side length of the patch.

3.3. Dictionary Learning

To stabilize the sparse code, a -norm regularization term is added to form the elastic net [25, 26] for deriving the sparse code in the form of where and are, respectively, parameters to control sparsity and stability. Likewise, the dictionary learning objective function will be in the form of elastic net to replace the -norm or -norm.For classifying voxels into infarct and noninfarct (normal), two subdictionaries are needed, which are denoted, respectively, as and .

As the number of voxels of infarcts is far smaller than that of normal voxels, selection of training samples needs careful attention to achieve representative sampling while avoiding unbalanced samples. For training, the set of infarct voxels forms the positive samples and is denoted as . is iteratively dilated with a structuring element of radius being 1 voxel until the dilated size is not smaller than the original size of . Those noninfarct voxels included in the dilated are then acting as the negative samples. The rationale behind this procedure will be explained in Discussion. Figure 4 shows the positive samples (in red) and negative samples (in yellow).

(a)

(b)

(c)

The objective function (formula (10)) could be optimized by an online algorithm based on stochastic approximations [27].

3.4. Derivation of Regions-of-Interest [23]

Due to the complexity and inhomogeneity of ischemic infarcts, there may be infarct-mimics in DWI volumes, which could be recognized as false positives. The ROIs are to include as many as possible infarct voxels and to exclude most infarct-mimics to reduce subsequent computational cost and enhance reliability. The ROIs are derived from dilation of the initial ischemic regions modified from [22] by thresholding ADC maps with constraints on DWIs. Specifically, denote the most frequent ADC value for all voxels within the brain mask as , and any voxels with not greater than are checked to formulate connected components. Any connected components with average DWI values not smaller than the intensity average plus the intensity standard deviation of brain voxels on DWI at the corresponding axial slice are kept as part of region . Voxels with ADC value within are checked to formulate connected components and are added to if the component has at least one neighboring voxel in . is then increased by morphological dilation with a structuring element of radius . Suppose the coordinates of are in the range of to with ; then the regions on the axial slice and are, respectively, pasted to axial slice and to attain the eventual ROIs. Figure 5 shows an axial slice of DWI, ADC, and the corresponding ROIs (in color), where regions in red are initial ischemic infarcts and those in yellow are added voxels through the described morphological dilation.

(a)

(b)

(c)

3.5. Classification

Once ROIs are determined, voxels within the ROIs are classified as infarct or normal based on SRC according to patch features of voxels. The classification procedure is as follows: (1) A global dictionary is formed by concatenating the two subdictionaries . (2) The sparse code of a new sample is computed by optimizing the elastic net function in formula (9) with respect to the global dictionary . (3) Compute the residue for each class where and is a characteristic function that would select the coefficients associated with class . (4) Classification according to the residue of each class is as follows:

4. Experiments

Tiantan Hospital and Tianjin Huanhu Hospital have been involved in a National Stroke Registry since 2005, which registered patients prospectively with stroke ictus within 6 hours according to a preestablished system [22]. The study included 98 consecutive patients (31 women and 66 men, age range 24–78 years) with confirmed ischemia. The protocol of the research has been approved by the Institutional Review Board of both hospitals. All patients gave written consent and provided permission for scientific and educational purpose.

The baseline DW imaging was carried out with two 3.0 Tesla scanners (Trio-Tim, Siemens, Erlangen, Germany) with a spin-echo, multislice, and single shot echo-planar imaging of and 1000 or 1500 s/mm² and the corresponding ADC map. The DWIs in this study are isotropic whose intensities are the cubic root of the multiplied signal intensities of the three individual images acquired with a diffusion gradient in each of the three orthogonal directions (, , and axes). The imaging covered the whole brain, with 19–24 axial slices of a 5 mm slice thickness and 1–1.5 mm gap, most matrixes being 128 × 128 with few being 156 × 156 or 384 × 384 to have an in-plane resolution of 0.60 mm to 1.80 mm.

A neuroanatomy expert (QH) manually drew the infarction regions using in-house software (that could adjust contrast, enlarge, pan, and undo) as the reference for measuring the accuracy of automatic algorithms. When ischemia boundaries were not clear, both DWI and ADC were checked, and a neuroradiologist was invited for discussion (YZ) to make the drawing as accurate as possible.

The algorithm was implemented with C++ based on SPAMS [27]. All experiments were carried out on a Pentium 4 PC with 2.4 GHz CPU (4 cores) and 4G RAM. The segmentation performance was tested by 2-fold cross validation. All training datasets were randomly divided into 2 groups with equal number of datasets. One group was used as training set while the other was used as testing set and vice versa. To quantify the infarct segmentation, the following measures are adopted as used in other investigations [7, 8]: DC, sensitivity, specificity, positive prediction value (PPV), and negative prediction value (NPV) given below:Here TP, TN, FP, and FN are, respectively, for true positive, true negative, false positive, and false negative.

The execution time for segmenting a dataset includes 5 seconds for determining ROIs and 1.9 seconds for classifying infarcts from ROIs on a Pentium 4 PC with 2.4 GHz CPU (4 cores) and 4 G RAM).

4.1. Parameters

Experiments are carried out to choose the appropriate parameters in dictionary learning and sparse representation in terms of DC. Best performance is achieved with , , , , and . Relevant experiments are conducted to see the dependency of accuracy on these parameters. When changing one or two parameters, the other parameters are fixed as those that attain the best performance.

The two parameters and are changed simultaneously, both taking one of the values of (Table 1). Experiments of accuracy dependency on other parameters are carried out separately, with to be one of the values of (Figure 6), taking a value of (Figure 7), and being one of the values of (Figure 8).

4.2. Role of the Regions-of-Interest

We have conducted experiments to validate the effect of ROIs on ischemic lesion segmentation. To implement the proposed method without ROIs, two minor modifications have been made to the proposed method with ROIs. First, in addition to the negative samples of the proposed method, another equal portion of negative samples from the rest of the brain is included to represent the normal brain structure for learning the dictionary . Second, after every voxel is classified by sparse representation classification as a candidate lesion voxel, a postprocessing step is applied. The postprocessing is to form candidate lesion region through connected component analysis and eliminate candidate lesion regions with less than 3 voxels, as well as eliminate candidate lesion regions with low average DWI (less than the most frequent value of DWI) or high average ADC (greater than the most frequent value of ADC). It was found that the proposed method without ROIs could yield a Dice coefficient of 0.673 ± 0.179, a sensitivity of 0.797 ± 0.168, and a specificity of 0.999 ± 0.001.

In case of taking the whole brain as the ROIs, the average execution time for segmenting a dataset is substantially increased from 7 seconds to 39 seconds.

4.3. Comparison with Existing Methods

The proposed method is to be compared with divergence measure based method (DM method) [7] and FCM method [8]. It is worth noting that the DC values reported in [7] (0.67) and [8] (0.90) are based on their data, and none of them used the data within 6 hours from ictus. As demonstrated in [7], data at admission are more difficult to be segmented due to substantial variability in appearance [6]. For a fair and relevant comparison, we have tried our best to implement the other two methods with minor enhancement to attain the best performance on the 98 datasets within 6 hours from symptom onset.

We are unable to use ROIs derived from Section 3.4 for FCM/DM because the two algorithms depend on the whole volume or slice for its calculation (either for histogram calculation (DM) of the two hemispheres and clustering into 50 clusters (FCM) (ROI may contain too few voxels to be categorized into 50 clusters)). In other words, it is very hard, if not impossible, to implement FCM/DM methods just for data within the ROIs. We have thought out and implemented one way to incorporate the ROIs for comparison: to segment the lesion using the original FCM/DM and remove those lesions outside the ROIs. To simplify denotations, these two implementations are, respectively, denoted as FCM_ROI and DM_ROI. Altogether there are 5 methods to be compared, namely, the proposed, DM, DM_ROI, FCM, and FCM_ROI methods.

Figures 9 and 10 show, respectively, an axial slice with deep inhomogeneous infarcts and an axial slice with inhomogeneous infarcts involving the cortex, ground truth, and the segmented infarcts of the five methods.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

The segmentation performance of the 98 datasets by the proposed, FCM, FCM_ROI, DM, and DM_ROI methods in terms of DC, sensitivity, specificity, PPV, and NPV is summarized in Table 2 and shown in Figure 11.

5. Discussion

An SRC based method has been proposed and validated to segment infarcts from hyperacute ischemic stroke patient data. It consists of dictionary learning and classification. The first stage is carried out offline with elastic net. Then reconstruction residue is used for voxel-wise classification. Experiments are conducted to determine the appropriate parameters, including dictionary size , to control the sparsity of the sparse code, to control the stability of the sparse code, radius of patches , and size of structuring element to dilate the initial ischemic region for deriving the ROIs. The proposed method achieves best accuracy when these parameters are, respectively, 200, 0.3, 0.1, 1, and 2 with features being extracted from 2D patches. With different combinations, the accuracy DC could vary from 0.660 ± 0.163 to 0.755 ± 0.117 (Table 1). As the segmentation accuracy is sensitive to and (Table 1), they need to be determined with care. Once they are fixed, the segmentation is not sensitive to the variation of other parameters (Figures 6–8). In other words, the algorithm is robust to parameters , , and .

Experiments have been conducted to compare the performance of 2D patches and 3D patches which are used to train dictionary and sparse coding. Results show that the segmentation accuracy with 2D patches () is slightly better than that with 3D patches (), which may imply that neighboring axial slices will add confusing information due to the large slice spacing.

In addition, we have carried out extra experiments to compare classification performance between elastic net and -norm constraints. For -norm constraints based on formulas (3) and (4), optimum and are found through experiments to be, respectively, 0.9 and 150. For a fair comparison, both elastic net and -norm constraints are based on 2D patches of being 1 and being 2, with other parameters being optimum. As expected, elastic net yields higher DC than the -norm constraint (0.755 ± 0.117 versus 0.749 ± 0.119), which may imply that elastic net could better balance sparsity and stability of sparse codes than -/-norm at least for classification of ischemic infarcts. Gao and his colleague [14] were the first to advocate elastic net and showed similar difference to ours (difference between DC of elastic net and that of /-norm around 0.007).

Because ischemic infarcts are inhomogeneous and are undergoing variation at hyperacute stage, the boundaries between the infarcts and noninfarcts are usually blurred. We thus hypothesize that samples near the infarct boundaries are more difficult to be differentiated. To validate this assumption, another classification model is derived in a similar way to the proposed one with the only difference being that the negative samples are randomly picked from noninfarct voxels. As expected, the classification model from randomly picked negative sample yields significantly lower accuracy than the proposed one (DC versus , according to the paired -test). This additional experiment justifies the way to pick up negative samples near the boundaries during training. As processing time is critical for hyperacute ischemic stroke data, ROIs are introduced to confine the classification space. Due to the substantial variability of DWI and ADC intensities of infarcts and artifacts with similar DWI and/or ADC intensities to infarcts, it is hard to determine appropriate candidate regions of infarcts or ROIs. For this purpose, we extend the ischemic regions calculated from our previous work that are based on decreased ADC with DWI being not low [22] that can include regions with unclear boundaries on DWI. The initial ROIs modified from [22] could yield an average DC of , sensitivity of , and specificity of , being better than FCM method [8] and DM method [7]. After dilation and pasting the two extreme axial slices, the eventual ROIs contain most infarct voxels to have a sensitivity of , which means that most infarct voxels not included in the initial ROIs are within the neighborhood. From the experiments on changing the neighborhood size (Figure 8), an of 2 attains best balance between inclusion of infarct voxels and exclusion of infarct-mimic voxels. The procedure to derive ROIs is reflected in the derivation of negative samples during training, that is, in the vicinity of positive samples through dilation and pasting. In the future, we will be working on derivation of ROIs with higher sensitivity and higher DC.

The experiments on taking the whole brain as the ROIs (Section 4.2) will yield lower Dice (0.673 versus 0.755), higher sensitivity (0.797 versus 0.758), and equal specificity (0.999) as compared with the proposed algorithm with ROIs. For the proposed method with ROIs, the lower sensitivity reflects the fact that the ROIs do not include all the ischemic lesions, while the higher Dice implies a net gain in accuracy to balance between excluding lesion mimics and missing some real lesions, as compared with the proposed method without the ROIs. We may thus argue that the introduction of ROIs could remove ischemic lesion mimics at the cost of excluding few ischemic lesions to have a net gain in accuracy, as well as speed up the segmentation substantially (from 39 seconds to 7 seconds, Section 4.2).

Quantification of infarcts from DW imaging at baseline within 6 hours from onset is crucial to guide treatment planning such as thrombolysis. As pointed out in [6], infarcts on DWI and ADC imaged within 6 hours are most difficult to be delineated by experts due to substantial variability in intensities and ill-defined edges as compared with those imaged after 6 hours. As the DM method [7] is basically a global thresholding method to determine the DWI threshold based on divergence measure, it is not appropriate for processing data imaged within 6 hours due to the substantial variability in intensities. For the FCM method [8], it is dependent on the prominent edge on DWI for confirmation of infarcts, which may be the case for data within 10 days. As the data in this study are all within 6 hours from ictus, the ischemic lesion is still evolving and may have ill-defined edges on DWI; it is understandable that FCM will have a bad performance for these ischemic data. Derivation of ROIs from Section 3.4 is a way to incorporate prior knowledge to differentiate between ischemic lesions and ischemic lesion mimics based on our previous work [22]. The ROIs are part of the proposed method to enhance the segmentation accuracy from an average DC of 0.673 (Section 4.2, the proposed method without ROIs) to an average DC of 0.755 (Table 2). When the DM and FCM methods are confined by the ROIs as illustrated in Section 4.3, the DM_ROI and FCM_ROI could yield a higher DC (being, resp., 0.417 and 0.606, Table 2), which are still significantly lower than the proposed method (). The proposed method could cope with the variability in intensities and ill-defined edges of DWI and ADC data imaged within 6 hours through learning the pattern from training samples. As such, the proposed method attain significantly higher accuracy in terms of DC, sensitivity, and PPV than [7, 8] all with , while there exists no significant difference for the specificity and FPV among the three methods (Table 2). The superior performance of the proposed method may be ascribed to the following characteristics of the method.

First, it takes into account B0, DWI, ADC, and asymmetry with respect to the MSP, so the artifacts on DWI due to shine-through effect could be eliminated (Figure 12(d)). For the FCM method [8], it imposes ADC constraint to confine infarcts so the shine-through artifact could be removed (Figure 12(e)). As for the DM method [7], it could not get rid of shine-through artifact because it is purely based on DWI (Figure 12(g)). Both DM_ROI and FCM_ROI could handle the shine-through artifact due to the introduction of ROI constraints.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

Second, the proposed method could handle better than [7, 8] for infarcts with intensity variability on DWI, as it is based on learning samples with intensity variation. Figure 13 shows an axial slice with lower intensities around the infarct border on DWI; as such, the FCM [8] and FCM_ROI methods could not include the border (Figures 13(e) and 13(f)) while the proposed method could (Figure 13(d)). The DM method [7] fails to segment infarcts at this axial slice as it has a lower DWI intensity than infarcts at other axial slices.

(a)

(b)

(c)

(d)

(e)

(f)

Third, the proposed method could handle better than [7, 8] infarcts with ill-defined edges on DWI, once again due to the fact that the delineation is based on learning samples with similar edges (Figure 14).

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

Fourth, introduction of the asymmetry map has significantly enhanced the segmentation accuracy. When only B0, DWI, and ADC are employed for SRC based training and classification, the best performance is and sensitivity = . When the asymmetry map is added, the DC and sensitivity have been, respectively, increased to and . We also carried out experiments to segment based on thresholding the asymmetry map to find that the highest Dice achieved is with a sensitivity of and specificity of when the asymmetry threshold is around 40. As the performance based on thresholding the asymmetry map is substantially inferior to the proposed method, we may argue the following: (1) both infarcts and noninfarcts could cause asymmetry (sensitivity greater than 0 and specificity smaller than 1); (2) not all infarcts could be detected by asymmetry map (sensitivity is always smaller than 1); and (3) the proposed sparse learning framework is better than simple thresholding, and there is much complementary information from B0, DWI, and ADC for segmenting the infarcts (recall that the training of dictionaries is from the asymmetry map, DWI, ADC, and B0, Section 3.2).

The proposed method combines the advantages of SRC and our previous work to delineate infarcts based on DWI and ADC [22]. In particular, SRC could help to find sophisticated object patterns (such as infarcts with intensity variation and ill-defined edges), while the ROIs derived from [22] will confine the infarcts within candidate regions to decrease computational cost and exclude infarct-mimics. According to [28], a DC of 0.7 and above indicates a good agreement. As the proposed method could achieve good agreement (DC > 0.70) with speed (within 7 seconds), it could be a potential tool to be used clinically for guiding thrombolytic therapy.

The proposed method has only been validated on Siemens 3T scanners of two hospitals. It is our intension to design different dictionaries for different scanners to account for variability of imaging hardware. We are in the process of designing classifiers for GE scanners.

The hyperacute ischemia sometimes exhibits substantial variability that are hard to be modeled mathematically, which is the major cause of deviation from ground truth infarcts of the proposed method. For these cases, manual delineation of infarcts is difficult and is based on experience, anatomical knowledge, and comprehension of the DWI and ADC. To aid manual delineation, a new volume, that is, , is created, which is similar to DWI but emphasizes infarcts. Figure 15 shows an axial slice with complex image properties and the segmentation of the proposed method (the FCM, FCM_ROI, DM, and DM_ROI methods fail to segment). New tools and methods are yet to be developed for better segmentation of infarcts with complicated imaging features that are even hard to be manually delineated by human experts.

(a)

(b)

(c)

(d)

(e)

6. Conclusion

In this paper, an SRC based cerebral infarct segmentation method is explored and validated against 98 ischemic datasets scanned within 6 hours from ictus. The proposed method could handle well infarcts with intensity variability and ill-defined edges to yield significantly higher DC () than the FCM method [8] (, ) and DM method [7] (, ) and their enhanced versions by confining their segmentations within the ROIs (, ; , ). It could segment infarcts of a patient from baseline DW imaging within 7 seconds on a Pentium 4 PC with 2.4 GHz CPU (4 cores) and 4 G RAM. The superior performance is mainly ascribed to the comprehensive inclusion of the DW imaging and introduced asymmetry map, learning based nature that could learn complex infarct patterns, adoption of elastic net to stabilize sparse code, and introduction of ROIs to speed up the classification procedure as well as exclude lesion mimics. The proposed method could provide a potential tool to quantify infarcts from DW imaging at hyperacute stage with accuracy and speed to assist the decision making especially for thrombolytic therapy.

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

This work has been supported by National Program on Key Basic Research Project (nos. 2013CB733800 and 2013CB733803), National Natural Science Foundation (no. 61671440), National Science and Technology Pillar Program during the Twelfth Five-Year Plan Period (no. 2011BAI08B09), Shenzhen Key Technical Development Grant (no. CXZZ20140610151856719), and Shenzhen Basic Research Grant (no. JCYJ20140414170821262). The authors would like to thank Dr. Yiqun Zhang for her valuable discussion on manually delineating infarct regions without clear boundaries.

References

B. C. V. Campbell, A. Purushotham, S. Christensen et al., “The infarct core is well represented by the acute diffusion lesion: sustained reversal is infrequent,” Journal of Cerebral Blood Flow and Metabolism, vol. 32, no. 1, pp. 50–56, 2012.
View at: Publisher Site | Google Scholar
K. W. Muir, A. Buchan, R. von Kummer, J. Rother, and J.-C. Baron, “Imaging of acute stroke,” The Lancet Neurology, vol. 5, no. 9, pp. 755–768, 2006.
View at: Publisher Site | Google Scholar
R. E. Latchaw, M. J. Alberts, M. H. Lev et al., “Recommendations for imaging of acute ischemic stroke: a scientific statement from the American Heart Association,” Stroke, vol. 40, no. 11, pp. 3646–3678, 2009.
View at: Publisher Site | Google Scholar
B. C. V. Campbell, S. M. Davis, and G. A. Donnan, “How much diffusion lesion reversal occurs after treatment within three-hours of stroke onset?” International Journal of Stroke, vol. 8, no. 5, pp. 329–330, 2013.
View at: Publisher Site | Google Scholar
P. D. Mitsias, J. R. Ewing, M. Lu et al., “Multiparametric iterative self-organizing MR imaging data analysis technique for assessment of tissue viability in acute cerebral ischemia,” American Journal of Neuroradiology, vol. 25, no. 9, pp. 1499–1508, 2004.
View at: Google Scholar
C. S. Rivers, J. M. Wardlaw, P. A. Armitage, M. E. Bastin, P. J. Hand, and M. S. Dennis, “Acute ischemic stroke lesion measurement on diffusion-weighted imaging—important considerations in designing acute stroke trials with magnetic resonance imaging,” Journal of Stroke & Cerebrovascular Diseases, vol. 16, no. 2, pp. 64–70, 2007.
View at: Publisher Site | Google Scholar
K. N. B. Prakash, V. Gupta, H. Jianbo, and W. L. Nowinski, “Automatic processing of diffusion-weighted ischemic stroke images based on divergence measures: slice and hemisphere identification, and stroke region segmentation,” International Journal of Computer Assisted Radiology and Surgery, vol. 3, no. 6, pp. 559–570, 2008.
View at: Publisher Site | Google Scholar
J.-Z. Tsai, S.-J. Peng, Y.-W. Chen et al., “Automatic detection and quantification of acute cerebral infarct by fuzzy clustering and histographic characterization on diffusion weighted MR imaging and apparent diffusion coefficient map,” BioMed Research International, vol. 2014, Article ID 963032, 13 pages, 2014.
View at: Publisher Site | Google Scholar
S. Mujumdar, R. Varma, and L. T. Kishore, “A novel framework for segmentation of stroke lesions in diffusion weighted MRI using multiple b-value data,” in Proceedings of the 21st International Conference on Pattern Recognition (ICPR '12), pp. 3762–3765, November 2012.
View at: Google Scholar
L. Wang, F. Shi, Y. Z. Gao et al., “Integration of sparse multi-modality representation and anatomical constraint for isointense infant brain MR image segmentation,” NeuroImage, vol. 89, pp. 152–164, 2014.
View at: Publisher Site | Google Scholar
M. Z. Liu, L. Lu, X. J. Ye, S. P. Yu, and M. Salganicoff, “Sparse classification for computer aided diagnosis using learned dictionaries,” in Proceedings of the 14th International Conference Medical Image Computing and Computer-Assisted Intervention, Toronto, Canada, September 2011, vol. 6893 of Lecture Notes in Computer Science, pp. 41–48, Springer, 2011.
View at: Google Scholar
M. Yang, L. Zhang, X. C. Feng, and D. Zhang, “Fisher discrimination dictionary learning for sparse representation,” in Proceedings of the IEEE International Conference on Computer Vision (ICCV '11), pp. 543–550, Barcelona, Spain, November 2011.
View at: Publisher Site | Google Scholar
S. Zhang, J. Huang, D. Metaxas, W. Wang, and X. Huang, “Discriminative sparse representations for cervigram image segmentation,” in Proceedings of the 7th IEEE International Symposium on Biomedical Imaging: From Nano to Macro (ISBI '10), pp. 133–136, April 2010.
View at: Publisher Site | Google Scholar
Y. Z. Gao, S. Liao, and D. G. Shen, “Prostate segmentation by sparse representation based classification,” Medical Physics, vol. 39, no. 10, pp. 6372–6387, 2012.
View at: Publisher Site | Google Scholar
L. Wang, F. Shi, G. Li, W. L. Lin, J. H. Gilmore, and D. G. Shen, “Patch-driven neonatal brain MRI segmentation with sparse representation and level sets,” in Proceedings of the IEEE 10th International Symposium on Biomedical Imaging (ISBI '13), pp. 1090–1093, IEEE, San Francisco, Calif, USA, April 2013.
View at: Publisher Site | Google Scholar
L. Wang, K. C. Chen, Y. Gao et al., “Automated bone segmentation from dental CBCT images using patch-based sparse representation and convex optimization,” Medical Physics, vol. 41, no. 4, Article ID 043503, 2014.
View at: Publisher Site | Google Scholar
R. Fang, T. Chen, and P. C. Sanelli, “Towards robust deconvolution of low-dose perfusion CT: sparse perfusion deconvolution using online dictionary learning,” Medical Image Analysis, vol. 17, no. 4, pp. 417–428, 2013.
View at: Publisher Site | Google Scholar
M. Aharon, M. Elad, and A. Bruckstein, “K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation,” IEEE Transactions on Signal Processing, vol. 54, no. 11, pp. 4311–4322, 2006.
View at: Publisher Site | Google Scholar
K. Engan, S. O. Aase, and J. H. Husoy, “Frame based signal compression using method of optimal directions (MOD),” in Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS '99), vol. 4, pp. 1–4, May-June 1999.
View at: Publisher Site | Google Scholar
B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani, “Least angle regression,” The Annals of Statistics, vol. 32, no. 2, pp. 407–499, 2004.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
J. Wright, A. Y. Yang, A. Ganesh, S. S. Sastry, and Y. Ma, “Robust face recognition via sparse representation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 2, pp. 210–227, 2009.
View at: Publisher Site | Google Scholar
L. Ma, P.-Y. Gao, Q.-M. Hu et al., “Prediction of infarct core and salvageable ischemic tissue volumes by analyzing apparent diffusion coefficient without intravenous contrast material,” Academic Radiology, vol. 17, no. 12, pp. 1506–1517, 2010.
View at: Publisher Site | Google Scholar
Y. Q. Peng, X. D. Zhang, and Q. M. Hu, “Segmentation of hyper-acute ischemic infarcts from diffusion weighted imaging based on support vector machine,” Journal of Computer and Communications, vol. 3, no. 11, pp. 152–157, 2015.
View at: Publisher Site | Google Scholar
Q. Hu and W. L. Nowinski, “A rapid algorithm for robust and automatic extraction of the midsagittal plane of the human cerebrum from neuroimages based on local symmetry and outlier removal,” NeuroImage, vol. 20, no. 4, pp. 2153–2165, 2003.
View at: Publisher Site | Google Scholar
H. Zou and T. Hastie, “Regularization and variable selection via the elastic net,” Journal of the Royal Statistical Society. Series B. Statistical Methodology, vol. 67, no. 2, pp. 301–320, 2005.
View at: Publisher Site | Google Scholar | MathSciNet
Y. Z. Gao, S. Liao, and D. G. Shen, “Prostate segmentation by sparse representation based classification,” Medical Physics, vol. 39, no. 10, pp. 6372–6387, 2012.
View at: Publisher Site | Google Scholar
J. Mairal, F. Bach, J. Ponce, and G. Sapiro, “Online learning for matrix factorization and sparse coding,” Journal of Machine Learning Research, vol. 11, pp. 19–60, 2010.
View at: Google Scholar | MathSciNet
J. J. Bartko, “Measurement and reliability: statistical thinking considerations,” Schizophrenia Bulletin, vol. 17, no. 3, pp. 483–489, 1991.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2016 Xiaodong Zhang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1454

Downloads

1312

Citations