Abstract

In order to better predict and follow treatment responses in cancer patients, there is growing interest in noninvasively characterizing tumor heterogeneity based on MR images possessing different contrast and quantitative information. This requires mechanisms for integrating such data and reducing the data dimensionality to levels amenable to interpretation by human readers. Here we propose a two-step pipeline for integrating diffusion and perfusion MRI that we demonstrate in the quantification of breast lesion heterogeneity. First, the images acquired with the two modalities are aligned using an intermodal registration. Dissimilarity-based clustering is then performed exploiting the information coming from both modalities. To this end an ad hoc distance metric is developed and tested for tuning the weighting for the two modalities. The distributions of the diffusion parameter values in subregions identified by the algorithm are extracted and compared through nonparametric testing for posterior evaluation of the tissue heterogeneity. Results show that the joint exploitation of the information brought by DCE and DWI leads to consistent results accounting for both perfusion and microstructural information yielding a greater refinement of the segmentation than the separate processing of the two modalities, consistent with that drawn manually by a radiologist with access to the same data.

1. Introduction

Responses to cancer treatment are increasingly differentiated based not only on tumor type, but also on genetic and histochemical biomarkers. Exemplifying the progress in this respect is breast cancer. Biopsy-derived histological biomarkers offer high biological specificity and play an important role in determining the choice of chemotherapeutic agent. As different parts of a tumor often show different histological signatures or have evolved to different stages of tumor progression that may impact on their response to a given therapy, it is important to obtain a complete coverage of the tumor. Biopsies, however, are difficult to localize within the breast, are subject to sampling errors, and can seldom be repeated. Thus, there is growing clinical interest in the possible role of imaging to describe anatomical and physiological heterogeneity of tumors [1, 2].

Magnetic resonance imaging (MRI) methods such as dynamic contrast enhanced (DCE) and diffusion weighted (DW) MRI methods are amongst those of interest as they provide noninvasive digital biomarkers with good spatial coverage and repeatability [3]. DCE-MRI uses serial acquisition of images during and after the injection of intravenous contrast agent and has been shown to reflect tumor vascularity [4, 5]. DWI, on the other hand, generates images that are sensitized to water displacement at the diffusion scale and can be used to calculate a quantitative index reflecting the apparent freedom of diffusion (apparent diffusion coefficient (ADC)). Preclinical and clinical data show that ADC reflects regional cellularity [68].

DCE-MRI has a high sensitivity for breast cancer detection (89–100%), while DWI has shown utility in predicting suitable therapies and monitoring response [9]. A recognized weakness of DCE and DW-MRI is their lack of specificity between tumor types as overlap between the findings of benign and malignant lesions results in variable specificity (37–86%) [9]. This is not entirely surprising given that across cancer types the common features tend to include such processes as cell proliferation, angiogenesis, and necrosis. The ability of DCE- and DW-MRI to provide a spatial depiction of these anatomical and physiological conditions within a tumor makes them natural tools for probing tumor heterogeneity. The reporting of MRI has long relied on visual assessment of several scans having different contrasts, but in relation to breast cancer, few studies have exploited this inherently multiparametric data in a unified manner [1012]. Moreover, the most recent works mainly address the problem of comparing and retrospectively integrating the contributions from the different modalities, without exploiting the conjunct information. Nevertheless, these works have highlighted the potential of combining DCE-MRI and DWI to differentiate the core of the tumor from peritumoral tissues and normal tissues and thus provide an indication of lesion heterogeneity [13].

In this work, we propose the multimodal integration of the information provided by DCE-MRI and DWI of breast cancer lesions for evaluating their heterogeneity, that is, to divide the lesion into zones that share certain similarity when using combined information coming from different imaging domains. The ultimate intention of this protocol is to allow a more extensive, reproducible characterization of heterogeneity in tumors that have been previously identified by a clinician.

In all previous reports on breast lesion segmentation the representation of DCE curves and ADC maps has been that of features in a vector space defined by the image values [1417]. In this work a different approach is followed exploiting dissimilarity-based representations (DBR) [18]. The concept of dissimilarity-based representation consists of focusing on the contrast, or distance, between objects and of measuring it by a suitable criterion. The term object refers, in the present context, to the information represented by each particular voxel. This information need not be of a single type and in this case consists of both signal intensities (i.e., the time-intensity enhancement curve for DCE-MRI) and the ADC parameter value (derived from DW-MRI). A key concept in DBR is that of a proximity relation between two objects, which does not need to be explicitly represented in a feature space. Objects are characterized through pairwise dissimilarities; instead of using an absolute characterization of the objects by a set of features, problem-centric knowledge is used to define a measure that estimates the dissimilarity between objects. Here, both DCE and DWI contribute to such a measure leading to a novel multimodal approach to tissue characterization.

This paper is organized as follows. Section 2 describes the pipeline including the clustering and registration processing steps. Section 3 presents the results, which are then discussed in Section 4, and Section 5 derives conclusions.

2. Materials and Methods

This section provides an overview of the pipeline shown in Figure 1 and details the methodological choices with respect to both clustering and registration. The DCE-MRI data are first visually inspected to identify a time point where the lesion has the higher contrast with respect to the surrounding tissue. Multimodal registration is carried out between DW-MRI and DCE-MRI images, allowing a spatial mapping of both volumes. Dissimilarity-based clustering is then performed integrating information from both acquisition modalities. Statistical analysis, consisting of nonparametric tests, were applied on the ADC distributions defined by the obtained clusters. An assessment of the results was carried out by clinical experts, and, for the sake of completeness, an evaluation of the tightness and separation of the clusters was also performed.

2.1. Multimodal Registration

In order to perform voxelwise dissimilarity-based clustering that incorporates both DCE-MRI and DWI data, it is necessary to first spatially align the two datasets. The problem of registering between DCE-MRI and DWI becomes an increasingly difficult task in a highly compressible and elastic tissues like the breast, with its inhomogeneous anisotropic soft tissue, inherent nonrigid behavior, and lack of solid landmarks to guide the registration as fixed references. A standard registration protocol was used. Due to the highly distinct contrast and intensity characteristics of the two modalities as well as the low resolution of the DWI volumes, the registration process was divided into two steps, each following a standard multiresolution strategy. In the first step, rigid and affine transformations were performed successively in order to align and match the features of the fixed (DCE-MRI) and moving (DWI) images following a 5-level Gaussian scale space. In the second step a multiresolution cubic B-spline transformation with a regularization penalty was performed to elastically refine the alignment. Lesion-specific masks based on regions delineated by clinical experts were used in order to assign a greater weight to the voxels in the lesion area [19]. Normalized mutual information (NMI) was used as registration metric. In order to regularize the deformation, we used a bending energy penalty which is based on the spatial derivatives of the transformation [20]. The methodology used for registration was implemented in Elastix [19], and all the steps have been widely validated in literature [20, 21].

The registration protocol was applied to the b0 images from the DWI dataset and their transformation to the DCE-MRI space validated for each subject through visual inspection by an expert. The resulting transformation was applied to the remaining b-values, and the ADC was estimated on the transformed DWI images.

2.2. Dissimilarity-Based Clustering Methodology

The next step in the processing methodology is the construction of a dissimilarity matrix. This matrix consists of a set of row vectors, one for each voxel. These vectors represent the voxels in a vector space constructed by the dissimilarities to each other voxel. Usually, such a space can be safely treated as an Euclidean space equipped with the standard inner product definition.

Let be a voxel-based dataset. Given a dissimilarity function, a data-dependent mapping is defined as linking to the so-called dissimilarity space [22]. The complete dissimilarity representation yields a square matrix consisting of the dissimilarities between all pairs of objects, such that every object is described by an -dimensional dissimilarity vector .

A distance function based on the adaptive dissimilarity index first proposed in [23] has been exploited in a previous work [24] for calculating the pairwise proximity between DCE-MRI perfusion curves. There are two main approaches to quantifiably compare two time series: one makes use of the distances between the absolute values of their elements while the other focuses on the similarity of their behavior along time. Unlike conventional time-series distance functions, which focus only on the closeness of the values observed at corresponding points in time, ignoring the interdependence relationship between elements that characterize the time-series behavior, the proposed distance function takes into account the proximity with respect to values as well as the temporal correlation for the proximity with respect to behavior. For two voxel-derived perfusion curves and , closeness with respect to behavior is defined as the combination of their monotonicity, that is, if both curves increase or decrease simultaneously, and the closeness of their growth rate over a determined period [23]. Both criteria are quantified by the temporal correlation present in the first term of the distance function , (1). The complete distance function for DCE-MRI derived perfusion curves is defined as follows: where and are two voxel-derived perfusion curves sampled at time instants [23, 25]. Cort is the temporal correlation (2), and is the Hausdorff distance, defined in (3), which is used to measure the distance between both voxelwise perfusion curves: The integration of the diffusion information into the dissimilarity function is accomplished through the addition of an ADC-dependent term (4). This term is defined as a sigmoid function which makes use of the normalized difference between the ADCs ( and ) of the voxels under consideration, which ranges from 0 to 1:

The tuning parameter weights the contribution of to the complete dissimilarity measure by modulating the shape of the sigmoid function. When the value of the normalized difference between ADCs is low, denoting similar ADC values between voxels, the dissimilarity function approaches zero. On the contrary, when the value of the normalized difference between ADCs is high, denoting a large dissimilarity between ADC values between voxels, approaches one, making the overall dissimilarity measure approaches the value of . The impact of the different values of is illustrated in Figure 2.

The complete dissimilarity function is then the product of and : This global measure enables the monitoring of the performance as a function of the relative weight given to the ADC, as well as of different values of .

2.3. Performance Assessment

In each of the patients, a ROI was delineated by an expert around the lesion in the motion-corrected DCE-MRI volumes. Since unsupervised classification is sensitive to the general structure and distribution of the data, the ROI was drawn just exceeding the area of the enhancing lesion, allowing for a clear delineation of the heterogeneity of the lesion inside the ROI. The time-intensity curves normalized to the baseline at and the corresponding ADC values from the voxels inside the ROI were treated as independent objects on a voxel by voxel basis. Using from (5), a dissimilarity matrix was derived on a slicewise basis from the pairwise dissimilarities of the elements in the corresponding ROI. In such a space, each element was represented by a row vector whose dimensionality was defined by the cardinality of the ROI.

Once the dissimilarity space was constructed, the -means algorithm [26] was used to group the voxels in the ROI into clusters. The initial centroids were calculated automatically following a preliminary clustering step with a random subsample, as a strategy to improve the algorithm initialization avoiding a misplacement of the initial seeds. -means minimize the sum over all clusters of the within-cluster sums of point-to-cluster-centroid distances using, in this case, the squared Euclidean distance.

For selecting the number of clusters the standard clinical assessment protocol has been taken into consideration. It considers only three classes (persistent, plateau, and wash-out). An additional has been included for the surrounding tissue considering that the ROI exceeds the estimated limits of the enhancing lesion.

In order to perform a comparison with established methods the clustering procedure was also performed following a morphologic feature-based approach. This method relies on descriptors derived from the voxelwise time-intensity curves, comprising mainly specific characteristics of the shape of such curve. The features extracted from the DCE-MRI voxelwise time-intensity curves are baseline, maximum signal difference, time to peak, area under curve, maximum enhancement, wash-in rate, maximum slope of increase, wash-out rate, and the intercept of the line fitting the tail of the time-intensity curve with the axis . The use and definition of these morphologic features to describe the contrast agent intake can be found in the related literature [14, 17, 27]. Further, the clustering procedure was repeated incorporating the ADC of each voxel as an additional feature to the morphologic descriptor vectors calculated previously. The ADC and the morphologic features were standardized by subtracting their mean and dividing by their standard deviation. The results of these two procedures were compared with our method in order to assess the clustering and data representation outcome.

2.4. Patient Population

Data were acquired from 21 patients (age years). All the patients had been diagnosed to have primary ductal carcinoma.

DWI was acquired with a single-shot spin-echo (SE) echo planar imaging (EPI) sequence in three orthogonal diffusion encoding directions (, , and ) using 4 values (, , and  s/mm2) with parallel imaging (acceleration factor ). Subjects were breathing freely, with no gating applied. The dataset consisted of transverse slices (slice thickness mm, no slice gap) and TR/TE , matrix over the field of view (FOV) mm.

DCE-MRI was performed using a 3D T1-weighted FLASH sequence (TR/TE ) with a flip angle of and an acquisition matrix of and field of view (FOV) . Each -slice set was collected in 90 s at time points for approximately  min of scanning. A catheter placed within an antecubital vein delivered of the contrast agent, gadopentetate dimeglumine, (Magnevist, Wayne, NJ, USA) over (followed by saline flush) after the acquisition of one baseline dynamic scan. The DCE-MRI time series was motion corrected using the scanner manufacturer's in-line procedure.

3. Results

The regions resulting from dissimilarity-based clustering were rendered as colored overlays on the morphological images on each slice. The results from a representative patient are displayed in Figure 3. After clustering was performed on the normalized curves, the resulting clusters were assessed by the radiologists to validate the segmentation of both the central tumoral and surrounding regions. Figure 3(b) shows examples of the clusters obtained, while Figures 3(c) and 3(d) represent the plots of the average time-intensity perfusion curves calculated on the raw and normalized data, respectively. The plots show the impact that the normalization step has in highlighting the intercluster differences. The central region exhibits a characteristic pattern in the DCE-MRI of a high early enhancement followed by a rapid washout, indicative of angiogenesis (Figure 3(d), red line). Typically, surrounding this central region lays a cluster featuring a pattern of rapid enhancement followed by a signal plateau (Figure 3(d), orange line). The outermost cluster surrounding these two central regions features a slow enhancement behavior (Figure 3(d), yellow line). The voxels corresponding to the each cluster were extracted from the spatially registered 3D ADC maps in order to perform statistical analysis. The analysis was carried out in the whole 3D ROI, that is, taking into account the ADC values corresponding to all the clustered slices as a single volume. Normality tests (Jarque-Bera) revealed that the ADC values for the different clusters analyzed were not normally distributed. Accordingly, a nonparametric test (Wilcoxon-signed-rank test) was used () to evaluate whether the tumor's subregions corresponded to regions in the ADC maps with statistically different PDFs. In this way we found that the distributions of the ADC values in the DCE-MRI defined regions were statistically different, in each one of the two conditions, in out of patients.

The radiologist reviewed the overlays in comparison to the DCE seen as a dynamic loop, the DWI images, and the ADC maps derived from them, as well as T2 STIR images. Criteria for the review were whether or not any of the subregions obtained by the method corresponded to a zone of necrosis based on the complete set of images and whether one or more regions that would be classified as either benign or malignant have been subdivided.

Figure 4 illustrates a typical case setting to 1, 3, and 5. From the obtained results it was highlighted by the experts the usefulness of varying the parameter to emphasize different characteristics of the lesion. A high allows the discrimination between the core tumor and the surrounding regions by giving a higher weight to the difference between ADCs. This is mainly due to the fact that there is a progressive increase in ADC from the core of the tumor to peritumor tissues to normal tissues that leads to the possibility to use the ADC for locoregional staging [28]. Lowering allows the subdivision of the core based on DCE-MRI dissimilarity and the evaluation of the heterogeneity of the tumor thanks to the balanced contribution of DCE and DWI in the distance function .

For the sake of cluster comparison and validation among different methods, the silhouette analysis was used in all the clustering results. The silhouette analysis measures how close each point in one cluster is to points in the same cluster and how far away it is to points in the neighboring clusters. This is performed by quantitatively comparing the clusters by their tightness and separation and its average width provides an evaluation of cluster validity [29]. The silhouette analysis highlighted an improved performance of 31% for the clustering performed using with respect to the established approach that employs morphologic features derived from the DCE-MRI time-intensity curves and the ADC as an additional feature (Table 1).

4. Discussion

As a general strategy, we have demonstrated a dissimilarity clustering based on multidimensional data derived from diffusion and perfusion MRI. Extension of the algorithm to additional data is straightforward, though the computational demand rises, and the similarity metric will likely need to incorporate further context-specific knowledge. As examination of tumor heterogeneity is carried out on a tumor by tumor basis, the data space can be restricted to areas containing lesions already located, but not necessarily segmented. For the specific use of DCE and DW-MRI, the lower resolution of the DWI data presents an issue of partial volume effects that affects the clustering of small lesions, but this issue is not specific to any one characterization strategy.

The two free parameters of the protocol: number of clusters (), and relative weighting of the diffusion data (), warrant discussion as the present work provides only a starting approximation to their choice, and the values may well be pathology dependent. For an unsupervised classification as used herein, the number of clusters should follow the actual structure and separation of the data into natural groups.

For breast tumors such as ductal carcinoma, the reporting of DCE-MRI data is currently based on a three-way division, while DWI is binary between normal and abnormal. The three DCE curve types (a rise and fall, a rise to a plateau, and a steady rise) have established clinical utility in predicting tumor malignancy [30]. This is not to say however that only three subgroups are possible, nor that these subgroupings are predictive of treatment response, which is the motive for examining tumor heterogeneity. In fact, works such as [14] have demonstrated that as the temporal resolution increases, a higher number of curve archetypes can be naturally identified and can be used for classification of voxelwise perfusion curves.

We consider it noteworthy, therefore, that when was reduced to just three or four groups, these were identifiable with the 3 enhancement patterns (or these three and non-enhancement) used in clinical practice for the assessment of the breast cancer. As well, the confines of the groups with DCE-MRI time-course patterns consistent with malignant and benign tumors coincided very closely with the tumor margin drawn by a radiologist. Increasing the value showed the expected progressive splitting of these groups as increased, with providing a distinction in the way this splitting proceeded based on the relative weight given to the diffusion data. The benefits of increasing the number of clusters are evident for understanding the heterogeneity of the lesion and the distribution of voxels that share certain similarities; however, the increase of the number of clusters should go hand to hand with cluster and data analysis techniques in order to avoid false or meaningless divisions. The overall protocol would also benefit from an integrating methodology such as cluster ensembles, in order to combine the multiple base clusterings done with different values into a unified consolidated clustering, reaching with this a consensus solution.

The primary criteria for noninvasive assessment of tumors based on DCE MRI involve three enhancement patterns (four including necrosis/nonenhancement). In the clinical data used for this study this assessment criteria have limited the validation to the visual interpretation of enhancement patterns based on the conventional interpretation of DCE curves, with a reader-dependent incorporation of ADC information. Ultimately, the envisaged application is in anticipating and evaluating treatment response. If tumor heterogeneity in terms of both perfusion and diffusion is to be encompassed, the conventional 3-way categorization may not be adequate or appropriate and indeed for other organs this rating is less common. We are now looking into robust methods for further validation of the processing pipeline that would enable a clinical exploitation of the multimodal analysis. Access to ground truth beyond radiological and biopsy evaluation is needed and likely requires voxelwise comparison of with histology of resections, a process that requires modifications to the surgical procedure that were not justified for this first demonstration of the method. Even were histology image data available, a significant task remains in the spatially correlation of individual MRI voxels with the histological results in order to get the requisite voxel-scale validation.

5. Conclusions

In this paper, we presented a general methodology for heterogeneity quantification that integrates information from diffusion (an indicator of cellularity) and perfusion (reflecting blood volume, flow, and vascular permeability) MRI images and illustrated its use in application to ductal carcinoma. The demonstration illustrated that multimodal clustering leads to improved selectivity and yields a greater refinement of the segmentation of tissues within the lesion than the separate processing of the two modalities.

By demonstrating that statistically consistent subgroups can be defined within tumors based on a combination of DCE-MRI and DWI-MRI data, we have indicated a means for objectively segmenting tumors that can be used for larger studies to examine clinical impact. Moreover, the appearance of statistically distinct perfusion regions within the tumor at moderate and low ADC weightings that in turn have statistically distinct ADC distributions suggests there is a useable distinction present that is not capitalized upon in present clinical practice.