Abstract

Tissue segmentation and visualization are useful for breast lesion detection and quantitative analysis. In this paper, a 3D segmentation algorithm based on Kernel-based Fuzzy C-Means (KFCM) is proposed to separate the breast MR images into different tissues. Then, an improved volume rendering algorithm based on a new transfer function model is applied to implement 3D breast visualization. Experimental results have been shown visually and have achieved reasonable consistency.

1. Introduction

Recently, magnetic resonance imaging (MRI) technique has been widely used in diagnosing and detecting diseases. It provides an effective mean of noninvasively mapping the anatomy of a subject. It works better than X-ray computed tomography (CT) at soft tissue, such as breast. The three-dimensional segmentation and visualization of breast are useful for breast lesion detection and quantitative analysis.

Segmentation is applied to extract the interesting tissues in the breast. Several algorithms have been developed for segmenting the breast tissues. Threshold-based method, the gradient method, polynomial approximation method, the active contour models, and classifier segmentation are used in breast skin segmentation. Raba et al. [1] summarized that threshold-based method, the gradient method, polynomial approximation method, the active contour models, and classifier segmentation are the main methods commonly used in breast skin segmentation. Chen et al. [2] introduced the fuzzy clustering algorithm to the tumor region segmentation which had achieved better results. Kannan et al. [3] made the breast region segmentation by introducing new objective function of fuzzy c-means with the help of hypertangent function, Lagrangian multipliers method, and kernel functions. However, these studies did not separate the fat and fibroglandular tissues. Pathmanathan [4] suggested a region-growing method, which required the user to manually choose one or more seed points. This method got satisfying results, but it is inefficient and time consuming. Nie [5] used two steps to segment the breast: firstly, locating the skin border and lungs region by standard FCM algorithm and secondly, extracting the fibroglandular tissue by an adaptive FCM algorithm. However, it is a semiautomated method.

Two kinds of methods are mainly applied in volume visualization, which are surface rendering and volume rendering. For surface rendering, Marching Cubes (MC) algorithm [6] is usually used which was developed by Lorensen and Cline in 1987. MC represents 3D objects by surface representations such as triangular patches or polygonal meshes. However, MC algorithm suffered from a common problem of having to make a binary classification: either a surface passes through the current voxel or it does not [7]. Volume rendering addresses these defects, which can display the dataset by translucent images. By volume rendering, the internal structure can be seen and analyzed conveniently, which is useful for breast disease diagnosis. Most of the volume rendering algorithms are based on the optical theory [8]. The optical model can simulate the propagation of light in real world. Levoy [7] proposed ray-casting algorithm to display surfaces from volume data. Lacroute [9] improved ray casting with shear-warp algorithm. Maximum intensity projection (MIP) is a variant of volume rendering in which the color of the pixel in the final image is determined by the maximum value encountered along a ray.

In this paper, firstly, an automatic segmentation algorithm based on KFCM is developed to separate breast images into different tissues. Secondly, an improved volume rendering algorithm based on a new transfer function model is proposed to visualize the breast on MRI. Figure 1 shows the procedure of the breast segmentation and visualization.

2. Breast Tissue Segmentation

Breast tissue segmentation is used to extract the interesting regions. It separates the breast into three parts: fat tissue, fibroglandular tissue, and air. In this paper, an automatic segmentation algorithm is developed, which consists of three steps described as follows.

2.1. Image Preprocessing

MR image has inhomogeneity, noise, and other factors which affect the continuity and accuracy of the images segmentation results. Therefore, the anisotropic diffusion filter [10] is introduced to reduce the image noise. And then binarization is performed on the image by using Otsu’s thresholding algorithm. Since air produces almost zero MR signals, the background in MR images is virtually black, and the breast-skin boundary is relatively clear. Pectoral muscle appears dark gray in the axial T1-weighted breast images, and the grey level of this area is close to zero. So the direct threshold-based method is used to segment the breast region. Figure 2(a) is the original MR image that is partially enlarged. Figure 2(b) is the image that is processed by anisotropic diffusion filter which is smoother than the original image while the edges are well preserved.

2.2. Kernel-Based Fuzzy C-Means Clustering Algorithm

Typical Fuzzy C-Means (FCM) clustering algorithm is improved from C-means method, in which every iterative makes the sample belong to an exact cluster [11].

FCM introduces a fuzzy membership function which controls the degree of a sample belonging to different classes. The Kernel-based Fuzzy C-Means (KFCM) clustering method was proposed by Zhang [12, 13] based on FCM. It used a kernel function instead of the original Euclidian norm metric in typical FCM algorithm. The KFCM algorithm minimized the following objective function: where is a fuzzy membership matrix. is the number of clustering centers which is usually set by a priori knowledge. is the number of sample points. is the fuzzy center of the th cluster. The parameter is a constant which controls the fuzziness of the resulting partition. In the experiments, is set as 2.

In this paper, the Gaussian function (see (2)) is selected as the kernel function:

So the distance measure and the objective function can be redefined as

The equations of clustering center and membership matrix are similar with FCM as follows:

Because the sum of every column value in the fuzzy membership matrix is 1, the result of KFCM is influenced by the number of the clustering centers. The result will be devious if the number is very different from the reality. However, the classification for breast using KFCM benefits from the good performance of convergence [11] because we know exactly the clustering number.

2.3. Evaluation of the Clustering Centers

The clustering centers of KFCM are initialized randomly without analyzing the original dataset, which probably causes nonaccurate segmentation and needs more iteration times. So we utilize a boundary model to determine the rough boundary of breast tissues and then calculate more accurate clustering centers.

We assume that there are gradual changes in data value between different tissues. For scalar data, the gradient is a first derivative measure which describes the direction of greatest change. The gradient magnitude is a scalar quantity that represents the rate of change in the scalar field. Since precise boundary is unnecessary, the second directional derivative is abstained for reducing complexity. is used to represent the gradient magnitude, where is the scalar function:

We assume that if we order the tissues by data value, then each type touches only types adjacent to it in the ordering [7]. The transition from one tissue to another is smooth. We create a 1D histogram of and find its peaks. The positions where peaks appear are approximately the boundary. We denote the boundary by is the number of positions whose gradient magnitude belongs to the peaks. There are three kinds of tissues in breast MR images (fat, fibroglandular, and air). So , and will be produced. And then compute the one moment of the discrete boundary points set [14]: where .

So the one moment can be used to initialize the clustering centers to perform the KFCM algorithm.

3. Breast Visualization

Volume rendering is useful for exploring the internal structure of the object such as breast. Most of the volume rendering algorithms are based on the optical theory. There are several different optical models for light interaction with volume densities of absorbing, emitting, reflecting, and scattering materials [8]. In this paper only absorption plus emission is considered in which voxel emits light itself and absorbs incoming light. It is the most common one in volume rendering [15]. Each pixel of the image casts a single ray into the dataset. The ray interacts with the scalar value of the dataset which has been virtually mapped to color and optical properties. The mapping is implemented through a transfer function. The optical properties then are used in compositing procedure which is known as the volume rendering integral. The integral is solved numerically to get the color of the pixel at last. This process continues until the color of all the pixels of the image is obtained, and then the final image will be displayed.

3.1. Optical Model and Composition

Every pixel casts a ray from viewing image to the volume, and then we resample the volume scalar data values at equispaced intervals through trilinear interpolation. The optical model and composition are not the main point of this paper and are presented here for completeness.

The process of the light propagation and the composition is parameterized. A ray cast into the volume is represented by , where is the distance from the eye to the current position. The scalar value along is denoted by . Since only absorption and emission are considered, the volume rendering equation integrates absorption coefficients and emissive colors:

Both and are functions of distance instead of scalar . We denote the energy which eventually reaches the eye from by . If is constant along the ray,

However, is usually not constant. It depends on the distance from the eye, so

So, the integral over the absorption coefficients in the exponent is which is also called the optical depth [15]. It is a single point. We should perform integral for all possible positions along the ray to acquire the total amount of radiant energy reaching the eye from direction:

The integral of (12) can be approximated by a Riemann sum: where represents the distance between resampling points.

According to exponential formula, the component of (13) can be redefined as

The approximate evaluation of the volume rendering integral is where is the number of samples, is called opacity, and is the emitted color of the th ray segment:

In our case, the front-to-back order is applied by stepping from 1 to . The following iterative addresses (16):

So we can get one pixel of the final image. After all the pixels of the final image are integrated from all directions, the result of volume rendering will be displayed.

3.2. A New Transfer Function Model

Function which is termed the opacity distribution function is introduced [16], which maps data value in an identical tissue to opacity. The independent variable is the Euclidean distance between data values and clustering centers, which are generated through KFCM. This function can be a piecewise function, polynomial function, or spline. In Figure 3, two opacity distribution functions are demonstrated. In Figure 3(a), the opacity has a linear gradient from the center of the sphere to the outline. In Figure 3(b), the opacity has a quadratic gradient in which we can largely concentrate on the boundary of the material.

3.3. Performance Optimization

According to Figure 4, the histogram of data values of one slice in the dataset implies that the data value of many points is zero. They should deserve little attention. But in traditional ray-cast algorithm, we have to consider the voxels which contribute nothing to the final image. Therefore, a scanline that is perpendicular to the slices is introduced (Figure 5). The scanline consists of gray segments and white segments. The white segments represent transparent parts in the volume, which contribute little to the final image. So, the interpolation in preprocess and the iterative of (19) are abstained in transparent segments of the corresponding scanline. Therefore, the compositing speeds up.

4. Results and Discussion

The breast MR images we used in this paper were acquired from the Military General Hospital of Beijing PLA. There are 28 slices of samples each. The procedure was implemented on a 3.4 GHz CPU, 2 G memory PC.

Figure 6 shows the results of KFCM algorithm of three slices in the dataset. This example indicates the effectiveness of KFCM algorithm in 2D field. Fibroglandular and fat tissues can be relatively separated precisely. The blue region is fibroglandular tissue. The shortcoming of KFCM is that the convergence time cannot be predicted and may be unacceptably long. However, through the initialization of clustering centers in Section 2.3, the convergence time can be made more acceptable.

Figure 7 shows the results of the breast visualization through various algorithms. In Figure 7(c), we can see that the result of Marching Cubes algorithm can well display the breast skin, but the internal structure cannot be seen. A simple linear transfer function was used to produce a result with misty fibroglandular tissue, as shown in Figure 7(d). The green regions represent fibroglandular tissue, and the white regions represent fat tissue. They are mixed together, so we could not easily distinguish them.

Figure 7(e) shows the result of MIP. MIP uses the maximum value encountered along a ray to determine the color, so it is difficult to design the transfer function. However, the region having the maximum value may not be the interested region, and furthermore it does not consider the contribution of the other points along the ray.

Figure 7(f) shows the results of ImageVis3D system which is developed by the Center for Integrative Biomedical Computing, University of Utah. This system can visualize the biological tissue well, but the transfer function should be adjusted manually, and the quality of the rendered image depends on the user’s experience.

Figures 7(a) and 7(b) show the visualization of breast by KFCM algorithm. The number of clustering centers in KFCM is three: fat tissue, fibroglandular tissue, and air. The comparison of convergence time is shown in Table 1. We have tested a single dataset for three times. From Table 1, we can see that the convergence time is stably six minutes with the evaluation of the clustering center proposed in Section 2.3. It is faster and more predicable than KFCM with random clustering centers. Then, using the compositing algorithm described in Section 3.1 and the quadratic opacity distribution function described in Section 3.2, two views are computed. Comparing with the other results, Figures 7(a) and 7(b) show the fibroglandular more clearly. Fibroglandular tissue is separated from fat tissue well. The pink regions in the breast represent fibroglandular tissue, and the white regions in the breast represent fat tissue.

5. Conclusion

A 3D segmentation algorithm based on Kernel-based Fuzzy C-Means (KFCM) is presented to separate the breast images into different tissues. In the KFCM algorithm, we propose to evaluate the clustering centers through the rough boundary of breast tissues before clustering process. Then, an improved volume rendering algorithm based on a new transfer function model is applied to implement 3D breast visualization. Experimental results show that our algorithm can efficiently segment fibroglandular and fat tissues. Also, the visualization performance of our algorithm is better than ray-casting algorithm with a linear transfer function and MIP, which makes it useful for breast lesion detection and quantitative analysis.

Acknowledgment

This project is supported by the National Natural Science Foundation of China (Grant no. 61240010).