Abstract

We proposed a novel framework of multiphase segmentation based on stochastic theory and phase transition theory. Our main contribution lies in the introduction of a constructed function so that its composition with phase function forms membership functions. In this way, it saves memory space and also avoids the general simplex constraint problem for soft segmentations. The framework is then applied to partial volume segmentation. Although the partial volume segmentation in this paper is focused on brain MR image, the proposed framework can be applied to any segmentation containing partial volume caused by limited resolution and overlapping.

1. Introduction

Roughly speaking, multiphase segmentation is more challenging than two-phase segmentation. There are a lot of explanations for this phenomenon. But for energy-based models, one reason is that it is harder to control the convergence in the implementation for multiphase segmentation than for two-phase segmentation. One extreme case is that the implementation may lead to less phases (local minimum) than expected. Recently, multiphase segmentation has been studied extensively. One way is to employ level set method. For example, papers in [1, 2] use multiple level set functions, and papers in [3, 4] use multiple layers for each level set function. Soft segmentation is another way developed for multiphase segmentation [5–7]. Let 𝐼 denote an image containing 𝐾 phases. The purpose of soft segmentation is to find a set of functions 𝑝𝑖,(1≀𝑖≀𝐾) indicating the probabilities that point π‘₯ belongs to 𝑖th phase.

The similarities between image segmentation and the phase transition theory in material sciences and fluid mechanics have inspired people to borrow some ideas in contemporary material sciences, for example, the diffuse interface model of Cahn-Hilliard [8], and its rigorous mathematical analysis in the framework of Ξ“ convergence approximation by Modica and Mortola [9, 10]. The phase-field relaxation consists in approximating the perimeter using a Cahn-Hilliard type penalization functional [8] with the formπ‘ƒπœ–(πœ–πœ)=2ξ€œΞ©||||βˆ‡πœ21𝑑π‘₯+πœ–ξ€œΞ©π‘Š(𝜐)𝑑π‘₯,(1) where π‘ŠβˆΆβ„π‘β†’β„βˆͺ+∞ is a scalar function with exactly two minimizers at 0 and 1 satisfying π‘Š(0)=π‘Š(1)=0. The second term of the penalty functional ensures that the values of the material density 𝜐 converges to 0 or 1 as πœ–β†’0, while the first term controls the perimeter. The parameter πœ– can be interpreted as the width of the diffused edge representation in 𝜐. The phase-field approach has been used in topological optimization problems [11–13]. In [14], the authors used the phase field to approximate sharp edges and a variational phase field model is derived to compute a shape average of a given number of shapes. In [12], the authors used the phase transition theory in a Cahn-Hilliard inpainting model.

Since applying phase-transition theory into a model can enhance pattern separation and make boundaries smooth, the theory has been widely applied to image segmentation [6, 15–17]. The authors in [17] presented a model for image segmentation based upon phase transition theory of Modica and Mortola and discussed its connections to the Mumford-Shah model and some related works. Then Chen et al. extended it under stochastic theory [16]. In [6], Shen proposed a general multiphase stochastic variational fuzzy segmentation model combining stochastic principle and the Modica-Mortola’s phase-transition theory.

There is a common point in the models mentioned above. They all contain a regularity term and a fidelity term and assume that the image is piecewise constant or smooth enough, which makes them vulnerable to noise. Although we can choose large weight for the fidelity term to restrain noise, the segmentation result is sensitive to the choice of the weights between the regularity term and the fidelity term. If the weight of the regularity term is much bigger than that of the fidelity term, then the edge and fine structure can be preserved very well but the noise may not be removed ideally. On the other hand, if we choose the contrary, that is, the weight of fidelity term is much bigger than that of the regularity term, noise will be removed very well but edge may be damaged and some fine structure may be lost. To avoid this dilemma, there have been some studies in recent years aiming at employing nonlocal information of images, such as graph-cut-based method (discrete case) [18] and nonlocal-variation-based method (continuous case) [19]. In these methods, whether a point is an edge depends not only on the local intensity difference, but also on finding how often the similar features of the point have been repeated in the whole domain. By taking nonlocal information, the edge can be well preserved while the noise is smoothed.

Different from using nonlocal information, the framework in this paper uses stochastic theory to restrain noise and improve segmentation result. It can be thought an extension of piecewise constant Mumford-Shah kind models mentioned above. More precisely, we assume that the intensity of each point is a Gaussian distributed random sample. In each phase Ξ©π‘˜, the points follow a same Gaussian distribution with mean π‘π‘˜ and variance πœŽπ‘˜. We assume that the clean true image 𝑒(π‘₯) is still piecewise constant (i.e., inside each phase, the intensities are always a constant equal to π‘π‘˜) but contaminated by a Gaussian noise 𝑛(π‘₯), that is, 𝐼(π‘₯)=𝑒(π‘₯)+𝑛(π‘₯). As a result, the intensities of points in a same phase will not be a constant, but a family of samples from a same Gaussian distribution π‘“π‘˜(π‘₯), that is, a Gaussian distribution PDF (Probability Density Function). By maximizing the likelihood (joint PDF) of all random samples, the fidelity term becomes the following form (see detail in Section 3):πΎβˆ’1ξ“π‘˜=0ξ€œΞ©ξƒ©logπœŽπ‘˜+ξ€·π‘π‘˜ξ€Έβˆ’πΌ(π‘₯)22𝜎2π‘˜ξƒͺπ‘π‘˜(π‘₯)𝑑π‘₯,(2) where 0β‰€π‘π‘˜(π‘₯)≀1 is a smooth approximation of characteristic function of π‘˜th phase, and π‘π‘˜ and πœŽπ‘˜ are mean and standard variation, respectively. In application, π‘π‘˜(π‘₯) are usually chosen in such a way that βˆ‘πΎβˆ’1π‘˜=0π‘π‘˜(π‘₯)=1 holds in the whole domain. So, the model becomes a standard soft segmentation with π‘π‘˜(π‘₯) as probability functions. Compared with those piecewise constant [1–7, 15, 17], where variants are not involved, for point π‘₯ where (π‘π‘˜βˆ’πΌ(π‘₯))2 is relatively larger than (π‘π‘–βˆ’πΌ(π‘₯))2 for some π‘–β‰ π‘˜ due to noise, the model with fidelity term (2) can still classify it to kth phase (the correct phase) and the noise can therefore be restrained. The reason is that the effect of large (π‘π‘˜βˆ’πΌ(π‘₯))2 will be partly counteracted by the variance. On the other hand, based on probability theory, we know that the probability that π‘†βŠ‚Ξ©π‘˜ for a connected area 𝑆 with big difference |πΌπ‘˜(π‘₯)βˆ’π‘π‘˜| for all π‘₯βˆˆΞ©π‘˜ is much smaller than the probability that π‘₯βˆˆΞ©π‘˜ for one isolated point π‘₯ with big difference |πΌπ‘˜(π‘₯)βˆ’π‘π‘˜|. This fact can guarantee that the model based on (2) can preserve small structure while removing isolated noise.

In this paper, we did not introduce membership functions as an approximation of characteristic function. Instead, we introduce a constructed function β„Ž(π‘₯) so that the composite function β„Žβˆ˜(𝑧(π‘₯)βˆ’π‘˜) has the property of probability function π‘π‘˜(π‘₯). As a result, the model itself is still a soft segmentation. The advantage of applying constructed function in the model lies in the fact that there will be less variables introduced in the model which makes the discussion and the implementation easier. For example, as long as we know 𝑧(π‘₯), the probability π‘π‘˜(π‘₯) is followed by π‘π‘˜(π‘₯)=β„Žβˆ˜(𝑧(π‘₯)βˆ’π‘˜).

In MRI brain image, there are three tissues, that is, white matter, gray matter, and cerebrospinal fluid (CSF). Due to limited resolution and nonregularity of the boundaries of pure matters, there are some partial volumes formed by overlaps of pure matters. Those partial volumes are hard to be classified to pure matters. We assume each partial volume follows a mixed Gaussian distribution generated by two Gaussian distributions which correspond to pure matters. The proposed frame work can then be applied to partial volume segmentation. The rest of the paper is organized as follows. In Section 2, we develop the framework of multiphase segmentation by combining phase transition theory and Gaussian distribution. Section 3 is the implementation and some considerations. Then in Section 4, we apply the frame work to partial volume segmentation. Experiments are carried out in Section 5. We show by examples the advantage of the proposed model by comparing with other multiphase segmentation models. Finally, we close the paper with a short conclusion.

2. Framework Development

In this section, we develop a framework of multiphase segmentation by integrating phase transition theory and statistics method. Let 𝐼∈𝐿2(Ξ©) be an image defined in a bounded, smooth and open domain Ξ©βŠ‚π‘…2. Suppose the image contains 𝐾 phases, and we take the image as a random field with the following assumptions: (i)at each point π‘₯∈Ω, the intensity 𝐼(π‘₯) is a random variable;(ii)all the random variables {𝐼(π‘₯)∣π‘₯∈Ω} are independent;(iii)in each phase πΌβˆ£Ξ©π‘˜,0β‰€π‘˜β‰€πΎβˆ’1, all the random variables {𝐼(π‘₯)∢π‘₯βˆˆΞ©π‘˜} are identically distributed as a Gaussian distribution with same mean π‘π‘˜ and same variance 𝜎2π‘˜ (which are to be determined).

We want to maximize the likelihood, joint pdf of {𝐼(π‘₯),π‘₯∈Ω}, which is equivalent to minimize the following energy (the details can be found in [16]):βˆ’log𝐿(𝑐,𝜎)=πΎβˆ’1ξ“π‘˜=0ξ€œΞ©π‘˜ξƒ©logπœŽπ‘˜+ξ€·π‘π‘˜ξ€Έβˆ’πΌ22𝜎2π‘˜ξƒͺ=πΎβˆ’1ξ“π‘˜=0ξ€œΞ©ξƒ©logπœŽπ‘˜+ξ€·π‘π‘˜ξ€Έβˆ’πΌ22𝜎2π‘˜ξƒͺ1Ξ©π‘˜.(3)

Second, we borrow the length term from [17] which is based on phase transition theory. Let 𝐼 be the image defined above. The signature function 𝑧(π‘₯) is defined by𝑧(π‘₯)=π‘˜,ifπ‘₯βˆˆΞ©π‘˜,π‘˜=0,1,…,πΎβˆ’1.(4) Then the total variation of 𝑧(π‘₯) is ∫Ω|𝐷𝑧|, where 𝐷𝑧 denotes the vectorial Radon measure of the total variation (TV) of 𝑧. We have the following relation:πΎβˆ’1ξ“π‘˜=0||πœ•Ξ©π‘˜||β‰€ξ€œΞ©||||𝐷𝑧(π‘₯)β‰€πΎπΎβˆ’1ξ“π‘˜=0||πœ•Ξ©π‘˜||,(5) where πœ•Ξ©π‘˜ is the boundary of Ξ©π‘˜. Thus, ∫Ω|𝐷𝑧(π‘₯)| works as the length of the edges of all phases. The ideal model is to minimize the sum 𝐸[𝑧,𝑐,𝜎∣𝐼] of the length term and the negative log-likelihood(3)𝐸[]=ξ€œπ‘§,𝑐,𝜎∣𝐼Ω||||𝐷𝑧(π‘₯)+πœ†πΎβˆ’1ξ“π‘˜=0ξ€œΞ©ξƒ©logπœŽπ‘˜+ξ€·π‘π‘˜ξ€Έβˆ’πΌ22𝜎2π‘˜ξƒͺ1𝑧=π‘˜.(6)

Note that when 𝜎0=𝜎1=β‹―=πœŽπΎβˆ’1β‰ 0, the model is equivalent to piecewise constant Mumford-Shah model. However, this model has intrinsic drawback due to the discreteness of the signature function 𝑧(π‘₯) and the characteristic function 1𝑧=π‘˜, which will impede the application of PDE-based method. So, we need to use some relaxed version. For the signature function, we introduce its relaxed version via the celebrated model of Modica and Mortola on phase transitions in material science and fluid mechanics. We refer the authors to papers [8–10, 17] for further understanding to phase transition theory.

Let ̃𝑧(π‘₯) be a smoothed version of the signature function 𝑧(π‘₯), which is called phase fields. To be simple, we still use the same notation 𝑧(π‘₯) to denote the phase field. Modica and Mortola [9] established thatξ€œΞ©ξ‚ƒπœ–||||βˆ‡π‘§2+1πœ–sin2ξ‚„πœ‹π‘§π‘‘π‘₯(7)𝛾-converges to ∫(4/πœ‹)Ξ©|𝐷𝑧(π‘₯)| in 𝐿1(β„›) for phase fields 𝑧(π‘₯) that ultimately only take integer values. Now we replace the length term in (6) by (7) when πœ– is small enough. For the characteristic function 1𝑧=π‘˜, we can use any smooth function β„Žπ‘˜(π‘₯) as an approximation if only 0β‰€β„Žπ‘˜(π‘₯)≀1 and βˆ‘πΎβˆ’1π‘˜=0 for all π‘₯∈Ω. The ideal smooth function β„Žπ‘˜(π‘₯) should have the following properties:(i)at each point π‘₯∈Ω where 𝑧(π‘₯)=π‘˜, β„Žπ‘˜(π‘₯) is close to 1;(ii)at each point π‘₯∈Ω where |𝑧(π‘₯)βˆ’π‘˜|>0.5, β„Žπ‘˜(π‘₯) is close to 0.

If we can choose β„Ž(π‘₯) satisfying that β„Ž(π‘₯) is close to one at small neighborhood of 0 and close to zero elsewhere, then β„Žπ‘˜(π‘₯) can be denoted as β„Ž(𝑧(π‘₯)βˆ’π‘˜) since 𝑧(π‘₯) is almost integer. Then, the fidelity term (3) becomesπΎβˆ’1ξ“π‘˜=0ξ€œΞ©ξƒ©logπœŽπ‘˜+ξ€·π‘π‘˜ξ€Έβˆ’πΌ22𝜎2π‘˜ξƒͺβ„Ž(𝑧(π‘₯)βˆ’π‘˜).(8)

Now, as the final step, we integrate the relaxed length term (7) and the relaxed fidelity term (8). The new energy functional isπΈπœ–[]=ξ€œπ‘§,𝑐,πœŽβˆ£πΌΞ©ξ‚ƒπœ–||||βˆ‡π‘§2+1πœ–sin2ξ‚„πœ‹π‘§π‘‘π‘₯+πœ†πΎβˆ’1ξ“π‘˜=0ξ€œΞ©ξƒ©logπœŽπ‘˜+ξ€·π‘π‘˜ξ€Έβˆ’πΌ22𝜎2π‘˜ξƒͺβ„Ž(π‘§βˆ’π‘˜)∢=πΉπœ–[𝑧][].+𝐺𝑧,𝐜,𝝈∣𝐼(9)

This is the proposed framework of multiphase segmentation. Compared with most existing models, the proposed model is robust to noise and more applicable; moreover, since the phase function 𝑧(π‘₯) is involved in the model, segmentation can be derived directly from the phase function.

3. Implementation and Considerations

In our framework (9), there are three groups of parameters to be determined, the means 𝐜=(𝑐0,𝑐1,…,π‘πΎβˆ’1), the variances 𝝈𝟐=(𝜎20,𝜎21,…,𝜎2πΎβˆ’1), and the phase field 𝑧(π‘₯). We compute πΈπœ–[𝑧,𝐜,𝝈∣𝐼] regarding 𝑧, 𝐜, and 𝝈 as independent variables. This allows the application of the alternating minimization (AM) scheme, that is, to alternatingly optimize the three conditional energies πΈπœ–[π‘§βˆ£πœ,𝝈,𝐼],πΈπœ–[𝐜∣𝝈,𝑧,𝐼], and πΈπœ–[𝝈∣𝐜,𝑧,𝐼], under the iterations of 𝑧(𝑛)β†’πœ(𝑛)β†’πˆ(𝑛)→𝑧(𝑛+1) given by𝐜(𝑛)=argminπΈπœ–ξ€Ίπœβˆ£πˆ(𝑛),𝑧(𝑛)ξ€»πˆ,𝐼,(10)(𝑛)=argminπΈπœ–ξ€Ίπˆπœ(𝑛),𝑧(𝑛)𝑧,𝐼,(11)(𝑛+1)=argminπΈπœ–ξ€Ίπ‘§βˆ£πœ(𝑛),𝝈(𝑛)ξ€».,𝐼(12)

It is well known (see Vogel [20]) that the AM scheme is monotone: πΈπœ–ξ€Ίπ‘§(𝑛+1),𝐜(𝑛+1),𝜎(𝑛+1)ξ€»βˆ£πΌβ‰€πΈπœ–ξ€Ίπ‘§(𝑛),𝐜(𝑛),𝜎(𝑛)ξ€».∣𝐼(13) For (10) and (11), one can simply have at pixel level,π‘π‘˜(𝑛)=βˆ‘π‘–βˆ‘π‘–πΌπ‘–,π‘—β„Žξ‚€π‘§(𝑛)𝑖,π‘—ξ‚βˆ’π‘˜βˆ‘π‘–βˆ‘π‘—β„Žξ‚€π‘§(𝑛)𝑖,π‘—ξ‚ξ‚€πœŽβˆ’π‘˜,π‘˜=0,1,…,πΎβˆ’1,π‘˜(𝑛)2=βˆ‘π‘–βˆ‘π‘—ξ‚€πΌπ‘–,π‘—βˆ’π‘π‘˜(𝑛)2β„Žξ‚€π‘§(𝑛)𝑖,π‘—ξ‚βˆ’π‘˜βˆ‘π‘–βˆ‘π‘—β„Žξ‚€π‘§(𝑛)𝑖,𝑗,βˆ’π‘˜π‘˜=0,1,…,πΎβˆ’1,(14) where 𝑧(𝑛)𝑖,𝑗 denotes the computational phase field on the Cartesian image domain at step 𝑛.

For argminπΈπœ–[π‘§βˆ£π‘(𝑛),𝜎(𝑛),𝐼], since the model is not an active contour model (the integral of the fidelity term cannot be separated by boundaries), it is not proper to use level set method for implementation. Besides, our purpose is not to find the rough segmentation of 𝑧(π‘₯) but the exact value of 𝑧(π‘₯) so that we can estimate pure matters form the partial volume in MRI brain images using the composition β„Žβˆ˜(𝑧(π‘₯)βˆ’π‘˜) for 0β‰€π‘˜β‰€πΎβˆ’1. Note that the Euler-Lagrange equation of πΈπœ–[π‘§βˆ£π‘(𝑛),𝜎(𝑛),𝐼] with respect to 𝑧 isπœ‹βˆ’2πœ–Ξ”π‘§(π‘₯)+πœ–sin(2πœ‹π‘§(π‘₯))+πœ†πΎβˆ’1ξ“π‘˜=0β„Žξ…žξƒ©(𝑧(π‘₯)βˆ’π‘˜)logπœŽπ‘˜+||π‘π‘˜||βˆ’πΌ22𝜎2π‘˜ξƒͺ=0.(15)

As for most multiphase segmentation models, the energy functional πΈπœ–[𝑧,𝑐,𝜎∣𝐼] is not convex with respect to 𝑧. Thus, simple iteration scheme obtained directly from the Euler-Lagrange equation may not converge. In our application, we adopt the convex-concave procedure (CCCP). Before applying CCCP to our model, we first give a short review on CCCP. We only describe the basic result that is necessary to understand this paper. For more details, we refer the readers to [6, 21]. We also recommend the reader to use selected initial value to help converge (but not guaranteed).

3.1. Convex-Concave Procedure (CCCP)

The convex-concave procedure is a convex splitting method in optimization which was explored by Yuille and Rangarajan [21]. The following theorem is the foundation of CCCP.

Theorem 1. Consider an energy function which is bounded below and is an addition of convex and concave functions: 𝐸⃗π‘₯=𝐸convexξ€·ξ€Έβƒ—π‘₯+𝐸concaveξ€·ξ€Έβƒ—π‘₯.(16) Then, the discrete iterative CCCP algorithm given by βˆ‡πΈconvexξ‚€βƒ—π‘₯(𝑛+1)=βˆ’βˆ‡πΈconcaveξ‚€βƒ—π‘₯(𝑛),𝑛=0,1,…(17) is guaranteed to monotonically decrease the energy 𝐸(βƒ—π‘₯) as a function of time and to converge to a local minimum or a saddle point of 𝐸(βƒ—π‘₯).

Proposition 1. Let ∫𝐹(𝑒)=Ω𝑓(𝑒(π‘₯))𝑑π‘₯, where π‘“βˆˆπΆ2(𝑅). If π‘“ξ…žξ…žβ‰₯βˆ’π›Ύ for some 𝛾β‰₯0. We define the splitting ξ€œπΉ(𝑒)=Ω𝛾2𝑒2ξ‚ξ€œ+𝑓(𝑒)𝑑π‘₯βˆ’Ξ©π›Ύ2𝑒2𝑑π‘₯∢=𝐹1(𝑒)βˆ’πΉ2(𝑒).(18) Then both 𝐹(𝑒) and 𝐹2(𝑒) are convex.

The proof is trivial since 𝐹1ξ€Έξ…žξ…ž(ξ€œπ‘’)=Ω𝛾+π‘“ξ…žξ…ž(𝑒)𝑑π‘₯β‰₯0(19) if π‘“ξ…žξ…žβ‰₯βˆ’π›Ύ for some 𝛾β‰₯0.

3.2. Iteration Scheme for Phase Function 𝑧(π‘₯)

To find argminπΈπœ–[𝑧|𝑐(𝑛),𝜎(𝑛),𝐼], we now apply the CCCP algorithm. In this section, we will temporally suppose the parameters π‘π‘˜,πœŽπ‘˜,1β‰€π‘˜β‰€πΎ are all known for the purpose of statement.

Note that (𝑑2/𝑑𝑧2)sin2πœ‹π‘§β‰₯βˆ’2πœ‹2, we split πΉπœ–[𝑧] asπΉπœ–[𝑧]=ξ‚»πΉπœ–[𝑧]+πœ‹2πœ–ξ€œΞ©π‘§2ξ‚Όβˆ’πœ‹π‘‘π‘₯2πœ–ξ€œΞ©π‘§2𝑑π‘₯∢=𝐹1πœ–[𝑧]βˆ’πΉ2πœ–[𝑧].(20)

Similarly, suppose (𝑑2/𝑑𝑧2)β„Ž(π‘§βˆ’π‘˜)β‰₯βˆ’π›Ύ, where 𝛾 depends on function β„Ž(β‹…). We split 𝐺[𝑧|𝐼] as𝐺[]π‘§πΌβˆΆ=𝐺1[]π‘§πΌβˆ’πΊ2[],𝑧𝐼(21) where𝐺1[]=π‘§βˆ£πΌπΎβˆ’1ξ“π‘˜=0ξ€œΞ©ξƒ¬logπœŽπ‘˜+ξ€·π‘π‘˜ξ€Έβˆ’πΌ22𝜎2π‘˜ξƒ­+β„Ž(π‘§βˆ’π‘˜)𝑑π‘₯πΎβˆ’1ξ“π‘˜=0ξ€œΞ©π›Ύ2logπœŽπ‘˜+ξ€·π‘π‘˜ξ€Έβˆ’πΌ22𝜎2π‘˜ξƒ­(π‘§βˆ’π‘˜)2𝐺𝑑π‘₯,2[]=π‘§βˆ£πΌπΎβˆ’1ξ“π‘˜=0ξ€œΞ©π›Ύ2logπœŽπ‘˜+ξ€·π‘π‘˜ξ€Έβˆ’πΌ22𝜎2π‘˜ξƒ­(π‘§βˆ’π‘˜)2𝑑π‘₯.(22) We simply denote it byπΈπœ–[]=ξ€½πΉπ‘§βˆ£πΌ1πœ–[𝑧]+𝐺1[]ξ€Ύβˆ’ξ€½πΉπ‘§βˆ£πΌ2πœ–[𝑧]+𝐺2[]ξ€Ύ.π‘§βˆ£πΌ(23)

By Proposition 1, 𝐹1πœ–[𝑧], 𝐹2πœ–[𝑧], 𝐺1[π‘§βˆ£πΌ] and 𝐺2[π‘§βˆ£πΌ] are all convex. In order to apply CCCP, we still need the functional to be bounded below. To do that, we must assume that the image is not constant for any phase. Or in detail, we suppose each variance 𝜎2π‘˜β‰ 0. Thenξ€½πœŽMinπ‘˜ξ€Ύ,0β‰€π‘˜β‰€πΎβˆ’1>0.(24) Thus, the functional πΈπœ–[π‘§βˆ£π‘(𝑛),𝜎(𝑛),𝐼] is lower bounded.

Remark 1. This assumption is true for most applications since if the image has been piecewise constant, we do not need to do any segmentation. Even if some phases are exactly constant, the assumption can still be true by adding some noise or spurious dots in the phases.

By Theorem 1, we can now use the CCCP iteration scheme via Frechet derivative, that is,𝐹1+πœ†πΊ1ξ€Έξ…žξ€·π‘§(𝑛+1)ξ€Έ=𝐹2+πœ†πΊ2ξ€Έξ…žξ€·π‘§(𝑛)ξ€Έ.(25) Under integration by parts (see (15)), the above equation is equivalent to the following PDE.ξ‚ƒβˆ’2πœ–Ξ”π‘§(𝑛+1)+πœ‹πœ–sin2πœ‹π‘§(𝑛+1)ξ‚„+2πœ‹2πœ–π‘§(𝑛+1)+πœ†πΎβˆ’1ξ“π‘˜=0logπœŽπ‘˜+||π‘π‘˜||βˆ’πΌ22𝜎2π‘˜ξƒͺβ„Žξ…žξ€·π‘§(𝑛+1)ξ€Έξƒ­βˆ’π‘˜+πœ†π›ΎπΎβˆ’1ξ“π‘˜=0logπœŽπ‘˜+||π‘π‘˜||βˆ’πΌ22𝜎2π‘˜ξƒͺ𝑧(𝑛+1)ξ€Έ=βˆ’π‘˜2πœ‹2πœ–π‘§(𝑛)+πœ†π›ΎπΎβˆ’1ξ“π‘˜=0logπœŽπ‘˜+||π‘π‘˜||βˆ’πΌ22𝜎2π‘˜ξƒͺ𝑧(𝑛)ξ€Έ,βˆ’π‘˜(26) where the terms in the square brackets come from the Euler-Lagrange equation of πΈπœ–. Then we can use any existing method to solve 𝑧𝑛+1 such as Gauss-Seidel method. Now we have all the three minimizations (10), (11), and (12). We can use the alternating minimization scheme as discussed at the beginning of this section.

3.3. Construct Approximation Function β„Ž(π‘₯)

We construct β„Žπ›Ώ(π‘₯) in such a way that β„Žπ›Ώ(𝑧(π‘₯)βˆ’π‘˜) forms an ownership function of π‘˜th phase, where β„Žπ›Ώ(π‘₯) is defined as follows:β„Žπ›Ώ=⎧βŽͺβŽͺβŽͺβŽͺ⎨βŽͺβŽͺβŽͺβŽͺ⎩(π‘₯)1+𝛿2+π‘₯2βˆ’π›Ώπœ‹cosπœ‹(π‘₯+1)||||2𝛿ifπ‘₯+1≀𝛿,1+π‘₯ifβˆ’1+𝛿<π‘₯<βˆ’π›Ώ,2π›Ώπœ‹cosπ‘₯πœ‹2𝛿+1βˆ’π›Ώif|π‘₯|≀𝛿,1βˆ’π‘₯if𝛿<π‘₯<1βˆ’π›Ώ,1+𝛿2βˆ’π‘₯2βˆ’π›Ώπœ‹πœ‹cos(π‘₯βˆ’1)||||2𝛿ifπ‘₯βˆ’1≀𝛿,0,otherwise,(27) where 𝛿 is a parameter theoretically satisfying 0<𝛿<0.5 and should be small enough. Figure 1 shows different branches of β„Žπ›Ώβˆ˜(𝑧(π‘₯)βˆ’π‘˜) for π‘˜=βˆ’1,0,1,2,3. In Figure 2, a three-phase example shows thatβ„Žπ›Ώ(π‘₯+1)+β„Žπ›Ώ(π‘₯)+β„Žπ›Ώ(π‘₯βˆ’1)=1(28) holds only for π‘₯∈[βˆ’1βˆ’π›Ώ,1+𝛿]. In application, we define β„Žπ›Ώ(π‘₯)∢=1 for π‘₯∈(βˆ’βˆž,βˆ’π›Ώ)βˆͺ(πΎβˆ’1+𝛿,∞). Then βˆ‘πΎβˆ’1π‘˜=0β„Žπ›Ώ(π‘₯βˆ’π‘˜)≑1, and so our model is a standard soft segmentation model.

Note that although the function β„Žπ›Ώ(π‘₯) itself is not a good approximation of 1𝑧=π‘˜ based on its graph, the composition β„Žπ›Ώβˆ˜(𝑧(π‘₯)βˆ’π‘˜) is actually a very good approximation of 1𝑧=π‘˜ for the reason that, with the length term (7), the phase function 𝑧(π‘₯) will ultimately only take integer values which makes the composition β„Žπ›Ώβˆ˜π‘§(π‘₯) mostly evaluated only based on the piece (2𝛿/πœ‹)cos(π‘₯πœ‹/2𝛿)+1βˆ’π›Ώ since |𝑧(π‘₯)βˆ’π‘˜|≀𝛿 for π‘₯ belonging to phase π‘˜. Thus, β„Žπ›Ώβˆ˜(𝑧(π‘₯)βˆ’π‘˜) will ultimately take the value (2𝛿/πœ‹)+1βˆ’π›Ώ which approximates 1 very well when 𝛿 is small enough. At the end, we want to mention that as β„Žπ›Ώ(π‘₯) is defined in an explicit way, its derivative can be calculated easily. So we can finally use the iteration scheme CCCP developed in the above section.

3.4. Segmentation Decision

Finally, once the iterations are stable, we have two ways to determine the segmentation. One way is hard segmentation, that is, we apply to 𝑧(π‘₯) the following hard thresholding decision rule:⎧βŽͺ⎨βŽͺ⎩Ωπ‘₯∈0Ξ©if𝑧(π‘₯)<0.5,πΎβˆ’1Ξ©if𝑧(π‘₯)β‰₯πΎβˆ’1.5,π‘˜βˆ’1ifπ‘˜βˆ’1.5≀𝑧(π‘₯)<π‘˜βˆ’0.5,otherwise.(29)

When the choice of β„Ž(π‘₯) makes β„Žπ›Ώβˆ˜(𝑧(π‘₯)βˆ’π‘˜) an ownership function, the model is a soft segmentation. Which scheme (hard segmentation or soft segmentation) should be chosen depends on the purpose of application. If we only want to know the pieces of segments, we should use the hard segmentation. However, in the application to partial volume estimation, we should choose the soft segmentation scheme.

4. Applied to Partial Volume Segmentation

Since our model is a soft segmentation and the membership functions can be viewed as ownership, we can apply our framework to partial volume estimation.

4.1. Introduction to Partial Volume Segmentation

One important application of multiphase segmentation is MRI brain image segmentation. There are three different tissues in human brains, that is, the white matter, the gray matter, and the cerebrospinal fluid (CSF). It is well known that the volume of gray matter has a close relation to some intelligence diseases. Precisely computing the volume of white matter and gray matter can help to diagnose those diseases earlier. On the other hand, due to the limited spatial resolution of imaging equipment, not all voxels in the image contain the same type of tissue, especially the voxels near the tissue borders, which highly likely contain more than one tissue types, called partial volume (PV). Figure 3 shows the principle of partial volume formulation (which is originally used by [22]). The left image contains two phases with higher resolution. Due to lower resolution (half of the left image in each dimension), four squares in the left image contribute to one square in the right image. As a result, the right image contains more phases (for this example, it contains four different phases). When all four subsquares with the same phase in the left image contribute to one square in the right image, the resulted square with lower resolution will be still the same phase as original one, called pure matter. However, when the four subsquares contain different phases, the resulted square will present a phase between the original two phases. In this case, the resulted square is called partial volume. The intensity of the partial volume is a weighted average of original pure matters with combination ratios depending on the number of subsquares of each pure matter in the original image forming the partial volume.

This kind of partial volume may cause a big error in the estimation of pure tissue volumes. The error is sometimes as big as 40–60 percent [23]. Thus, partial volume segmentation of MRI images has now received considerable attention. The ideal partial volume segmentation should contain two aspects: finding partial volume and deciding its combination ratio of different pure matters. Most recent work on partial volume segmentation based on statistical principal, for example, the expectation-maximization (EM) method [22, 24]. These methods improved the precision of pure matter estimation. However, they do not contain length term, which makes them sensitive to noise. In [25], the author applied a reparameterized level set algorithm to partial volume segmentation. The method does include the length term. It takes the partial volume part as separated classes that are composition of pure tissues. The drawback of the paper is to use fixed ratios (e.g., 50%) of combinations for partial volumes.

4.2. Apply the framework to Partial Volume Segmentation

We can now apply the proposed framework to partial volume segmentation for brain MR images with mixed Gaussian distribution. In order to apply the proposed framework to partial volume segmentation, we treat the brain image as three different phases: white matter, gray matter, and pure CSF. We calculate the pure matter volumes in a natural way:ξ€œVolume(phase(π‘˜))=Ξ©π‘π‘˜(ξ€œπ‘₯)𝑑π‘₯=Ξ©β„Žπ›Ώ(𝑧(π‘₯)βˆ’(π‘˜))𝑑π‘₯.(30)

5. Experiment and Discussion

Since this paper focus on soft segmentation and applications to partial volume segmentation, our experiments are mostly applied to MRI brain images although a natural image is also presented which is not very close to piecewise constant. Because this work is related to the Sine-Sinc model [17] (where the author proposed a multiphase segmentation model based on phase-transition theory but it is not a soft model), we first give a comparison between our model and the Sine-Sinc model in Figure 4 with synthetic image and MRI brain image. Then in Figure 5, we show three membership functions of a natural image.

The rest of the examples give a comparison between adaptive fuzzy c-mean method [26] and the proposed model. Figure 6 is the segmentations for a synthetic biased image which is first used by Bresson and Chan [19]. The contour of segmentations in the third column is obtained by thresholding the membership functions (same thing is true for next two figures). Figures 7 and 8 are for MRI brain images.

Finally, we apply our model to partial volume segmentation using simulated brain images. Then we compare the ground truth of pure matters and our segmentation results. We calculate the errors of partial volume estimation in two ways. One way is based on hard segmentation. Another way is based on soft segmentation.

Figure 9 is a comparison between the ground truth of the original simulated brain MR image and the membership functions obtained using the proposed framework. Figure 9(a) is the original simulated noised image. The corresponding ground truth of white matter, gray matter, and CSF are shown in Figures 9(b), 9(c), and 9(d), respectively. Phase membership functions are shown in Figures 9(e)–9(g),

We carried out the experiment with 35 consecutive 2D slices of a 3D simulated brain MR image. Then we compared the errors between the Sine-Sinc model and the proposed model. As an average, the errors are shown in Table 1.

Finally, we add series of Gaussian noises to the images with zero mean and different variances. Then compare their errors among the AFCM model, the proposed model with hard segmentation by thresholding, and the proposed model with soft segmentation. The errors are shown in Figure 10(a). From the graph, we can see that as the variance of the noise rises, the error will also rise for all the three cases. However, compared with the AFCM model, the errors using the proposed model rises much more slowly as the variance of the noise rises.

We also compared the influence on the errors as the parameter 𝛿 in function β„Žπ›Ώ(π‘₯) changes. This is shown in Figure 10(b). From the graph, pure matter estimation based on phase function 𝑧(π‘₯) looks a little better than the estimation based on the ownership function β„Žπ›Ώ(𝑧(π‘₯)βˆ’π‘˜) when 𝛿 is bigger, while the pure matter estimation based on the ownership function β„Žπ›Ώ(𝑧(π‘₯)βˆ’π‘˜) is a little better than the estimation based on the phase function 𝑧(π‘₯) when 𝛿 is smaller.

6. Conclusions

A critical problem in energy-based multiphase segmentation is the nonconvexity of the energy functional. The problem is more difficult to handle than two-phase case. Level set-based method is proved very successful in multiphase segmentation based on hard segmentation, but they do not work for soft segmentation since the area of different phases may be overlapped and so no clear boundaries between different phases. The soft segmentation is more useful in partial volume segmentation for brain MR images. This paper borrowed the ideas from phase-transition-based methods [6, 15, 17]. The key point of this paper is to construct an approximation function so that the membership functions can be obtained by its composition with phase function. With this constructed function, we can avoid adding new variables for membership functions and so it saves memory space and promotes efficiency. Moreover, since the composition of the constructed function and phase functions forms membership functions, we also avoid the general constraint problem for soft segmentation in implementation. The framework is then applied to partial volume segmentation. The future contains choosing better constructed function β„Ž(π‘₯), and better discretization scheme and iteration scheme.

Acknowledgment

Publication of this article was funded in part by the University of Florida Open-Access Publishing Fund.