Abstract

For a Bi-Exponentiated Weibull model, the authors obtain a general AUC formula and derive the maximum likelihood estimator of AUC and its asymptotic property. A simulation study is carried out to illustrate the finite sample size performance.

1. ROC Curve and AUC for a Bi-Exponentiated Weibull Model

In medical science, a diagnostic test result called a biomarker [1, 2] is an indicator for disease status of patients. The accuracy of a medical diagnostic test is typically evaluated by sensitivity and specificity. Receiver Operating Characteristic (ROC) curve is a graphical representation of the relationship between sensitivity and specificity. The area under the ROC curve (AUC) is an overall performance measure for the biomarker. Hence the main issue in assessing the accuracy of a diagnostic test is to estimate the ROC curve and its AUC.

Suppose that there are two groups of study subjects: diseased and nondiseased. Let be a continuous biomarker. Assume that the larger is, the more likely a subject is diseased. That is, a subject is classified as positive or diseased status if and as negative or nondiseased status otherwise, where is a cutoff point. Let be the disease status: represents diseased population while represents nondiseased population. Sensitivity of is defined as the probability of being correctly classified as disease status and specificity as the probability of being correctly classified as nondisease status. That is, Let and be the biomarkers for diseased and nondiseased subjects with survival functions and , respectively. Then The ROC function (curve) is defined as Notice that ROC curve is a monotonic increasing curve in a unit square starting at and ending at . A good biomarker has a very concave down ROC curve.

We say a bivariate random vector is a Bi-model if and are independent and are from the same distribution family but with different parameters. Pepe [3] summarizes ROC analysis and notices that among Bi-model, Bi-normal model has been the most popular one. The ROC curve method traces back to Green and Swets [4] in the signal detection theory. Bi-normal model assumes the existence of a monotonic increasing transformation such that the biomarker can be transformed into normally distributed for both diseased and nondiseased patients. Pepe [3, Results 4.1 and 4.4] shows that the ROC curve of a biomarker is invariant under a monotone increasing transformation. Hence, a good estimator of ROC curve should satisfy this invariance property.

The literature for ROC analysis under Bi-normal model is extensive and the majority of the results are for a continuous biomarker. For a continuous biomarker, Fushing and Turnbull [5] obtained the estimators and their asymptotic properties of the estimated ROC curve and AUC using the empirical based nonparametric and semiparametric approaches for Bi-normal model. Metz et al. [6] studied properties of maximum likelihood estimator via discretization over finite support points. Zou and Hall [7] developed an MLE algorithm to estimate ROC curve under an unspecific or a specific monotonic transformation. Gu et al. [8] introduced a nonparametric method based on the Bayesian bootstrap technique to estimate ROC curve and compared the new method with several different semiparametric and nonparametric methods. Pepe and Cai [9] considered the ROC curve as the distribution of placement value and then estimated the ROC curve by maximizing the pseudolikelihood function of the estimated placement values. All the above methods require complicated computations. Zhou and Lin [10] proposed a semi-parametric maximum likelihood (ML) estimator of ROC which satisfies the property of invariance. Their method largely reduced the number of nuisance parameters and therefore the computational complexity.

In reality, the distribution of the biomarker may be skewed and the normality assumption is not reasonable. Under this situation, other models such as generalized exponential (GE), Weibull (WE), and gamma, to name a few, provide a reasonable alternative; see Faraggi et al. [11]. Another issue is that in the presence of two different biomarkers, the one with uniformly higher ROC curve is better. However, when two ROC curves cross, we cannot simply compare and find which one is better. Many different numerical indices have been proposed to summarize and compare ROC curves of different biomarkers [1215]. The most popular summary index is AUC, defined as

The estimation of is also of great importance in engineering reliability, typically, in stress-strength model. For example, if is the strength of a system which is subject to a stress , then is a measure of a system performance. The system fails if and only if at any time the applied stress is greater than its strength. During the past twenty years, many results have been obtained on the estimation of the probability under the different parametric models. Kotz et al. [16] give a comprehensive summarization about stress-strength model.

Many research results have been obtained for the estimation of AUC under popular survival models: Bi-exponential model [17], Bi-normal model [5, 18], Bi-gamma model [19], Bi-Burr type X [2023], Bi-generalized exponential [24], Bi-Weibull model [25]. However, all the popular survival models such as generalized exponential (GE), Weibull (WE), and Gamma do not allow nonmonotonic failure rates which often occur in real practice. Furthermore, the common nonmonotonic failure rate in engineering and biological science involves bathtub shapes.

To overcome the aforementioned drawback, many models such as generalized gamma [26], generalized distribution [27], two families [28], a four-parameter family [29], and a three-parameter (IDB) family [30] have been proposed for modeling nonmonotone failure-rate data. Mudholkar and Srivasta [31] proposed an exponentiated Weibull (EW) model, an extension of both GE and WE models. Mudholkar and Hutson [32] studied some properties for the model. EW model not only includes distributions with unimodal and bathtub failure rates but also allows for a broader class of monotonic hazard rates. EW model may fit better than GE for some actual data. For instance, Khedhairi et al. [33] conclude that generalized Rayleigh model (a special case of EW model) is the best among the three models compared: exponential, generalized exponential, and generalized Rayleigh model.

In the research of estimation of under the different survival models, the most recent result in Baklizi [34] focuses the two-parameter Weibull model, which is a special case of our Bi-EW model. Baklizi assumes a common shape parameter and different scale parameter which is also studied in Kundu and Gupta [25]. In contrast to their studies, here we study the estimation of under the Bi-EW model which shares one shape parameter only. Since EW includes Weibull as a submodel, our results extend the ones in Baklizi [34] and Kundu and Gupta [25], as shown in Theorem 1. On the other hand, Baklizi [34] also studied Bayes estimator, which we have not studied. It will be a future project.

In this paper, we adopt AUC as a main summary index of ROC when comparing biomarkers under a Bi-Exponentiated Weibull (Bi-EW) model. We obtain a precise formula for the AUC and derive the MLE of AUC and its asymptotic distribution. The remaining of this paper is organized as follows. Section 2 introduces the Bi-EW model and obtains the theoretical AUC formula. Section 3 studies the maximum likelihood estimation of AUC under the Bi-EW model and derives the asymptotic normality of the estimated AUC. Section 4 reports the simulation results to compare the accuracy of the estimated parameters and estimated AUC for different sample sizes and various parameter settings in terms of absolute relative biases (ARB) and square root of mean square error (RMSE). The conclusion is given in Section 5.

2. A Bi-EW Model and Its AUC

A random variable has an Exponentiated Weibull (EW) distribution if its distribution function is defined as Denote . Mudholkar and Srivasta [31] and Qian [35] show that this distribution allows both monotonic and bathtub shaped hazard rates. The latter shape is useful in reality. In this section, we first derive a general moment formula for the distribution and then obtain AUC formula for a Bi-EW model with a common shape parameter.

2.1. General Moment Formula of Exponentiated Weibull

To fully understand a new family of distributions, it is important to derive its summary properties such as moment formulas.

Theorem 1. Let . One has where and .

Proof. By the definition of the EW model, we have
By change of variables , we have Then the integral part in (7) is equal to Therefore, we have

Remark 2. Theorem 1 covers the following results as special cases.
Case 1. If , and  , then it reduces to the formula in Choudlhury [36], the simple moments of EW family. One has Moreover, if is positive integer, then where .
Case 2. If , and  , it reducces to the moments of Generalized exponential family in Kunda and Gupta [24].
Case 3. If , and , it reduces to the moments of Weibull family in Kunda and Gupta [25].

2.2. AUC under a Bi-EW Model

In this subsection, we derive the general formula of AUC under a Bi-EW model with a common shape parameter. That is, we assume that and are independent and with the following EW distribution functions: where the two distribution functions share a common shape parameter .

Theorem 3. Suppose that follows the Bi-EW model with the above distribution functions. Then

Proof. By Taylor expansion, we have Thus,

Remark 4. (i) Note that the exact expression of AUC is independent of the common parameter . This is similar to the cases in Bi-WE and Bi-GE models; see Kundu and Gupta [24, 25].
(ii) When , (14) reduces to the AUC for a Bi-GE model.
(iii) When , (14) reduces to the AUC for a Bi-WE model. To see (iii), we actually prove the following corollary.

Corollary 5. For and , one has

Proof. Denote Then, on one hand we have On the other hand, we have Let ; then (19) implies that while (20) implies that This completes the proof of the corollary.

3. Maximum Likelihood Estimator of AUC and Its Asymptotic Property under the Bi-EW Model

To estimate AUC under the Bi-EW model, we adopt the plug in method. That is, we first obtain the maximum likelihood estimator of the model parameters and then plug it into (14) to get the MLE of AUC.

3.1. Maximum Likelihood Estimator of AUC under the Bi-EW Model

Let and be independent random samples from and , respectively. Denote the model parameter vector by . Then the log-likelihood function is We take derivatives with respect to the parameters to get the following score equations: The MLE of is the numerical solution of the above score equations. Plugging into (14), we obtain the maximum likelihood estimator of AUC as below:

3.2. Asymptotic Normality of the Estimated AUC

In this section, we obtain the asymptotic distribution of and hence, by the continuous mapping theorem, the asymptotic distribution of the estimated AUC. We need to introduce some notation. Let Then

We denote . Notice that EW family satisfies all the regularity conditions almost everywhere. Now we are ready to state the following theorem.

Theorem 6. Suppose follows the Bi-EW model with the model parameter . If and , but , for , then one has the following. (a) The MLE of is asympotic normal: (b) Part (a), the continuous mapping theorem, and Delta method imply that where

4. A Simulation Study

In this section we present some results based on Monte Carlo simulation to check the performance of the MLE of parameters and AUC. We consider small to moderate sample sizes: , and . We set two scale parameters: . The common shape parameter is . The other shape parameters are and , respectively. The simulation is based on replicates.

Figure 1 illustrates the histograms of the estimated AUC under the sample sizes when the true parameters are . Obviously, as the sample size increases, the histogram tends to be more symmetric bell shaped. Therefore, the MLE of AUC performs well for the moderate sample size.

We report the absolute relative biases (ARB) and the square root of mean squared errors (RMSEs) for the estimators of and AUC defined as below: where is the estimates of for the th replicate.

Recall that The estimator and true value can be computed by Riemann sums respectively. Here, we choose and . The interval is divided evenly with each subinterval length and takes the value of the right end of each subinterval. We have verified that the summation keeps stable for any larger value and smaller value .

Similarly, where is the estimate of AUC for the th replicate.

Tables 1 and 2 report the ARB and the RMSE for the estimators of , and the AUC. Table 1 reports the sensitivity analysis against and sample sizes. In all cases, the two scale parameters have much smaller ARB and RMSE compared with shape parameters. The MLE of AUC behaves well. For two different shape parameters, it is always the common parameter which behaves better than the other two shape parameters and as expected. For two shape parameters and , when always behaves better than . Furthermore, as expected, the ARB and the RMSE decrease as the sample size increases. Table 2 reports the sensitivity analysis against the common shape parameter . The results show the robustness of the MLEs for small sample sizes.

Our second simulations are to check the performance of MLE as the common shape parameter increases. We choose the following sample sizes: , . The two scale parameters are still set to 1, and ,   and   takes three values: 2, 2.5 and 3. The results are reported in Table 2. Once again, one notices that AUC has very small ARB and RMSE. Under all settings, has larger ARB and RMSE than that of when . When increases, we do not find a noticeable change of ARB and RMSE of all parameters.

5. Conclusions

In summary, this paper considers an Exponentiated Weibull model with two shape parameters and one scale parameter. Firstly, we derive a general moment formula which extends many results in the literature. Secondly, we obtain the theoretical AUC formula for a Bi-Exponentiated Weibull model which shares a common shape parameter between the two groups. We have noticed that the formula of AUC is independent of the common shape parameter, which is consistent with existing known results for simpler models such as Bi-generalized exponential and Bi-Weibull. Thirdly, we derive the maximum likelihood estimator of AUC for the Bi-EW model with a common shape parameter and show the asymptotic normality of the estimators. Finally, we conduct a simulation study to illustrate the performance of our estimator for small to moderate sample sizes. The simulation results show that the MLE of AUC can behave very well under moderate sample sizes although the performance of the MLE of some shape parameters is somewhat unsatisfactory but still acceptable.