Model Selection Approaches for Predicting Future Order Statistics from Type II Censored Data

Chiang, Jyun-You; Wang, Shuai; Tsai, Tzong-Ru; Li, Ting

doi:https://doi.org/10.1155/2018/3465909

Mathematical Problems in Engineering

On this page

Abstract Introduction Conclusions Appendix Data Availability Conflicts of Interest References Copyright Related Articles

Research Article | Open Access

Volume 2018 | Article ID 3465909 | https://doi.org/10.1155/2018/3465909

Model Selection Approaches for Predicting Future Order Statistics from Type II Censored Data

Jyun-You Chiang,¹Shuai Wang,¹Tzong-Ru Tsai,²and Ting Li³

Academic Editor: Mohammed Nouari

Received07 Apr 2018

Revised04 Jul 2018

Accepted14 Aug 2018

Published08 Oct 2018

Abstract

This paper studies a discriminant problem of location-scale family in case of prediction from type II censored samples. Three model selection approaches and two types of predictors are, respectively, proposed to predict the future order statistics from censored data when the best underlying distribution is not clear with several candidates. Two members in the location-scale family, the normal distribution and smallest extreme value distribution, are used as candidates to illustrate the best model competition for the underlying distribution via using the proposed prediction methods. The performance of correct and incorrect selections under correct specification and misspecification is evaluated via using Monte Carlo simulations. Simulation results show that model misspecification has impact on the prediction precision and the proposed three model selection approaches perform well when more than one candidate distributions are competing for the best underlying distribution. Finally, the proposed approaches are applied to three data sets.

1. Introduction

For saving testing time and sample resource, censoring schemes often are considered to implement life tests. Type I censoring scheme and type II censoring scheme are two popular censoring schemes based on the criteria of test time censoring and failure number censoring. Plenty studies can be found for evaluating the reliability of lifetime components via using type I censoring test or type II censoring test. See examples like, [1–6] etc.

In this study, we mainly restrict our attention to using type II censoring scheme for predicting the censored sample for reliability evaluation when a discriminant problem is considered. In the type II censoring scheme, we consider an experiment where identical components are placed in the test simultaneously. Assuming that component fails, the experiment is terminated. Thus the last components are censored. In many engineering applications, censored data are not allowed for implementing statistical methods to obtain information. For example, if we like to conduct a factorial design or fractional factorial design based on the experimental design methods, most experimental design methods cannot be implemented with censored data. In such situation, a reliable procedure for predicting censored or unobserved observations is required. Moreover, if we can predict the unobserved observations and transform a censored data set into a complete data set, the parameter estimation problem becomes easy especially for dealing with the cases, which have no analytic solutions of the parameter estimators can be obtained. The purpose of predicting life length of the item is equivalent to the life length of a (n-s+1)-out-of-n system that was made up of identical components with independent life lengths. When s = n, it is better known as the parallel system. For this issue, various methods have been developed to predict the censored data. Kaminsky and Nelson [7] provided interval and point prediction of order statistics. Fertig et al. [8] provided Monte Carlo estimates of the distribution percentiles to construct prediction intervals for samples from a Weibull or smallest extreme value distribution (SEV). Kaminsky and Rhodin [9] provided the maximum likelihood predictor (MLP) to predict the future order statistics and then estimate the unknown parameters. Wu et al. [10] proposed five new pivotal quantities to obtain prediction intervals of future order statistics from the Pareto distribution. Kundu and Raqab [11] describes the Bayesian inference and prediction of the two-parameter Weibull distribution. Panahi and Sayyareh [12] proposed parameter estimation and prediction of order statistics for the Burr type XII distribution. Some of these predictions are complex, or they need to construct complex statistical models. Therefore, these existing methods are not easy to apply.

In order to solve this problem, Raqab [13] modified the MLP method and proposed four modified MLPs (MMLPs) to predict the future order statistics for the normal distribution (ND). In order to simplify the estimation function, they considered four types of modification to approximate the terms of hazard rate and extended hazard rate functions form a ND, which has unknown mean and known standard deviation. Yang and Tong [14] used MMLP method to predict type II censored data from factorial experiments. They derived the simple explicit solutions for parameters for a ND, which has unknown mean and unknown standard deviation. Chiang [15] used another three MMLP procedures to predict type II censored data under the Weibull distribution. In his procedures, it is difficult to find the only root solution to the parameter estimation. However, the parameter estimation of MMLP method can be obtained via simple parameter explicit solution only in the ND. For other commonly used distributions, the likelihood equations of MMLP may be nonlinear and does not admit explicit solutions. Hence the parameter estimation of MMLP loses the advantage for other commonly used distributions.

Another important problem in life testing experiments is the model selection based on the existing sample. In practical applications, many statistical distributions are much alike, especially in censored data, and the underlying distribution of product quality characteristics is usually unknown. They may fit the data well in practical applications. However, their predictions may lead to a significant difference. Therefore, correctly identifying the underlying distribution is an important issue and it has long been studied. Dumonceaux and Antle [16] applied ratio of maximized likelihood (RML) to discriminating between the lognormal and Weibull distributions. Kundu and Manglick [17] proposed statistical methods to discriminate between the lognormal and gamma distributions. Kundu and Raqab [18] proposed a selection to discriminate between the generalized Rayleigh and lognormal distribution. Yu [19] provided a misspecification analysis method to discriminate between the ND and SEV for the design of experiment. Dey and Kundu [20] studied the discrimination problem between the lognormal and log-logistic distributions. Elsherpieny et al [21] considered the discrimination problem between the Weibull and log-logistic distributions. Ashour and Hashish [22] provided a numerical comparison study for using RML-procedure, S-procedure, and F-procedure in failure model discrimination. Pakyari [23] presented diagnostic tools based on the likelihood ratio test and the minimum Kolmogorov distance method to discriminate between the generalized exponential, geometric extreme exponential, and Weibull distributions. Elsherpieny et al. [24] provided a method to discriminate the gamma and log-logistic distributions based on progressive type II censored data. Although the inference methods in the aforementioned studies are valuable, the impacts of model misspecification on predicting the future order statistics have not been well studied.

Among the model discrimination problems, due to the well-developed theory and inferential procedures for the location-scale family of distributions, the model discrimination within the location-scale family of distributions is particularly important and it has received much attention. The main purpose of this paper is to address these issues and provide satisfactory estimators of parameters and predictors of future order statistics when the underlying distribution is unknown but it is a member in the location-scale family. Specifically, for lifetime analysis, the essence of this study is to predict the future order statistics for type II censored data when the underlying distribution is unknown but is a member of the location-scale family. The major contributions of this study for censored data prediction are presented in Figure 1.

The rest of this paper is organized as follows. Section 2 presents materials and methods. In this section, statistical methods to obtain approximate predictors for type II right censored variables are studied and two prediction methods are proposed to predict the type II right-censored variables based on the AMLEs. The ND and SEV are considered as the candidate distributions to compete the best distribution for obtaining the predictors of type II right-censored variables. In Section 3, we provide three algorithms to implement the three proposed model selection approaches to deal with the discrimination problem when obtaining the predictors of type II right-censored variables based on the proposed methods. An intensive simulation study is conducted in Section 4 to evaluate the performance of the proposed approaches. Then, three examples are used to demonstrate the applications of the proposed methodologies in Section 5. Some concluding remarks are provided in Section 6.

2. Methods for Approximate Predictors

2.1. Approximate Maximum Likelihood Estimation

Let denote the failure time of item and , which follows a location-scale family, having the probability density function (PDF) and cumulative distribution function (CDF): andrespectively, where is location parameter and is scale parameter. and are the PDF and CDF of a member, respectively, in the location-scale family. Denote the sample size by , and denote type II censored sample with failures by , which are the realizations of , where . Our goal is to predict for . Let and here and after to simplify the notations. Kaminsky and Rhodin [9] considered prediction of having observed , The predictive likelihood functions (PLF) of , and isPlease note that the capital notation in is unknown and can be predicted based on the sample . Based on the proposed method by Raqab [13], the PLF of , and in (3) can be represented as a product of two likelihood functions, the PLF of and (i.e., which is denoted as ) and the PLF of (i.e., which is denoted as ). Both likelihood functions are presented, respectively, byand In practice, we can obtain the MLEs of and , denoted by and , respectively, through maximizing in (4). Then use and to replace and as the plug-in parameters in (5) to predict . Let for , for and , then we can rewrite (4) and (5) byandwhere and . After straightforward computations, the MLEs of , and respectively can be obtained as the solutions ofandwhere andBecause of no analytic presentation for and , one needs to use numerical gradient computation methods, for example, the Newton-Raphson method, for obtaining and via by equating (8) and (9). To obtain proper initial solutions for implementing gradient computation methods, we consider using the approximate MLEs (AMLE) of and from Hossain and Willan [25] as their initial solutions in this study.

2.2. Approximate Maximum Likelihood Predictors

When we obtain the MLEs and , we can predict by using two approximation methods, the expected value prediction method and Taylor series prediction method. The resulting predictors of based on the expected prediction method is denoted by , and the resulting predictors of based on the Taylor series prediction method is denoted by . The two approximate methods mainly use two different methods to get the approximates of and . Mehrotra and Nanda [26] proposed approximate maximum likelihood estimators for the ND and gamma distribution by replacing and by their respective expected values and efficiencies compared to those for the best linear unbiased estimators for these distributions. Balakrishnan and Cohen [27] used the Taylor series expansion of and at the points to obtain modified MLEs of the parameters of the ND and Rayleigh distribution, where for . The main point of their approach is that likelihood equations involve complicated terms and it is not possible to obtain an explicit form for MLE. So we follow their ideas and find an explicit form for the predictor of .

Based on the expected value prediction method, replacing with , and replacing and by their respective expected values in (10). According to Raqab [13], the expected value of , and can be presented, respectively, byand

Based on the Taylor series prediction method, replacing with and replacing and with their Taylor series approximations at points and (), respectively, in (10). In this study, we denote the and of under the candidate distribution by and , respectively.

There are many common distributions in location-scale family of distributions. The widely used members including the ND, SEV, logistic distribution, etc. It is impossible to list all inference formulas for predicting under all widely used members in the location-scale family. In this study, we use ND and SEV as candidates to illustrating the applications of the proposed methods. But the suggested algorithms in this study can be applied for the cases with more than two candidate members. The reason to select the ND and SEV as candidates is due to the fact that the Weibull distribution and lognormal distribution are two widely used distributions for life testing applications. The Weibull and lognormal distributions can be respectively transformed into the SEV and ND by taking log-transformation.

If the underlying distribution is normal, the PDF of normal distribution is given byThrough using (17), we can obtain . The MLEs of normal distribution parameters are denoted by and . Replacing and with and in (6), we can represent (6) bywhere is the CDF of the standard ND. According to (15) and (16), and can be replaced with their respective expected values in (10). Equation (10) can be rewritten asThe values of are available and have been tabulated by Teichroew [28]. Hence, of for ND can be derived as Because is a necessary condition, we modify (20) byand use in (21) to protect for .

Based on the Taylor series prediction method, the functions and are expanded by using the Taylor series around points and (), respectively. According to Raqab [13], we can approximate and byandThe values of and are given in Appendix A. Equation (10) can be rewritten byThe of can be obtained bywhere

If the underlying distribution is SEV, the PDF of the SEV is given by

Based on the expected value prediction method, . Using (8) and (9), the MLEs of and are denoted by and , respectively. Replacing and with and in (6), (6) can be represented bywhere is the CDF of the standard SEV. Then and are replaced with their respective expected values in Eq. (10). Equation (10) can be rewritten asThe of can be obtained asfor and .

Based on the Taylor series prediction method, expanding and by using the Taylor series at the points and (), respectively. We obtainandThe values of and are given in Appendix B. Equation (10) can be rewritten asThe of can be derived asfor

3. Three Model Selection Approaches

When several candidate distributions are competing for the best underlying distribution and the users cannot identify which one distribution is the best, we suggest three approaches to discriminate the candidate distributions, the ratio of the maximized likelihood (RRML) approach, modification approach (shorted as approach), and modification D approach (shorted as the D approach), to obtain the predictor of . It is noticed that the idea of the approach and D approach is based on goodness-of-fit test methods. All these three approaches can be implemented to obtain the predictor of via using Algorithms 1–3.

Algorithm 1 (the RRML approach).
Step 1. Collect a type II censored sample, which has size and observed failure times; we consider candidate distributions.
Step 2. Obtain () and for the candidate distribution . Obtain under the candidate distribution and label it by for , and or 2.
Step 3. Let denote the predicted value of for or 2. Based on the method proposed by Dumonceaux and Antle [16], we can obtain , which can provide the largest maximum likelihood information by If the candidate distributions are ND and SEV, Steps 2 and 3 in Algorithm 1 can be reduced to Step 2’ and Step 3’ as the following, respectively:
Step 2’. Obtain (, ), (, ), and . Obtain under the ND () and obtain under the SEV () for and or 2.
Step 3’. Let denote the predicted value of . Then

Algorithm 2 (the approach).
Step 1. Collect a type II censored sample, which has size and observed failure times.
Step 2. Obtain () for , and then obtain for , and or 2.
Step 3. Based on the method proposed by Castro-Kuriss et al. [29], the modification of with censored observations can be presented bywhere . The definition of is the same as that of (2), it represents the CDF of the assumed distribution in model selection. Evaluate the value of through using the candidate distribution for .
Step 4. Let be the predicted value of for or 2, then can be obtained with the smallest . That is, is the value corresponding to , which is defined byIf the candidate distributions are ND and SEV, Steps 2, 3, and 4 in Algorithm 2 can be reduced to Step 2’ and Step 3’ as the following, respectively:
Step 2’. Obtain () and (). Obtain the under the ND and obtain the under the SEV for and or 2.
Step 3’. The modification of with censored observations can be presented bywhere . The definition of is the same as that of (2); it represents the CDF of the assumed distribution in model selection. Evaluate the values of through using the ND and SEV and denot them by and , respectively.
Step 4’. Let denote the predicted value of , then can be obtained by

Algorithm 3 (the approach).
Step 1. Collect a type II censored sample, which has size and observed failure times.
Step 2. Obtain () for , and then obtain for , and or 2.
Step 3. Based on the method proposed by Castro-Kuriss et al. [29], the modification of with censored observations can be presented bywhere .
Step 4. Let be the predicted value of for or 2, then can be obtained with the smallest . That is, is the value corresponding to , which is defined byIf the candidate distributions are ND and SEV, Steps 2, 3, and 4 in Algorithm 3 can be reduced to Step 2’ and Step 3’ as the following, respectively:
Step 2’. Obtain () and (). Obtain under the ND and obtain under the SEV for and or 2.
Step 3’. The modification of with censored observations can be presented bywhere . Evaluate the value of by using the ND and SEV and denote them by and .
Step 4’. Let denote the predicted value of , then can be obtained by

4. Monte Carlo Simulations

A Monte Carlo simulation study was conducted in this section, by using R language, to evaluate the performance of the proposed three approaches with two predicting methods. We consider the ND and SEV as the candidate distributions for competing the best lifetime model in the simulation study. The data sets of type II censoring sample, , used in the simulation were randomly generated from the ND and SEV with location parameter and scale parameter . Then, the order statistic is predicted and denoted by for for the sample sizes and 60. For the purpose of comparison, the values of the bias and mean square error (MSE) of are evaluated using Monte Carlo runs:andwhere is the predicted value of that is obtained in the iteration of simulation for . All simulation results are displayed in Tables 1 and 2 with the candidate distributions of ND and SEV. From Tables 1 and 2, we notice that the bias and MSE are large when the misspecification model is used. The impact of misspecification depends on the values of and . As or increases, the simulated bias and MSE are decreased. We also find that the MSE based on using the Taylor series prediction method is smaller than that based on using the expected values prediction method when the sample size is or larger than 30.

To evaluate the performance of the three proposed model selection approaches for MLP, Tables 3–5 report the simulation results for three model selection approaches from the ND. Tables 6–8 respectively report the simulation results for three model selection approaches from the SEV. The column “correct (%)” presented in Tables 3–8 is the correct model selection rate in all simulation runs. From Tables 3–8 we find that the three model selection approaches have good ability to identify the correct underlying distribution with a high probability. Moreover, the MSEs of these three approaches are close to those simulated MSEs of the cases by using the real underlying distribution. Overall, the correct model selection rates through using approach or approach are higher than that of using the RRML approach when the sample size is smaller than 30. When the sample size grows to or over 30, the performance of the RRML approach is improved and the correct model selection rate of the RRML approach is higher than that are obtained by using the or approach. To compare the performance of using two different MLPs, the MSEs of using the expected values prediction method are smaller than that using the Taylor series prediction method when the sample size is smaller than 30. The proposed approaches can perform well under large sample size cases.

5. Illustrative Examples

In this section, three numerical examples are presented to illustrate the proposed approaches in Sections 2–4.

5.1. Example 1

A test airplane component’s failure time dataset provided in Mann and Fertig [30], in which 13 components were placed on test, and the test was terminated at the time of the 10^th failure. The failure times (in hours) of the 10 components that failed were : 0.22, 0.50, 0.88, 1.00, 1.32, 1.33, 1.54, 1.76, 2.50, 3.00.

Let be the logs of the ten observations, i.e., . Figure 2 presents the histogram and the estimated PDFs of the ND and SEV. From Figure 2, we find a difficulty to fully decide the best distribution for lifetime fitting due to the fact that both candidate distributions can provide good fitting for this data set. In this example, we consider using approach to discriminate competing models and apply Taylor series prediction method to predicting the future order statistics, which are censored. The R source codes of Example 1 can be found in Appendix C and other designs can be obtained from the authors upon request.

Through using Newton-Raphson algorithm, we obtained the MLEs of and as and for the ND and SEV, respectively.

The values via using ND and SEV are 0.223 and 0.212, respectively. Because the value obtained from the SEV is smaller than that obtained from the ND, we claim the best distribution of this data set is SEV. The Taylor series prediction for under the extreme value distribution with the censored sample can be obtained by .

5.2. Example 2

In this example, we consider that the tests on endurance of deep groove ball bearings data, reported by Lieblein and Zelen [31] and further studied by Meeker and Escobar (1998), are used to illustrate the methodologies developed in this paper. The data are the numbers of million revolutions before failure for each of the 23 ball bearings in the life test. Meeker and Escobar [32] pointed out that this data () follows lognormal distribution or Weibull distribution. Hence follows a ND or SEV. The data is given as follows: : 17.88, 28.92, 33.00, 41.52, 42.12, 45.60, 48.40, 51.84, 51.96, 54.12, 55.56, 67.80, 68.64, 68.64, 68.88, 84.12, 93.12, 98.64, 105.12, 105.84, 127.92, 128.04, 173.40.

For more information about this carbon fiber breaking strength data set, one can be referred to Meeker and Escobar (1998). In this example, we assume that the censoring proportion is 0.8696 (). Figure 3 presents the histogram and the estimated PDFs of ND and SEV based on the type II right-censored data set. From Figure 3, it is difficult to decide the best distribution from these two candidate distributions.

We consider using approach in Example 2 for model selection and use expected values prediction method to predict the future order statistics, which are censored. The MLEs of and can be obtained via using Newton-Raphson algorithm, the resulting MLEs are and for the ND and SEV, respectively. The values based on using the ND and SEV are 0.181 and 0.297, respectively. Because the value obtained from ND is smaller than that obtained from SEV, we claim the best model is normal. The expected values prediction of via using ND are . In addition, we compare our prediction results with the MMLP values that proposed by Yang and Tong (2006), in which the MMLP is . Our predicted results are close to that proposed by Yang and Tong [14] even we cannot initially assume which one of the ND or SEV is the best distribution.

5.3. Example 3

We consider the experiment on the pull-off performance for use in automotive engine components, reported by Byrne and Taguchi [33] and further studied by Yang and Tong [14], is used to illustrate the methodologies developed in this study. An experiment was conducted to find a method to maximize the pull-off force. Four control factors that could influence the assembly’s pull-off force have been identified. Repeat 8 times for each run and record the pull-off force in pounds. Table 9 lists the four control factors with their levels and complete data of this experiment. In this example, we assume that the censoring proportion is 0.75 (). Please note that censored data cannot support the practitioner to conduct experimental design methods. Predicting the unobserved data and using a pseudo-complete data set for conducting experimental design methods is required.

We consider using the RRML approach for model selection and use Taylor series prediction method to predict the future order statistics in this example. After combining the uncensored data and the predicted censored data, the pseudo-complete data are shown in Table 10.

6. Conclusions

It could be difficult to discriminate a best model sometimes from several candidate distributions. The sample size, estimation methods, and goodness-of-fit testing methods can affect the final results of model selection. In this study, we focus on providing reliable methods to obtain predicting values of censored data to reduce the impact of model misspecification. In this study, three model selection approaches are proposed for predicting the future order statistics from type II censored data, in which the quality characteristic is assumed to follow a location-scale family. The ND and SEV are considered as the candidate members in the location-scale distribution to compete the best underlying distribution. The ND can be the log transformation from the lognormal distribution and the SEV can be the log transformation from the Weibull distribution. Discrimination between lognormal and Weibull distributions is equivalent to the discrimination between ND and SEV. Hence, both ND and SEV are widely used for practical reliability applications.

Through any one of three proposed approaches, the robust predictions can be obtained even under model uncertainty. Three examples are used to illustrate the methodologies. Moreover, the performance of these three proposed approaches are evaluated through using Monte Carlo simulations. Numerical results show that the three proposed model selection approaches are robust and effective in obtaining good predicted values for the future order statistics, which are censored.

In comparing these three proposed approaches, we recommend using approach or approach for model selection and use expected values prediction method to predict the future order statistics for small sample size cases, that is, the sample cases with a size is less than 30. For large sample size cases (sample size larger than 30), we recommend using RRML approach for model selection and use Taylor series prediction method to predict the future order statistics. Simulation results show that the proposed approaches are robust and can highly reduce the impact caused by model uncertainty. The proposed approaches can also work well if more than two candidate distribution are competing for the best distribution.

Other model selection methods from the current three proposed approaches could also be competitive. How to employ new model selection methods for the topic of type II censored data prediction can be studied in the future.

Appendix

A.

For the normal distribution case, the functions and can be expanded by using Taylor series at the points and (), respectively. We obtainandin which the constants can be taken to bewhere , and for .

B.

For the smallest extreme value distribution case, the functions and can be expanded by using Taylor series at the points and (), respectively. We obtainandThe above constants can be taken to beandwhere .

C.

See Algorithm 1.

rextreme=function(n,mu,sig)mu+sig(log(-log(1-runif(n))))
dextreme=function(x,mu,sig)(1/sig)exp((x-mu)/sig-exp((x-mu)/sig))
pextreme=function(x,mu,sig)1-exp(-exp((x-mu)/sig))
n=13
r=10
data=log(c(0.22, 0.50, 0.88, 1.00, 1.32, 1.33, 1.54, 1.76, 2.50, 3.00))
data1=sort(data)[1:r]
Xr=data1[r]
# AMLE of normal
pr=r/(n+1)
qr=1-pr
invpr=qnorm(pr,0,1)
alpha=dnorm(invpr)((1+invpr2)qr-invprdnorm(invpr))/(qr2)
beta=dnorm(invpr)(dnorm(invpr)-invprqr)/(qr2)
A=sum(data1)+beta(n-r)Xr
M=r+beta(n-r)
C=(n-r)alpha
D=AC/M-CXr
E=sum(data12)+(n-r)beta(Xr2)-A2/M
sigma_hat=(-D+(D2+4rE)(1/2))/(2r)
u_hat=A/M+Csigma_hat/M
## MLE of normal
L2=function(x)
u=x[]
sigma=x[]
-(prod(dnorm(data1,u,sigma))(1-pnorm(Xr,u,sigma))(n-r))
Ans1=optim(c(u_hat,sigma_hat),L2)
uN=Ans1$par[]
sigmaN=Ans1$par[]
## AMLE of extreme value
prE=r/(n+1)
qrE=1-prE
alphar=1+log(qrE)-log(qrE)log(-log(qrE))
betar=-log(qrE)
SUMbeta=SUMbetaX=SUMalpha=SUMalphaX=SUMbetaX2=0
for(h in 1:r)
pi=h/(n+1)
qi=1-pi
alphai=1+log(qi)-log(qi)log(-log(qi))
betai=-log(qi)
SUMbeta=SUMbeta+betai
SUMbetaX=SUMbetaX+betaidata1[h]
SUMalpha=SUMalpha+alphai
SUMalphaX=SUMalphaX+alphaidata1[h]
SUMbetaX2=SUMbetaX2+betai((data1[h])2)
M=SUMbeta+betar(n-r)
B=(SUMbetaX+(n-r)betarXr)/M
C=(SUMalpha-(n-r)(1-alphar))/M
D=-(n-r)Xr+(n-r)alpharXr+SUMalphaX-BCM
E=(n-r)betar(Xr2)+SUMbetaX2-M(B2)
sigma_hat=(-D+(D2+4rE)(1/2))/(2r)
u_hat=B-Csigma_hat
## MLE of extreme value
L9=function(x)
u=x[]
sigma=x[]
-(prod(dextreme(data1,u,sigma))(1-pextreme(Xr,u,sigma))(n-r))
Ans9=optim(c(u_hat,sigma_hat),L9)
uE=Ans9$par[]
sigmaE=Ans9$par[]
## Model Selection Approaches
Dsp1=Dsp2=D1=D2=array()
for(j in 1:r)
L7=factorial(n)/(factorial(n-r))
LN=L7prod(dnorm(data1,uN,sigmaN))(1-pnorm(Xr,uN,sigmaN))(n-r)
LE=L7prod(dextreme(data1,uE,sigmaE))(1-pextreme(Xr,uE,sigmaE))(n-r)
Dsp1[j]=(2/pi)abs(asin(sqrt((j-0.5)/n))-asin(sqrt(pnorm(data1[j],uN,sigmaN))))
Dsp2[j]=(2/pi)abs(asin(sqrt((j-0.5)/n))-asin(sqrt(pextreme(data1[j],uE,sigmaE))))
D1[j]=(2/pi)abs(((j-0.5)/n)-pnorm(data1[j],uN,sigmaN))+0.5/n
D2[j]=(2/pi)abs(((j-0.5)/n)-pextreme(data1[j],uE,sigmaE))+0.5/n
DspN=max(Dsp1)
DspE=max(Dsp2)
#DN=max(D1)
#DE=max(D2)
## The Taylor series prediction
AMLP_E=function(r,n)
pr=r/(n+1)
qr=1-pr
Xs2=array()
for(i in 1:(n-r))
s=r+i
ps=s/(n+1)
qs=1-ps
alphas=1+log(qs)-log(qs)log(-log(qs))
betas=log(qs)
gamma1=qslog(qs)((qr-qs)(-1+(1+log(qs))log(-log(qs)))+qslog(qs)log(-log(qs))-
qrlog(qr)log(-log(qr)))/((qr-qs)2)
rou1=qslog(qs)(-(1+log(qs))(qr-qs)-qslog(qs))/((qr-qs)2)
v1=qslog(qs)qrlog(qr)/((qr-qs)2)
A=s-r-1
B=Arou1+Av1+betas+(n-s)betas
C=Agamma1+alphas-(n-s)+(n-s)alphas
D=Arou1+betas+(n-s)betas
Xs2[i]=-Av1data1[r]/D+uEB/D-sigmaEC/D
Xs2[which(Xs2<=data1[r])]=data1[r]
Xs2
AMLP_E(r,n)

Data Availability

Data in examples of this study are cited from reference papers. We have put citation in each example and listed cited papers in references.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

References

W. Q. Meeker Jr., “A comparison of accelerated life test plans for Weibull and lognormal distributions and type-I censoring,” Technometrics, vol. 26, no. 2, pp. 157–171, 1984.
View at: Publisher Site | Google Scholar
Y. Dai, Y. F. Zhou, and Y. Z. Jia, “Distribution of time between failures of machining center based on type I censored data,” Reliability Engineering & System Safety, vol. 79, no. 3, pp. 377–379, 2003.
View at: Publisher Site | Google Scholar
R. Sundberg, “Comparison of confidence procedures for type I censored exponential lifetimes,” Lifetime Data Analysis. An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, vol. 7, no. 4, pp. 393–413, 2001.
View at: Publisher Site | Google Scholar | MathSciNet
N. Ahmad and A. Islam, “Optimal accelerated life test designs for Burr type XII distributions under periodic inspection and type I censoring,” Naval Research Logistics (NRL), vol. 43, no. 8, pp. 1049–1077, 1996.
View at: Publisher Site | Google Scholar
G. K. Bhattacharyya, “The asymptotics of maximum likelihood and related estimators based on type II censored data,” Journal of the American Statistical Association, vol. 80, no. 390, pp. 398–404, 1985.
View at: Publisher Site | Google Scholar | MathSciNet
T.-R. Tsai, J.-Y. Chiang, T. Liang, and M.-C. Yang, “Efficient Bayesian sampling plans for exponential distributions with type-I-censored samples,” Journal of Statistical Computation and Simulation, vol. 84, no. 5, pp. 964–981, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
K. S. Kaminsky and P. I. Nelson, “Prediction of order statistics,” in Balakrishnan, N. Balakrishnan and R. C. Rao, Eds., pp. 431–450, Handbook of Statistics 17, Order Statistics: Applications, New York, NY, USA, 1998.
View at: Google Scholar
K. W. Fertig, M. E. Meyer, and N. R. Mann, “On constructing prediction intervals for samples from a weibull or extreme value distribution,” Technometrics, vol. 22, no. 4, pp. 567–573, 1980.
View at: Publisher Site | Google Scholar
K. S. Kaminsky and L. S. Rhodin, “Maximum likelihood prediction,” Annals of the Institute of Statistical Mathematics, vol. 37, no. 3, pp. 507–517, 1985.
View at: Publisher Site | Google Scholar | MathSciNet
J.-W. Wu, H.-L. Lu, C.-H. Chen, and -H. Yang, “A note on the prediction intervals for a future ordered observation from a Pareto distribution,” Quality & Quantity, vol. 38, pp. 217–233, 2004.
View at: Publisher Site | Google Scholar
D. Kundu and M. Z. Raqab, “Bayesian inference and prediction of order statistics for a Type-II censored Weibull distribution,” Journal of Statistical Planning and Inference, vol. 142, no. 1, pp. 41–47, 2012.
View at: Publisher Site | Google Scholar | MathSciNet
H. Panahi and A. Sayyareh, “Parameter estimation and prediction of order statistics for the Burr type XII distribution with type II censoring,” Journal of Applied Statistics, vol. 41, no. 1, pp. 215–232, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
M. Z. Raqab, “Modified maximum likelihood predictors of future order statistics from normal samples,” Computational Statistics & Data Analysis, vol. 25, no. 1, pp. 91–106, 1997.
View at: Publisher Site | Google Scholar
C.-H. Yang and L.-I. Tong, “Predicting type II censored data from factorial experiments using modified maximum likelihood predictor,” The International Journal of Advanced Manufacturing Technology, vol. 30, no. 9-10, pp. 887–896, 2006.
View at: Publisher Site | Google Scholar
J.-Y. Chiang, “Modified maximum likelihood prediction for type II censored data under the Weibull distribution,” International Journal of Intelligent Technologies and Applied Statistics, vol. 3, no. 1, pp. 17–32, 2010.
View at: Google Scholar
R. Dumonceaux and C. E. Antle, “Discrimination between the log-normal and the weibull distributions,” Technometrics, vol. 15, no. 4, pp. 923–926, 1973.
View at: Publisher Site | Google Scholar
D. Kundu and A. Manglick, “Discriminating between the log-normal and gamma distributions,” Journal of Applied Statistical Science, vol. 14, no. 1-2, pp. 175–187, 2005.
View at: Google Scholar | MathSciNet
D. Kundu and M. Z. Raqab, “Discriminating between the generalized Rayleigh and log-normal distribution,” Statistics. A Journal of Theoretical and Applied Statistics, vol. 41, no. 6, pp. 505–515, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
H.-F. Yu, “Mis-specification analysis between normal and extreme value distributions for a screening experiment,” Computers & Industrial Engineering, vol. 56, no. 4, pp. 1657–1667, 2009.
View at: Publisher Site | Google Scholar
A. K. Dey and D. Kundu, “Discriminating between the log-normal and log-logistic distributions,” Communications in Statistics—Theory and Methods, vol. 39, no. 1-2, pp. 280–292, 2010.
View at: Publisher Site | Google Scholar | MathSciNet
A. E. Elsherpieny, N. S. Ibrahim, and U. N. Radwan, “Discriminating between Weibull and log-logistic distributions,” International Journal of Innovative Research in Science, Engineering and Technology, vol. 2, no. 8, pp. 3358–3371, 2013.
View at: Google Scholar
S. K. Ashour and A. M. Hashish, “A numerical comparison of three procedures used in failure model discrimination,” Pakistan Journal of Statistics and Operation Research, vol. 10, no. 1, pp. 107–119, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
R. Pakyari, “Discriminating between generalized exponential, geometric extreme exponential and Weibull distributions,” Journal of Statistical Computation and Simulation, vol. 80, no. 12, pp. 1403–1412, 2010.
View at: Publisher Site | Google Scholar | MathSciNet
E. A. Elsherpieny, H. Z. Muhammed, and N. U. Mohamed Mohamed Radwan, “On discriminating between gamma and log-logistic distributions in case of progressive type II censoring,” Pakistan Journal of Statistics and Operation Research, vol. 13, no. 1, pp. 157–183, 2017.
View at: Publisher Site | Google Scholar | MathSciNet
A. Hossain and A. R. Willan, “Approximate MLEs of the parameters of location-scale models under type II censoring,” Statistics. A Journal of Theoretical and Applied Statistics, vol. 41, no. 5, pp. 385–394, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
K. G. Mehrotra and P. Nanda, “Unbiased estimation of parameters by order statistics in the case of censored samples,” Biometrika, vol. 61, pp. 601–606, 1974.
View at: Publisher Site | Google Scholar | MathSciNet
N. Balakrishnan and A. C. Cohen, Order statistics and inference, Statistical Modeling and Decision Science, Academic Press, Inc., Boston, MA, 1991.
View at: MathSciNet
D. Teichroew, “Tables of expected values of order statistics and products of order statistics for samples of size twenty and less from the normal distribution,” Annals of Mathematical Statistics, vol. 27, pp. 410–426, 1956.
View at: Publisher Site | Google Scholar | MathSciNet
C. Castro-Kuriss, D. M. Kelmansky, V. Leiva, and E. J. Martizez, “A new goodness-of-fit test for censored data with an application in monitoring processes,” Communications in Statistics—Simulation and Computation, vol. 38, no. 6-7, pp. 1161–1177, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
N. R. Mann and K. W. Fertig, “Tables for obtaining Weibull confidence bounds and tolerance bounds based on best linear invariant estimates of parameters of the extreme-value distribution,” Technometrics. A Journal of Statistics for the Physical, Chemical and Engineering Sciences, vol. 15, pp. 87–101, 1973.
View at: Publisher Site | Google Scholar | MathSciNet
J. Lieblein and M. Zelen, “Statistical investigation of the fatigue life of deep-groove ball bearings,” Journal of Research of the National Bureau of Standards, vol. 57, no. 5, pp. 273–316, 1956.
View at: Publisher Site | Google Scholar
W. Q. Meeker and L. A. Escobar, Statistical Methods for Reliability Data, John Wiley and Sons, New York, NY, USA, 1998.
D. M. Byrne and S. Taguchi, “Taguchi approach to parameter design,” Quality Progress, vol. 20, no. 12, pp. 19–26, 1987.
View at: Google Scholar

Copyright

Copyright © 2018 Jyun-You Chiang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

745

Downloads

774

Citations