Abstract

Geometric extreme exponential (GE-exponential) is one of the nonnegative right-skewed distribution that is suitable for analyzing lifetime data. It is well known that the maximum likelihood estimators (MLEs) of the parameters lead to likelihood equations that have to be solved numerically. In this paper, we provide explicit estimators through an approximation of the likelihood equations based on progressively Type-II-censored samples. The approximate estimators are then used as starting values to find the MLEs numerically. The bias and variances of the MLEs are calculated for a wide range of sample sizes and different progressive censoring schemes through a Monte Carlo simulation study. Moreover, formulas for the observed Fisher information are given which could be used to construct asymptotic confidence intervals. The coverage probabilities of the confidence intervals and the percentage points of pivotal quantities associated with the MLEs are also calculated. A real dataset has been studied for illustrative purposes.

1. Introduction

Progressive censoring is one of the important sampling techniques that was first introduced by Herd [1] and its importance in life testing reliability experiments was discussed by Cohen [2].

The progressive Type-II censoring is as follows. Suppose 𝑛 units are placed on test. At the time of the first failure, 𝑅1 units are randomly removed from the 𝑛1 surviving units. Next, at the time of the second failure, 𝑅2 units are randomly removed from the 𝑛𝑅12 surviving units, and so on. Finally, after the 𝑚th failure, all remaining 𝑅𝑚 units are removed. Thus, we observe 𝑚 complete failures and 𝑅1+𝑅2++𝑅𝑚 items are progressively censored from the 𝑛 units under test, and so 𝑛=𝑚+(𝑅1+𝑅2++𝑅𝑚). The vector 𝑅=(𝑅1,,𝑅𝑚) is called the censoring scheme and is fixed prior to the study. If 𝑅=(0,,0), no withdrawals are made which correspond to the complete sample and the ordinary order statistics will be obtained. If 𝑅=(0,,0,𝑛𝑚), we obtain the conventional Type-II right censoring. We will denote the progressive Type-II censored data by 𝑋1𝑚𝑛<𝑋2𝑚𝑛<<𝑋𝑚𝑚𝑛. For an excellent discussion on progressive Type-II censoring technique see the monograph of Balakrishnan and Aggarwala [3] and the recent discussion paper by Balakrishnan [4].

There are numerous articles in the literature dealing with inferential procedures based on the progressively Type-II censoring data for a wide variety of lifetime distributions. See for example Mann [5, 6], Balakrishnan et al. [7], Balakrishnan and Kannan [8], Balakrishnan et al. [9], Kim and Han [10], Lin et al. [11], and Asgharzadeh [12] among others who study the inferential procedures for the Weibull, extreme value, logistic, normal, half-logistic, log-gamma, and generalized exponential models, respectively.

GE-exponential distribution was introduced by Marshal and Olkin [13] and further studied extensively by Adamidis and Loukas [14] and Adamidis et al. [15]. Recently, this model was of interest of authors as a natural competitor to the other well-established lifetime models such as Weibull, lognormal, and gamma distributions (see, e.g. [16, 17]).

In this paper, we study the inference for the parameters of the GE-exponential distribution based on progressively Type-II censored data.

The rest of the paper is organized as follows. The MLEs of the parameters are studied in Section 2. In Section 3, we provide an approximation to the likelihood function which leads to explicit approximate estimators of the parameters. The observed Fisher information is studied in Section 4. The performance of the maximum likelihood estimators is evaluated in terms of bias and variance through a large number of Monte Carlo simulations and the results are given in Section 5. Using the asymptotic normality of the proposed estimators, the coverage probabilities of the asymptotic confidence intervals and the percentage points for the pivotal quantities are calculated by simulations and the results are provided in Section 6. Finally, Section 7 is devoted to study a real dataset for illustration of the proposed methodology.

2. Maximum Likelihood Estimation

Let the failure times follow a geometric extreme exponential distribution with shape parameter 𝛾 and scale parameter 𝜆 and with probability density function:𝑓(𝑥;𝜆,𝛾)=𝛾𝜆𝑒𝜆𝑥1𝛾𝑒𝜆𝑥2,(𝑥>0),(2.1) where 𝛾=1𝛾. The cumulative distribution function is give by𝐹(𝑥;𝜆,𝛾)=1𝑒𝜆𝑥1𝛾𝑒𝜆𝑥.(2.2)

The GE-exponential reduces to the exponential distribution if we take 𝛾=1. When 0<𝛾<1, the distribution reduces to exponential geometric distribution studied by Adamidis and Loukas [14]. The GE-exponential distribution has decreasing failure rate when 0<𝛾<1 and increasing failure rate when 𝛾1. The density function is unimodal with modal value equal to 𝜆𝛾{4𝛾}1 when 𝛾>2. Hereafter, we assume 𝛾1.

Let 𝑋1𝑚𝑛,,𝑋𝑚𝑚𝑛 be a progressively Type-II censored sample from density function (2.1). The likelihood function based on the censored sample is given by (see [3]) 𝐿(𝜆,𝛾)=𝐶𝑚𝑖=1𝛾𝜆𝑒𝜆𝑥𝑖1𝛾𝑒𝜆𝑥𝑖211𝑒𝜆𝑥𝑖1𝛾𝑒𝜆𝑥𝑖𝑅𝑖=𝐶(𝛾𝜆)𝑚𝑚𝑖=1𝛾𝑅𝑖𝑒𝜆(𝑅𝑖+1)𝑥𝑖1𝛾𝑒𝜆𝑥𝑖𝑅𝑖+2,(2.3) where 𝐶=𝑛𝑛1𝑅1𝑛2𝑅1𝑅2𝑛𝑚+1𝑅1𝑅𝑚1.(2.4) The log-likelihood function may be written as 𝐿(𝜆,𝛾)=𝐾+𝑚log𝛾+𝑚log𝜆+log𝛾𝑚𝑖=1𝑅𝑖𝜆𝑚𝑖=1𝑅𝑖𝑥+1𝑖𝑚𝑖=1𝑅𝑖+2log1𝛾𝑒𝜆𝑥𝑖,(2.5) where 𝐾 is a constant. Let 𝑧𝑖=𝜆𝑥𝑖, now the log-likelihood function may be rewritten as 𝐿(𝜆,𝛾)=𝐾+𝑚log𝛾+𝑚log𝜆+log𝛾𝑚𝑖=1𝑅𝑖𝑚𝑖=1𝑅𝑖𝑧+1𝑖𝑚𝑖=1𝑅𝑖+2log1𝛾𝑒𝑧𝑖.(2.6) The likelihood equations are then given by 𝜕𝐿𝜕𝜆𝑚(𝜆,𝛾)=𝜆1𝜆𝑚𝑖=1𝑅𝑖𝑧+1𝑖1𝜆𝑚𝑖=1𝑅𝑖+2𝛾𝑧𝑖𝑒𝑧𝑖1𝛾𝑒𝑧𝑖𝜕=0,(2.7)𝐿𝜕𝛾𝑚(𝜆,𝛾)=𝛾+𝑚𝑖=1𝑅𝑖𝛾𝑚𝑖=1𝑅𝑖𝑒+2𝑧𝑖1𝛾𝑒𝑧𝑖=0,(2.8) which may be simplified as 𝑚𝑚𝑖=1𝑅𝑖𝑧+1𝑖𝑚𝑖=1𝑅𝑖+2𝛾𝑧𝑖𝑒𝑧𝑖1𝛾𝑒𝑧𝑖=0,(2.9)𝑚+𝑚𝑖=1𝑅𝑖𝛾𝑚𝑖=1𝑅𝑖𝑒+2𝑧𝑖1𝛾𝑒𝑧𝑖=0.(2.10) The MLEs of the parameters 𝜆 and 𝛾 are the solutions of the system of nonlinear (2.9) and (2.10) which cannot be expressed explicitly. Standard numerical procedures such as Newton’s method needed to solve the above system of nonlinear equations. However, as is necessary for every numerical method, are starting values are needed to implement the numerical procedure. In the following section, we present an approximate MLEs for the parameters 𝜆 and 𝛾 to be used as starting values for implementation of the Newton’s method to solve the system of nonlinear (2.9) and (2.10). The approximation method is based on first finding a moment estimator for the shape parameter 𝛾 and based on this estimator finding an approximate MLE of the scale parameter 𝜆 and using that to find an approximate MLE of 𝛾 by either (2.9) or (2.10).

3. Approximate Maximum Likelihood Estimators

As was studied in the previous section, the maximum likelihood estimators of the parameters are given only implicitly by solving the system of nonlinear equations (2.9) and (2.10) by an appropriate numerical method. In this section, we provide approximate maximum likelihood estimators to be used as starting values to solve the system of nonlinear equations numerically. We propose a two-step approximation method, in which the moment estimator of the shape parameter 𝛾 is used to find the approximate maximum likelihood estimator of the scale parameter 𝜆. The approximate MLE of 𝛾 is then can be found by either the likelihood equations (2.9) or (2.10) and the value of the approximate MLE of 𝜆.

To find a moment estimator of 𝛾, we consider the coefficient of variation (C.V.) which is given by C.V.=𝛾2Φ𝛾,2,1𝛾ln2𝛾/𝛾21/2,𝛾ln𝛾(3.1) where Φ(𝑎,𝑏,𝑐) is the Lerch’s transcendent (see, for example, [18]).

Note that C.V. is independent from the scale parameter 𝜆 and depends only on the shape parameter 𝛾. Table 1 provides the value of the C.V. for different values of the shape parameter 𝛾. This table may be used to find the moment estimator of the shape parameter 𝛾 for the given value of the sample C.V. when the shape parameter is unknown.

Let 𝑋1𝑚𝑛,,𝑋𝑚𝑚𝑛 be a progressively Type-II censored sample from 𝑓(𝑥;𝜆,̃𝛾), where ̃𝛾 is the moment estimator of 𝛾. The likelihood equation (2.9) may now be rewritten in terms of ̃𝛾 as 𝑚𝑚𝑖=1𝑅𝑖𝑧+1𝑖𝑚𝑖=1𝑅𝑖+2̃𝛾𝑧𝑖𝑒𝑧𝑖1̃𝛾𝑒𝑧𝑖=0,(3.2) where ̃𝛾=1̃𝛾. The MLE of the scale parameter 𝜆 is the solution of the nonlinear equation (3.2) which can not be expressed explicitly.

Let us approximate the likelihood equation (3.2) by approximating the function (𝑧)=𝑒𝑧/(1̃𝛾𝑒𝑧) in a Taylor series expansion around 𝐸(𝑍𝑖𝑚𝑛)=𝜈𝑖𝑚𝑛. From Balakrishnan and Sandhu [19] we have 𝐹𝑍𝑖𝑚𝑛=𝑈𝑖𝑚𝑛,(3.3) where 𝑈𝑖𝑚𝑛 is the 𝑖th order statistic of a progressively Type-II censored sample from the U (0,1) distribution.

We then have 𝑍𝑖𝑚𝑛=𝐹1𝑈𝑖𝑚𝑛=log̃𝛾𝑈𝑖𝑚𝑛1𝑈𝑖𝑚𝑛,1(3.4) and hence 𝜈𝑖𝑚𝑛𝑍=𝐸𝑖𝑚𝑛=log̃𝛾𝛼𝑖𝑚𝑛1𝛼𝑖𝑚𝑛,1(3.5) where 𝛼𝑖𝑚𝑛=𝐸(𝑈𝑖𝑚𝑛) and 𝛼𝑖𝑚𝑛=1𝑚𝑗=𝑚𝑖+1𝑗+𝑅𝑚𝑗+1++𝑅𝑚𝑗+1+𝑅𝑚𝑗+1++𝑅𝑚,𝑖=1,,𝑚.(3.6) Expanding (𝑧𝑖) around 𝜈𝑖𝑚𝑛 by Taylor series expansion and keeping the first two terms, we deduce 𝑧𝑖𝜈𝑖𝑚𝑛+𝑧𝑖|𝑧=𝜈𝑖𝑚𝑛𝑧𝑖𝜈𝑖𝑚𝑛=𝛼𝑖+𝛽𝑖𝑧𝑖,(3.7) where 𝛼𝑖=𝑒𝜈𝑖𝑚𝑛1̃𝛾𝑒𝜈𝑖𝑚𝑛+𝜈𝑖𝑚𝑛1̃𝛾𝑒𝜈𝑖𝑚𝑛2,𝛽𝑖=𝑒𝜈𝑖𝑚𝑛1̃𝛾𝑒𝜈𝑖𝑚𝑛2.(3.8) Therefore, the likelihood equation (3.2) may be approximated by 𝑚𝑚𝑖=1𝑅𝑖𝑧+1𝑖̃𝛾𝑚𝑖=1𝑅𝑖𝑧+2𝑖𝛼𝑖+𝛽𝑖𝑧𝑖=0.(3.9) Equation (3.9) may be rewritten as ̃𝛾𝑚𝑖=1𝑅𝑖𝛽+2𝑖𝑥2𝑖𝜆2+̃𝛾𝑚𝑖=1𝑅𝑖𝛼+2𝑖𝑥𝑖+𝑚𝑖=1𝑅𝑖𝑥+1𝑖𝜆𝑚=0(3.10) or 𝐴𝜆2+𝐵𝜆𝑚=0,(3.11) where 𝐴=̃𝛾𝑚𝑖=1𝑅𝑖𝛽+2𝑖𝑥2𝑖,𝐵=̃𝛾𝑚𝑖=1𝑅𝑖𝛼+2𝑖𝑥𝑖+𝑚𝑖=1𝑅𝑖𝑥+1𝑖.(3.12) Equation (3.11) is a quadratic equation in terms of 𝜆 with the following two solutions:𝐵±𝐵2+4𝐴𝑚.2𝐴(3.13) However, one of the solutions is inadmissible since 𝐴>0. Hence, the approximate MLE of the scale parameter 𝜆 is given explicitly by ̆𝜆=𝐵+𝐵2+4𝐴𝑚.2𝐴(3.14) The approximate MLE of 𝛾, say ̆𝛾 is then maybe found by plugging in the value of ̆𝜆 in either (2.9) or (2.10) and solve the equation by a numerical method. Once the values of ̆𝜆 and ̆𝛾 are found, we could use them as good starting values to solve the likelihood (2.9) and (2.10) numerically in order to find the MLEs of 𝜆 and 𝛾.

4. Fisher Information

In this section, we derive the asymptotic variance-covariance matrix of the maximum likelihood estimators of the parameters 𝜆 and 𝛾.

The observed Fisher information may be obtained from the likelihood equations (2.7) and (2.8) as 𝐼𝐼=11𝐼12𝐼21𝐼22,(4.1) where 𝐼11𝜕=2𝜕𝜆2𝐿=𝑚𝜆21𝜆2𝑚𝑖=1𝑅𝑖+2𝛾𝑧2𝑖𝑒𝑧𝑖1𝛾𝑒𝑧𝑖2,𝐼12=𝐼21𝜕=2𝐿𝜕𝜆𝜕𝛾1=𝜆𝑚𝑖=1𝑅𝑖𝑧+2𝑖𝑒𝑧𝑖1𝛾𝑒𝑧𝑖2,𝐼22𝜕=2𝜕𝛾2𝐿=𝑚𝛾2+𝑚𝑖=1𝑅𝑖𝛾2𝑚𝑖=1𝑅𝑖𝑒+22𝑧𝑖1𝛾𝑒𝑧𝑖2.(4.2) The asymptotic variance-covariance matrix of the MLEs is then given by inverting the observed Fisher information matrix: 𝑉̂=𝐼𝜆,̂𝛾11𝐼12𝐼12𝐼221=𝐼11𝐼12𝐼12𝐼22,(4.3) where 𝐼11=𝐼22𝐼11𝐼22𝐼212,𝐼12=𝐼12𝐼11𝐼22𝐼212,𝐼22=𝐼11𝐼11𝐼22𝐼212.(4.4) These quantities are estimated by evaluating them at MLEs 𝐼11=𝐼22𝐼11𝐼22𝐼212||||𝜆=̂𝜆,𝛾=,𝐼̂𝛾12=𝐼12𝐼11𝐼22𝐼212||||𝜆=̂𝜆,𝛾=,𝐼̂𝛾22=𝐼11𝐼11𝐼22𝐼212||||𝜆=̂𝜆,𝛾=̂𝛾(4.5) where ̂𝜆 and ̂𝛾 are the MLEs of 𝜆 and 𝛾, respectively. We will use these asymptotic variances on Section 6 to derive asymptotic confidence intervals for the scale parameter 𝜆 and the shape parameter 𝛾.

5. Simulation Study

In this section, we assess the performance of the proposed maximum likelihood estimators by a large number of Monte Carlo simulation experiments. All the simulations were carried out in R using the pseudorandom generator in that software package. We generated the progressively Type-II censored samples from the GE-exponential distribution with 𝜆=1.0 and 𝛾=3.0 using the algorithm presented by Balakrishnan and Sadhu [19]. The choice of 𝛾=3.0 provides unimodality and increasing failure rate.

Five values of the sample sizes, say 𝑛=15,20,30,40,50, different choices of the effective sample size 𝑚, and different progressive schemes were considered. For brevity in notation, we follow the Balakrishnan, et al. [7] notation to denote the censoring scheme. For example, (27,40) denotes the censoring scheme (7,7,0,0,0,0).

We first calculated the moment estimator of the shape parameter 𝛾 using sample C.V. and Table 1. Then the approximate MLE of the scale parameter 𝜆 was calculated using (3.14). Approximate MLE of 𝛾 was then calculated by solving (2.8) numerically. The MLEs of 𝜆 and 𝛾 were calculated by solving the system of normal equations (2.7) and (2.8) simultaneously using BB R package of Varadhan and Gilbert [20]. We repeated this procedure 10000 times.

Tables 2 and 3 give the average of absolute bias, variance, and estimated variance calculated from observed Fisher information for MLEs ̂𝜆 and ̂𝛾, respectively. The values of the absolute biases and variances are reasonably small for almost all the sample sizes and censoring schemes indicating the validity of the MLEs ̂𝜆 and ̂𝛾. Also, as one would expect, both the bias and the variance decrease as the effective sample proportion that is, 𝑚/𝑛 increases. Moreover, the observed Fisher information tends to be close to the variance of the estimator for both the MLEs when the effective sample proportion increases.

The tables also show that for all sample sizes 𝑛 and effective sample sizes 𝑚, the censoring scheme 𝑅=(𝑛𝑚,0,,0) tends to the smallest bias and variance, whilst the censoring scheme 𝑅=(0,0,,𝑛𝑚) tends to the largest results. Incidentally, this is in coincidence with the findings of Balakrishnan et al. [9] for Gaussian distribution.

6. Asymptotic Confidence Intervals

In this section, we study the asymptotic confidence intervals for the parameters 𝜆 and 𝛾 using the estimation for the observed Fisher information matrix. Note that the asymptotic distributions of the pivotal quantities 𝑃1=̂𝜆𝜆𝐼11,𝑃2=̂𝛾𝛾𝐼22,(6.1) are standard normal distribution. The asymptotic 100(1𝛼)% confidence intervals for the parameters are then given by ̂𝜆±𝑧𝛼/2𝐼11,̂𝛾±𝑧𝛼/2𝐼22,(6.2) where 𝑧𝛼/2 is the upper 𝛼/2 critical value of the standard normal distribution.

Table 4 provides the coverage probabilities through 10000 Monte Carlo simulations for the parameters 𝜆 and 𝛾. It is observed that the coverage probabilities are very close to their nominated values even for small sample sizes for both confidence intervals.

We also calculated the unconditional percentage points of these pivotal quantities and the results are given in Table 5. The values of the percentage points associate with 𝑃1 and 𝑃2 support the standard normality of these pivotal quantities specifically for large values of the sample size and the effective sample proportion. Indeed, the percentage points in Table 5 could be used to construct another asymptotic confidence intervals for the unknown parameters.

7. Real Data Analysis

In this section, we consider a real dataset to illustrate the inferential procedure developed in this article. The data studied by various authors are the number of the million revolutions before failure of the 25 ball bearings provided by Lieblein and Zelen [21]. The data are one of the most frequently used datasets in literature although incorrectly as a complete dataset, however, the data are indeed censored (see [22]).

Table 6 shows the ball bearings data. The six data values, as marked with asterisks, were discarded from analysis (censored). In Table 7, we presented the censored data as well as the progressively Type-II censoring scheme.

The sample C.V. of the data is found to be 0.4917 and Table 1 suggests the value of the moment estimator of the shape parameter to be ̃𝛾=23.

We found the approximate estimators for the parameters to be ̆𝜆=0.0410 and ̆𝛾=19.1628. These estimates were then used as a starting values to find the MLEs from the system of normal equations (2.7) and (2.8) numerically. We obtained ̂𝜆=0.04 and ̂𝛾=17.67. Figure 1 depicts the histogram of the data and the fitted GE-exponential (0.04,17.67).