Abstract

In applied work, the two-parameter exponential distribution gives useful representations of many physical situations. Confidence interval for the scale parameter and predictive interval for a future independent observation have been studied by many, including Petropoulos (2011) and Lawless (1977), respectively. However, interval estimates for the threshold parameter have not been widely examined in statistical literature. The aim of this paper is to, first, obtain the exact significance function of the scale parameter by renormalizing the π‘βˆ—-formula. Then the approximate Studentization method is applied to obtain the significance function of the threshold parameter. Finally, a predictive density function of the two-parameter exponential distribution is derived. A real-life data set is used to show the implementation of the method. Simulation studies are then carried out to illustrate the accuracy of the proposed methods.

1. Introduction

The two-parameter exponential distribution with density:1𝑓(π‘₯;πœ‡,𝜎)=πœŽξ‚†βˆ’expπ‘₯βˆ’πœ‡πœŽξ‚‡,(1.1) where πœ‡<π‘₯ is the threshold parameter, and 𝜎>0 is the scale parameter, is widely used in applied statistics. For example, Lawless [1] applied the two-parameter exponential distribution to analyze lifetime data, and Baten and Kamil [2] applied the distribution to analyze inventory management systems with hazardous items. Petropoulos [3] proposed two new classes of confidence interval for the scale parameter 𝜎. Lawless [1] obtained a prediction interval for a future observation from the two-parameter exponential distribution.

In this paper, we consider a sample (π‘₯1,…,π‘₯𝑛) from the two-parameter exponential distribution with density (1.1). In Section 2, we show that by renormalizing the π‘βˆ—-formula, the exact significance function of the scale parameter 𝜎 can be obtained. In Section 3, the approximate Studentization method, based on the significance function of 𝜎 obtained in Section 2, is applied to obtain the significance function of the threshold parameter πœ‡. In Section 4, we combine the results of the previous two sections and derive a predictive density for a future observation from the two-parameter exponential distribution. Some concluding remarks are given in Section 5. Throughout this paper, a real-life data set is used to show the implementation of the proposed methods, and simulation results are presented to illustrate the accuracy of the proposed methods.

2. Confidence Interval for the Scale Parameter

For the two-parameter exponential distribution with density (1.1), it can be shown that the marginal density of 𝑋(1)=min(𝑋1,…,𝑋𝑛) isπ‘“π‘šξ€·π‘₯(1)ξ€Έ=𝑛;πœ‡,πœŽπœŽξ‚†βˆ’π‘›expπœŽξ€·π‘₯(1)π‘₯βˆ’πœ‡(1)>πœ‡.(2.1) With an observed sample (π‘₯1,…,π‘₯𝑛), the log conditional likelihood function that depends only on 𝜎, can be written asℓ𝑐(𝜎)=β„“(πœ‡,𝜎)βˆ’β„“π‘šπ‘›(πœ‡,𝜎)=π‘Žβˆ’(π‘›βˆ’1)logπœŽβˆ’πœŽξ€·π‘₯βˆ’π‘₯(1)ξ€Έ,(2.2) where π‘Ž is an additive constant and βˆ‘π‘₯=𝑛𝑖=1π‘₯𝑖. Without loss of generality, in this paper, π‘Ž is set as 0. Moreover, denote 𝑠=(π‘₯βˆ’π‘₯(1)) and πœ“=1/𝜎, then the log conditional likelihood function can be rewritten asℓ𝑐(πœ“)=(π‘›βˆ’1)logπœ“βˆ’π‘›π‘ πœ“.(2.3) Note that (2.3) has the same form as a log likelihood function of an exponential family model with canonical parameter πœ“. The maximum likelihood estimate of πœ“, ξπœ“=(π‘›βˆ’1)/𝑛𝑠 is obtained by solving (πœ•β„“π‘(πœ“))/πœ•πœ“|πœ“=ξπœ“=0. Furthermore, an estimate of the variance of ξπœ“ is Μ‚π‘—βˆ’1, where ̂𝑗=βˆ’(πœ•2ℓ𝑐(πœ“))/πœ•πœ“2|πœ“=ξπœ“=(𝑛2𝑠2)/(π‘›βˆ’1) is the observed information.

Under regularity conditions as stated in DiCiccio et al. [4], both the standardized maximum likelihood estimateξ€·ξ€ΈΜ‚π‘—π‘ž=π‘ž(πœ“)=ξπœ“βˆ’πœ“1/2,(2.4) and the signed log likelihood ratio statistic:ξ€·ξ€Έξ”π‘Ÿ=π‘Ÿ(πœ“)=signξπœ“βˆ’πœ“2ξ€Ίβ„“π‘ξ€·ξ€Έξπœ“βˆ’β„“π‘ξ€»(πœ“)(2.5) have limiting standard normal distribution with rate of convergence 𝑂(π‘›βˆ’1/2). Hence the significance function of 𝜎 can be approximated byξ‚΅π‘žξ‚΅1𝑝(𝜎)=Ξ¦πœ“ξ‚Άξ‚Ά,(2.6) orξ‚΅π‘Ÿξ‚΅1𝑝(𝜎)=Ξ¦πœ“ξ‚Άξ‚Ά,(2.7) where Ξ¦(β‹…) is the cumulative distribution function of the standard normal distribution.

Since the conditional log likelihood function given in (2.3) in exponential family form with πœ“ being the canonical parameter, the modified signed log likelihood statistic by Barndorff-Nielsen [5, 6] can be simplified intoπ‘Ÿβˆ—=π‘Ÿβˆ—1(πœ“)=π‘Ÿβˆ’π‘Ÿπ‘Ÿlogπ‘ž,(2.8) where π‘ž and π‘Ÿ are defined in (2.4) and (2.5) respectively, and π‘Ÿβˆ— has limiting standard normal distribution with rate of convergence 𝑂(π‘›βˆ’3/2). Hence the significance function of 𝜎 can be approximated byξ‚΅π‘Ÿπ‘(𝜎)=Ξ¦βˆ—ξ‚΅1πœ“ξ‚Άξ‚Ά.(2.9)

Barndorff-Nielsen [7] derived the π‘βˆ—-formulaβ€”an approximate density for the maximum likelihood estimator. In this case, we haveπ‘βˆ—ξ€·ξ€ΈΜ‚π‘—ξπœŽ;𝜎=π‘˜1/2ξ€½β„“exp𝑐(𝜎)βˆ’β„“π‘ξ€·ξπœŽξ€Έξ€Ύ=π‘˜ξπœŽπ‘›βˆ’2πœŽπ‘›βˆ’1ξ‚»βˆ’exp(π‘›βˆ’1)ξπœŽπœŽξ‚Ό,(2.10) where π‘˜ is the renormalizing constant. Since 𝜎=𝑛𝑠/(π‘›βˆ’1)=1/ξπœ“, by change of variable and renormalization, we haveπ‘βˆ—(𝑠;𝜎)=Ξ“βˆ’1ξ‚€(π‘›βˆ’1)π‘›π‘ πœŽξ‚π‘›βˆ’11π‘ ξ‚†βˆ’expπ‘›π‘ πœŽξ‚‡,(2.11) which is the exact density of 𝑆=π‘‹βˆ’π‘‹(1), and it is free of πœ‡. Therefore, inference concerning 𝜎 can be based on the distribution of 𝑆. Thus, the exact confidence interval for 𝜎 can be obtained using π‘βˆ—(𝑠;𝜎) and isξ€œπ‘(𝜎)=𝑠0π‘βˆ—(𝑑;𝜎)𝑑𝑑.(2.12)

Grubbs [8] reported the following data set (see Table 1).

We have𝑛=19,π‘₯(1)=162,𝑠=835.21.(2.13)

Table 2 recorded the 90%, 95%, and 99% confidence interval for 𝜎. The proposed method (π‘Ÿβˆ—) and the exact method (exact) give approximately the same confidence intervals, whereas the results obtained by the standardized maximum likelihood estimate method (mle) and the signed log likelihood ratio method (π‘Ÿ) are quite different. Moreover, from Table 2, it is clear that π‘Ÿ gives the narrowest confidence intervals and mle gives the widest confidence intervals. The significance functions of 𝜎 obtained from the four methods are plotted in Figure 1.

In order to compare the accuracies of the four methods, Monte Carlo simulation studies with 10,000 replicates are performed. For each simulation study, we generate sample of size 𝑛 from the two-parameter exponential distribution with scale parameter 𝜎 and threshold parameter πœ‡. Then for each sample, the 95% confidence intervals for 𝜎 is calculated from the four methods discussed in this section.

Table 3 recorded the results of the simulation studies for some combinations of 𝑛, πœ‡, and 𝜎. The β€œLower Error” is the proportion of the true 𝜎 that falls outside the lower limit of the 95% confidence intervals while the β€œUpper Error” is the proportion of the true 𝜎 that falls outside the upper limit of the 95% confidence intervals, and (1β€”Lower Errorβ€”Upper Error) is recorded as β€œCentral Coverage”. The nominal values for the β€œLower Error”, β€œUpper Error”, and β€œCentral Coverage” are 0.025, 0.025, and 0.95, respectively. Moreover, for 10,000 Monte Carlo simulations, the standard errors for the β€œLower Error” and the β€œUpper Error” are the same and are √0.025(1βˆ’0.025)/10000=0.0016. Similarly, the corresponding standard error for the β€œCentral Coverage” is 0.0022.

From Table 3, even for the smallest possible sample size (𝑛=2), the proposed method and the exact method give almost identical results and have excellent coverage properties. The results obtained by the other two methods are not as satisfactory especially when the sample size is small.

Since mle method gives the poorest coverage, we excluded it from further investigation. Table 4 recorded the average width of the confidence interval for 𝜎 for the simulation study with (𝑛,𝜎,πœ‡)=(10,0.5,5). We can note that even though π‘Ÿ has the shortest average width of the confidence interval for 𝜎, it also has the poorest coverage properties as demonstrated in Table 3. For inference purpose, coverage properties are more important than width of the confidence interval. Hence the proposed method is recommended for this problem.

3. Confidence Interval for the Threshold Parameter

For the two-parameter exponential distribution, Petropoulos [3] showed that (𝑋(1),𝑆) is a sufficient statistic. Note that πœ‡ is completely contained in the marginal density of 𝑋(1), but it also depends 𝜎. In Section 2, the significance function of 𝜎, 𝑝(𝜎), was obtained. Hence, we can apply the approximate Studentization method, which is discussed in Fraser and Wong [9], to eliminate the dependence of 𝜎 from the marginal density of 𝑋(1). More specifically, the approximate Studentized marginal density of 𝑋(1) is𝑓𝑆π‘₯(1)ξ€Έξ€œ;πœ‡=π‘βˆž0π‘“π‘šξ€·π‘₯(1)ξ€Έ||||ξ€œ;πœ‡,πœŽπ‘‘π‘(𝜎)=π‘βˆž0Ξ“βˆ’1𝑛(π‘›βˆ’1)π‘›π‘ π‘›βˆ’1πœŽπ‘›+1ξ‚†βˆ’π‘›expπœŽξ€·π‘₯(1)ξ€Έξ‚‡βˆ’πœ‡+π‘ π‘‘πœŽ,(3.1) where βˆ«π‘=βˆžπœ‡βˆ«βˆž0π‘“π‘š(π‘₯(1);πœ‡,𝜎)|𝑑𝑝(𝜎)|𝑑π‘₯(1) is the normalizing constant. Hence, we have𝑓𝑆π‘₯(1)ξ€Έ=;πœ‡π‘›βˆ’1𝑠π‘₯1+(1)βˆ’πœ‡π‘ ξ‚Άβˆ’π‘›π‘₯(1)>πœ‡,(3.2) and the corresponding significance function of πœ‡ isξ€œπ‘(πœ‡)=π‘₯(1)πœ‡π‘“π‘†ξ‚΅π‘₯(𝑣;πœ‡)𝑑𝑣=1βˆ’1+(1)βˆ’πœ‡π‘ ξ‚Άβˆ’(π‘›βˆ’1).(3.3) Thus, the explicit (1βˆ’π›Ό)100% confidence interval for πœ‡ obtained by the approximate Studentization method isξ‚΅π‘₯(1)ξ‚Έξ‚€π›Όβˆ’π‘ 2ξ‚βˆ’1/(π‘›βˆ’1)ξ‚Ήβˆ’1,π‘₯(1)ξ‚Έξ‚€π›Όβˆ’π‘ 1βˆ’2ξ‚βˆ’1/(π‘›βˆ’1)βˆ’1ξ‚Ήξ‚Ά.(3.4)

Applying the approximate Studentization method to the Grubbs [8] data set, the 90%, 95%, and 99% confidence intervals for the threshold parameter πœ‡ are (βˆ’27.97,160.82), (βˆ’68.22,161.42), and (βˆ’157.87,161.88), respectively.

To illustrate the accuracy of the proposed method, we performed a Monte Carlo simulation study. Table 5 records the results from this study. The proposed approximate Studentization method gives extremely good coverage properties even when the sample size is extremely small.

4. Prediction Interval for a Future Observation

The density of a future observation from the two-parameter exponential distribution is given in (1.1), which depends on both parameters πœ‡ and 𝜎. With observed data (π‘₯1,…,π‘₯𝑛), or equivalently, observed sufficient statistic (𝑠,𝑑), 𝑝(𝜎) was obtained by the method in Section 2, and it gives us as much information about 𝜎 as we can extract from the observed data in the absence of knowledge of πœ‡. Moreover, 𝑝(πœ‡) was obtained by the approximate Studentization method in Section 3 gives us as much information about πœ‡ as we can extract from the observed data after averaging out the effect of 𝜎. Therefore, to eliminate πœ‡ and 𝜎 from the density of a future observation, we apply the approximate Studentization method to obtain a predictive density of 𝑋: π‘“π‘ƒξ€œ(π‘₯)=π‘₯(1)βˆ’βˆžξ€œβˆž0||||||||=ξ€œπ‘“(π‘₯;πœ‡,𝜎)𝑑𝑝(𝜎)𝑑𝑝(πœ‡)π‘₯(1)βˆ’βˆžξ‚Έξ€œβˆž01πœŽξ‚†βˆ’expπ‘₯βˆ’πœ‡πœŽξ‚‡Ξ“βˆ’1(π‘›βˆ’1)(𝑛𝑠)π‘›βˆ’1πœŽπ‘›ξ‚†βˆ’expπ‘›π‘ πœŽξ‚‡ξ‚Ή||||π‘‘πœŽπ‘‘π‘(πœ‡)=(π‘›βˆ’1)2π‘›π‘›βˆ’1𝑠2π‘›βˆ’2ξ€œπ‘₯(1).1βˆ’βˆžξƒ©1(ξ€·π‘₯π‘₯βˆ’πœ‡+𝑛𝑠)(1)ξ€Έξƒͺβˆ’πœ‡+π‘ π‘›π‘‘πœ‡.(4.1) Hence, the corresponding predictive cumulative distribution function isπΉπ‘ƒξ€œ(π‘₯)=π‘₯βˆ’βˆžπ‘“π‘ƒ(𝑒)𝑑𝑒.(4.2) Although the explicit form of the predictive interval is not available, it can be obtained numerically from softwares like Maple or Matlab.

Applying the proposed method to the Grubbs [8] data set, the predictive cumulative distribution function obtained in (4.2) is plotted in Figure 2, and the corresponding 90%, 95% and 99% predictive intervals are (161, 2980), (128, 3714), and (47, 5530), respectively. The corresponding intervals obtained by the method discussed in Lawless [1] are (161.00, 2982.23), (129.21, 3715.96), and (48.02, 5532.63), respectively. The two methods give almost identical results. Lawless’s method is easy to apply but the derivation is more difficult. The derivation of the proposed method is easy to follow but it requires good numerical integration methods to carry out the calculation.

5. Conclusion

In this paper, by renormalizing the π‘βˆ—-formula, the exact significance function of the scale parameter of the two-parameter exponential distribution is obtained. This significance function is then used in the approximate Studentization method to obtain the significance function of the threshold parameter. Simulation results illustrated that these two significance functions have excellent coverage properties even when the sample size is extremely small. Finally, these two significance functions are used in the approximate Studentization method to obtain a predictive density and hence a predictive cumulative distribution function, of a future observation from the two-parameter exponential distribution.