Abstract

We consider the stress-strength reliability based on record values from the Weibull distribution. The Bayes estimator based on squared error loss and the maximum likelihood estimator are derived and their bias and mean squared error performance are studied. Likelihood-based confidence intervals as well as some bootstrap intervals are developed. We derived also the highest posterior density interval. Simulation studies are conducted to investigate and compare the performance of the estimators and intervals.

1. Introduction

Chandler [1] introduced and studied some properties of record values. Since then a considerable amount of the literature is devoted to the study of records. Ahsanullah [2] and Arnold et al. [3] provided a detailed account of theory of records and the inference issues associated with records. In this paper, we will consider confidence interval estimation of the stress-strength reliability based on record data when the underlying distribution is Weibull. The Weibull model is quite popular in analyzing reliability and life-testing data. Its flexibility and capability of modeling various forms of failure mechanisms and hazard function types gave him an important role in reliability literature; see Johnson et al. [4] and Murthy et al. [5]. The probability density function (pdf) and the cumulative distribution function (cdf) of the Weibull distribution 𝑊(𝜃,𝛽) are given by 𝑓(𝑥)=𝜃𝛽𝑥𝛽1𝑒𝜃𝑥𝛽,𝑥>0,𝜃>0,𝛽>0𝐹(𝑥)=1𝑒𝜃𝑥𝛽.(1.1)

Let 𝑋1,𝑋2, be an infinite sequence of 𝑖𝑖𝑑 random variables. An observation 𝑋𝑗 is called a record if its value is greater than all previous observations, that is 𝑋𝑗>𝑋𝑖 for every 𝑖<𝑗. We want to estimate the stress-strength reliability Pr(𝑋<𝑌) using the data on records. The stress-strength reliability Pr(𝑋<𝑌) arises in life-testing experiments when 𝑋 and 𝑌 represent the lifetimes of two devices, and it gives the probability that the device with life time 𝑋 fails before the other. As an example, Hall [6] studied a situation where the breakdown voltage 𝑌 of a capacitor must exceed the voltage output 𝑋 of a power supply in order for the component to work properly. Weerahandi and Johnson [7] considered another example on rocket motors. This probability has other interpretations in other disciplines, for example, in medical sciences; it is used as a measure of treatment effectiveness in studies involving comparison between control and treatment groups. Various other examples may be found in Kotz et al. [8].

Estimation of the stress-strength reliability based on record data was considered by Baklizi [9] for the exponential distribution with record values and by Baklizi [10] for the generalized exponential distribution. Kundu and Gupta [11] investigated this problem for simple random samples from the Weibull distribution. In Section 2 we derive the maximum likelihood estimator and the associated large sample intervals. Bayesian procedures based on records are derived in Section 3. Simulations to investigate the performance of the asymptotic inference procedures and to compare them with the bootstrap intervals are described in Section 4. The results and conclusions are given in Section 5.

2. Likelihood Inference

We shall consider the case when the shape parameters are equal. Let 𝑋𝑊(𝜃1,𝛽) and 𝑌𝑊(𝜃2,𝛽) be independent random variables. Let 𝑅=Pr(𝑋<𝑌) be the stress strength reliability. Let 𝑟=𝑟0,,𝑟𝑛 be a set of records from 𝑊(𝜃1,𝛽), and let 𝑠=𝑠0,,𝑠𝑚 be an independent set of records from 𝑊(𝜃2,𝛽). The likelihood functions are given by [3] as follows: 𝐿1𝛽,𝜃1𝑟𝑟=𝑓𝑛𝑛1𝑖=0𝑓𝑟𝑖𝑟1𝐹𝑖,𝐿2𝛽,𝜃2𝑠𝑠=𝑔𝑚𝑚1𝑖=0𝑔𝑠𝑖𝑠1𝐺𝑖,(2.1) where 𝑓 and 𝐹 are the pdf and cdf of 𝑋𝑊(𝜃1,𝛽) and 𝑔 and 𝐺 are the pdf and cdf of 𝑌𝑊(𝜃2,𝛽). The likelihood function of (𝛽,𝜃1,𝜃2) based on 𝑟,𝑠 is given by 𝐿𝛽,𝜃1,𝜃2𝑟,𝑠=𝜃1𝛽𝑛+1𝑒𝜃1𝑟𝛽𝑛𝑛𝑖=0𝑟𝑖𝛽1𝜃2𝛽𝑚+1𝑒𝜃2𝑠𝛽𝑚𝑚𝑖=0𝑠𝑖𝛽1.(2.2)

Taking the natural logarithm we get the log-likelihood function 𝑙𝛽,𝜃1,𝜃2=(𝑛+𝑚+2)ln𝛽+(𝑛+1)ln𝜃1+(𝑚+1)ln𝜃2𝜃1𝑟𝛽𝑛𝜃2𝑠𝛽𝑚+(𝛽1)𝑛𝑖=1ln𝑟𝑖+𝑚𝑖=1ln𝑠𝑖.(2.3)

The first partial derivatives are given by 𝜕𝑙𝛽,𝜃1,𝜃2=𝜕𝛽(𝑛+𝑚+2)𝛽𝜃1𝑟𝛽𝑛ln𝑟𝑛𝜃2𝑠𝛽𝑚ln𝑠𝑚+𝑛𝑖=1ln𝑟𝑖+𝑚𝑖=1ln𝑠𝑖,(2.4)𝜕𝑙𝛽,𝜃1,𝜃2𝜕𝜃1=(𝑛+1)𝜃1𝑟𝛽𝑛,𝜕𝑙𝛽,𝜃1,𝜃2𝜕𝜃2=(𝑚+1)𝜃2𝑠𝛽𝑚.(2.5)

Equating these partial derivatives to zero and solving simultaneously we obtain ̂𝛽=(𝑛+𝑚+2)(𝑛+1)ln𝑟𝑛+(𝑚+1)ln𝑠𝑚𝑛𝑖=0ln𝑟𝑖+𝑚𝑖=0ln𝑠𝑖,̂𝜃1=𝑛+1𝑟̂𝛽𝑛,̂𝜃2=𝑚+1𝑠̂𝛽𝑚.(2.6)

Hence the MLE of 𝑅 is given by  ̂𝜃𝑅=2/̂𝜃1+̂𝜃2. The study of the exact distribution of 𝑅 is apparently rather complicated, so we will consider the asymptotic distribution. We need the asymptotic joint distribution of ̂𝛽, ̂𝜃1 and ̂𝜃2. The second partial derivatives of the log-likelihood function are given by 𝜕2𝑙𝛽,𝜃1,𝜃2𝜕𝛽2=(𝑛+𝑚+2)𝛽2𝜃1𝑟𝛽𝑛ln𝑟𝑛2𝜃2𝑠𝛽𝑚ln𝑠𝑚2,𝜕2𝑙𝛽,𝜃1,𝜃2𝜕𝜃21(=𝑛+1)𝜃21,𝜕2𝑙𝛽,𝜃1,𝜃2𝜕𝜃22=(𝑚+1)𝜃22,𝜕2𝑙𝛽,𝜃1,𝜃2𝜕𝛽𝜕𝜃1=𝑟𝛽𝑛ln𝑟𝑛,𝜕2𝑙𝛽,𝜃1,𝜃2𝜕𝛽𝜕𝜃2=𝑠𝛽𝑚ln𝑠𝑚,𝜕2𝑙𝛽,𝜃1,𝜃2𝜕𝜃1𝜕𝜃2=0.(2.7)

The Fisher information matrix is given by 𝐼𝐈=11𝐼12𝐼13𝐼21𝐼22𝐼23𝐼31𝐼32𝐼33𝐸𝜕=2𝑙𝜕𝛽2𝐸𝜕2𝑙𝜕𝛽𝜕𝜃1𝐸𝜕2𝑙𝜕𝛽𝜕𝜃2𝐸𝜕2𝑙𝜕𝛽𝜕𝜃1𝐸𝜕2𝑙𝜕𝜃21𝐸𝜕2𝑙𝜕𝜃1𝜕𝜃2𝐸𝜕2𝑙𝜕𝛽𝜕𝜃2𝐸𝜕2𝑙𝜕𝜃1𝜕𝜃2𝐸𝜕2𝑙𝜕𝜃22.(2.8)

Now we will find the entries of the information matrix. We need to find 𝐸(𝑟𝛽𝑛ln𝑟𝑛), 𝐸(𝑟𝛽𝑛(ln𝑟𝑛)2), 𝐸(𝑠𝛽𝑚ln𝑠𝑚) and 𝐸(𝑠𝛽𝑚(ln𝑠𝑚)2). Using result (2.4.3) of Arnold et al. [3] we have 𝑈=𝜃1𝑟𝛽𝑛Γ(𝑛+1,1), and it follows that𝐸𝑟𝛽𝑛ln𝑟𝑛=1𝛽𝜃1𝐸𝜃1𝑟𝛽𝑛ln𝜃1𝑟𝛽𝑛ln𝜃1=1𝛽𝜃1𝐸(𝑈ln𝑈)ln𝜃1=𝐸(𝑈)(𝑛+1)𝛽𝜃1𝜓(𝑛+2)ln𝜃1,(2.9)𝐸𝑟𝛽𝑛ln𝑟𝑛2=1𝜃1𝛽2𝐸𝜃1𝑟𝛽𝑛ln𝜃1𝑟𝛽𝑛ln𝜃12=1𝜃1𝛽2𝐸𝜃1𝑟𝛽𝑛ln𝜃1𝑟𝛽𝑛22ln𝜃1𝑙𝑛𝜃1𝑟𝛽𝑛+ln𝜃12=1𝜃1𝛽2𝐸𝑈(ln𝑈)22ln𝜃1𝐸(𝑈ln𝑈)+ln𝜃12=𝐸(𝑈)(𝑛+1)𝜃1𝛽2𝜓(𝑛+2)2+𝜓(𝑛+2)2ln𝜃1𝜓(𝑛+2)+ln𝜃12.(2.10)

Similarly, since 𝑉=𝜃2𝑠𝛽𝑚Γ(𝑚+1,1), we have 𝐸𝑠𝛽𝑚ln𝑠𝑚=(𝑚+1)𝛽𝜃2𝜓(𝑚+2)ln𝜃2,𝐸𝑠𝛽𝑚ln𝑠𝑚2=(𝑚+1)𝜃2𝛽2𝜓(𝑚+2)2+𝜓(𝑚+2)2ln𝜃2𝜓(𝑚+2)+ln𝜃22.(2.11)

Let 𝑅=𝜃1/(𝜃1+𝜃2)=(𝜃1,𝜃2), the maximum likelihood estimator of 𝑅 is given by ̂𝜃𝑅=1̂𝜃/(1+̂𝜃2̂𝜃)=(1,̂𝜃2), and it follows that 𝑛=𝑅𝑅𝑛̂𝜃1,̂𝜃2𝜃1,𝜃2𝐷𝑁0,𝜂2,(2.12) as 𝑛, 𝑚 such that 𝑚/𝑛𝑝, 0<𝑝<1. The asymptotic variance is given by 𝜂2𝜃𝜕1,𝜃2𝜕𝜃12var̂𝜃1+𝜃𝜕1,𝜃2𝜕𝜃22var̂𝜃2𝑝̂𝜃+2cov1,̂𝜃2𝜃𝜕1,𝜃2𝜕𝜃1𝜃𝜕1,𝜃2𝜕𝜃2.(2.13)

A (1𝛼)% confidence interval for 𝑅 based on this asymptotic result is given by 𝑅𝑧1𝛼/2̂𝜂,𝑅+𝑧1𝛼/2,̂𝜂(2.14) where ̂𝜂 is obtained by substituting 𝑚/𝑛 for 𝑝 and the MLEs of 𝜃1 and 𝜃2 in the asymptotic standard deviation 𝜂. Another interval can be obtained by using the matrix of minus the second partial derivatives of the log-likelihood function. This matrix can be used in place of the Fisher information matrix to obtain ̂̂𝜂 as an estimator of the variance of 𝑅. The new interval is given by 𝑅𝑧1𝛼/2̂̂𝜂,𝑅+𝑧1𝛼/2.̂̂𝜂(2.15) Many authors suggested that parameter transformation may improve the performance of intervals based on the asymptotic normality of the maximum likelihood estimator. For parameters representing probabilities, the logit transformation seems appropriate. Let 𝜆=ln(𝑅/1𝑅), and the maximum likelihood of 𝜆 is given by ̂𝜆=ln(𝑅/1𝑅). The asymptotic variance of ̂𝑛(𝜆𝜆) is given by 𝜂2(𝑑𝜆/𝑑𝑅)2. This variance can be estimated by substituting the maximum likelihood estimator instead of the parameter. A (1𝛼)% confidence interval for 𝜆 is given by (̂𝜆𝐿,̂𝜆𝑈), where ̂𝜆𝐿=̂𝜆𝑧1𝛼/2̂𝜂𝑑𝜆/𝑑𝑅𝑅=𝑅 and ̂𝜆𝑈=̂𝜆+𝑧1𝛼/2̂𝜂𝑑𝜆/𝑑𝑅𝑅=𝑅, 𝑧1𝛼/2 is the (1𝛼/2) quantile of the standard normal distribution. Therefore A (1𝛼)% confidence interval for 𝑅 is given by 𝑒̂𝜆𝐿̂𝜆1+𝑒𝐿,𝑒̂𝜆𝑈̂𝜆1+𝑒𝑈.(2.16)

3. Bayesian Inference

As in Kundu and Gupta [11], we will use the conjugate gamma priors for the scale parameters 𝜃1 and 𝜃2 and a squared error loss function. The prior density of 𝜃𝑗 therefore is given by 𝜋𝑗𝜃𝑗𝑎𝑗,𝑏𝑗=𝑏𝑎𝑗𝑗Γ𝑎𝑗𝜃𝑎𝑗𝑗1𝑒𝑏𝑗𝜃𝑗,𝑗=1,2,(3.1) where 𝑎𝑗>0, 𝑏𝑗>0 are the parameters of the prior distributions. Assuming that 𝜃1 and 𝜃2 are independent, the joint prior distribution of 𝜃1 and 𝜃2 is given by 𝜋(𝜃1,𝜃2)=2𝑗=1𝜋𝑗(𝜃𝑗). Assume that the prior distribution of 𝛽, denoted by 𝜁(𝛽), has a support on (0,). Recall that the likelihood function of (𝛽,𝜃1,𝜃2) based on 𝑟=𝑟0,,𝑟𝑛 and 𝑠=𝑠0,,𝑠𝑚 is given by 𝐿𝛽,𝜃1,𝜃2𝑟,𝑠=𝜃1𝛽𝑛+1𝑒𝜃1𝑟𝛽𝑛𝑛𝑖=0𝑟𝑖𝛽1𝜃2𝛽𝑚+1𝑒𝜃2𝑠𝛽𝑚𝑚𝑖=0𝑠𝑖𝛽1.(3.2)

The joint posterior density function of (𝛽,𝜃1,𝜃2) therefore is given by; 𝜋𝛽,𝜃1,𝜃2=𝜋𝜃1,𝜃2𝜁(𝛽)𝐿𝛽,𝜃1,𝜃2𝑟,𝑠0𝜋𝜃1,𝜃2𝜁(𝛽)𝐿𝛽,𝜃1,𝜃2𝑟,𝑠𝑑𝛽𝑑𝜃1𝑑𝜃2,(3.3) where the numerator is given by 𝛽𝑛+𝑚+2𝑏𝜁(𝛽)𝑎11Γ𝑎1𝜃1𝑛+𝑎1𝑒𝜃1(𝑟𝛽𝑛+𝑏1)𝑛𝑖=0𝑟𝑖𝛽1𝑏𝑎22Γ𝑎2𝜃2𝑚+𝑎2𝑒𝜃2(𝑠𝛽𝑚+𝑏2)𝑚𝑖=0𝑠𝑖𝛽1.(3.4)

This expression for 𝜋(𝛽,𝜃1,𝜃2) is difficult or even impossible to find in closed form. A simulation technique is needed. It is clear that the conditional posterior distributions of 𝜃1 and 𝜃2 given 𝛽 are given by 𝜋1𝜃1=𝑟𝛽𝛽𝑛+𝑏1𝑛+𝑎1+1Γ𝑛+𝑎1𝜃+11𝑛+𝑎1𝑒𝜃1(𝑟𝛽𝑛+𝑏1),(3.5)𝜋2𝜃2=𝑠𝛽𝛽𝑚+𝑏2𝑚+𝑎2+1Γ𝑚+𝑎2𝜃+12𝑚+𝑎2𝑒𝜃2(𝑠𝛽𝑚+𝑏2).(3.6)

Note that given 𝛽, 𝜃1, and 𝜃2 are independent. From the joint posterior distribution of 𝜃1, 𝜃2, and 𝛽 we obtain 𝜁(𝛽)𝜁(𝛽)𝛽𝑛+𝑚+2𝑛𝑖=0𝑟𝑖𝑚𝑖=0𝑠𝑖𝛽𝑟𝛽𝑛+𝑏1(𝑛+𝑎1+1)𝑠𝛽𝑚+𝑏2(𝑚+𝑎2+1).(3.7)

Taking 𝑎1=𝑎2=0 and 𝑏1=𝑏2=0 in the prior distributions we obtain 𝜁(𝛽)𝜁(𝛽)𝛽𝑛+𝑚+2𝑟𝑛(𝑛+1)𝑠𝑚𝑛(𝑚+1)𝑖=0𝑟𝑖𝑚𝑖=0𝑠𝑖𝛽.(3.8)

If we can take 𝜁(𝛽)=𝑒𝛽, we obtain 𝜁(𝛽)𝛽𝑛+𝑚+2𝑒1𝑟𝑛(𝑛+1)𝑠𝑚𝑛(𝑚+1)𝑖=0𝑟𝑖𝑚𝑖=0𝑠𝑖𝛽.(3.9)

It follows that the posterior distribution of 𝛽 is Γ𝑒𝑛+𝑚+3,ln1𝑟𝑛(𝑛+1)𝑠𝑚𝑛(𝑚+1)𝑖=0𝑟𝑖𝑚𝑖=0𝑠𝑖1.(3.10)

We can use the following Monte Carlo method to find approximate point estimates. (1)Generate 𝛽1 from 𝜁(𝛽) given by (3.10).(2)Generate 𝜃1,1 and 𝜃2,1 from 𝜋1(𝜃1𝛽1) and 𝜋2(𝜃2𝛽1) given by (3.5) and (3.6), respectively, and calculate 𝑅1=𝜃1,1/𝜃1,1+𝜃2,1.(3)Repeat steps (1) and (2)   𝑀 times to get 𝑅1,,𝑅𝑀.(4)Calculate the approximate Bayes estimator and the approximate posterior variance; 𝐸𝑅𝑟,𝑠=𝑀𝑖=1𝑅𝑖𝑀=𝑉𝑅,𝑅𝑟,𝑠=𝑀𝑖=1𝑅𝑖𝑅2𝑀.(3.11)

Approximate highest posterior density (HPD) intervals for the stress-strength reliability 𝑅 may be found using the algorithm of Chen and Shao [12].

4. A Simulation Study

We conducted simulations to compare the performance of the point and interval estimators developed in this paper. In our simulations we also included some bootstrap intervals [13], namely, the percentile interval and the bootstrap-t interval based on the logit transformation of 𝑅. We used 𝑛=𝑚=6, 8, 10, 15. We fixed 𝜃1=1 and used 𝜃2 = 1, 3, 5, 7, 9. The confidence level taken is (1𝛼)=0.95. In each simulation run we generated 2000 samples of records from the distributions of 𝑋 and 𝑌. For each pair of samples we calculated the following intervals:(1)ANE: the interval based on the asymptotic normality of the MLE with variance estimate based on the Fisher information matrix, given by (2.14),(2)ANO: the interval based on the asymptotic normality of the MLE with variance estimate based on the observed information matrix, given by (2.15),(3)ANT: the interval based on the asymptotic normality of the MLE of the transformed parameter with variance estimate based on the observed information matrix, given by (2.16),(4)HPD: the approximate highest posterior density interval of Chen and Shao [12],(5)Perc: the percentile interval, [13], and(6)Boot: the bootstrap-t interval based on the logit transformation of the stress-strength reliability parameter, [13].

The biases and mean squared error of the Bayes and the maximum likelihood estimator are simulated and the results are given in Table 1. The lower (L), upper (U), and total (T) error rates and the expected widths (W) of the intervals are approximated using the results of the 2000 simulation replications. For the percentile and bootstrap-t intervals we used 1000 bootstrap resamples. The results of our simulations are given in Table 2.

5. Conclusions

Results concerning the performance of the Bayes estimator and the maximum likelihood estimator are given in Table 1. It appears that the MLE is negatively biased while the Bayes estimator is positively biased. However, the biases are very small, especially for large sample sizes. The mean squared errors of the estimators are very close, especially for larger samples. However, it appears that the MLE has higher MSE for values of 𝑅 near 0.5 and small sample sizes. The situation is reversed for values of 𝑅 close to the extremes.

The simulation results in Table 2 show that the widths of all intervals are maximized when 𝑅=0.5, and they become narrower as the true value of 𝑅 approaches the extremes. As expected, increasing the sample sizes also results in shorter intervals. The performance of the intervals (ANO) and (ANE) based on the asymptotic normality of the MLE are very similar. They tend to be anticonservative and are short. However, for the transformed intervals, the situation completely changed, and the results are very encouraging. This is true for (ANT) interval and the bootstrap-t interval based on the transformed parameter. The performance of bootstrap-t intervals for the original parameter is also investigated, but the results are very poor and not included in Table 2. The Bayes interval (HPD) appears to have better performance than (ANO), and (ANE). However it is dominated by the intervals based on the transformed parameter (ANT) and (Boot). The same is true for the percentile interval which tends to be anticonservative. Concerning the symmetry of lower (L) and upper (U) error rates, it appears that the (ANE), (ANO) and (Perc) intervals are highly asymmetric, especially for extreme values of the stress-strength reliability. On the other hand, the (ANT), (Boot), and (HPD) tend to be symmetric even for small samples. In conclusion, we would recommend the use of the bootstrap-t interval (Boot) based on the transformed parameter followed by (ANT) for all sample sizes.