Statistical Inferences and Applications of the Half Exponential Power Distribution

Gui, Wenhao

doi:https://doi.org/10.1155/2013/219473

Journal of Quality and Reliability Engineering

On this page

Abstract Introduction Appendix References Copyright Related Articles

Research Article | Open Access

Volume 2013 | Article ID 219473 | https://doi.org/10.1155/2013/219473

Statistical Inferences and Applications of the Half Exponential Power Distribution

Wenhao Gui¹

Academic Editor: Kai Yuan Cai

Received23 Oct 2012

Revised13 Dec 2012

Accepted04 Feb 2013

Published05 Jun 2013

Abstract

We investigate the statistical inferences and applications of the half exponential power distribution for the first time. The proposed model defined on the nonnegative reals extends the half normal distribution and is more flexible. The characterizations and properties involving moments and some measures based on moments of this distribution are derived. The inference aspects using methods of moment and maximum likelihood are presented. We also study the performance of the estimators using the Monte Carlo simulation. Finally, we illustrate it with two real applications.

1. Introduction

The well-known exponential power (EP) distribution or the generalized normal distribution has the following density function: where is the shape parameter. This family consists of a wide range of symmetric distributions and allows continuous variation from normality to nonnormality. It includes the normal distribution as the special case when and the Laplace distribution when . Nadarajah [1] provided a comprehensive treatment of its mathematical properties.

Its tails can be more platykurtic () or more leptokurtic () than the normal distribution (). The distribution has been widely used in the Bayes analysis and robustness studies (see Box and Tiao [2], Genc [3], Goodman and Kotz [4], and Tiao and Lund [5].)

On the other hand, since the most popular models used to describe the lifetime process are defined on nonnegative measurements, which motivate us to take a positive truncation in the model (1) and develop a half exponential power (HEP) distribution. As far as we know, this model has not been previously studied although, we believe, it plays an important role in data analysis. The resulting nonnegative half exponential power distribution generalizes the half normal (HN) distribution, and it is more flexible. In our work, we aim to investigate the statistical features of the nonnegative model and apply them to fit the lifetime data.

The rest of this paper is organized as follows: in Section 2, we present the new distribution and study its properties. Section 3 discusses the inference, moments, and maximum likelihood estimation for the parameters. In Section 4, we discuss a useful technique, a half normal plot with a simulated envelope, to assess the model adequacy. Simulation studies are performed in Section 5. Section 6 gives two illustrative examples and reports the results. Section 7 concludes our work.

2. The Half Exponential Power Distribution

2.1. The Density and Hazard Function

Definition 1. A random variable has a half exponential power slash distribution if its density function with scale parameter takes where and . We denote it as .
Figure 1(a) displays some plots of the density function of the half exponential power distribution with various parameters.
The cumulative distribution function of the half exponential power distribution is given as follows. For , where is the lower incomplete gamma function, defined as .
The hazard rate function (also known as the failure rate function) of the half exponential power distribution is given by, for ,
Since , as , we obtain . Therefore, the hazard rate function is increasing for and decreasing for . Figure 1(b) displays some plots of the hazard rate function of the half exponential power distribution with various parameters.

(a) Density function

(b) Hazard function

2.2. Moments and Measures Based on Moments

Proposition 2. Let , for ; the th noncentral moments are given by

The following results are immediate consequences of (5).

Corollary 3. Let . The mean and variance of are given by

Corollary 4. Let . The skewness and kurtosis coefficients of are given by

Figure 2 shows the skewness and kurtosis coefficients with various parameters for the HEP model.

(a) Skewness coefficient

(b) Skewness coefficient in log scale

(c) Kurtosis coefficient

(d) Kurtosis coefficient in log scale

3. Inference

3.1. Moment Estimation

Let be a random sample from the distribution . From (5), we have and . Replacing and with the corresponding sample estimators, we obtain the moment equations

The estimate is the solution to which can be solved numerically. And the estimate is given by

It is clear that, for the special case when is known, estimator is unbiased and its mean squared error (MSE) is given by

In the following proposition, we present the asymtotic property of the moment estimators.

Proposition 5. Let be a random sample of size from the distribution , and let ; then, if and is the moment estimator of , one has as , where and is given by
whose entries are given by

where is the digamma function defined as the logarithmic derivative of the gamma function, .

Remark 6. A consistent estimator for the asymptotic covariance matrix can be obtained by replacing parameters with their corresponding moment estimators.

3.2. Maximum Likelihood Estimation

In this section, we consider the maximum likelihood estimation about the parameter of the model defined in (2). The log likelihood for a random sample is

By taking the partial derivatives of the log-likelihood function with respect to and , respectively, and equalizing the obtained expressions to zero, the following maximum likelihood estimating equations are obtained:

In general, there are no explicit solutions for the above maximum likelihood estimating equations. The estimates can be obtained by means of numerical procedures such as the Newton-Raphson method. The program provides the nonlinear optimization routine optim for solving such problems.

For asymptotic inference of , we need the Fisher information matrix . It is known that its inverse is the asymptotic variance matrix of the maximum likelihood estimators. For the case of a single observation (), we take the second-order derivatives of the log-likelihood function in (15).

Consider, Using the facts we can obtain the elements of the Fisher information matrix:

Proposition 7. Let be a random sample of size from the distribution , let , and is the maximum likelihood estimator of , one has

4. Assessment of Model Adequacy

In this section, we introduce a useful tool, a half normal plot with a simulated envelope which will be used to evaluate the HEP model in Section 6. The advantage of this technique is its ease of interpretation without knowing the distribution of the residuals.

Atkinson [6] proposed this diagnostic plot to detect potential outliers and influential observations in linear regression models. A simulated envelope is added to the plot to aid overall assessment, whereby the observed residuals are expected to lie within the boundary of the envelope if the presumed model has been correctly specified.

The method of simulated envelope and its corresponding transformations have been widely applied in many applications (see Flack and Flores [7], Ferrari and Cribari-Neto [8], da Silva Ferreira et al. [9], and so forth.) The simulated envelope technique compares the observed statistics with those of the data generated from the proposed model. Any sizeble departure of the observed residuals from the simulated quantities may be thought as evidence against the adequacy of the proposed model. Here is the procedure to produce the half normal plot with simulated envelopes. (1)Fit the model to the observed data (sample size = ). (2)Generate a sample of observations based on the fitted model. (3)Fit the model to the above generated sample and compute the ordered absolute values of the standard residuals. (4)Repeat the above steps times.(5)Consider the sets of the -ordered statistics; calculate the average, minimum, and maximum values across each set. (6)Plot these values together with the ordered residuals from the original data against the half normal scores .

The minimum and maximum values of the -ordered statistics constitute a simulated envelope to guide assessment of the model adequacy. Atkinson [6] suggested using since there is a 5% chance to detect the largest residual being outside the boundary of the simulated envelope. Moreover, other types of residuals such as deviance or score residual may be used in the procedure. For example, da Silva Ferreira et al. [9] used the Mahalanobis distance to assess their models. The horizontal axis can also show other variables such as index.

5. Simulation Study

In this section, we conduct some simulations and study the properties of the estimators numerically.

We perform a simulation to illustrate the behaviors of the moment and MLE estimators for parameters , respectively. The simulation is conducted by the software . We generate 1000 samples of size , , and from the distribution for fixed parameters and .

The random numbers can be generated as follows. We first generate random numbers from an exponential power distribution with , , and , the procedures can be found in Chiodi [10]; then we take the absolute value of the random numbers, . It follows that .

The estimators are computed using the results in Section 3. The empirical means and standard deviations of the estimators are presented in Tables 1 and 2, respectively. The simulation studies show that the parameters are well estimated, and the estimates are asymptotically unbiased. The empirical MSEs decrease as sample size increases as expected. Further, MLEs are more efficient than moment estimators.

6. Real Data Illustration

In this section, we analyze two real datasets to fit with the proposed model. The applications demonstrate that the HEP model fits the data better than the HN model.

6.1. Application 1

The data are the plasma ferritin concentration measurements of 202 athletes collected at the Australian Institute of Sport. This dataset has been studied by several authors (see Azzalini and Dalla Valle [11], Cook and Weisberc [12], and Elal-Olivero et al. [13].)

The descriptive statistics for the dataset are shown in Table 3, where and are the sample skewness and kurtosis coefficients. Notice that the dataset presents nonnegative measurements.

We fit the dataset with the half normal and the half exponential power distribution, respectively, using maximum likelihood method. The MLE estimators are computed using , and the results are reported in Table 4. The usual Akaike information criterion (AIC) and Bayesian information criterion (BIC) to measure of the goodness of fit are also computed: and , where, is the number of parameters in the distribution and is the maximized value of the likelihood function. The results indicate that HEP model has the lower values for the AIC and BIC statistics, and thus it is a better model. Figures 3(a) and 3(b) display the fitted models using the MLE estimates.

(a) Histogram and fitted curves

(b) Empirical and fitted CDF

The diagnostic procedure introduced in Section 4 is implemented for both models. The simulated envelope plots are shown in Figures 4(a) and 4(b). Most of the observed residuals are either near or outside the boundary of the envelope, indicating inadequacy of the fitted HN model. On the other hand, the observed residuals corresponding to the HEP model in Figure 4(b) are well within the simulated envelope, indicating that the HEP model provides a better fit to the data.

(a) Half normal

(b) Half exponential power

6.2. Application 2

We consider the stress-rupture dataset and the life of fatigue fracture of Kevlar 49/epoxy that are subject to the pressure at the 90% level. The dataset has been previously studied by Andrews and Herzberg [14], Barlow et al. [15], and Olmos et al. [16].

Table 5 summarizes the dataset. This dataset also shows nonnegative asymmetry. Same as before, we fit the dataset with the half normal and the half exponential power distribution, respectively, using maximum likelihood method. The results are reported in Table 6. The AIC and BIC are presented as well, and the results show that HEP model fits better. Figures 5(a) and 5(b) display the fitted models using the MLE estimates.

(a) Histogram and fitted curves

(b) Empirical and fitted CDF

The diagnostic procedure introduced in Section 4 is implemented for both models. The simulated envelope plots are shown in Figures 6(a) and 6(b). The observed residuals corresponding to the HEP model in Figure 6(b) are well within the simulated envelope, indicating that the HEP model provides a better fit to the data.

(a) Half normal

(b) Half exponential power

7. Concluding Remarks

In this paper, we have studied the half exponential power distribution in detail. This nonnegative distribution contains the half normal distribution as its special case. Probabilistic and inferential properties are studied. A simulation is conducted and demonstrates the good performance of the moment and maximum likelihood estimators. We apply the model to two real datasets, illustrating that the proposed model is appropriate and flexible in real applications. There are a number of possible extensions of the current work. Mixture modeling using the proposed distributions is the most natural extension. Other extensions of the current work include a generalization of the distribution to multivariate settings.

Appendix

Proofs of Propositions

Proof of Proposition 2. Consider,

Proof of Proposition 5. This result follows directly by using standard large sample theory for moment estimators, as discussed in Sen and Singer [17].

Proof of Proposition 7. It follows directly by using the large sample theory for maximum likelihood estimators and the Fisher information matrix given above.

References

S. Nadarajah, “A generalized normal distribution,” Journal of Applied Statistics, vol. 32, no. 7, pp. 685–694, 2005.
View at: Publisher Site | Google Scholar
G. Box and G. Tiao, “A further look at robustness via bayes's theorem,” Biometrika, vol. 49, no. 3-4, pp. 419–432, 1962.
View at: Google Scholar
A. I. Genç, “A generalization of the univariate slash by a scale-mixtured exponential power distribution,” Communications in Statistics, vol. 36, no. 5, pp. 937–947, 2007.
View at: Publisher Site | Google Scholar
I. R. Goodman and S. Kotz, “Multivariate θ-generalized normal distributions,” Journal of Multivariate Analysis, vol. 3, no. 2, pp. 204–219, 1973.
View at: Google Scholar
G. Tiao and D. Lund, “The use of olumv estimators in inference robustness studies of the location parameter of a class of symmetric distributions,” Journal of the American Statistical Association, vol. 65, pp. 370–386, 1970.
View at: Google Scholar
A. Atkinson, Plots, Transformations, and Regression: An Introduction to Graphical Methods of Diagnostic Regression Analysis, Clarendon Press Oxford, 1985.
V. F. Flack and R. A. Flores, “Using simulated envelopes in the evaluation of normal probability plots of regression residuals,” Technometrics, vol. 31, no. 2, pp. 219–225, 1989.
View at: Google Scholar
S. L. P. Ferrari and F. Cribari-Neto, “Beta regression for modelling rates and proportions,” Journal of Applied Statistics, vol. 31, no. 7, pp. 799–815, 2004.
View at: Publisher Site | Google Scholar
C. da Silva Ferreira, H. Bolfarine, and V. H. Lachos, “Skew scale mixtures of normal distributions: properties and estimation,” Statistical Methodology, vol. 8, no. 2, pp. 154–171, 2011.
View at: Publisher Site | Google Scholar
M. Chiodi, “Procedures for generating pseudo-random numbers from a normal distribution of order p ( $P > 1$ ),” Statistica Applicata, vol. 1, pp. 7–26, 1986.
View at: Google Scholar
A. Azzalini and A. Dalla Valle, “The multivariate skew-normal distribution,” Biometrika, vol. 83, no. 4, pp. 715–726, 1996.
View at: Google Scholar
R. Cook and S. Weisberc, “An introduction to regression graphic?” Methods, vol. 17, article 640, 1994.
View at: Google Scholar
D. Elal-Olivero, J. F. Olivares-Pacheco, H. W. Gómez, and H. Bolfarine, “A new class of non negative distributions generated by symmetric distributions,” Communications in Statistics—Theory and Methods, vol. 38, no. 7, pp. 993–1008, 2009.
View at: Publisher Site | Google Scholar
D. Andrews and A. Herzberg, Data: A Collection of Problems from Many Fields for the Student and Research Worker, vol. 18, Springer, New York, NY, USA, 1985.
R. Barlow, R. Toland, and T. Freeman, “A bayesian analysis of the stress-rupture life of kevlar/epoxy spherical pressure vessels,” in Accelerated Life Testing and Experts Opinions in Reliability, C. A. Clarotti and D. V. Lindley, Eds., 1988.
View at: Google Scholar
N. M. Olmos, H. Varela, H. W. Gómez, and H. Bolfarine, “An extension of the half-normal distribution,” Statistical Papers, pp. 1–12, 2011.
View at: Publisher Site | Google Scholar
P. Sen and J. M. Singer, Large Sample Methods in Statistics: An Introduction with Applications, Chapman and Hall/CRC, 1993.

Copyright

Copyright © 2013 Wenhao Gui. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1566

Downloads

728

Citations