A New Class of Distributions Generated by the Extended Bimodal-Normal Distribution

Cortés, Milton A.; Elal-Olivero, David; Olivares-Pacheco, Juan F.

doi:https://doi.org/10.1155/2018/9753439

Journal of Probability and Statistics

On this page

Abstract Introduction Conclusion Data Availability Disclosure Conflicts of Interest References Copyright Related Articles

Research Article | Open Access

Volume 2018 | Article ID 9753439 | https://doi.org/10.1155/2018/9753439

A New Class of Distributions Generated by the Extended Bimodal-Normal Distribution

Milton A. Cortés,¹David Elal-Olivero,¹and Juan F. Olivares-Pacheco¹

Academic Editor: Steve Su

Received21 Jun 2018

Accepted27 Sept 2018

Published01 Nov 2018

Abstract

In this study, we present a new family of distributions through generalization of the extended bimodal-normal distribution. This family includes several special cases, like the normal, Birnbaum-Saunders, Student’s , and Laplace distribution, that are developed and defined using stochastic representation. The theoretical properties are derived, and easily implemented Monte Carlo simulation schemes are presented. An inferential study is performed for the Laplace distribution. We end with an illustration of two real data sets.

1. Introduction

Although the normal distribution is the most popular probability model in statistics, several random phenomena in nature cannot be described by the normal distribution. In this regard, Azzalini [1] introduced an extension of the normal distribution called skew-normal distribution, where this model shares some properties with the standard normal model; it is mathematically tractable and it has a wide range of the coefficients of skewness and kurtosis. From this work, an important line of research focusing on finding new distributions that offered greater flexibility is generated.

More recently, Elal-Olivero [2] introduced a new class of skew-normal distribution called alpha-skew-normal distribution. In doing so, he first defined a new bimodal-symmetric normal distribution with probability density function given by where is the standard normal density, which is defined as the bimodal-normal (BN) distribution. Furthermore, he studies some properties of this distribution and presents its stochastic representation as the product of two independent random variables and , where and is a discrete random variable such that ; that is, has the distribution BN. On the other hand, an extension of the BN density is given bywhere is the shape parameter. Note that this density function is symmetric and is characterized by incorporating bimodality into the normal distribution, which is controlled by the parameter . Elal-Olivero [2] presents this extension as the symmetric-component of the alpha-skew-normal distribution. Furthermore, (2) also can be deduced from the model presented in Elal-Olivero et al. [3]. In this regard, Gui et al. [4] incorporated (2) into the slash distribution, developed its properties, and performed inferential studies, whereas Gómez and Guerrero [5] incorporated (2) into the Birnbaum-Saunders distribution, tested its bimodality, and demonstrated its principal properties.

The objective of this article is to present a new family of distributions through generalization of (2). This generalization can be applied to any density function, thereby producing a more flexible model incorporating a shape parameter. Depending on the density at which we apply this generalization, it is observed that the new model is flexible enough to support uni- and bimodal shapes. Furthermore, Gui et al. [4] and Gómez and Guerrero [5] are particular cases of the generalization proposal.

This article is organized as follows. In Section 2, we present a generalization of (2) and review some particular cases (normal, Birnbaum-Saunders, Student’s , and Laplace distribution). In Section 3, we develop the basic properties of the cases from Section 2 and study the effects of this new generalization. In Section 4, we study some inferential aspects of the extended Laplace distribution using maximum likelihood estimation and perform a Monte Carlo simulation study. We conclude in Section 5 with a discussion.

2. A General Class of Distributions

This section describes a general class of distributions generated by (2), presents its basic properties, and derives explicit expressions for the normal, Birnbaum-Saunders, Student’s , and Laplace distribution.

2.1. Characterization and Properties

Theorem 1 (general class of distributions). Let be a probability density function and a positive continuous function such that , where . Then,is a probability density function with shape parameter .

Proof. If we note can be represented as a mixture of two densities, then the result follows immediately; that is, , where .

Remark 2. On the basis of Theorem 1, we can make the following observations: (1)If , then , .(2)If , then , .

Theorem 3 (stochastic representation). Let and be independent random variables. Ifthen .

Proof. Since can be represented as a mixture, the result follows immediately.

Remark 4. If , then (1)The cumulative distribution function is given by where and are the cumulative distribution functions of and , respectively.(2)The moment generating function is given by where and are the moments generating functions of and , respectively, if both exist.(3)The -th moment of the random variable is given by

2.2. Special Cases

In this section, explicit expressions are provided for the probability density function in (3) for the normal, Birnbaum-Saunders, Student’s , and Laplace distribution and different choices of . These models are selected to show the benefits of the proposed extension, and the choice of the function is conditioned upon a positive function with finite expectation.

Corollary 5 (normal case). If and , then has the probability density function given byand we say that has an “extended normal distribution,” which is denoted as .

Corollary 6 (Birnbaum-Saunders case). Let . If and where is the derivative with respect to , with and , then has the probability density function given byand we say that has an “extended Birnbaum-Saunders distribution,” which is denoted as .

Corollary 7 (Student’s case). If and where , then has the probability density function given byand we say that has an “extended Student’s distribution,” which is denoted as .

Corollary 8 (Laplace case). If and , then has the probability density function given byand we say that has an “extended Laplace distribution,” which is denoted as .

As we notice, in the Corollaries 5–8 and Figure 1, when the function is a symmetric density, the effect of the extension is that the model supports uni- and bimodal shapes. On the other hand, if the model has positive support, the bimodality depends on the choice of parameters, as seen in the Birnbaum-Saunders distribution case.

(a)

(b)

(c)

(d)

3. Some Results of the Special Cases

In this section, we develop some properties associated with the models defined in Corollaries 5–8. The cumulative distribution function, moment, and stochastic representation will be presented when they correspond to the cases at hand. Some proofs are straightforward and are, therefore, omitted.

3.1. Extended Normal Distribution

The extended normal distribution is the basis for the development of the specific cases discussed previously. If and , then, from Theorem 3, we know that and . Note that the distribution of corresponds to the bimodal-normal distribution, for which the stochastic representation was presented in Section 1.

The stochastic representation of is obtained through Theorem 3. Table 1 shows an alternative way to generate random variables . Furthermore, the stochastic representation has a form that is similar to the representation given in Henze [6] for the skew-normal distribution presented in Azzalini [1].

Remark 9. If , then (1)The cumulative distribution function is given by (2)The -th moment of the random variable is given by (3)The expected value and variance of the random variable is given by

3.2. Extended Birnbaum-Saunders distribution

The Birnbaum-Saunders (BS) distribution (see Birnbaum and Saunders [7, 8]) describes the lifetime of components exposed to fatigue caused by cyclical stress and tension. Since 1969, the number of studies that have investigated this distribution and discussed the development of both its theoretical properties and its applications has increased dramatically. Because of its significance, this distribution has been extended in a variety of manners to relax its behavior and thus make it applicable to a wide range of situations. For example, see Birnbaum and Saunders [7, 8], Mann et al. [9], Desmond [10, 11], Chang and Tang [12], Díaz-García and Leiva-Sánchez [13], Gómez et al. [14], and Olmos et al. [15]. The BS distribution with parameters and has density function given by where is defined in Corollary 6 and is denoted as . If and , then with a stochastic representation given by where . From Theorem 1, the extended Birnbaum-Saunders distribution has density (9) and from Theorem 3 we can generate random variables . An alternative way to generate this random variable can be seen in Table 1.

Theorem 10. Let with , and . Then (1), for .(2).

Proof. The proofs are immediate from the theorem of the change of variable.

Remark 11. Like the Birnbaum-Saunders distribution observes that the property established in Theorem 10 implies that the EBS distribution belongs to the scale family, whilst the property implies that it also belongs to the family of random variables closed under reciprocation; see Saunders [16]. Furthermore, based on properties and , we can have the two-parameter EBS distribution: .

Remark 12. If , then (1)The cumulative distribution function is given by where is the cumulative distribution function of the Birnbaum-Saunders distribution.(2)The -th moment of the random variable is given by where .(3)The expected value and variance of the random variable is given by

3.3. Extended Student’s -Distribution

The Student’s -distribution serves as a robust alternative when it is desired to model data sets with atypical values and with a coefficient of kurtosis that is greater than of the normal distribution. The Student’s -distribution with parameter has a density function given by If and , then, with a stochastic representation given by , where and are independent random variables. From Theorem 1, the extended Student’s -distribution has density (10) and from Theorem 3 we can generate random variables . An alternative way to generate this random variable can be seen in Table 1.

Remark 13. If , then (1)The cumulative distribution function is given by where and are the cumulative distribution function and probability density function of the -Student distribution, respectively.(2)The -th moment of the random variable is given by where .(3)The expected value and variance of the random variable is given by

3.4. Extended Laplace Distribution

The Laplace (L) or double exponential distribution, which was originally published by Pierre Laplace in 1774, is a symmetric distribution with density function given by If and , then, with a stochastic representation given by , where and are independent random variables. From Theorem 1, the extended Laplace distribution has density (11) and from Theorem 3 we can generate random variables . An alternative way to generate this random variable can be seen in Table 1.

Remark 14. If , then (1)The cumulative distribution function is given by (2)The -th moment of the random variable is given by (3)The expected value and variance of the random variable are given by Table 1 shows an alternative way to generate random variables for the special cases defined in Corollaries 5–8. We can see that the extended normal distribution is the basis for the development of the specific cases discussed previously.

4. Inferential Aspects of the EL Distribution

In this section, we will study some inferential properties of the extended Laplace distribution defined in Corollary 8. We will explore maximum likelihood estimators and Monte Carlo simulation and will apply there results to two real data sets, comparing the fit with the Laplace distribution using the likelihood ratio and the Akaike Information Criterion (AIC).

4.1. Maximum Likelihood Estimator

In practice, it is common to work with a location and scale transformation , where , , and with . Hence, the density for the random variable , denoted as , isAssume that is a random sample of size from an distribution. From (31), the log-likelihood function is where , which is a continuous function in each parameter, but it is not differentiable at , for . Thus, by assuming , for , we have that elements of the score vector are , where , given bywhere denotes the sign function.

Hence, the maximum likelihood estimator solves the score equations . Which must be obtained through a numerical method. A lot of software, including optimization toolbox, can be used for obtaining the maximum likelihood estimates. To achieve the maximization of log-likelihood function, we used the function optim on R (see R Core Team [17]), the specific method being Nelder-Mead (see Nelder and Mead [18]), that uses only function values and is robust but relatively slow. It will work reasonably well for nondifferentiable functions.

For obtaining the standard errors of the maximum likelihood estimates one should compute the information matrix . It is well known that the elements of are given by Since expectation over EL distribution is not straightforward, numerical methods should be performed to obtain the explicit form of the information matrix. This matrix can be approximated by the observed information matrix , which is defined as minus the Hessian matrix evaluated at ; that iswhere the second derivatives are given below: Thus, we use the observed information matrix for computing the standard errors in the rest of the paper. Note that this approximation of the observed information matrix is obtained under a less stringent supposition, this is, assuming that the density function is absolutely continuous, as is the case with the Laplace distribution (see Kotz et al. [19], remark 2.6.1).

4.2. Numerical Study

We shall use Monte Carlo simulation to evaluate the finite sample performance of the maximum likelihood estimator. The number of Monte Carlo replications was from simulated samples of the EL distribution for several samples sizes. Each sample was generated using the stochastic representation of the EL distribution, described above. For each generated sample, we obtain the maximum likelihood estimates using the function optim on R, the specific method being Nelder-Mead.

In order to analyze the point estimation results, we computed, for each sample size and for each estimator, the standard error from the observed information matrix defined in (35). The result can be seen in Table 2. From the results, we can see that the estimates are quite stable and estimates are asymptotically unbiased as expected, that is, it is observed that the bias becomes smaller as the sample size increases.

4.3. Data Illustration

In this section we shall examine the application of the EL distribution to two real data sets. The first data set is related to the project WHO MONICA (World Health Organization Multinational Monitoring of Trends and Determinants in Cardiovascular Disease). This data set has been previously analyzed and studied in Kuulasmaa et al. [20], Kulathinal et al. [21], and de Castro et al. [22] and corresponds to the average annual rate of occurrence of cardiovascular mortality or the presence of coronary disease. The data are as follows: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , . The second data set consists of the heights in inches of students from University of Pennsylvania. This data set has been previously analyzed and studied by Hassan and Hijazi [23] and Gui et al. [4]. The data are as follows: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , . Table 3 shows a descriptive summary of the data sets analyzed.

From these data sets, the maximum likelihood estimators of the parameters associated with the Laplace () and the extended Laplace () distributions are obtained; the results of the comparison are summarized in Table 4 and Figure 2 shows the histograms and the fitted curves for the data set.

(a) WHO MONICA data set

(b) HEIGHT data set

The AIC is used to compare the estimated models. As one can see, our model with the smallest values of AIC is preferable. In addition, we can use the likelihood ratio (LR) test statistic to confirm our claim. To do this, we consider the following hypotheses: The value of the LR test statistic for the data WHO MONICA is and for the data set of the height of students is and comparing this quantity with , the null hypothesis is rejected.

Figure 3 shows the graphs of the QQ-plots for the WHO MONICA and HEIGHT data sets calculated with the Laplace and extended Laplace models fitted with the maximum likelihood estimates of the parameters. These also show the good agreement of the EL distribution for the two data sets.

(a) QQ-plot Laplace

(b) QQ-plot extended Laplace

(c) QQ-plot Laplace

(d) QQ-plot extended Laplace

5. Conclusion and Final Comments

We have presented a generalization of the extended bimodal-normal distribution that depends on a shape parameter that controls the effect of bimodality when the density is symmetric. But, generally speaking, it produces a more flexible model in terms of asymmetry and kurtosis coefficients. Additionally, we have demonstrated its basic properties and stochastic representation, the latter of which played a significant role in the development of this work. The family of distributions includes a large number of distributions. For example, four of them were presented as corollaries, leaving the Laplace distribution for last, and used to develop some inferential aspects and Monte Carlo simulation schemes, which were facilitated by the use of stochastic representation in the generation of random variables. Finally, using two real data sets, we demonstrated that the proposed model resulted in better behavior relative to the standard Laplace model. Moreover, in the statistical literature, there are a variety of extensions of the Laplace distribution, in order to achieve greater flexibility, but without the effect of bimodality that fitted the data analyzed. Although bimodality can be achieved through a mixture of distributions, the proposed model is more parsimonious in terms of the number of parameters. It is important to emphasize that Theorem 1 can be extended, as demonstrated below.

Theorem 15. Let and be a probability density functions and a positive continuous function such that , where . Thenis a probability density function with shape parameter .

Note that when we have Theorem 1. Furthermore, this new extension has stochastic representation given by the following theorem.

Theorem 16. Let and be independent random variables. Ifthen .

Proof. Since is a mixture, the result follows immediately.

This new extension includes, for examples, the case of the slash distribution Rogers and Tukey [24], Mosteller and Tukey [25], and Kafadar [26], if we consider the following definitions: (i) , (ii) , and (iii) , with and , with density given by

Data Availability

The WHO MONICA and HEIGHT data sets used to support the findings of this study are included within the article.

Disclosure

Preliminary results of this manuscript were presented as a poster in “Flexible Statistical Models for a Skewed World of Data: A Workshop in Honor of Reinaldo B. Arellano-Valle’s 65th Birthday” 2017.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Funding

The research of M. A. Cortés was supported in part by Grant DIUDA 221230. The research of D. Elal-Olivero was supported in part by Grant DIUDA 22287. The research of J. F. Olivares-Pacheco was supported in part by Grant DIUDA 22257.

References

A. Azzalini, “A class of distributions which includes the normal ones,” Scandinavian Journal of Statistics, vol. 12, no. 2, pp. 171–178, 1985.
View at: Google Scholar | MathSciNet
D. Elal-Olivero, “Alpha-skew-normal distribution,” Proyecciones Journal of Mathematics, vol. 29, no. 3, pp. 224–240, 2010.
View at: Publisher Site | Google Scholar | MathSciNet
D. Elal-Olivero, H. W. Gómez, and F. A. Quintana, “Bayesian modeling using a class of bimodal skew-elliptical distributions,” Journal of Statistical Planning and Inference, vol. 139, no. 4, pp. 1484–1492, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
W. Gui, P.-H. Chen, and H. Wu, “A symmetric component alpha normal slash distribution: properties and inferences,” Journal of Statistical Theory and Applications, vol. 12, no. 1, pp. 55–66, 2013.
View at: Publisher Site | Google Scholar | MathSciNet
H. W. Gómez and G. A. Guerrero, Bimodal Birnbaum-Saunders Distribution, COMCA 2014: Congreso de Matemática Capricornio, Copiapó, Chile, 2014.
N. Henze, “A probabilistic representation of the 'skew-normal' distribution,” Scandinavian Journal of Statistics, vol. 13, no. 4, pp. 271–275, 1986.
View at: Google Scholar | MathSciNet
Z. W. Birnbaum and S. C. Saunders, “A new family of life distributions,” Journal of Applied Probability, vol. 6, no. 2, pp. 319–327, 1969.
View at: Publisher Site | Google Scholar | MathSciNet
Z. W. Birnbaum and S. C. Saunders, “Estimation for a family of life distribution with applications to fatigue,” Journal of Applied Probability, vol. 6, no. 2, pp. 328–347, 1969.
View at: Publisher Site | Google Scholar | MathSciNet
N. R. Mann, R. E. Schafer, and N. D. Singpurwalla, Methods for Statistical Analysis of Reliability and Lifetime Data, John Wiley & Sons, New York, NY, USA, 1974.
View at: MathSciNet
A. Desmond, “Stochastic models of failure in random environments,” Canadian Journal of Statistics, vol. 13, no. 2, pp. 171–183, 1985.
View at: Publisher Site | Google Scholar | MathSciNet
A. F. Desmond, “On the Relationship Between Two Fatigue-Life Models,” IEEE Transactions on Reliability, vol. 35, no. 2, pp. 167–169, 1986.
View at: Publisher Site | Google Scholar
D. S. Chang and L. C. Tang, “Graphical analysis for Birnbaum-Saunders distribution,” Microelectronics Reliability, vol. 34, no. 1, pp. 17–22, 1994.
View at: Publisher Site | Google Scholar
J. A. Díaz-García and V. Leiva-Sánchez, “A new family of life distributions based on the elliptically contoured distributions,” Journal of Statistical Planning and Inference, vol. 128, no. 2, pp. 445–457, 2005.
View at: Publisher Site | Google Scholar | MathSciNet
H. W. Gómez, J. F. Olivares-Pacheco, and H. Bolfarine, “An extension of the generalized Birnbaum-Saunders distribution,” Statistics & Probability Letters, vol. 79, no. 3, pp. 331–338, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
N. M. Olmos, G. Martínez-Flórez, and H. Bolfarine, “Bimodal Birnbaum-Saunders distribution with applications to non negative measurements,” Communications in Statistics—Theory and Methods, vol. 46, no. 13, pp. 6240–6257, 2017.
View at: Publisher Site | Google Scholar | MathSciNet
S. C. Saunders, “A family of random variables closed under reciprocation,” Journal of the American Statistical Association, vol. 69, no. 346, pp. 533–539, 1974.
View at: Publisher Site | Google Scholar
R Core Team, “R: A Language and Environment for Statistical Computing,” R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/, 2018.
View at: Google Scholar
J. A. Nelder and R. Mead, “A simplex algorithm for function minimization,” The Computer Journal, vol. 7, no. 4, pp. 308–313, 1965.
View at: Publisher Site | Google Scholar
S. Kotz, T. J. Kozubowski, and K. Podgórski, The Laplace Distribution and Generalizations: A Revisit with Applications to Communications, Economic, Engineering and Finance, Birkhäuser-Springer, Boston, Mass, USA, 2001.
View at: Publisher Site | MathSciNet
K. Kuulasmaa, H. Tunstall-Pedoe, A. Dobson et al., “Estimation of contribution of changes in classic risk factors to trends in coronary-event rates across the WHO MONICA Project populations,” The Lancet, vol. 355, no. 9205, pp. 675–687, 2000.
View at: Publisher Site | Google Scholar
S. B. Kulathinal, K. Kuulasmaa, and D. Gasbarra, “Estimation of an errors-in-variables regression model when the variances of the measurement errors vary between the observations,” Statistics in Medicine, vol. 21, no. 8, pp. 1089–1101, 2002.
View at: Publisher Site | Google Scholar
M. de Castro, M. Galea, and H. Bolfarine, “Hypothesis testing in an errors-in-variables model with heteroscedastic measurement errors,” Statistics in Medicine, vol. 27, no. 25, pp. 5217–5234, 2008.
View at: Publisher Site | Google Scholar | MathSciNet
M. Y. Hassan and R. H. Hijazi, “A bimodal exponential power distribution,” Pakistan Journal of Statistics, vol. 26, no. 2, pp. 379–396, 2010.
View at: Google Scholar | MathSciNet
W. H. Rogers and J. W. Tukey, “Understanding some long-tailed symmetrical distributions,” Statistica Neerlandica. Journal of the Netherlands Society for Statistics and Operations Research, vol. 26, no. 3, pp. 211–226, 1972.
View at: Publisher Site | Google Scholar | MathSciNet
F. Mosteller and J. Tukey, Data Analysis and Regression: A Second Course in Statistics, Addison-Wesley Pub. Co., 1977.
K. Kafadar, “A biweight approach to the one-sample problem,” Journal of the American Statistical Association, vol. 77, no. 378, pp. 416–424, 1982.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2018 Milton A. Cortés et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2588

Downloads

1254

Citations