Abstract

In this paper, we proposed a new four-parameter Extended Gumbel type-2 distribution which can further be split into the Lehman type I and type II Gumbel type-2 distribution by using a generalized exponentiated distribution. The distributional properties of the proposed distribution have been studied. We derive the th moment; thus, we generalize some results in the literature. Expressions for the density, moment-generating function, and th moment of the order statistics are also obtained. We discuss estimation of the parameters by maximum likelihood and provide the information matrix of the developed distribution. Two life data, which consist of data on cancer remission times and survival times of pigs, were used to show the applicability of the Extended Gumbel type-2 distribution in modelling real life data, and we found out that the new model is more flexible than its submodels.

1. Introduction and Motivation

Statistical distributions are important tools in analysing and predicting real-world phenomena. Several distributions have been developed and studied. There is always room for development in statistical distributions to blend with the current situations which gives room for wider applications which can be achieved by inducing flexibility into the standard probability distribution to allow for fitting specific real-world scenarios. This has motivated researchers to work towards developing new and more flexible distributions. There are several ways to extend standard probability distributions, and one of the most popular methods is the use of distribution generators such as the exponentiated method by Lehmann [1]; the Marshall-Olkin method developed by Marshall and Olkin [2]; the beta distribution method proposed by Alexander et al. [3] and Eugene et al. [4]; the gamma distribution method by Cordeiro et al. [5], Ristic and Balakrishnan [6]), and Zografos and Balakrishnan [7]; the McDonald method proposed and studied by MacDonald [8]; and the exponentiated generalized method developed by De Andrade et al. [9]. The cubic rank transmutation map was proposed and studied by Granzotto et al. [10].

The Gumbel type-2 distribution is a very important distribution method from the theory of extreme value distribution. The distribution found applications in modelling extreme events in the field of meteorology, life testing, fracture roughness, seismology, and reliability analysis. It can also be used for modelling lifetime data sets with monotonic failure (or hazard) rates, most especially with a decreasing hazard rate. But in real-life data analysis, the hazard rate function of so many complex phenomena that are regularly encountered in practice are nonmonotonic and cannot be modelled by the Gumbel type-two (GTT) distribution. In other to improve the fit of (GTT) distribution, Okorie et al. [11] proposed and studied the properties of an exponentiated form of the GTT distribution of Lehman type I. Okorie et al. [12] investigated the properties of the Kumaraswamy Exponentiated Gumbel type-two distribution. The cumulative distribution function (cdf) and probability density function (pdf) of the GTT distribution is given bywhere and and are the scale and shape parameters, respectively.

Adding parameters to the GTT model may give rise to new, more flexible models for fitting real-life data. Therefore, we defined in this work an extension of the model above by using the methodology proposed by [6]. Given any given continuous baseline cdf and , those authors defined the exponentiated generalized class of distributions with two extra shape parameters and with cdf and pdf given byrespectively, which have implicit dependence on the parameters of .

Here, and are the two additional shape parameters. It should be noted that there is no complicated function in (3), which is in contrast with the beta generalized family by Alexander et al. [3] and Eugene et al. [4]), which also includes two extra parameters but incorporates the beta incomplete function. Equation (3) has tractable properties especially for simulations, since its quantile function takes a simple form given bywhere is the baseline quantile function.

To illustrate the tractability and flexibility of the exponentiated generalized model, Cordeiro et al. [13] applied (3) to extend some well-known distributions such as the gamma, frechet, Gumbel, and normal distributions. Several properties for the exponentiated generalized class were discussed, which motivate the use of this generator. In fact, for and also taking , (3) reduces to and which correspond to the cdf’s of the Lehmann type I and II families, respectively. For this reason, the model proposed by Cordeiro et al. [13] contains both Lehmann type I and type II classes. For this reason, the exponentiated generalized family can be derived from a double transformation using these classes. The two extra parameters and in density (4) can control both tail weights, allowing the generation of flexible distributions, with heavier or lighter tails.

The above properties and many others have been examined and explored in recent works for the generator proposed and studied by Cordeiro et al. [14]: Silva et al. [15] examined the properties of the Dagum distribution, De Andrade et al. [9] investigated the properties of the exponential distribution, Elbatal and Muhammed [16] studied the properties of the Inverse Weibull distribution, De Andrade and Zea [17] studied the properties of the Gumbel model, Cordeiro et al. [14] studied the properties of the arcsine distribution, De Andrade et al. [18] examined the properties of the extended Pareto distribution, De Andrade et al. [19] studied the properties of the exponentiated generalized extended Gompertz distribution, etc.

2. Exponentiated Generalized Gumbel Type-Two (EGTT) Distribution

Given that a random variable with support on the set of positive real numbers and EGTT distribution, say, , is defined by inserting (1) in equation (3). Thus, the cdf of is given bywhere , , , , and .

The graph of the cumulative distribution function is plotted in Figure 1 drawn below by taking the values of , , and and varying the value of .

And the associated pdf is given bywhere , , , , and .

The graphs of the pdf of EGTT distribution are drawn in Figure 2 with various parameter values.

The graphs drawn in Figure 2 indicate that and control the skewness and kurtosis of EGTT distribution, and as such, the distribution can be used to model real-life data which are mesokurtic, leptokurtic, and platykurtic.

The reliability and hazard functions of EGGT, respectively, are given by

Note that and .

The plots of the reliability and hazard rate functions are shown in Figures 3 and 4, respectively.

The diagrams in Figures 3 and 4 indicate that the hazard rate function of the EGTT distribution could be shaped as unimodal, bathtub, or upside-down bathtub. These properties suggest that the EGTT distribution is suitable for modelling data sets with nonmonotonic hazard rate behaviour which are mostly encountered in practical situations.

2.1. Useful Expansion of the Probability Density Function

To motivate analytical derivation of some basic distributional properties of the EGTT distribution, we present the series representation of its pdf and cdf by using the generalized binomial expansion. If is a positive real noninteger and , we consider the power series expansion given by

Applying (9) in (4) and using the binomial expansion for a positive real power yield

Equation (10) can be rewritten as

Thus, we havewhereare the weights and is the pdf of a Gumbel type-two distribution with a scale parameter of . Consequently, the Extended Gumbel type-two density can be written as a linear combination of Gumbel type-two density functions. The mathematical properties of the EGTT follow directly from those of the GTT distribution.

3. Statistical Properties

Here, we study the statistical properties of the (EGTT) distribution, specifically quantile function, moments, and moment-generating function.

3.1. Quantile Function and Median

The quantile function corresponding to (4) is given by

Taking to be a uniform variate on the unit interval . Thus, by means of the inverse transformation method, we consider the random variable given by

The median in particular can be derived by taking the value of in equation (14); then, we have

Expression for the lower quartiles and upper quartiles can also be developed by taking the value of to be and , respectively.

Invariably, the quantile function for certain fractile values provides an alternative measure of skewness and kurtosis such that the limitations of the classical measures are well known. The Bowley skewness due to Bowley [20] is given by

The Moors kurtosis due to Moors [21] is given by

The advantage of the Bowley skewness and the Moors kurtosis over the classical measures of skewness and kurtosis is that they can be obtained even in situations where the moments of the distribution do not exist, and they are not reasonably affected by extreme values.

Table 1 gives the values of the Bowley skewness and the Moors kurtosis for various values of parameters.

From Table 1, it can be deduced that the EGTT distribution is positively skewed for various values of the parameters considered.

3.2. Moments

Here, we consider the th moment for (EGTT) distribution. Moments are important features in any statistical analysis, especially in applications. They can be used to study the characteristics of a distribution, e.g., dispersion, skewness, and kurtosis.

Theorem 1. If has EGTT , then the th moment of is given by the following:

Proof. Let be a random variable with density function (11). The ordinary th moment of the EGTT distribution is given byThis completes the proof.

Table 2 gives the first four moments, variance, and coefficient of variation (CV) for various values of the parameters of EGTT distribution.

Based on the first four moments of the EGTT distribution, the limitations of the classical kurtosis measure are known by the fact that some heavy-tailed distributions for which this measure is designed may be infinite. The Bowley skewness of Bowley [20] was based on quartiles of a distribution.

3.3. Moment-Generating Function

Here, we obtain an expression for the moment-generating function of (EGTT) distribution.

Theorem 2. If has (EGTT) distribution, then the moment-generating function is given by

Proof. The moment-generating function of a random variable can be obtained using a relation that exists between a moment and the moment-generating function given byBy inserting equation (21) into (23), we complete the proof.

4. Order Statistics

Here, we derive closed form expressions for the pdf of the th order statistic of (EGTT) distribution. Let be a simple random sample from (EGTT) distribution with cdf and pdf given by (6) and (7), respectively. Let denote the order statistics obtained from this sample. The probability density function of is given bywhere and are the cdf and the pdf of the EGGT distribution given in equations (6) and (7), respectively. Thus, we have

Using the binomial series expansion in (9), can be expressed as

Also applying (9) to the last term, we obtain

Finally, we havewhere

By substituting equations (1) and (2), we obtain the th order statistic of the EGTT distribution.

5. Estimation and Inference

Here, we obtain the maximum likelihood estimates (MLEs) of the parameters of the EGTT distribution from complete samples only. Let be a random sample of size from , . The log likelihood function for the vector of parameters can be written as

The log likelihood can be maximized either directly or by solving the nonlinear likelihood equations obtained by differentiating (30). The components of the score vector are given by

We can obtain the estimates of the unknown parameters by the maximum likelihood method by setting these above nonlinear equations (31), (32), (33), and (24) to zero and solving them simultaneously. Therefore, statistical software can be employed in obtaining the numerical solution to the nonlinear equations such as R and MATLAB. For the four-parameter Extended Gumbel type-two pdf, all the second-order derivatives can be obtained. Thus, the inverse dispersion matrix is given bywhere with . This gives the approximate variance covariance matrix. By solving for the inverse of the dispersion matrix, the solution will give the asymptotic variance and covariance of the MLs for , , , and . The approximate confidence intervals for , , , and, can be obtained, respectively, as

5.1. Percentile Estimator (PE)

Let be a random sample for the EGTT distribution and let be the corresponding order statistics. Based on the PR method of estimation technique, the estimators of the set of parameters can be obtained by minimizing the following:with respect to , where denotes some estimates of , and .

5.2. The Cramer-von Mises Minimum Distance Estimators

The CV estimator is a type of minimum distance estimator which is based on the difference between the estimate of the cdf and the empirical cdf, as shown by D’Agostino and Stephens [22] and Luceno [23]. The CV estimators are obtained by minimizing

D’Agostino and Stephens [22] mentioned that the choice of CV method type minimum distance estimators provides the empirical evidence that the bias of the estimator is smaller than the other minimum distance estimators.

6. Applications to Lifetime Data

Here, we present two examples that demonstrate the flexibility and the applicability of the EGTT distribution in modelling real-world data. We fit the density functions of the EGTT distribution and its submodels such as the Exponentiated Gumbel type-two (EGTT) distribution and the Gumbel type-two (GTT) distribution. The first data set represents the survival times (in days) of 72 guinea pigs infected with virulent tubercle bacilli, observed and reported by Bjerkedal [24]. The starting point of the iterative processes for the guinea pig data set is (1 : 0; 0 : 009; 10 : 0; 0 : 1; and 0 : 1). For the survival times (in days) of guinea pigs infected with virulent tubercle bacilli, the following data are given: 0.1, 0.33, 0.44, 0.56, 0.59, 0.72, 0.74, 0.77, 0.92, 0.93, 0.96, 1, 1, 1.02, 1.05, 1.07, 1.07, 1.08, 1.08, 1.08, 1.09, 1.12, 1.13, 1.15, 1.16, 1.2, 1.21, 1.22, 1.22, 1.24, 1.3, 1.34, 1.36, 1.39, 1.44, 1.46, 1.53, 1.59, 1.6, 1.63, 1.63, 1.68, 1.71, 1.72, 1.76, 1.83, 1.95, 1.96, 1.97, 2.02, 2.13, 2.15, 2.16, 2.22, 2.3, 2.31, 2.4, 2.45, 2.51, 2.53, 2.54, 2.54, 2.78, 2.93, 3.27, 3.42, 3.47, 3.61, 4.02, 4.32, 4.58, and 5.55. Table 3 gives the MLEs, and Table 4 gives the selection criterion statistics for the pig data. Figure 5 gives the graph of the total time on test plot and the graph of the kernel density of the pig data. Figure 6 gives the fitted densities of the pig data.

The element of the information matrix is given by

6.1. Cancer Remission Time Data

The second data set consists of data of cancer patients. The data represents the remission times (in months) of a random sample of 128 bladder cancer patients from Lee and Wang [25]. The starting point of the iterative processes for the cancer patient data set is (1 : 0; 0 : 009; 10 : 0; 0 : 1; 0 : 1). The following data are given: 0.08, 2.09, 3.48, 4.87, 6.94, 8.66, 13.11, 23.63, 0.20, 2.23, 3.52, 4.98, 6.97, 9.02, 13.29, 0.40, 2.26, 3.57, 5.06, 7.09, 9.22, 13.80, 25.74, 0.50, 2.46, 3.64, 5.09, 7.26, 9.47, 14.24, 25.82, 0.51, 2.54, 3.70, 5.17, 7.28, 9.74, 14.76, 26.31, 0.81, 2.62, 3.82, 5.32, 7.32, 10.06, 14.77, 32.15, 2.64, 3.88, 5.32, 7.39, 10.34, 14.83, 34.26, 0.90, 2.69, 4.18, 5.34, 7.59, 10.66, 15.96, 36.66, 1.05, 2.69, 4.23, 5.41, 7.62, 10.75, 16.62, 43.01, 1.19, 2.75, 4.26, 5.41, 7.63, 17.12, 46.12, 1.26, 2.83, 4.33, 5.49, 7.66, 11.25, 17.14, 79.05, 1.35, 2.87, 5.62, 7.87, 11.64, 17.36, 1.40, 3.02, 4.34, 5.71, 7.93, 11.79, 18.10, 1.46, 4.40, 5.85, 8.26, 11.98, 19.13, 1.76, 3.25, 4.50, 6.25, 8.37, 12.02, 2.02, 3.31, 4.51, 6.54, 8.53, 12.03, 20.28, 2.02, 3.36, 6.76, 12.07, 21.73, 2.07, 3.36, 6.93, 8.65, 12.63, and 22.69. Table 5 gives the MLEs, and Table 6 gives the selection criterion statistics for the cancer data. Figure 7 gives the graph of the total time on test plot and the graph of the kernel density of the cancer data. Figure 8 gives the fitted densities to the cancer data.

The element of the information matrix is given by

We employ statistical tools for model comparison which includes Kolmogorov-Smirnov (KS) statistics, Anderson Darling () statistics, probability value ( value), Akaike Information Criterion (AIC), Hannan-Quinine Information Criterion (HQIC), Consistent Akaike Information Criterion (CAIC), and Bayesian Information Criterion (BIC). The selection procedure will be based on the lowest AIC, BIC, CAIC, HQIC, , and KS, and the largest value will be considered as the best model.

7. Conclusion

In this work, we studied a four-parameter distribution named the Extended Gumbel type-two distribution which is an extension of the Gumbel type-two distribution. This work also provided several mathematical properties of the Extended Gumbel type-two distribution including explicit expressions for the density and quantile functions, ordinary moments, and order statistics. We employed the maximum likelihood estimation method to estimate the model parameters and also to obtain the elements of the information matrix. To evaluate the performance/flexibility of the new distribution, two real-life data applications which include data on cancer remission times of patients and also survival times of pigs clearly illustrate the potential of the Extended Gumbel type-2 distribution in fitting the two survival data because it possesses the lowest AIC, BIC, CAIC, HQIC, , and KS as well as the largest value.

Data Availability

The data used for this research are commonly used in the area of research.

Additional Points

Contribution to Knowledge. This work has been able to provide the authors a more flexible distribution of work, most especially in modelling real-life data which include failure rate increases and decreases and nonmonotonic failure. This is often encountered in real-life phenomenon, where failure is considered throughout the entire life span.

Conflicts of Interest

The authors declare that they have no conflicts of interests.

Acknowledgments

This work is funded mainly by the authors.