Type II Half Logistic Kumaraswamy Distribution with Applications
In this paper, a new distribution with a unit interval named type II half logistic Kumaraswamy (TIIHLKw) distribution is proposed. Its density and distribution functions are presented using alternate expressions. This distribution is obtained by adding an extra parameter in the existing model to rise its ability fitting complex data sets. Some important statistical properties of TIIHLKw distribution are derived. The estimation of the parameters is obtained by numerous well-recognized approaches and simulation study confirmed the efficiencies of estimates such obtained. We apply the related model to practical datasets, and it is concluded that the proposed model is the best by model selection criteria than other competitive models.
Over the last few years, inspired by the increasing demand of probability distributions in many fields, many generalized distributions have been studied. Most of them are proposed by the addition of more parameters to well-known probability distributions that exist in literature to make them flexible. For instance, Haq et al.  proposed and studied the generalized odd Burr III (GOBIII) family of distributions along with its important characterizations. Ahmed  proposed a new model derived by the transformation of the baseline model. Different shapes of failure function can be formed such as increasing and bathtub. Some mathematical properties are obtained. So many research works have been done for proposing more flexible generalized probability distributions such as Alzaatreh et al. , Haq et al. , Hashmi et al. , Elgarhy et al. , and ZeinEldin et al. . Alshenawy  suggested a new one-parameter distribution. Several statistical properties and characteristics of the proposed distribution are derived along with estimation under Type II censoring. The research is concluded on the basis of a simulation study and real data analysis.
The Kumaraswamy (Kw) distribution was introduced and studied by Kumaraswamy  with unit interval, denoted by 𝐾w(a, b), with cumulative distribution function (cdf) is and its related probability density function (pdf) is
The Kw density has one of these shapes depending upon parameter values, unimodal (), bathtub (), increasing ( and ), decreasing (D) ( and ), or steady (both a and b equal to 1).
The behavior of Kw distribution is analogous to the beta distribution but simpler due to closed-form of both its pdf and cdf. Boundary behavior and the major special models are also the same in both Beta and Kw distributions. This distribution could be a good substitute in situations wherein reality the bounds are finite i.e., (0, 1).
The Kw is originally developed as a lifetime distribution. Many authors studied and developed the generalizations of Kw distribution such as exponentiated Kw distribution studied by Lemonte et al. , El-Sherpieny, and Ahmed  proposed Kw Kw distribution, transmuted Kw distribution studied by Khan et al. , Sharma and Chakrabarty  studied size biased Kw distribution, George and Thobias  introduced Marshall-Olkin Kw distribution, exponentiated generalized Kw distribution studied by Elgarhy et al. , type II Topp Leone inverted Kw by ZeinEldin et al. , type I half logistic inverted Kw by ZeinEldin et al. , truncated inverted Kw by Bantan et al. , and Ghosh  introduced bivariate and multivariate weighted Kw distributions.
Hassan et al.  proposed Type II half logistic-G (TIIHL-G). The TIIHL-G distribution is expressed by its cdf given by where is cdf of baseline model with parameter vector and is cdf derived by the T-X generator proposed by Alzaatreh . The pdf of the TIIHL-G family is given as
This article is dedicated to both its mathematical and application features. A significant portion is kept for the estimation of the parameters through various methods including maximum likelihood estimation (MLE), least-square estimation (LSE), weighted least square estimation (WLSE), percentiles estimation (PCE), and Cramer-von Mises estimation (CVE).
The core purpose of this research is to suggest a simpler and more flexible model called Type II Half Logistic Kw (TIIHLKw) distribution. This article is organized in the following manner: Section 2 is dedicated for the proposition of type II half logistic-Kw (TIIHLKw) distribution. Section 3 deals with the leading statistical properties of this model. Important binomial expansions of density and distribution functions are presented which involve binomial expansions. In Section 4, an extensive study of five different methods of estimation is carried out, with all derivations and detailed discussions. A comprehensive simulation study is conducted to compute the biases and efficiency for parameters and compare the performances of five estimation approaches stated above in the next section. Section 6 is devoted to the real data application of TIIHLKwD to show the importance of TIIHLKw distribution. Lastly, the conclusion is given in Section 7.
2. The TIIHLKw Distribution
In this section, we examine the usefulness, and flexibility of a new associate of type II half-logistic-G family having (0, 1), using Kw distribution as the baseline. The pdf and cdf of Kw distribution (2 shape parameters: ) are given as follows
The pdf of TIIHLKw distribution is as follows
The survival function of TIIHLKw distribution is
The failure rate function of TIIHLKwD is
3. Some Mathematical Properties
3.1. Quantile Function
The quantile function of is denoted by , defined as the inverse function of is , given in (6).
Simulated values from TIIHLKwD can be utilized in the simulation study. The r. v. follows rectangular distribution on the interval 0 to 1, i.e., follows TIIHLKw distribution. In particular, the first three quartiles are obtained by putting , and 3/4, respectively, in (11) and (12).
On differentiation, we can have the density of quantile function
3.2. Alternate Representation
Here, we present the expansion of the pdf and cdf for TIIHLKw distribution for further mathematical manipulation.
The binomial theorem, for and , can be expressed as
Inserting this expansion in (17), we have pdf of the following form where .
Proposition 1. Let “h” be a positive integer. The expansion of cdf can be written in following form: where .
Proof. The cdf is obtained, for an integer by using (16).
Once more binomial expansion is applied to and it can be written as
The required result is obtained by combing some expression together, completing the proof.
3.3. Probability Weighted Moments:
The probability-weighted moments (PWMs) are used to study some more characteristics of the probability distribution. Under the specified setting discussed above, PWMs are denoted by , can be defined as
Then, where Also, is beta function.
The moments have a vital role in the study of the distribution and real data applications. Now, we get the moment for the TIIHLKwD. Under the specified assumptions, the moment is obtained as where
The mean () and variance (var) of TIIHLKw distribution can be derived as
Also, the coefficients of skewness and kurtosis of TIIHLKw distribution are given by
The summary measures, mean, variance, skewness () and kurtosis () values are presented in Table 1. The plots of mean and var for the TIIHLKwD are given in Figure 4 and graphs of skewness and kurtosis are presented in Figure 5 for different parameter ranges.
We see a monotonic variation in these measures caused by variation in parameters , and .
3.5. Moment-Generating Function
The moment-generating function of , using moments about the origin (29), is obtained as
3.6. Incomplete Moments
The incomplete moment of TIIHLKwD can be obtained by using (19) is where is incomplete beta function.
3.7. The Mean Deviation
Following are expressions used to get mean deviation about mean and median, respectively where = Median of and is the initial incomplete moment. Now
3.8. Bonferroni and Lorenz Curves
Bonferroni and Lorenz curves are important applications of the first incomplete moment. These curves are mostly used in different fields of life such as economics, reliability analysis, demographic studies, life testing, life insurance, and medical technology. The Lorenz and Bonferroni curves are obtained, respectively, as follows and,
3.9. Order Statistics
In statistical theory, order statistics is widely applied and practiced. Let be r.vs. with their corresponding cdfs . Let be the related ordered r. sample of size n, then the density of order statistic is given as where and β(.,.) is the beta function. The pdf of the order statistic of is obtained by putting (21) and (22) in (38), changing with where
More, the moment of order statistics for TIIHLKwD is given by
3.10. Rényi Entropy
Renyi (1961) proposed and used this measure. It can be obtained by
So, the Rényi entropy of TIIHLKw distribution is as follows where .
3.11. Stress-Strength Reliability
This subsection deals with the stress-strength parameter of TIIHLKw distribution. Let be the strength of a structure with a stress , and if follows and follows , provided and are statistically independent r.vs.,
Proposition 2. Under the assumption discussed above, we have
Proof. The reliability is defined by
Then, we can write where .which completes the proof.
This section is dedicated to estimation aspects of TIIHLKw distribution, assuming that population parameters () are unknown and can be estimated using different methods of estimation including ML, LS, WLS, PC, and CV.
4.1. Maximum Likelihood Estimation
For a random sample of from the distribution, the log-likelihood function for is
The members of are given below equating (52), (53), and (54) to zero and solve them simultaneously give the ML estimates (MLEs) of The iterative algorithm is used to obtain the numerical solution of these nonlinear equations such as the Newton-Raphson method.
4.2. Ordinary and Weighted LS Estimation (LSEs and WLSEs):
The LSEs of , and can be obtained by minimizing the sum of squares of errors with respect to parameters. Suppose is a random sample of size from TIIHLKwD and suppose be the related ordered sample.
The sum is independent of the unknown parameters.
The following function is minimized with respect to and to get WLS estimators of model parameters.
i.e., , and .
4.3. PC Estimators (PCEs)
Under the specification defined above including order statistics having relationship . In the PC method of estimation, the estimators of , and are derived by minimizing the following expression with respect to , and .
4.4. The Cramer-von Mises Minimum Distance Estimation
The CVE is another estimation method based on minimum distance. The CV estimators are obtained by minimizing, with respect to , and .
CV minimum distance estimators provide empirical evidence that the bias of this estimator is smaller than the other minimum distance estimators.
5. Simulation Study
In this section, we present a Monte Carlo (MC) simulation study in order to illustrate the behavior of different estimates. We consider four random sample sizes: , 100, 200, and 500 from the TIIHLKw distribution and the samples are drawn 10,000 times. Four specifications of the parameters are used in this simulation study, given as, , , , . For sample generated, the MLE, LSE, WLSE, CVE, PCE, and MPSE of estimators are computed numerically. Then, the estimates of all methods and their mean square errors (MSEs) are documented in Tables 1–4.
Estimates of parameters and their corresponding MSEs are calculated using the same sample under different methods of estimation for four sets of parameters mentioned above for small sample size () to sufficient large sample size, that is, .
The entries of Table 2 to Table 5 show that estimates are reliable and consistent. MSEs reduce as sample size () increases under each method of estimation. The summary of these four tables is presented in Table 5.
Using the entries of Table 6 for different parametric combinations, we can conclude that the MLE method outperforms than all other estimation methods (with an overall score of 18.5). Therefore, depending on the simulation study, the MLE method performs best for TIIHLKwD.
The TIIHLKw distribution aims at providing an alternative distribution to fit data on the unit interval to other distributions available in the literature. Here, we used the following probability distributions as competitor models; (i)The Kumaraswamy (Kw) distribution (ii)Transmuted Kumaraswamy (TKw) distribution (iii)Size biased Kumaraswamy (SBKw) distribution (iv)Beta distribution
We use the following accuracy measures for model comparison: Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), log-likelihood, Anderson-Darling (A), and Cramer–von Mises (W). R-Language is used for numerical computations.
1st data set. The data set was taken from Dasgupta , and considered on burr (in millimeters), with hole diameter and a sheet thickness of 12 mm and 3.15 mm, respectively.
2nd data set. This data set was taken from , and considered measurements on petroleum rock samples from a petroleum reservoir.
The total test time (TTT) for both datasets are presented in Figure 6. We can observe that the shape of TTT plots is concave for both datasets, which demonstrates an increasing failure rate.
Hence, it is concluded that the new model provides the better fit. Figures 7 and 8 show the estimated densities, cdfs, estimated survival functions, and PP plots for the considered distributions of both data sets, respectively. We note that the proposed model is more appropriated to fit the data than the other competing models.
In this article, a new Type-II Half Logistic Kumaraswamy distribution is proposed. Some characteristics of the TIIHLKw distribution including linear combination expressions for the density function, probability weighted moments, moments, incomplete moments, quantile function, mean deviation about mean and about median, Bonferroni and Lorenz curves, order statistics, stress-strength reliability, and Rényi entropy are derived. The ML method is used to estimate model parameters. An extensive simulation study is conducted to compare several well-known estimation methods, including the method of maximum likelihood estimation, methods of least squares and weighted least squares estimation, and method of Cramer-von Mises minimum distance estimation. The simulation study showed the reliability and efficiency of the estimates. Finally, by considering the method of maximum likelihood estimation, the new model is fitted to two practical data sets. The applications on real data sets validated the significance of the new distribution.
A.1. Data I
0.04, 0.02, 0.06, 0.12, 0.14, 0.08, 0.22, 0.12, 0.08, 0.26, 0.24, 0.04, 0.14, 0.16, 0.08, 0.26, 0.32, 0.28, 0.14, 0.16, 0.24, 0.22, 0.12, 0.18, 0.24, 0.32, 0.16, 0.14, 0.08, 0.16, 0.24, 0.16, 0.32, 0.18, 0.24, 0.22, 0.16, 0.12, 0.24, 0.06, 0.02, 0.18, 0.22, 0.14, 0.06, 0.04, 0.14, 0.26, 0.18, 0.16.
A.2. Data II
0.0903296, 0.2036540, 0.2043140, 0.2808870, 0.1976530, 0.3286410, 0.1486220, 0.1623940, 0.2627270, 0.1794550, 0.3266350, 0.2300810, 0.1833120, 0.1509440, 0.2000710, 0.1918020, 0.1541920, 0.4641250, 0.1170630, 0.1481410, 0.1448100, 0.1330830, 0.2760160, 0.4204770, 0.1224170, 0.2285950, 0.1138520, 0.2252140, 0.1769690, 0.2007440, 0.1670450, 0.2316230, 0.2910290, 0.3412730, 0.4387120, 0.2626510, 0.1896510, 0.1725670, 0.2400770, 0.3116460, 0.1635860, 0.1824530, 0.1641270, 0.1534810, 0.1618650, 0.2760160, 0.2538320, 0.2004470.
Data is present in appendix.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
This work was funded by the Deanship of Scientific Research (DSR), King Abdul Aziz University, Jeddah, under grant No. (DF-287-305-1441). The authors gratefully acknowledge the DSR technical and financial support.
S. Hashmi, M. A. u. Haq, and R. M. Usman, “A generalized exponential distribution with increasing, decreasing and constant shape hazard curves,” Electronic Journal of Applied Statistical Analysis, vol. 12, no. 1, pp. 223–244, 2019.View at: Google Scholar