Abstract

In this paper, a new modified Kumaraswamy distribution is proposed, and some of its basic properties are presented, such as the mathematical expressions for the moments, probability weighted moments, order statistics, quantile function, reliability, and entropy measures. The parameter estimation is done via the maximum likelihood estimation method. In order to show the usefulness of the proposed model, some well-established actuarial measures such as value-at-risk, expected-shortfall, tail-value-at-risk, tail-variance, and tail-variance-premium are obtained. A simulation study is carried out to assess the performance of maximum likelihood estimates. The empirical analysis is carried out to show that our proposed model is better in performance as compared to other competitive models related to the extended Kumaraswamy model. Thus, insurance claim data and engineering related real-life data sets are considered to prove this claim.

1. Introduction

The discipline of actuaries, the actuarial statistics, has also received increased attention in statistical science with the existence of agricultural statistics, mathematical statistics, medical statistics, bio-statistics, computational statistics, reliability analysis, and survival analysis. The actuaries are always looking for ways to model insurance risk data using heavy-tailed and other models. According to some researchers, the insurance risk data may be unimodal [1], positively skewed [2], or having a longer tail [3]. It has also been claimed by many authors that the heavy-tailed distributions are better for estimating risk from insurance risk data and sometimes perform better as compared to other existing models. In order to improve risk assessment, there is always a need for a flexible model that can provide better estimates of well-established actuarial measures, and also provide a better goodness-of-fit to actuarial data sets. Such adaptable models may entice more researchers and practitioners, who are always on the lookout for ways to reduce their losses in terms of insurance risk or risk returns.

Modern distribution theory also emphasizes on the development or proposal of new models, which can be extended, generalized, or modified. Some new models which are applied to claimed data sets have been reported in recent literature, for example, Ahmad et al. [4] defined the exponentiated power Weibull distribution, which is based on heavy-tailed models and has applications in medical care insurance and vehicle insurance. Then, Afify et al. [5] proposed a new heavy-tailed exponential distribution with application to unemployment claim data. Furthermore, some new unit models have been developed to model different phenomenons in [620].

P. Kumaraswamy introduced the well-known Kumaraswamy (Kw) distribution in 1980 with the application to hydrology. The probability density function (pdf) and cumulative distribution function (cdf) for unit support (0, 1) are given by the following equations:andrespectively, where both parameters and are shapes parameters, and the corresponding (rv) having pdf (1) is denoted by . The Kw model exhibits flexible shapes such as unimodal (symmetrical, left-skewed, and right-skewed), bathtub, J, and reversed-J (uniantimodal). The hazard rate shapes of the Kumaraswamy model are increasing and bathtub-shaped which are useful for investigations of the lifetime and reliability phenomenon. Only a few authors [21, 22] have explored some more properties of the Kw model that are not addressed in the original novel paper. If we consider the Kw model to have unit support, then a few models for Kw’s power function distribution have been reported in the literature [23]. The Kumaraswamy exponentiated Weibull was studied by [24]. In addition to that, Tahir et al. and Ramzan et al. [25, 26] also presented new Kumaraswamy models and extended generalized inverse Kumaraswamy models, respectively. There is also a modified Kw distribution introduced by Alshkaki [27]. However, none of the authors has investigated actuarial data using the Kw distribution or some of its modified versions.

As a result, in this article, we attempted to bridge the gap by employing the proposed Kw model to assess actuarial data, and report computation results of actuarial measures. Thus, we propose the first “transformation of the Kw distribution for a new unit distribution” based on novel variable transformation that can be written as “” (further details will be provided later). More specifically, we modified the functionalities of the former Kw distribution in a totally new way, giving new possibilities for the pdf (a quick look to Figure 1 shows a lot) flexibility on the basis of our model. We investigate different phenomenon, including those in actuarial science and engineering to complete the objective of our paper.

The organization of our paper is as follows: In Section 2, the proposal for a new modified Kumaraswamy distribution (MKw) is presented, while in Section 3 some basic mathematical properties of the proposed MKw model are discussed, including the linear representation of the pdf, the quantile function, the expression of moments, probability weighted moments, the pdf of order statistics, stress-strength-reliability, and entropies. In Sections 4 and 5, the parameter estimation of MKw is dealt, and then a simulation study is conducted to assess the parameters performance of the proposed model. In Section 6, some well-established actuarial measures such as value-at-risk (VaR), expected shortfall (ES), tail-variance (TV), tail-value-at-risk (TVaR), and tail-variance premium (TVP) are obtained. The empirical investigation is carried out in Section 7, where the usefulness of the proposed MKw model is shown by analyzing five real-life data sets. Section 8 concludes our paper final remarks.

2. New Modified Kumaraswamy Distribution

The new modified Kw (MKw) distribution is derived from the Kw distribution by using the following original variable: in the cdf. Hence, based on equation (2), the cdf and pdf are, respectively, given by the following equations:

To the best of our knowledge, it is the first “transformation for a new unit distribution” based on the transformation , opening some new horizon of modeling. The rv associated with pdf (4) is denoted by , having same shape parameters and . The survival hazard rate and cumulative hazard rate functions of MKw model are, respectively, given by the following equations:

The possible shapes of the pdf and hrf of the newly proposed model are displayed in Figures 1 and 2. The pdf of MKw distribution exhibits flexible shapes such as unimodal (right-skewed, symmetrical, and left-skewed), reversed-J, J, and bathtub (uniantimodal). The hrf of MKw distribution exhibits flexible shapes such as increasing function and bathtub shape.

3. Properties of the MKw Distribution

Several mathematical properties of the newly investigated MKw distribution are reported in the following section.

3.1. Quantile Function

The quantile function (qf) is important for obtaining information about the median and other positional measures. Furthermore, qf is also an important tool for generating random variates. The qf of the proposed family after inverting equation (3) becomes as follows:

3.2. Analytical Shapes of the pdf and hrf

Ignoring the dependence of parameters, the shapes of the pdf as well as hrf can be viewed analytically. The solutions of the following equations give the critical point of the pdf as follows:and the solutions of the following equations give the critical point of the hrf as follows:

It can be noted that the parameter has no influence on the solutions of the above equations.

3.3. Expansion of the MKw pdf

We derive the linear expansion of the MKw pdf by using the generalized binomial expansion twice in equation (3), which becomes as follows:

By separating the null values for the indices, we obtain the following equation:

Furthermore, we use a result given by Castellares and Lemonte (2014, Proposition 2), which states thatwhere , , , for , and are Stirling polynomials. The first four polynomials are , , , and .

By using equation (11) with in place of , we rewrite equation (10) as follows:where (for and ).

Changing indices , we can rewrite as follows:and by interchanging the sums, we obtain the following equation:

By expanding through binomial and interchanging the sums, we can write as follows:where (for ).

After interchanging sums, we get the following equation:where (for )

By differentiating , we have the following equation:

3.4. Moments

The th raw or ordinary moment of , say , can be yielded by using the following definition:

By using equation (18), the th moment expression for the MKw distribution will be as follows:andwhere denotes the beta function of the first kind. Furthermore, the actual or mean moments and cumulants of yielded from equation (21) are as follows:Here, . However, by using the relationship between mean moments and ordinary moments, the measure of skewness as well as measure of kurtosis can be obtained. The th descending factorial moment of (for r = 1, 2, ...) is as follows:where is the first kind Stirling number.

Table 1 provides the results of the first four raw moments, variance, skewness, and kurtosis under different parametric values . The graphical illustration of skewness and kurtosis is shown in Figures 3 and 4 depending on the parameters and .

The th incomplete moment of MKw distribution can be expressed as follows:

The incomplete moments are used to compute the well-known curves, namely, the Bonferroni and Lorenz curves. Incomplete moments can also be used to calculate mean waiting time and mean residual life.

3.5. Probability Weighted Moments

The authors [28] introduced the idea of computing probability waiting moments (PWMs). PWMs are the expected function of any with existing means. For , the th PWMs of is defined by the following equation:

Inserting equations (3) and (4) in equation (25), we have the following equation:

After using binomial series expansions by using similar fashion in the expansion of the pdf of the MKw distribution, then the expression for can be expressed as follows:where

3.6. Order Statistics

Let be a random sample of size from the MKw distribution. Then, the pdf of the th order statistics is as follows:

Inserting equations (3) and (4) in equation (29), we get the following equation:

After simplification, we get th order statistic density as follows:where

The order statistics of the MKw distribution are expressed in terms of linear expansion. To study the distributional behavior of the set of observations, we can use a minimum and maximum (min-max) plot of the order statistics. Figure 5 represents the min-max plot that depends on extreme order statistics, and it is introduced to capture all information not only about the tails of the distribution, but also about the whole distribution of the data.

3.7. Reliability

Reliability is an important measure, and several applications are documented in the fields of economics, physical science, and engineering. Reliability enables us to determine the failure probability at a certain point in time. Let say and be the two following the MKw distribution. The component fails if the applied stress exceeds its strength, if the component will perform satisfactorily. The reliability is defined by the following expression:

Let , then the above equation will be as follows:

After solving, it gives the result as follows:where

3.8. Entropy

Entropy measures are important for highlighting the uncertainty variation of the . Assume is a with pdf . The two important entropy measures, namely, Rényi and Shannon entropies, can be yielded by the following expressions.

3.8.1. Rényi Entropy

The Rényi entropy is defined by the following equation:where , and .

Inserting equation (4) in as follows:

By using the similar binomial series expansion as in Section 3.3, we have the following equation:

After incorporating the result in equation (37), the expression for Rényi entropy is reduced as follows:where

3.8.2. Shannon Entropy

The Shannon entropy is obtained as follows:wherewherein which

4. Estimation

In this section, we estimate the unknown parameters of the MKw model using the widely used estimation method known as maximum likelihood estimation (MLE). There are several advantages of MLE over other estimation methods, for instance, maximum likelihood estimates fulfill the required properties that can be used in constructing confidence intervals, as well as delivering a simple approximation that is very handy while working with the finite sample. The well-known R package called “adequacymodel” is implemented to estimate the unknown parameters in the application section. The log-likelihood function for the vector of parameters can be expressed as follows:

The score components are as follows:

By setting these equations to zero and solving them simultaneously, the MLE of the model parameters is obtained.

5. Simulation Study

This section yielded a simulation study in order to test the performance of MLEs in the newly proposed MKw distribution. is replicated 1000 times with various sample sizes,  = 50, 100, 200, 300, 400, and 500 of the MKw model by taking ; ; ; ; and .

The calculation of estimates is based on the bias, mean square error (MSE), and average estimate (AE) of the MLEs of the model parameters, namely,

The R programming language is used for the empirical study, and the results of Tables 26 show that as sample sizes increase, both the mean square error and the bias reduce. Thus, MLEs perform well in evaluating the parameters of the MKw distribution.

6. Actuarial Measures

6.1. Value-at-Risk

The Value-at-Risk (VaR), also known as quantile risk or simply “VaR,” is extensively used as a standard final market risk measure. It plays an important role in many business decisions; the uncertainty regarding foreign markets, commodity prices, and government policies can significantly affect firm earnings. The loss portfolio value is defined by a level of confidence, such as . For the MKw model, VaR is defined by the following expression:

6.2. Expected-Shortfall

One of the other measures is called expected shortfall (ES), which is considered as a better measure than VaR introduced by [29]. The ES can be yielded by the following equation:for . Then, we have the following equation:

6.3. Tail-Value-at-Risk

The problem of risk measurement is one of the most important problems in risk management. Tail-value-at-risk (TVaR) or conditional tail expectation is an important measure in finance and insurance that is defined as the expected value of the loss if the loss is greater than the VaR. Its mathematical expression is as follows:

By inserting equation (18) in equation (52), we get the TVaR as follows:

6.4. Tail-Variance

The tail-variance (TV) is defined by the following expression:

Consider . Then, we have the following equation:

Substituting equation (53) and equation (55) in equation (54), we obtain the expression for TV for the MKw model.

6.5. Tail-Variance Premium

The Tail-variance premium (TVP) is a mixture of both central tendency and dispersion statistics. It is defined by the following expression:where . Using expression equation (53) and equation (54) in equation (56), we obtain the TVP for MKw model.

A sample of 100 is randomly drawn and the effect of shape and scale parameters of the proposed models are underlined for both risk measures. Various combinations of the scale and shape parameters are executed I = , II = , III = , and IV = , and change in the curve of VaR and ES are illustrated in Figure 6.

7. Applications

In this section the proposed MKw model is compared to its counterpart models Gamma Kumaraswamy(GaKw) [30], Size-Biased Kumaraswamy (SBKw) [31], Kumaraswamy (Kw), and Beta by using the five data sets. The detailed description of the data sets are given below.

Data Set 1: Drilling Data 1: The first data set based on 50 observation of holes having diameter 12 mm and thickness of sheet 3.15 mm. The data set is also used by [32]. The following are the data observation: 0.040, 0.020, 0.060, 0.120, 0.140, 0.080, 0.220, 0.120, 0.080, 0.260, 0.240, 0.040, 0.140, 0.160, 0.080, 0.260, 0.320, 0.280, 0.140, 0.160, 0.240, 0.220, 0.120, 0.180, 0.240, 0.320, 0.160, 0.140, 0.080, 0.160, 0.240, 0.160, 0.320, 0.180, 0.240, 0.220, 0.160, 0.120, 0.240, 0.060, 0.020, 0.180, 0.220, 0.140, 0.060, 0.040, 0.140, 0.260, 0.180, and 0.160. Data Set 2: Drilling Data 2: The second data set is based on 50 observations of holes having diameter 9 mm and thickness of sheet 2 mm. The data set is also used by [32]. The following are the data observation: 0.060, 0.120, 0.140, 0.040, 0.140, 0.160, 0.080, 0.260, 0.320, 0.220, 0.160, 0.120, 0.240, 0.060, 0.020, 0.180, 0.220, 0.140, 0.220, 0.160, 0.120, 0.240, 0.060, 0.020, 0.180, 0.220, 0.140, 0.020, 0.180, 0.220, 0.140, 0.060, 0.040, 0.140, 0.220, 0.140, 0.060, 0.040, 0.160, 0.240, 0.160, 0.320, 0.180, 0.240, 0.220, 0.040, 0.140, 0.260, 0.180, and 0.160.

Data Set 3: Milk Production Data: The third data revealed the overall yield production of 107 cows at first birth of SINDI race. The data set is also used by [33]. The following are the data observation: 0.4365, 0.4260, 0.5140, 0.6907, 0.7471, 0.2605, 0.6196, 0.8781, 0.4990, 0.6058, 0.6891, 0.5770, 0.5394, 0.1479, 0.2356, 0.6012, 0.1525, 0.5483, 0.6927, 0.7261, 0.3323, 0.0671, 0.2361, 0.4800, 0.5707, 0.7131, 0.5853, 0.6768, 0.5350, 0.4151, 0.6789, 0.4576, 0.3259, 0.2303, 0.7687, 0.4371, 0.3383, 0.6114, 0.3480, 0.4564, 0.7804, 0.3406, 0.4823, 0.5912, 0.5744, 0.5481, 0.1131, 0.7290, 0.0168, 0.5529, 0.4530, 0.3891, 0.4752, 0.3134, 0.3175, 0.1167, 0.6750, 0.5113, 0.5447, 0.4143, 0.5627, 0.5150, 0.0776, 0.3945, 0.4553, 0.4470, 0.5285, 0.5232, 0.6465, 0.0650, 0.8492, 0.8147, 0.3627, 0.3906, 0.4438, 0.4612, 0.3188, 0.2160, 0.6707, 0.6220, 0.5629, 0.4675, 0.6844, 0.3413, 0.4332, 0.0854, 0.3821, 0.4694, 0.3635, 0.4111, 0.5349, 0.3751, 0.1546, 0.4517, 0.2681, 0.4049, 0.5553, 0.5878, 0.4741, 0.3598, 0.7629, 0.5941, 0.6174, 0.6860, 0.0609, 0.6488, and 0.2747.

Data Set 4: Unemployment Claim Data 1: The usefulness of the proposed MKw model is determined by taking into the account a heavy tailed real data sets from insurance field. The given data was used by [5] and consisted of 58 values related to the monthly metrics on the unemployment insurance: 0.188, 0.202, 0.195, 0.385, 0.489, 0.545, 0.541, 0.535, 0.521, 0.508, 0.512, 0.507, 0.519, 0.493, 0.487, 0.460, 0.490, 0.460, 0.490, 0.500, 0.400, 0.350, 0.370, 0.410, 0.400, 0.400, 0.410, 0.400, 0.420, 0.450, 0.450, 0.420, 0.390, 0.340, 0.360, 0.400, 0.440, 0.390, 0.410, 0.450, 0.460, 0.470, 0.490, 0.460, 0.410, 0.390, 0.400, 0.440, 0.420, 0.420, 0.450, 0.470, 0.530, 0.420, 0.490, 0.440, 0.420, and 0.400.

Data Set 5: Unemployment Claim Data 2: 0.823, 0.864, 0.816, 0.841, 0.831, 0.833, 0.894, 0.869, 0.866, 0.860, 0.837, 0.826, 0.804, 0.809, 0.758, 0.770, 0.778, 0.707, 0.814, 0.825, 0.906, 0.924, 0.927, 0.920, 0.770, 0.544, 0.550, 0.608, 0.630, 0.650, 0.820, 0.873, 0.900, 0.916, 0.899, 0.862, 0.695, 0.650, 0.751, 0.862, 0.702, 0.530, 0.764, 0.898, 0.897, 0.908, 0.902, 0.879, 0.645, 0.739, 0.765, 0.803, 0.708, 0.669, 0.561, 0.579, 0.701, and 0.839.

We used the MLE method in order to find the unknown values of the MKw parameters. Several goodness-of-fit (GoF), namely, Akaike information criterion (AIC), corrected AIC (CAIC), Bayesian information criterion (BIC), Hannan-Quinn information criterion (HQIC), Cramér-von Mises (CvM), Anderson–Darling (AD) and Kolmogrov–Smirnov (KS) measures were used to elect an adequate model.

Tables711 list the MLEs and standard error for the MKw model and other competitive distributions such as, Gamma Kumaraswamy(GaKw) [30], Size-Biased Kumaraswamy (SBKw) [31], Kumaraswamy (Kw), and Beta. While the AIC, CAIC, BIC, HQIC, and other GoFs for the MKw model and other competitive models (GaKw, SBKw, Kw, and Beta) for data sets 1, 2, 3, 4, and 5, respectively. The values of the GoFs in Tables 1216 indicate that the MKw model shows small values of the GoFs, and thus provides the best fit as compared to the other models. The plots in Figures 711 also support our claim.

7.1. Numerical Illustration of VaR and ES

Here we demonstrate the numerical as well as graphical presentation of the two important risk measures, VaR and ES. The comparative study of VaR and ES of the proposed MKw model with its counterparts (Kw, SBKw, and Beta models) is performed by taking MLEs estimates of the parameters for the models in both data sets. It is worth-emphasizing that a model with higher values of the risk measures is said to have a heavier tail. Tables 17 and 18 provide the numerical illustration of the VaR and ES for four models of data 4 and 5 and yield that the MKw model has higher values of both the risk measures as compared to their counterparts (Kw, SBKw, and Beta models). The graphical demonstration of the models from Figures 12 and 13, also revealed that the proposed model has heavier tail than Kw, SBKw, and Beta model. The readers are referred to [34] for detailed discussion of VaR and ES and their computation by using an R package.

It is clear that, the MKw model provides a better fit than the other tested models, because it has the smallest value among AIC, CAIC, BIC, HQIC, CvM, AD, and K-S.

8. Concluding Remarks

We proposed a modified Kumaraswamy distribution by modification for (0, 1). We reported some mathematical properties of the modified Kumaraswamy distribution. We solved the quantile function, which helped in the simulation study. We simulated some parameter values, which showed that the model’s behavior was good. We also analyze this distribution with well-known models such as Gamma Kumaraswamy, Size-Biased Kumaraswamy, Kumaraswamy, and Beta using well-established GoF test-statistics for five real-life data sets including insurance claim data. We observed that our model performed better than the other comparative models on the basis of numerical results, GoFs, and graphical measures. We hope that the proposed modified distribution will get great attention from researchers.

Data Availability

The data used in the article are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.