Abstract

Statistical distributions play a prominent role for modeling data in applied fields, particularly in actuarial, financial sciences, and risk management fields. Among the statistical distributions, the heavy-tailed distributions have proven the best choice to use for modeling heavy-tailed financial data. The actuaries are often in search of such types of distributions to provide the best description of the actuarial and financial data. This study presents a new power transformation to introduce a new family of heavy-tailed distributions useful for modeling heavy-tailed financial data. A submodel, namely, heavy-tailed beta-power transformed Weibull model is considered to demonstrate the adequacy of the proposed method. Some actuarial measures such as value at risk, tail value at risk, tail variance, and tail variance premium are calculated. A brief simulation study based on these measures is provided. Finally, an application to the insurance loss dataset is analyzed, which revealed that the proposed distribution is a superior model among the competitors and could potentially be very adequate in describing and modeling actuarial and financial data.

1. Introduction

Heavy-tailed distributions have been studied for decades by actuaries to investigate various aspects of financial portfolio theory and risk management problems. As always, most insurance loss datasets share the same properties such as skewed to the right, unimodal shaped, and have a very thick right tail, refer the studies of Kazemi and Noorizadeh [1], Adcock et al. [2], Bhati and Ravi [3], Ahmad et al. [4], Ahmad et al. [5], Ahmad et al. [6], and Ahmad et al. [7]. A distribution possess these characteristics is quite appropriate for modeling insurance loss data and can also be used for estimating the business risk level. These distributions include Pareto (Coorey and Ananada [8]), Lomax (Scollnik [9]), beta (Hogg and Klugman [10], lognormal (Klugman et al. [11]), Burr (Nadarajah and Bakar [12]), and Weibull (Bakar et al. [13]). These are the most appropriate models for modeling of insurance losses. For more details, refer to the studies by Shushi [14], Punzo et al. [15], Punzo et al. [16], and Punzo [17].

However, despite these classical distributions having many merits, there are still some deficiencies in these models as they are not flexible enough to provide the best fit to the heavy-tailed datasets. As an example, (i) the Pareto model, which is one of the prominent models used for modeling financial data, sometimes provides poor fitting to many financial applications, (ii) on the other hand, the Weibull model can only cover the behavior of small losses adequately but is not a reasonable candidate model to deal with the behavior of large losses, and (iii) the distribution functions of both the lognormal and the beta distributions have no closed form expressions causing difficulties in the derivation of many mathematical properties and make them less popular to use for analyzing financial datasets (Bhati and Ravi [18]).

Due to the importance of the heavy-tailed distributions in the field of actuarial sciences, the actuaries are always in search for introducing new statistical distributions to cover the deficiencies of the traditional distributions. This motivated the researchers to search for new flexible heavy-tailed distributions.

In this context, serious efforts have been performed to propose new approaches for developing new flexible distributions such as (i) composition of two or more distributions, (ii) transformation of variables, (iii) finite mixture of distributions, and (iv) compounding of distributions. However, still these methods are suffering from some sort of deficiencies (Ahmad [19] and Ahmad et al. [20]). Thus, it is imperative to develop new statistical models having a closed form of distribution functions, heavy-tailed than the exponential class of distributions, and are capable of adequately modeling the financial risk management problems.

In the premises of the above, we further carry this area of distribution theory and propose a new method to introduce new flexible heavy-tailed distributions. The proposed family may be called a new heavy-tailed beta-power transformed (HTBPT) family of distributions. The proposed method may be an interesting alternative to the existing literature for fitting heavy-tailed data. We study a new model called the heavy-tailed beta-power transformed Weibull (HTBPT–Weibull) distribution as a special case of the HTBPT distributions. The proposed model is very flexible and could be chosen as a good candidate model for modeling heavy-tailed data. Some mathematical properties along with the actuarial measures are derived. Based on the actuarial measures, a comparative simulation study of the proposed model and Weibull (also known as superexponential) distributions is conducted. The simulation results show that the HTBPT–Weibull distribution has heavier tail than the superexponential distribution.

The study is sectioned as follows. In Section 2, we provided the steps followed in the derivation of the proposed method. In Section 3, we introduced a special case of the proposed family. In Section 4, we introduced some mathematical properties of the proposed family. In Section 5, the maximum likelihood estimation method is used in order to estimate the parameters of the model. A brief Monte Carlo simulation study is provided in Section 6. Actuarial measures of the proposed model are discussed in Section 7. A real-life application of the proposed model to insurance loss data is provided in Section 8. Finally, Section 9 contains the conclusions and the major findings of this research study.

2. Steps Followed for Developing the Proposed Family

Suppose that a random variable for , with density function , and let the random variable has the cumulative distribution function (cdf) , satisfying(1)(2) must be monotonically increasing and can be differentiated(3) as and as

The family method was originally proposed by Alzaatreh et al. [21] as follows:where satisfies the above conditions. Corresponding to (1), the probability density function (pdf) is

More information about the technique for introducing new families of distributions is given by Ahmad et al. [19]. By applying the technique, let us assume that T follows the exponential distribution with the scale parameter, say ; then, its cdf has the form given by

By differentiating equation (3) with respect to , we have

Assume that is as represented in equation (4) and use in (1). Then, we havewhere is the cdf of the baseline model which depends on the vector parameter . From expression (5), we can see that the proposed cdf reduces to the baseline cdf for . The density function of equation (5) is as follows:

The expression (6) can also be written as

The survival function (sf) and hazard rate function (hrf) of the proposed family are given byrespectively.

The random variable X with pdf (5) will be denoted by . For the sake of simplicity, we can omit the dependence on the parameters and simply write .

3. Description of the Proposed Model

This section deals with the introduction of the new HTBPT–Weibull distribution and its special cases.

3.1. Heavy-Tailed Beta-Power Transformed Weibull Distribution

Referring to the cdf of the Weibull distribution with shape parameter and scale parameter , , and pdf , where . A random variable say X has the HTBPT–Weibull distribution, if its cdf is

The pdf, sf, and hrf of the HTBPT–Weibull distributions are given byrespectively.

In order to see the effect of adding to the proposed distribution, we fixed the distribution’s parameters ( and ) and changed the values of and sketched the plots. From the plots provided in Figure 1, it is clear that as the value of the additional parameter increases, the proposed model tends to a heavy-tailed distribution.

Plots for the cdf and sf of the proposed model are provided in Figure 2. Whereas, the plots of the hrf are shown in Figure 3.

3.2. Special Cases of the HTBPT–Weibull Distribution

This subsection deals with the subcases of the proposed distribution. Let X has the HTBPT–Weibull distribution with parameters . Then, we obtainThe Weibull distribution with parameters and , using The one parameter Weibull distribution with parameter , using The exponential distribution with parameter , using The Rayleigh distribution with parameter , if and The one parameter HTBPT–Weibull distribution with parameters and , if (new)The heavy-tailed beta-power transformed exponential distribution with parameters and , if (new)The HTBPTR–Rayleigh distribution with parameters and , if (new)

4. Mathematical Properties

This section is devoted to some basic mathematical properties of the HTBPT family.

4.1. Shape of Density and Hazard Functions of the HTBPT Distributions

The shapes of the density function and the hrf of the HTBPT distributions can be expressed by an analytical formula. We can also find the critical points of the density function by differentiating it with respect to as follows:

The above equation may have more than one root. If is a root of the above equation, then it corresponds to (i) a local maximum, if , (ii) a local minimum, if , and (iii) a point of inflection, if .

The parameter space is the same as in the HTBPT family. In a similar way, the critical points of the hazard function can be obtained by differentiation as follows:

The above equation may have more than one root. If is a root of the above equation, then it corresponds to (i) a local maximum, if , (ii) a local minimum, if , and (iii) a point of inflection, if .

4.2. Asymptotic Behavior of HTBPT Distributions

This subsection is devoted to the asymptotic behavior of the HTBPT distributions.As , we haveAs , we have

4.3. Mixture Representation of the Density of HTBPT Distributions

This subsection deals with the mixture representation of the density of HTBPT distributions. We know that ; therefore, the expression (6) can be formulated aswhere , and .where is defined as the density function of exponentiated generated distributions with parameter in its power.

4.4. Quantile Function

The quantile functions are in extensive use in general statistics and frequently used to generate random samples from probability distributions. Let X denotes the HTBPT random variable; then, the quantile function of X can be obtained as follows:where t is the solution of the equation , and u is the uniform distribution over (0, 1) interval.

4.5. Moments

In this part of the study, we derived an important property of the HTBPT distributions called moments, and it has a specific and important role in statistical analysis. It is used in finding the major properties and characteristics of the distribution (e.g., central tendency, dispersion, skewness, and kurtosis). The moment of the HTBPT distributions is derived as follows:where

The effects of different values of the parameters and on the mean, variance, skewness, and kurtosis of the HTBPT–Weibull distribution with are illustrated in Figures 4 and 5.

The moment generating function (mgf) is another useful tool to calculate the moments of the statistical distributions. The mgf of the HTBPT distributions is given by

4.6. Incomplete Moments

The shape of many statistical distributions can be conveniently described by the incomplete moments. It plays a very important role in measuring inequalities (e.g., income quantiles as well as Lorenz and Bonferroni curves). Here, we drive the incomplete moment of HTBPT distributions aswhere , and . Furthermore, we can find the conditional measures of the HTBPT-distributed random variable X aswhere , and .

5. Estimation

This part of the study is concerned with the estimation of the unknown parameters of the HTBPT distributions. Let be a random sample taken from HTBPT distributions with parameters . Then, the corresponding log-likelihood function of this sample is

The function (23) can be maximized either by deploying Ox program (subroutine Max BFGS) or SAS (PROC NLMIXED) or by solving it manually. Obtaining the partial derivatives of (10), we have

Equating and to zero and simultaneously solving these expressions numerically yields the maximum likelihood estimators (MLEs) of .

6. Simulation Study

The section offers a brief Monte Carlo simulation study to evaluate the efficiency and consistency properties of the MLEs of the HTBPT–Weibull parameters (a particular case from the proposed family). The random samples are generated from the HTBPT–Weibull distribution using the inverse cdf method. The simulation results are obtained using optim R function with the argument method = “L-BFGS-B.” The steps taken for simulation are given as follows:(1) iterations are made, and different samples of size are generated from the HTBPT–Weibull distribution using the inversion cdf method.(2)The parameter values are estimated using the maximum likelihood estimation method(3)The biases and mean square errors (MSEs) are calculated using the formulasrespectively(4)For calculating the biases as well as the MSEs of , step (3) is repeated

The simulation results are presented in Tables 1 and 2. Whereas, the graphical display of the simulation results is shown in Figures 69. It is important to note that, in Figures 69, is used for , and is used for .

From the simulation results provided in Tables 1 and 2, it is clear that the estimates of the model parameters are very precise and close to the true value of the parameters as the samples’ size increases. Also, we can see that the values of the estimated biases and absolute biases tend to zero, when the sample size increases, which shows the accuracy of MLEs. Also, as increases, MSEs tend to zero. This fact leads to the consistency property of the MLEs of the HTBPT–Weibull distribution.

7. Actuarial Measures

One of the primary tasks of actuarial sciences institutions is to predict the market risks in a portfolio of instruments. Henceforth, estimating the risk measures is very important in selling and buying products. In this section of the study, we discuss some important risk measures such as the value at risk (VaR), tail value at risk (TVaR), tail variance (TV), and tail variance premium (TVP) of the proposed family.

7.1. Value at Risk

The VaR of a random variable X is the quantile of its cdf (Artzner [22]). If X follows the HTBPT distributions with parameters , then the VaR of X denoted by is given bywhere t is the solution of the equation , and u follows the uniform distribution on interval (0, 1).

7.2. Tail Value at Risk

Let X follows the HTBPT distributions with parameters and ; then, the TVaR of X is derived as

Using (6) in (27), we get

Using the pdf and cdf of Weibull distribution in (28), we get

On solving (29), we have

7.3. Tail Variance

Among the actuarial measures, TV is the most prominent one and deals with the tail variance beyond VaR. The TV of the HTBPT–Weibull distributed random variable is derived as follows:

Consider

By solving the above equations, we getBy referring to equations (30) and (33) and substituting in (31), we get the TV of HTBPT–Weibull distribution.

7.4. Tail Variance Premium

In this subsection, we discuss another most important risk measure called TVP. The TVP of the HTBPT–Weibull distribution is derived as follows:where . By substituting (30) and (31) in (34), we can get the TVP of the proposed distribution.

7.5. Numerical Study of the Risk Measures

The importance of this subsection is due to the numerical study of VaR and TVaR for the Weibull and HTBPT–Weibull distributions for different sets of parameters. The followed steps are given by(1)A random sample of size is generated from the Weibull and HTBPT–Weibull distributions, and the maximum likelihood method is adopted to estimate the model parameters.(2)Repetitions of 1000 iterations are made to find the values of VaR and TVaR for these distributions

Tables 35 contain the simulation results of the risk measures. The simulation results are also displayed graphically corresponding to each table in Figures 1012, respectively.

After performing the simulation study based on VaR and TVaR, we observed that the proposed model is better than the other compared models because it has higher values of the risk measures (VaR and TVaR).

8. Practical Illustration of the Proposed Model

This section deals with the practical illustration of the proposed model using a real-life insurance dataset. Additionally, we use the same data to calculate the actuarial measures of the Weibull and HTBPT–Weibull distributions.

8.1. An Application to Vehicle Insurance Loss Data

In this part of the study, we consider a dataset from insurance sciences to show the practicability of the proposed distribution. These data have also been studied by Zhao et al. [23] and can be found at http://www.businessandeconomics.mq.edu.au. The comparison of the HTBPT–Weibull distribution is made with the other well-known distributions including the two-parameter Weibull, Lomax, Burr-XII (B-XII), three-parameter new beta-power transformed Weibull (NBPTW), four-parameter Kumaraswamy–Weibull (Ku–W), and new Weibull Burr-XII (NWB-XI) distributions. The cdfs of the competing distributions are(1)Weibull distribution(2)W-Loss distribution(3)NBPTW distribution(4)Kumaraswamy–Weibull(5)Lomax(6)NWB-XII(7)B-XII

The comparison is made using some certain analytical measures called the Akaike information criterion (AIC) (Akaike [24]), Bayesian information criterion (BIC) (Schwarz [25]), Hannan–Quinn information criterion (HQIC) (Hannan and Quinn [26]), and consistent Akaike information criterion (CAIC) (Bozdogan [27]). These measures are as follows:The AIC is given byThe CAIC isThe BIC is calculated as follows:The HQIC iswhere is the number of model parameters, is the sample size, and refers to the log-likelihood function evaluated at MLEs.

For the calculation of the analytical measures, we use the optim R function with the argument method = “SANN.” Smaller values of these analytical measures indicate the best fit. The MLEs of the model parameters along with the analytical measures are reported in Tables 6 and 7, respectively. Referring to the results provided in Table 7, we found that the HTBPT–Weibull distribution is a superior model and provides the best fit among the other considered competitors. In support of Table 7, the estimated pdf and cdf of the proposed distribution are shown in Figure 13, which closely fit the proposed model. The probability-probability (PP) plot and Kaplan–Meier survival plot of the HTBPT–Weibull model are shown in Figure 14. From the PP and Kaplan–Meier survival plots, it is clear that the proposed is a good candidate model for modeling heavy-tailed insurance datasets. Furthermore, the quantile-quantile (QQ) plot and box plot of the data are shown in Figure 15.

8.2. Computation of the Actuarial Measures Using Real Data

In this subsection, we calculate and compare the values of the VaR and TVaR measures of the Weibull and HTBPT–Weibull distributions using the data discussed in subsection 8.1. The numerical results are reported in Table 8.

9. Concluded Remarks

The statistical distributions are very useful for modeling financial and actuarial datasets. Among them, the heavy-tailed distributions are very prominent for modeling financial and other related datasets. A number of methods to define heavy-tailed distributions have been studied in the literature. However, unfortunately, these methods still have certain limitations. To address the issues associated with the former methods, we introduced a new heavy-tailed family of distributions. By using the proposed method, a new three-parameter special model, called the heavy-tailed beta power transformed Weibull distribution is studied in detail. Actuarial measures of the proposed model are calculated, and a simulation study is conducted to show the usefulness of the proposed method in actuarial sciences. The applicability of the family of distributions has been illustrated using a real-life heavy-tailed dataset from insurance and shown that the proposed model performs reasonably well compared to other well-known heavy-tailed distributions.

Appendix

R Code for Analysis

Note: in the following R-code, pm is used for the proposed model.data<-read.csv(file.choose(), header=TRUE)data=data[,1]data=data[!is.na(data)]data\paraindent #################################################################### PDF of the proposed model###################################################pdf_pm <−function(par,x){alpha=par [1]gamma=par [2]beta=par [3]alphagamma(x^(alpha-1))exp(-gammax^alpha)<v:shape xmlns:wpc=http://schemas.microsoft.com/office/word/2010/wordprocessingCanvas”xmlns:mc=“http://schemas.openxmlformats.org/markup-compatibility/2006” xml-ns:w14=“http://schemas.microsoft.com/office/word/20-10/wordml” xmlns:wpg=“http://schemas.microsoft.com/office/word/2010/wordprocessingGroup” xmlns:wpi=“http://schemas.microsoft.com/office/word/2010/wordprocessingInk” xmlns:wps=“http://schemas.micr-osoft.com/office/word/2010/wordprocessingShape” id=“_x0000_i1910” type=“#_x0000_t75” style=“width:6.8pt;height:14.75pt” o:ole=””><v:imagedata r:id=“rId554” o:title=””/><o:OLEObject xmlns:wpc=“http://schemas.microsoft.com/office/word/2010/word-processingCanvas” xmlns:mc=“http://schemas.openxmlformats.org/markup-compatibility/2006” xmlns:w14=“http://schemas.microsoft.com/office/word/2010/wordml” xmlns:wpg=“http://schemas.microsoft.com/office/word/2010/wordprocessingGroup” xmlns:wpi=“http://schemas.microsoft.com/office/word/2010/wordprocessingInk” xmlns:wps=“http://schemas.microsoft.com/office/word/2010/wordprocessingShape” Type=“Embed” ProgID=“Equation.DSMT4” ShapeID=“_x0000_i1910” DrawAspect=“Content” ObjectID=“_1677760135”r:id=rId555/>(beta-(log(beta))beta^(exp(-gammax^alpha)))}#################################################################### CDF of the proposed model###################################################cdf_pm <- function(par,x){alpha=par [1]gamma=par [2]beta=par [3]beta^(exp(-gamma x^alpha))-beta(exp(-gamma x^alpha))}set.seed(0)goodness.fit(pdf=pdf_pm, cdf=cdf_pm,starts = c(0.5,0.5,0.5), data = data,method=BFGS, domain=c(0,Inf),mle=NULL)

Data Availability

The link of the dataset used in this study is included within the article, and the data are available from Zubair Ahmad upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.