Bayesian Estimation of Inequality and Poverty Indices in Case of Pareto Distribution Using Different Priors under LINEX Loss Function

Kaur, Kamaljit; Arora, Sangeeta; Mahajan, Kalpana K.

doi:https://doi.org/10.1155/2015/964824

Advances in Statistics

On this page

Abstract Introduction Conclusion Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2015 | Article ID 964824 | https://doi.org/10.1155/2015/964824

Bayesian Estimation of Inequality and Poverty Indices in Case of Pareto Distribution Using Different Priors under LINEX Loss Function

Kamaljit Kaur,¹Sangeeta Arora,¹and Kalpana K. Mahajan¹

Academic Editor: Karthik Devarajan

Received29 Aug 2014

Accepted07 Jan 2015

Published29 Jan 2015

Abstract

Bayesian estimators of Gini index and a Poverty measure are obtained in case of Pareto distribution under censored and complete setup. The said estimators are obtained using two noninformative priors, namely, uniform prior and Jeffreys’ prior, and one conjugate prior under the assumption of Linear Exponential (LINEX) loss function. Using simulation techniques, the relative efficiency of proposed estimators using different priors and loss functions is obtained. The performances of the proposed estimators have been compared on the basis of their simulated risks obtained under LINEX loss function.

1. Introduction

The Pareto distribution is a skewed, heavy-tailed distribution that is used to model the distribution of incomes and other financial variables. It was introduced by Pareto [1] which has a probability density function of the form and cumulative distribution function is The parameter in (2) represents the minimum income in the population under study and assumed to be known, while the other parameter α is assumed to be unknown.

The average income for Pareto distribution is In the context of income inequality and poverty, Gini index and Poverty measure head count ratio are two most popular indices [2, 3]. Gini index is generally defined as where is the equation of the Lorenz curve and is the mean of the distribution.

Equivalently, Gini index can also be defined as where is population Gini mean difference.

The Poverty index head count ratio is simply the count of the number of households whose incomes are below the poverty line divided by the total population. In terms of continuous distribution, where, is called Poverty Line.

In case of Pareto distribution, Gini index [4, 5] is given by and Poverty measure is where, and .

Thus, is per capita annual income representing a minimum acceptable standard of living and represents the proportion of population having income equal to or less than .

The estimation of Gini index and Poverty measure () and the associated inference using classical approach (parametric and nonparametric) is available in literature [5–8]. However, in the Bayesian setup, this has not evoked the interest of many researchers [9, 10]. In the present paper, our focus will be on the estimation of inequality and poverty indices in the Bayesian setup.

When the Bayesian method is used, the choice of appropriate prior distribution plays an important role, which may be categorized as informative, noninformative, and conjugate priors [11, 12]. In the present paper, three priors (two noninformative priors and one conjugate prior) are used to estimate shape parameter, Gini index, Average income, and Poverty measure. The two noninformative priors are Uniform prior and Jeffreys’ prior, while conjugate prior is chosen as Truncated Erlang distribution.

In Bayesian estimation, the criterion for good estimators for the parameters of interest is the choice of appropriate loss function. In Bayesian estimation, two types of loss functions commonly used are Squared error loss function (SELF) and Linear exponential (LINEX) loss function. The simplest type of loss function is squared error, which is also referred to as quadratic loss is given as where is the estimator of θ.

The usual squared error loss function is symmetrical and associates equal importance to the losses due to overestimation and underestimation of equal magnitude. However, such a restriction may be impractical; for example, in estimation of shape parameter of Classical Pareto distribution, the overestimation and underestimation may not be of equal importance as over estimate of shape parameter gives an under-estimate of inequality index which seems to be more serious as compared to under estimate of shape parameter because we are often interested in reducing income inequality index. This leads one to think that an asymmetrical loss function be considered for estimation of shape parameter which associates greater importance to overestimation. A number of asymmetrical loss functions have been proposed in statistical literature [13–16]. Varian [16] proposed a useful asymmetrical loss function known as Linear exponential (LINEX) loss function which is given as The posterior expectation of the LINEX loss function (10) is where denotes posterior expectation with respect to the posterior density of .

By a result of Zellner [17] the Bayes estimator of denoted by under the LINEX loss function is the value which minimizes posterior expectation and is given by provided that the expectation exists and is finite [18].

In Figures 1(a) and 1(b), values of are plotted for the selected values of for and . It is seen that, for , the function is quite asymmetric with a value exceeding the target being more serious than a value below the target. But, for , the function is also quite asymmetric with a value below the target value being more serious than a value exceeding the target.

(a)

(b)

(c)

For small value of , the LINEX loss function can be expanded by Taylor’s series expansion as Thus, the LINEX loss function is approximately equal to squared error loss function for small values of b (see Figure 1(c)).

This loss function has been considered by Zellner [17], Basu and Ebrahimi [19], and Afify [20] for different distributions.

In the present study, LINEX loss function is used for estimating the shape parameter, Gini index, Mean income, and a Poverty measure in the context of Pareto distribution using noninformative priors (Uniform prior and Jeffreys’ prior) and one conjugate prior (Truncated Erlang distribution) along with some assumptions regarding the sampled population. Bayesian approach with prior and posterior distributions along with sampling schemes in the context of Pareto distribution is given in Section 2. In Section 3, Bayesian estimators of shape parameter, Gini index, Mean income, and Poverty measure using different priors under the assumption of LINEX loss function are obtained. Finally, in Section 4, simulation is done to compare the efficiency of three different approaches using three priors and loss functions. The robustness of the hyperparameters is given in Section 4.1 through simulation study. Section 5 presents the conclusion of the study.

2. Preliminary about Sampling Scheme, Priors, and Posterior Densities

The Bayesian analysis of the Pareto distribution (2) is based on the following censored sampling scheme on personal income data. It is assumed that annual incomes of the persons are under study but exact figures are available only for those individuals whose annual income does not exceed a prescribed annual income , and for the remaining individuals, the exact income figures are unknown but we do know that their annual income exceed the prescribed figure . Before the arrival of the sample data on personal incomes, is predetermined but not , which is a random. This censoring scheme used is referred as right censored sampling scheme.

The likelihood function for complete sample in case of Pareto distribution [4] is In case of censored data, the likelihood function for any distribution [21] is The likelihood function for Pareto distribution in censored sample is where is product income statistics [22] and .

Bayes estimators of Gini index and Average income will not be convergent in the interval and , respectively, and the method will fail to work. Hence, this difficulty is removed by assuming , to obtain different Bayes estimators.

The prior and posterior densities for noninformative priors (Uniform prior and Jeffreys’ prior) and conjugate prior are explained below.

(i) Uniform Prior. In practice, the informative priors are not always available; for such situations, the use of noninformative priors is recommended. One of the most widely used noninformative prior, due to Laplace [23], is a uniform prior. Therefore, the uniform prior has been assumed for the estimation of the shape parameter of the Pareto distribution.

Uniform prior for is Combine likelihood function (16) with the prior density (17) by using Bayes theorem to obtain the posterior density as where is the upper incomplete gamma function and posterior density is left truncated Gamma distribution.

(ii) Jeffreys’ Prior. Another noninformative prior has been suggested by Jeffreys [24] which is frequently used in situations where one does not have much information about the parameters. This is defined as the distribution of the parameters proportional to the square root of the determinants of the Fisher information matrix, that is, , where is Fisher’s information of the given distribution. In case of Pareto distribution, A motivation for Jeffreys’ prior is that Fisher’s information is an indicator of the amount of information brought by the model (observations) about .

The posterior density is obtained as which is left truncated Gamma distribution.

Note: Extension of Jeffreys’ Prior. Jeffreys’ prior is a particular case of extension of Jeffreys’ prior proposed by Al-Kutubi and Ibrahim [25], defined as where is a positive constant. For , it reduces to Jeffreys’ prior.

In case of Pareto distribution, this prior is The posterior distribution by using extension to Jeffreys’ prior is obtained as

(iii) Conjugate Prior. The conjugate prior was introduced by Raiffa and Schlaifer [26], where the prior and posterior distributions are from the same family, that is, the form of the posterior density has the same distributional form as the prior distribution. For the existence of Gini index and Mean income for the Pareto distribution, we must take into account a truncated prior distribution since the random variable is defined in , where the constant is assumed to be known.

Let have Truncated Erlang distribution [22] where and are the hyperparameters.

The posterior density for is The posterior density follows Truncated Erlang distribution with parameters and .

3. Bayesian Estimation under Linear Exponential (LINEX) Loss Function Using Different Priors

3.1. Bayesian Estimators Using Uniform Prior

Bayesian estimator of using uniform prior (17) and posterior density (18), under the assumption of the LINEX loss function (ref. (12)) is obtained as Therefore, The Bayes estimator of , using uniform prior is (using formula of 3.471, page 368 of Gradshteyn and Ryzhik [27] , where is modified Bessel function of third kind).

Thereby, The Bayes estimator of , using uniform prior is (using formula of 3.471, page 368 of Gradshteyn and Ryzhik [27] , where is modified Bessel function of third kind) The Bayes estimator of , using uniform prior, is

3.2. Bayesian Estimators Using Jeffreys’ Prior

In case of Jeffreys’ prior (19) and using posterior density (20), the Bayesian estimators of , , , and under the assumption of the LINEX loss function are obtained as follows:

Note. The expression for extension of Jeffreys’ prior can be obtained with some modifications in Jeffreys’ prior and are listed below:

3.3. Bayesian Estimators Using Conjugate Prior

Using the Bayesian posterior density (25), the Bayes estimators of , , , and , under the assumption of the LINEX loss function are

Note: Case of Complete Sample. The Bayesian estimators for complete sample can be obtained using noninformative priors and conjugate prior by simply substituting in the above estimators.

4. Simulation Study

In order to assess the statistical performance of these estimators of shape parameter, Gini index, Mean income, and Poverty measure using LINEX loss function, a simulation study is conduced. The estimated losses are computed using generated random samples from Pareto distribution of different sizes. These estimated losses are computed for sample sizes (20) 100, (1) 4.5, , , and . The value of should be taken from Poverty line given by the Government of India in 2009-10 for urban people. For the conjugate prior, the values of hyperparameter are taken as , ; , and . The estimated losses of , , , and with LINEX loss function by using noninformative (Uniform prior and Jeffreys’ prior) and conjugate priors are tabulated in Tables 1, 2, 3, and 4, respectively.

It is observed from the above simulation study (ref. Tables 1, 2, 3, and 4) that(i)Bayesian estimators with conjugate prior (hyperparameter , ) perform better as compared to noninformative priors as it has smaller estimated loss for , , , and ;(ii)in case of noninformative priors, Jeffreys’ prior has less estimated loss than uniform prior, which implies that Bayesian methods with Jeffreys’ prior are better;(iii)a change in the value of on higher side does result in an increase in the loss; the loss remains unaffected by the change in the value of .In Table 5 simulation study is taken to find estimated loss for , , , and under the assumptions of SELF using different priors by considering small as well as large samples for comparisons purpose with the LINEX loss function.

From Table 5 and its comparison with LINEX loss function (ref. Tables 1, 2, 3, and 4), it is observed that LINEX loss function gives smaller loss in comparison with SELF for noninformative priors and conjugate prior for small as well as large sample sizes. When sample size increases estimated loss decreases in all cases.

4.1. Choice of Hyperparameters

Sinha and Howlader [28] suggested that a Bayes estimate is robust with respect to its hyperparameter if it leads to a high index of the estimate for the varying values of those hyperparameter. To check results, simulations are done by taking different values of hyperparameter and keeping and fixed (ref. Tables 6 and 7).

The ratio in case of both Gini index and Poverty measure is close to 1 for different combinations of and indicating thereby the Bayes estimates are robust with respect to hyperparameters, which justifies the use of hyperparameters in simulation study.

5. Conclusion

The simulation study as carried out in Section 4 suggests that Bayesian estimators using conjugate prior (hyperparameter , ) perform better than two noninformative priors (Uniform prior and Jeffreys’ prior) in general. It is also observed that LINEX loss function results in smaller loss than the SELF for both small and large samples irrespective of the choice of the priors taken for the Bayesian estimators. Hence, the combinations of conjugate prior and LINEX loss results in smaller loss than the choice of other two priors and squared error loss function. One can further infer that as sample size increases the expected loss function decreases for all cases.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The authors are thankful to the anonymous referees and the editor for their valuable suggestions and comments.

References

V. Pareto, Cours D' Economic Politique Paris, Rouge and cie, 1897.
C. Gini, Variability and Mutabiltity, C. Cuppini, Bologna, Italy, 1912.
J. Foster, J. Greer, and E. Thorbecke, “A class of decomposable poverty measures,” Econometrica, vol. 52, no. 3, pp. 761–766, 1984.
View at: Publisher Site | Google Scholar
B. C. Arnold and S. J. Press, “Bayesian inference for Pareto populations,” Journal of Econometrics, vol. 21, no. 3, pp. 287–306, 1983.
View at: Publisher Site | Google Scholar | MathSciNet
T. S. Moothathu, “Sampling distributions of Lorenz curve and Gini index of the Pareto distribution,” Sankhya (Statistics), Series B, vol. 47, no. 2, pp. 247–258, 1985.
View at: Google Scholar | MathSciNet
P. K. Sen, “The harmonic Gini coefficient and affluence indexes,” Mathematical Social Sciences, vol. 16, no. 1, pp. 65–76, 1988.
View at: Publisher Site | Google Scholar | MathSciNet
P. M. Dixon, J. Weiner, T. Mitchell-Olds, and R. Woodley, “Bootstrapping the Gini coefficient of inequality,” Ecology, vol. 68, no. 5, pp. 1548–1561, 1987.
View at: Publisher Site | Google Scholar
P. Bansal, S. Arora, and K. K. Mahajan, “Testing homogeneity of Gini indices against simple-ordered alternative,” Communications in Statistics: Simulation and Computation, vol. 40, no. 2, pp. 185–198, 2011.
View at: Publisher Site | Google Scholar | MathSciNet
E. I. Abdul-Sathar, E. S. Jeevanand, and K. R. M. Nair, “Bayes estimation of Lorenz curve and Gini-index for classical Pareto distribution in some real data situation,” Journal of Applied Statistical Science, vol. 17, no. 2, pp. 315–329, 2009.
View at: Google Scholar
S. K. Bhattacharya, A. Chaturvedi, and N. K. Singh, “Bayesian estimation for the Pareto income distribution,” Statistical Papers, vol. 40, no. 3, pp. 247–262, 1999.
View at: Publisher Site | Google Scholar | MathSciNet
R. Kass and L. Wasserman, “The selection of prior distributions by formal rules,” Journal of American Statistical Association, vol. 91, no. 431, pp. 1343–1370, 1996.
View at: Google Scholar
J. Berger, “The case for objective Bayesian analysis,” Bayesian Analysis, vol. 1, no. 3, pp. 385–402, 2006.
View at: Publisher Site | Google Scholar | MathSciNet
J. Aitchison and I. R. Dunsmore, Statistical Prediction Analysis, Cambridge University Press, London, UK, 1975.
View at: MathSciNet
J. O. Berger, Statistical Decision Theory Foundations, Concepts and Methods, Springer, New York, NY, USA, 1980.
View at: MathSciNet
R. V. Canfield, “A bayesian approach to reliability estimation using a lossfunction,” IEEE Transaction on Reliability, vol. R-19, no. 1, pp. 13–16, 1970.
View at: Publisher Site | Google Scholar
H. R. Varian, “A bayesian approach to real estate assessment,” in Studies in Bayesian Econometrics and Statistics in Honor of Leonard J. Savage, S. E. Fienberg and A. Zellner, Eds., pp. 195–208, North-Holland, Amsterdam, The Netherlands, 1975.
View at: Google Scholar
A. Zellner, “Bayesian estimation and prediction using asymmetric loss functions,” Journal of the American Statistical Association, vol. 81, no. 394, pp. 446–451, 1986.
View at: Publisher Site | Google Scholar | MathSciNet
R. Calabria and G. Pulcini, “An engineering approach to Bayes estimation for the Weibull distribution,” Microelectronics Reliability, vol. 34, no. 5, pp. 789–802, 1994.
View at: Publisher Site | Google Scholar
A. P. Basu and N. Ebrahimi, “Bayesian approach to life testing and reliability estimation using asymmetric loss function,” Journal of Statistical Planning and Inference, vol. 29, no. 1-2, pp. 21–31, 1991.
View at: Publisher Site | Google Scholar | MathSciNet
W. M. Afify, “On estimation of the exponentiated Pareto distribution under different sample schemes,” Applied Mathematical Sciences, vol. 4, no. 8, pp. 393–402, 2010.
View at: Google Scholar | MathSciNet
A. C. Cohen, “Maximum likelihood estimation in the Weibull distribution based on complete and on censored samples,” Technometrics, vol. 7, pp. 579–588, 1965.
View at: Google Scholar | MathSciNet
A. Ganguly, N. K. Singh, H. Choudhuri, and S. K. Bhattacharya, “Bayesian estimation of the Gini index for the PID,” Test, vol. 1, no. 1, pp. 93–104, 1992.
View at: Publisher Site | Google Scholar | MathSciNet
P. S. Laplace, Theorie Analytique des Probabilities, Veuve Courcier, Paris, France, 1812.
H. Jeffreys, “An invariant form for the prior probability in estimation problems,” Proceedings of the Royal Society. London, Series A: Mathematical, Physical and Engineering Sciences, vol. 186, pp. 453–461, 1946.
View at: Publisher Site | Google Scholar | MathSciNet
H. S. Al-Kutubi and N. A. Ibrahim, “Bayes estimator for exponential distribution with extension of Jeffery prior information,” Malaysian Journal of Mathematical Sciences, vol. 3, no. 2, pp. 297–313, 2009.
View at: Google Scholar
H. Raiffa and R. Schlaifer, Applied Statistical Decision Theory, Division of Research, Graduate School of Business Administration, Harvard University, 1961.
I. S. Gradshteyn and I. M. Ryzhik, Tables of Integrals, Series and Products, United States of America, 7th edition, 2007.
View at: MathSciNet
S. K. Sinha and H. A. Howlader, “On the sampling distributions of Bayesian estimators of the Pareto Parameter with proper and improper priors and associated goodness of fit,” Tech. Rep. #103, Department of Statistics, University of Manitoba, Winnipeg, Canada, 1980.
View at: Google Scholar

Copyright

Copyright © 2015 Kamaljit Kaur et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2105

Downloads

339

Citations