Bayesian Estimation of Gumbel Type-II Distribution under Type-II Censoring with Medical Applications

Abbas, Kamran; Hussain, Zamir; Rashid, Noreen; Ali, Amjad; Taj, Muhammad; Khan, Sajjad Ahmad; Manzoor, Sadaf; Khalil, Umair; Khan, Dost Muhammad

doi:https://doi.org/10.1155/2020/1876073

Computational and Mathematical Methods in Medicine

On this page

Abstract Introduction Conclusion Appendix Data Availability Conflicts of Interest References Copyright Related Articles

Special Issue

Computational Intelligence Methods for Brain-Machine Interfacing or Brain-Computer Interfacing

View this Special Issue

Research Article | Open Access

Volume 2020 | Article ID 1876073 | https://doi.org/10.1155/2020/1876073

Bayesian Estimation of Gumbel Type-II Distribution under Type-II Censoring with Medical Applications

Kamran Abbas,¹Zamir Hussain,²Noreen Rashid,³Amjad Ali,⁴Muhammad Taj,⁵Sajjad Ahmad Khan,⁴Sadaf Manzoor,⁴Umair Khalil,⁶and Dost Muhammad Khan⁶

Guest Editor: Pritee Khanna

Received11 Dec 2019

Revised09 Feb 2020

Accepted10 Feb 2020

Published26 Mar 2020

Abstract

The time to event or survival time usually follows certain skewed probability distributions. These distributions encounter vital role using the Bayesian framework to analyze and project the maximum life expectancy in order to inform decision-making. The Bayesian method provides a flexible framework for monitoring the randomized clinical trials to update what is already known using prior information about specific phenomena under uncertainty. Additionally, medical practitioners can use the Bayesian estimators to measure the probability of time until tumor recurrence, time until cardiovascular death, and time until AIDS for HIV patients by considering the prior information. However, in clinical trials and medical studies, censoring is present when an exact event occurrence time is not known. The present study aims to estimate the parameters of Gumbel type-II distribution based on the type-II censored data using the Bayesian framework. The Bayesian estimators cannot be obtained in explicit forms, and therefore we use Lindley’s approximation based on noninformative prior and various loss functions such as squared error loss function, general entropy loss function, and LINEX (linear exponential) loss function. The maximum likelihood and Bayesian estimators are compared in terms of mean squared error by using the simulation study. Furthermore, two data sets about remission times (in months) of bladder cancer patients and survival times in weeks of 61 patients with inoperable adenocarcinoma of the lung are analyzed for illustration purposes.

1. Introduction

In medical research, data supporting the time until the occurrence of a particular event, such as the death of a patient, are frequently encountered. Such data are referred to as survival time data which has generally right-skewed distribution, and Gumbel type-II distribution can be used for this purpose. It was introduced by the German mathematician Gumbel in [1] and is useful to model “extreme values” such as floods, earthquakes, and natural disasters and also used in life expectancy tables, hydrology, and rainfall. The probability density function (PDF) of Gumbel type-II distribution iswhere “α” is the shape and “β” is the scale parameter of the distribution. The corresponding cumulative distribution function (CDF) is

A common feature of lifetime data is that the data points are possibly censored. In manifold reliability and life-testing studies, experiments are generally windup before failure times of all items are observed. Therefore, adequate information and results on failure times of all objects cannot be obtained. During experimentation, these situations occur due to loss or removal of objects before they fail. Therefore, generally, such experiments are preplanned and purposeful to save time and cost of these testing. Data obtained from such experiments are called censored. The type-I and type-II censoring are two well-known censoring schemes. In type-II censoring scheme, the number of failure units are fixed in advanced. For example, the investigator may decide to terminate the study after four of the six rats have developed tumors. There is an enormous literature accessible on estimation of parameters of distributions using type-II censoring, for example, Abbas and Tang [2] considered ML and least square estimators of Frechet distribution using type-II censored samples. Okasha [3] estimated the unknown parameters, reliability, and hazard functions of Lomax distribution under type-II censoring using Bayesian and E-Bayesian estimation. Abu-Zinadah [4] studied on exponentiated Gompertz distribution based on type-II and complete censored data. El-Sagheer [5] studied the generalized pareto distribution under the different censoring schemes.

Recently, many authors have worked on Gumbel type-II distribution and Bayesian estimation using different loss functions. Abbas et al. [6] worked on Gumbel type-II distribution and obtained the Bayes estimators under different loss functions. Feroze and Aslam [7] obtained Bayes estimators of two components of Gumbel type-II distribution. Malinowska and Szynal [8] also derived Bayes estimators for Gumbel type-II distribution on kth lower record values. Sultana et al. [9] worked on a three-component mixture of Gumbel type-II distribution using Bayesian estimation under different priors such as informative and noninformative. Moreover, Metiri et al. [10] worked on the properties of the Lindley distribution. The Bayes estimates were derived under LINEX loss function using informative and noninformative priors (Reyad and Ahmed [11]). Preda et al. [12] developed Bayes estimators of modified Weibull distribution under squared error loss function (SELF) and LINEX loss function.

However, Bayesian estimation of Gumbel type-II distribution based on type-II censoring is not frequently discussed; therefore, we are interested in estimating the unknown parameters of Gumbel type-II distribution under type-II censored data. Including this introduction section, the rest of the paper is arranged as follows: in Section 2, maximum likelihood estimators (MLEs) for the parameters are obtained. In Section 3, Bayesian estimators based on different loss functions by taking noninformative and gamma priors are derived. The proposed estimators are compared in terms of their mean squared error (MSE) in Section 4. Section 5 illustrates the applications of proposed estimators with two examples, namely, data set of remission times for bladder cancer and survival times of inoperable adenocarcinoma of the lung. Finally, conclusions and recommendations are presented in Section 6.

2. Maximum Likelihood Estimation

Suppose that X₁ < X₂ < , …, < X_r is a type-II censored sample of size “r” obtained from a life test on “n” items whose life times have the Gumbel type-II distribution with parameters “α” and “β.” The likelihood function of “r” failures and (n − r) censored values may be written as

It is more convenient to work with log-likelihood. The log-likelihood function is

To get the ML estimator of α and β, differentiate equation (5) with respect to α and β and the resulting equations are

Equations (6) and (7) cannot be written in closed form. Therefore, here, we the use the Laplace approximation to get the point estimates of α and β.

3. Bayesian Estimation

In Bayesian estimation, we consider different loss functions such as squared error loss function (SELF) proposed by Legendre [13] and Gauss [14], LINEX (Varian [15]), and general entropy loss function (GELF) introduced by Calabria and Pulcini [16]. As both parameters are unknown, independent noninformative form of priors can be used. Supposed that α and β have independent Gamma (a, b) and Gamma (c, d) priors, respectively, for a, b, c, d > 0, i.e.,

The joint prior distribution of parameters iswhere K is the normalizing constant that makes Φ′(α, β | x) a proper PDF. Thus,

Therefore, the joint posterior density under any loss function is

Posterior distribution (12) takes a ratio form that cannot be reduced to a closed form. Therefore, we use Lindley approximation [17] to get the Bayesian estimates, which can be written aswhere ; ; ; ; ; ; ; ; ; ; C_ij = l_iS_ii + l_jS_jii, j = 1, 2. The detail of equation (13) is provided in Appendix. The approximate Bayesian estimators of “α” and “β” based on SELF are

Similarly, the Bayesian estimators of “α” and “β” under LINEX loss function are

The Bayesian estimators of “α” and “β” under GELF arewhere and are the ML estimators of α and β which can be obtained from equations (6) and (7), respectively.

4. Simulation Study

The performance of the proposed Bayesian estimators with their ML counterpart in terms of MSE, different sample sizes, and different values of parameters are considered using Monte Carlo simulation based on prespecified different percentages of failures, i.e., 40%, 60%, and 80%. Monte Carlo simulation is conducted as follows:(i)Take the initial values of α and β, respectively, and the samples are generated from the Gumbel type-II distribution using inverse transformation technique, i.e., , where U∼uniform (0, 1).(ii)First, calculate the MLE using Laplace approximation, and then Bayesian estimates under noninformative priors are obtained via Lindley approximation.(iii)The process is replicated 5000 times for each sample size and averages of these estimates and the corresponding MSEs (within parenthesis) were calculated for each method using the R software version (i386 3.6.1), which approximately takes around half an hour.

The results are reported in Tables 1–4 for comparison purposes. Tables 1 and 2 contain simulation results for the case where loss function parameter k = 1 and values of hyperparameters are considered as a = b = c = d = 2, whereas Tables 3 and 4 comprise the results for the case where k = 1.5 and values of hyperparameters are a = 1, b = 2, c = 2.25, and d = 1.5 for the simulation study. From the results of the simulation study, conclusions are drawn regarding the behavior of the estimators, which are summarized as follows:(i)In terms of MSEs, the ML and Bayesian estimators become closer by increasing the sample sizes.(ii)For fixed percentage of failures, as sample size increases, it is observed that the MSEs of all the estimators decrease because as for large sample sizes, prior has minimal effect on the posterior.(iii)For fixed values of α and β, the MSEs of ML and Bayesian estimators decrease when both increase the sample size and percentage of failures.(iv)When k = 1 and a = b = c = d = 2, the Bayesian estimators based on GELF and LINEX loss function are smaller as compared to ML estimators in terms of MSEs. Therefore, Bayes estimators are much stable than ML estimators.(v)Generally, the ML and Bayesian estimators are closed for the large sample in terms of MSE.

5. Data Analysis

In this section, we consider two examples for illustration purposes.

5.1. Example 1

The real data about remission times (in months) of a random sample of 128 bladder cancer patients presented in Table 5 were reported by Lee and Wang [20]. A total of 128 patients with different prespecified percentages of events, i.e., 40%, 50%, 60%, and 80%, represented patients whose treatment was terminated and rest of the percentages are censored. Clearly, Figure 1 confirms that the histogram is slightly skewed to the right and is leptokurtic. Moreover, ML and Bayesian estimates can also be envisioned in Figure 1, in which the x-axis represents the remission times (in months) of bladder cancer patients, while the Gumbel type-II density function is taken on the y-axis. Therefore, it would be appropriate to select positively skewed distributions for describing the behavior of remission times of bladder cancer patients. Amongst the skewed distributions, Gumbel type-II distribution is fitted and the parameter estimates using ML and Bayesian methods are presented in Table 6 for comparison purposes. It is concluded that the proposed estimators of Gumbel type-II distribution fit the data well. Therefore, it is recommended that the Bayesian estimators can be more beneficial to address the uncertainty in medical-related censored data.

5.2. Example 2

The survival times, in weeks, of 61 patients with unoperable lung cancer treated with cyclophosphamide considered in Lagakos and Williams ([18]) and in Lee and Wolfe ([19]) are presented in Table 7. There are 33 uncensored observations and 28 censored observations, representing the patients whose treatment was terminated because of a devolving condition. The point estimates of α and β obtained by all the methods are summarized in Table 8. Figure 2 shows the results of different estimation methods and depicts that Gumbel type-II distribution fits the data better, in which x-axis comprises the survival times in weeks of 61 patients with inoperable adenocarcinoma of the lung as the Gumbel type-II density function is taken on the y-axis.

6. Conclusion and Recommendations

In medical decision-making, Bayesian tools incorporate the state of uncertainty and provide a rational framework for studying such problems. Usually, medical data are generally skewed to the right, and positively skewed distributions can be most suitable for describing unimodal medical data. In this study, an attempt has been made to develop the Bayesian estimators for Gumbel type-II distribution based on type-II censored data using squared error loss, GELF, and LINEX loss functions via Lindley’s approximation. It is concluded that ML and Bayesian estimators become closer by increasing the sample sizes and prespecified percentages of failures. Based on the outcomes of this research study, we may suggest that this study can be further extended by using other skewed distributions considering the Bayesian framework with other loss functions using medical data.

Appendix

Observed Fisher Information Matrix

The observed Fisher information matrix (FIM) is computed by taking the 2^nd partial derivatives with respect to “α” and “β,” respectively. Therefore, the matrix may be defined as

The components of observed FIM are

The observed FIM matrix is rewritten as

The inverse of I_(α,β) iswhere ; i, j = 0, 1, 2, 3. , , , , , ; , A_ij = (l_iS_ii + l_jS_ij)S_ii), , and C_ij = l_iS_ii + l_jS_ji, i, j = 1, 2.A_ij = (l_iS_ii + l_jS_ij)S_ii, A₁₂ = (l₁S₁₁ + l₂S₁₂)S₁₁, , A₂₁ = (l₂S₂₂ + l₁S₂₁)S₂₂, A₂₁ = S₂₁S₂₂, , , B₁₂ = 3S₁₁S₁₂, , , C_ij = l_iS_ii + l_jS_ji, C₁₂ = l₁S₁₁ + l₂S₂₁, C₁₂ = S₁₁, C₂₁ = l₂S₂₂ + l₁S₁₂, C₂₁ = S₁₂, , ,

Data Availability

This work is mainly a methodological development and has been applied on secondary data, but if required, data will be provided.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

References

E. Gumbel, Statistics of Extremes, Columbia University Press, New York, USA, 1958.
K. Abbas and Y. Tang, “Estimation of parameters for Frechet distribution based on type-II censored samples,” Caspian Journal of Applied Sciences Research, vol. 2, no. 7, pp. 36–43, 2013.
View at: Google Scholar
H. M. Okasha, “E-Bayesian estimation for the Lomax distribution based on type-II censored data,” Journal of the Egyptian Mathematical Society, vol. 22, no. 3, pp. 489–495, 2014.
View at: Publisher Site | Google Scholar
H. H. Abu-Zinadah, “Bayesian estimation on the exponentiated Gompertz distribution under type II censoring,” International Journal of Contemporary Mathematical Sciences, vol. 9, no. 11, pp. 497–505, 2014.
View at: Publisher Site | Google Scholar
R. M. El-Sagheer, “Bayesian prediction based on general progressive censored data from generalized Pareto distribution,” Journal of Statistics Applications & Probability, vol. 5, no. 1, pp. 43–51, 2016.
View at: Publisher Site | Google Scholar
K. Abbas, J. Fu, and Y. Tang, “Bayesian estimation of Gumbel type-II distribution,” Data Science Journal, vol. 12, pp. 33–46, 2013.
View at: Publisher Site | Google Scholar
N. Feroze and M. Aslam, “Bayesian estimation of two-component mixture of gumbel type II distribution under informative priors,” International Journal of Advanced Science and Technology, vol. 53, pp. 11–30, 2013.
View at: Google Scholar
I. Malinowska and D. Szynal, “On characterization of certain distributions of k^th lower (upper) record values,” Applied Mathematics and Computation, vol. 202, no. 1, pp. 338–347, 2008.
View at: Publisher Site | Google Scholar
T. Sultana, M. Aslam, and M. Raftab, “Bayesian estimation of 3-component mixture of Gumbel type-II distributions under non-informative and informative priors,” Journal of the National Science Foundation of Sri Lanka, vol. 45, no. 3, pp. 287–306, 2017.
View at: Publisher Site | Google Scholar
F. Metiri, H. Zeghdoudi, and M. R. Remita, “On Bayes estimates of Lindley distribution under Linux loss function: informative and non informative priors,” Global Journal of Putre and Applied Mathematics, vol. 12, no. 1, pp. 391–400, 2016.
View at: Google Scholar
H. Reyad and S. O. Ahmed, “E-Bayesian analysis of the Gumbel type-II distribution under type-II censored scheme,” International Journal of Advanced Mathematical Sciences, vol. 3, no. 2, pp. 108–120, 2015.
View at: Publisher Site | Google Scholar
V. Preda, E. Panaitescu, and A. Constantinescu, “Bayes estimators of modified- weibull distribution parameters using Lindleys approximation,” WSEAS Transactions on Mathematics, vol. 9, no. 7, pp. 539–549, 2010.
View at: Google Scholar
A. Legendre, New Method for the Dermination of Orbits of Comets, Courcier, Paris, France, 1805.
C. F. Gauss, Least Squares Method for the Combinations of Observation, (Translated by J. Bertrand 1955), Mallet-Bachelier, Paris, France, 1810.
H. R. Varian, A Bayesian Approach to Real Estate Assessment, North Holland, Amsterdam, Netherlands, 1975.
R. Calabria and G. Pulcini, “Point estimation under asymmetric loss functions for left-truncated exponential samples,” Communications in Statistics—Theory and Methods, vol. 25, no. 3, pp. 585–600, 1996.
View at: Publisher Site | Google Scholar
D. V. Lindley, “Approximate bayesian methods,” Trabajos de Estadistica Y de Investigacion Operativa, vol. 31, no. 1, pp. 223–245, 1980.
View at: Publisher Site | Google Scholar
E. T. Lee and J. W. Wang, Eds., Statistical Methods for Survival Data Analysis, John Wiley and Sons, New York, NY, USA, 3rd edition, 2003.
S. W. Lagakos and J. S. Williams, “Models for censored survival analysis: a cone class of variable-sum models,” Biometrika, vol. 65, pp. 181–189, 1978.
View at: Google Scholar
S. Lee and R. A. Wolfe, “A simple test for independent censoring under the proportional hazards model,” Biometrics, vol. 54, pp. 1176–1182, 1998.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2020 Kamran Abbas et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1452

Downloads

1218

Citations