On the Maximum Likelihood Estimation of Extreme Value Index Based on <span class="nowrap"><svg xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://www.w3.org/2000/svg" style="vertical-align:-0.2723999pt" id="M1" height="12.533pt" version="1.1" viewBox="-0.0657574 -12.2606 8.79534 12.533" width="8.79534pt"><g transform="matrix(.017,0,0,-0.017,0,0)"><path id="g113-108" d="M480 416C480 431 465 448 438 448C388 448 312 383 252 330C217 299 188 273 155 237H153L257 680C262 700 263 712 253 712C240 712 183 684 97 674L92 648L126 647C166 646 172 645 163 606L23 -6L29 -12C51 -5 77 2 107 8C115 62 130 128 142 180C153 193 179 220 204 241C231 170 259 106 288 54C317 0 336 -12 358 -12C381 -12 423 2 477 80L460 100C434 74 408 54 398 54C385 54 374 65 351 107C326 154 282 241 263 299C296 332 351 377 403 377C424 377 436 372 445 368C449 366 456 368 462 375C472 386 480 402 480 416Z"/></g></svg>-</span>Record Values

Louzaoui, Abderrahim; El Arrouchi, Mohamed

doi:https://doi.org/10.1155/2020/5497413

Journal of Probability and Statistics

On this page

Abstract Introduction Proofs Data Availability Conflicts of Interest References Copyright Related Articles

Research Article | Open Access

Volume 2020 | Article ID 5497413 | https://doi.org/10.1155/2020/5497413

On the Maximum Likelihood Estimation of Extreme Value Index Based on -Record Values

Abderrahim Louzaoui¹and Mohamed El Arrouchi¹

Academic Editor: Yaozhong Hu

Received05 May 2020

Accepted26 May 2020

Published24 Jun 2020

Abstract

In this paper, we study the existence and consistency of the maximum likelihood estimator of the extreme value index based on -record values. Following the method used by Drees et al. (2004) and Zhou (2009), we prove that the likelihood equations, in terms of -record values, eventually admit a strongly consistent solution without any restriction on the extreme value index, which is not the case in the aforementioned studies.

1. Introduction

Let , be a sequence of independent and identically distributed random variables (i.i.d.) having a continuous distribution function . For each , denote by the order statistics of the -sample . We first recall some basic notions of the univariate extreme value theory. Assume that belongs to the max-domain of attraction of an extreme value distribution, denoted by with , i.e., there exist sequences and such thatfor . The parameter is called the extreme value index. The first-order condition is equivalent to that there exists an auxiliary function such thatfor all , where . For more details on the max-domain of attraction, see De Haan and Ferreira [1] and references therein.

The estimation of the extreme value index plays an important role in the classical extreme value theory, and many estimators have been proposed in the literature such as the Hill estimator [2], Pickands estimator [3], and moment estimator suggested by Dekkers et al. [4]. The books by Beirlant et al. [5] and De Haan and Ferreira [1] provide good reviews on this estimation problem.

Alternatively, condition (1) is equivalent tofor all , where is a positive function and is the right endpoint of , i.e., . is the so-called generalized Pareto distribution (GPD) function.

Based on (3), Smith [6] constructed the maximum likelihood (ML) estimator for by solving two estimation equations, Drees et al. [7] derived its asymptotic normality for when the threshold is chosen as an upper order statistic, while Zhou [8] studied in detail its existence and consistency when . On the contrary, the theory of record values is connected very closely to the extreme value theory through, like, for example, Resnick’s duality theorem (see Theorem 2.3.3 in [9]) or the characterization of tail distributions (e.g., [10]). There are quite few publications which are devoted to the estimation of the extreme value index based on record values, see, for example, Berred [11], Khaled et al. [12], and El Arrouchi and Imlahi [13]. We intend to investigate this problem in this paper, so we are interested here to propose an alternative of the above ML estimation based on the -record values.

This paper is organized as follows. In Section 2, we give the likelihood equations based on -record values. Section 3 is devoted to existence and consistency of the solutions of these equations, whose proofs will be given in Section 4.

2. Likelihood Equations Based on -Record Values

Record values are of importance in many situations of real life as well as in many statistical applications involving data relating to natural phenomena, sports, economics, reliability, and life tests. Chandler [14] was the first to introduce the concept of record values, record times, and inter-record times in order to analyze weather data. We refer to Arnold et al. [9] and Nevzorov [15] and the references therein for a review of the general theory of records.

Let be an integer. Define the sequences of -record times and -record values (see [16]) by

Similar to the conditional approach used for order statistics, our equations may be found by using the following lemma which will be proved at the end of Section 4.

Lemma 1. For all integers , the conditional distribution of , given , is the same as the unconditional distribution of the -record values arising from i.i.d. random variables , with the left-truncated distribution

Let be an intermediate sequence of integers satisfying and as , and let

From Lemma 1, the conditional distribution of , given , equals the unconditional distribution of the -record values arising from i.i.d. random variables , with distribution which, in view of (3), can be approximated by the generalized Pareto distribution (see [7]). Using this information, one can construct an estimation of the unknown parameters and by a maximum likelihood estimation; that is, given the -record values , we maximize the likelihood functionwith , , and .

Remark 1. Observe that if , when , and so, the maximum of does not exist. However, this case will be disregarded since has been taken as a sequence tending to infinity.
The likelihood equations are then given in terms of the partial derivatives:The maximum likelihood estimators for the extreme value index and the scale, and , are obtained by solving the following likelihood equations:The equations for are defined by continuity. If , they can be simplified toIt follows thatPutIn view of (11), any root of (10) satisfies . Conversely, if is a nonzero root of , we obtain as the solution of (10). We can readily check that has a trivial root which must be omitted even if in reality, .

3. Existence and Consistency

Our main results are the following theorems, stating the existence and the consistency of ML estimators.

Theorem 1. Suppose (1) holds for , and assume that, as ,Then, there exists a sequence of estimators and a random integer such thatand as ,Moreover, if additionally, as , thenas , where is the auxiliary function in (2).

Theorem 2. Suppose (1) holds for . Assume that, as ,and with probability 1, the following relation does not hold for sufficiently large :Then, there exists a sequence of estimators and a random integer such thatand as ,Moreover, if additionally, as , then

Remark 2. Extra condition (18) ensures the existence of a nonzero solution of the likelihood equations for . Hence, the solution of the likelihood equations for will almost surely not be equal to 0 if, for example, possesses a density.

4. Proofs

We first recall the following representation of the -record values. Let be an i.i.d sequence of standard exponential random variables, and denote by , their partial sums. Let be the hazard function of . It is easy to see that for . Since is continuous, the function is strictly increasing, and hence, we have the following representation (see relation (4.7), p. 167 in [17]):

So, from now on, we shall assume, without loss of generality, that , for and .

Before proving the above theorems, we need the following lemmas.

Lemma 2. For a sequence , , and , we have as , uniformly on , where is the largest integer not exceeding .

Proof. First, we write, for ,where . It follows thatSince, for all , , we haveBy using the Komlós–Major–Tusnády approximation [18, 19], we can define Wiener processes such thatNext, observe thatNote that, for the first terms,For the last term, we use Theorem 3.2B in Hanson and Russo [20]. It implies thatCombining this with (25), (28), (29), and the above conditions on , we getwhich completes the proof of lemma.

Lemma 3. Suppose (1) holds for and as . Let . Then, for any , we have as ,In addition, if is close to 0 and is large enough, we have

Proof. Write , and note that when (1) is satisfied for , it is well known that is regularly varying at infinity with index so that locally uniformly in . Next, from Lemma 2, as ; it follows readily that as , and so, , , and as .
Similarly, we observe thatFrom Lemma 2, for all , and by the fact that is an increasing function, we have, for all and ,Hence, the dominated convergence theorem ensures thatBy the same arguments, we have as .
Next, again by using the dominated convergence theorem and after straightforward calculations, we obtainPut . Since, for all , , then . Hence, for all , . Thus, . Consequently, there exists , for any ; when is large enough,The same arguments show that .

Remark 3. This lemma can be proved for . Indeed, Potter’s inequality (see Proposition 0.8.(ii) in [17]) implies that, for any , there exists such that, for and , . Then, for all small enough,By Lemma 2, , which leads, when , to . Similarly, .

Lemma 4. Suppose (1) holds for and as . Let . Then, for any , we have as ,In addition, if is close to 0 and is large enough, we have

Proof. This proof is similar to the previous proof with straightforward modifications. When (1) is satisfied for , it is well known that and is regularly varying at infinity with index . WriteAgain from Lemma 2, ; it follows readily that , and so, .
Similarly, we writeSince, from Lemma 2, for all and observe that, for all and ,it implies by the dominated convergence theorem thatBy the same arguments, we have as .
Next, again by using the dominated convergence theorem and after straightforward calculations, we havePut . Since, for all , , then . Hence, for all , . Thus, . Consequently, there exists , for any ; when is large enough,The same arguments show that .

Lemma 5. Suppose (1) holds for and as . Let . Then, for any and as ,Furthermore, for sufficiently large , we have

Proof. For arbitrary , letFirst, we haveSuppose now (2) holds for , i.e., , andSince is monotone, this limit holds locally uniformly in .
Next, observe thatBy using Lemma 2, it follows readily that, for all ,Since, for any ,it follows by using the dominated convergence theorem that, as ,Similarly, for any and all ,and so, as ,Hence, as ,Note that, for , for all , which implies that, for all and , . Thus, and .
Consequently, for sufficiently large , we have almost surely

Proof of Theorem 1. Here, we present the proof only for . For , the proof is essentially the same.
By choosing a suitable positive sequence as , there exists, from Lemma 3, a random integer such that, for any , and . This ensures, by the mean value theorem, the existence of a random variable a.s. such that a.s. when .
Since is an increasing function, we have almost surelyFrom Lemma 3, and ; this implies that , i.e., is strongly consistent.
To prove the almost sure convergence of , we use the fact that, as , (see Lemma 1.2.9, p. 22 in [1]).
So, as ,Since, for sufficiently large , a.s., we have eventuallywhich leads to as . Hence, as ,By applying the law of the iterated logarithm, we have almost surelyIf as , then . Combining this with the fact that the function is regularly varying at infinity with index , the consistency of is proved for the positive case.

Proof of Theorem 2. First, we choose a suitable positive sequence as . It follows from Lemma 4 that there exists a random integer such that, for any , and .. Since, after straightforward calculations, we have almost surelyThis ensures that when is large enough, changes the sign in the neighborhood of 0. Combining this with the fact that and have the same sign, it is proved that almost surely, for sufficiently large , there exists a nonzero root of on .
Recall that is an increasing function. This implies almost surelySince as , , and the consistency is proved.
Now, we prove the almost sure convergence of . For this, we writeSince, for sufficiently large , a.s., we have eventuallywhich leads to as . Hence, as ,Under (2), Lemma 2 ensures thatTherefore,Finally, if as , we have, by applying the law of the iterated logarithm, . Combining this with the fact that the function is slowly varying at infinity, i.e., for all , (see Lemma 1.2.9, p. 22 in [1]), the consistency of is then proved for .

Proof of Lemma 1. Recalling the following representation,where is the continuous hazard function of the distribution function , and , be independent random variables having the standard exponential distribution.
It follows, without loss of generality, thatWe know that , and by continuity, . Then,which gives by independenceObserve that is the hazard function of the distribution function . This, by using the above representation, proves the desired conclusion.

Data Availability

No data were used to support this study.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

L. De Haan and A. Ferreira, Extreme Value Theory, An Introduction, Springer, Berlin, Germany, 2006.
B. M. Hill, “A simple general approach to inference about the tail of a distribution,” The Annals of Statistics, vol. 3, no. 5, pp. 1163–1174, 1975.
View at: Publisher Site | Google Scholar
J. Pickands III, “Statistical inference using extreme order statistics,” The Annals of Statistics, vol. 3, no. 1, pp. 119–131, 1975.
View at: Publisher Site | Google Scholar
A. L. M. Dekkers, J. H. J. Einmahl, and L. D. Haan, “A moment estimator for the index of an extreme-value distribution,” The Annals of Statistics, vol. 17, no. 4, pp. 1833–1855, 1989.
View at: Publisher Site | Google Scholar
J. Beirlant, Y. Goegebeur, J. Segers, and J. Teugels, Statistics of Extremes: Theory and Applications, John Wiley & Sons, Hoboken, NJ, USA, 2004.
R. L. Smith, “Estimating tails of probability distributions,” The Annals of Statistics, vol. 15, no. 3, pp. 1174–1207, 1987.
View at: Publisher Site | Google Scholar
H. Drees, A. Ferreira, and L. de Haan, “On maximum likelihood estimation of the extreme value index,” The Annals of Applied Probability, vol. 14, no. 3, pp. 1179–1201, 2004.
View at: Publisher Site | Google Scholar
C. Zhou, “Existence and consistency of the maximum likelihood estimator for the extreme value index,” Journal of Multivariate Analysis, vol. 100, no. 4, pp. 794–815, 2009.
View at: Publisher Site | Google Scholar
B. C. Arnold, N. Balakrishnan, and H. N. Nagaraga, Record, John Wiley & Sons Inc., Hoboken, NJ, USA, 1998.
M. El Arrouchi, “Characterization of tail distributions based on record values by using the Beurling’s Tauberian theorem,” Extremes, vol. 20, no. 1, pp. 111–120, 2017.
View at: Publisher Site | Google Scholar
A. M. Berred, “An estimator of the exponent of regular variation based on K-record values,” Statistics, vol. 32, no. 2, pp. 93–109, 1998.
View at: Publisher Site | Google Scholar
M. Khaled, M. Dakkon, and M. El Arrouchi, “Prediction interval for future record from Weibull tailed distributions,” International Journal of Applied Mathematics and Statistics, vol. 54, no. 2, pp. 83–94, 2016.
View at: Google Scholar
M. El Arrouchi and A. Imlahi, “Optimal choice of kn-records in the extreme value index estimation,” Statistics & Desicions, vol. 23, no. 2, pp. 101–115, 2005.
View at: Publisher Site | Google Scholar
K. N. Chandler, “The distribution and frequency of record values,” Journal of the Royal Statistical Society: Series B (Methodological), vol. 14, no. 2, pp. 220–228, 1952.
View at: Publisher Site | Google Scholar
V. B. Nevzerov, “On the k^th record times and their generalizations,” Zapiski Nauchnykh Seminarov LOMI, vol. 153, pp. 115–121, 1986.
View at: Google Scholar
W. Dziubdziela and B. Kopociński, “Limiting properties of the k-th record values,” Applicationes Mathematicae, vol. 15, no. 2, pp. 187–190, 1976.
View at: Publisher Site | Google Scholar
S. I. Resnick, Extreme Values, Regular Variation and Point Processes, Springer-Verlag, New York, NY, USA, 1987.
J. Komlós, P. Major, and G. Tusnády, “An approximation of partial sums of independent RV’s and the sample DF. I,” Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete, vol. 32, no. 1-2, pp. 111–131, 1975.
View at: Publisher Site | Google Scholar
J. Komlós, P. Major, and G. Tusnády, “An approximation of partial sums of independent RV’s and the sample DF. II,” Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete, vol. 34, no. 1, pp. 34–58, 1976.
View at: Publisher Site | Google Scholar
D. L. Hanson and R. P. Russo, “Some results on increments of the wiener process with applications to lag sums of I.I.D. random variables,” The Annals of Probability, vol. 11, no. 3, pp. 609–623, 1983.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2020 Abderrahim Louzaoui and Mohamed El Arrouchi. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1879

Downloads

1101

Citations