ARL Estimation of the Control Chart of Log Likelihood Ratios’ Sum for Markov Sequence

Guo, Yi; Gao, Lei; Zhu, Yan

doi:https://doi.org/10.1155/2021/6649949

Journal of Mathematics

On this page

Abstract Introduction Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2021 | Article ID 6649949 | https://doi.org/10.1155/2021/6649949

ARL Estimation of the Control Chart of Log Likelihood Ratios’ Sum for Markov Sequence

Yi Guo,¹Lei Gao,¹and Yan Zhu²

Academic Editor: Niansheng Tang

Received10 Dec 2020

Revised22 Apr 2021

Accepted28 Apr 2021

Published10 May 2021

Abstract

To evaluate the surveillance performance of a control chart with the charting statistic of the sum of log likelihood ratios in the statistical process control (SPC), in this paper, we give the proof procedure based on Markov chains for the asymptotic estimation of the average run length (ARL) for this kind of chart. The out-of-control is approximately equal to 1 for any fixed in-control with a negative control limit. By the equivalence between limit distribution of a sum and that of a suprema sum of Markov chain, we derive the estimation of with a large enough positive control limit. Numerical experiments are conducted to confirm our results.

1. Introduction

The main aim of SPC is to detect an abrupt change in the observation of time series as soon as possible after the change has happened. The study, found in [1], is the first to show the design of the control chart for quickly detecting possible changes in the underlying process. Subsequently, a great number of control charts have been proposed and come into a wider use in many fields, such as environmental control [2, 3], biostatistics [4, 5], clinical medicine [6, 7], economics and finance [8], industrial quality and process control [9, 10], process monitoring [11–16], public health [7, 14, 16], and social network [17].

Obviously, the research on constructing various control charts for surveillance has never stopped since Shewhart presented this method. The existing theoretical studies in statistical monitoring, on the whole, can be classified into three areas. The first category is the optimal economic or economic-statistical design, where, in most cases, the net sum of all costs is minimized [18–21]. The ideas of the second category are the choice of the statistical parameters of a control chart to minimize the out-of-control for a given in-control or a probability of false alarm, and it is named as the optimal statistical design [16, 22–25]. Recently, the theoretical approximation to ARL was proposed in [26]. Detecting changes in distribution of the optimal control charts is the last category, which can be regarded as a kind of optimal stopping time [27–29]. This metric for defining optimality is based on that it has smallest out-of-control among all control charts with either a given probability of a false alarm no greater than a preset level, or a given false alarm rate no less than a given value. So, this more mathematical area is thus related to statistical design. In the following argument, we will focus on the third type of these bibliographies mentioned above.

Consider the Gaussian observation sequence , whose distribution may change at time . Let and be the prechange probability density function and the postchange probability density function of , respectively. Denote the postchange joint probability distribution, expectation, and variance by , respectively. Especially, when the change time , we suppose that a change never occurs. Frisén [30] showed that there exists a positive value such that the control chart with the charting statistic of the sum of log likelihood ratios (SLR),is optimal in the sense,when the change is the case of a shift in the mean of , where denotes the control chart test, is the change-point time, and is the constant control limit such that . Although the optimality of has been proved in [30], there were few surveys and bibliographies which considered the performance of the control chart for monitoring the change point in the process.

Therefore, in this paper, we regard the ARL as the criterion of optimality of a control chart. This is because the ARL is the connection among itself, control limit, and statistical properties of the observation sequence . In Section 2, we prove that the out-of-control is approximately equal to for a given as the control limit is negative. In Section 3, we apply the equivalence between limit distribution of a sum and that of a suprema sum of Markov chain to obtain the estimation of when the control limit is some large enough positive constant. Finally, we conduct some numerical experiments to verify our theoretical analysis in Section 4.

2. Estimation of with a Negative Control Limit

In this section, we consider a constant control limit to construct a control chart and estimate the corresponding in-control and out-of-control . Let the observations be a time-homogeneous Markov chain with the discrete state space . Here, we consider only detecting the change at the initial time τ = 1, and the Markov chain is positive recurrence with the prechange transition probability and the postchange transition probability . For convenience, we use instead of for . Similar to the notations defined in Section 1, we use , and to denote, respectively, the prechange joint probability distribution, expectation, and variance of .

For ease of the subsequent analysis processing, we choose a large enough positive number to cut out the first terms of the sequence . In other words, the sequence tends to when goes to infinity. The optimality of the control chart for the Markov chain appeared in [31]. Now, we define the charting statistics as

Let for ; then, for . For any state , let be the time of th visit of state and be the number of times for visiting from time 0 to time . The cumulative sums of and on the th block of time are, respectively, defined byfor and . Then, both and are sequences of independent and identically distributed (i.i.d.) random variables.

Throughout this paper, we assume that(i) if and only if , as well as has no atom with respect to for . Here, we define 0/0 = 1.(ii).

The sum of is written by . Set , , and and . Moreover, for , let

Then, for large , we have

Now, we define a control chart in the following with the charting statistics of the sum of log likelihood ratios for detecting the change in distribution of the Markov sequence :for some constant .

Now, we are ready to estimate . Based on Section 3 presented in [30], is the average run length until an alarm is signaled, so and are equivalent. Consequently, we use the estimator of to substitute in the subsequent analysis. The main result is proposed in the following theorem.

Theorem 1. Let be a time-homogeneous Markov chain satisfying conditions (i) and (ii). Then, for a given and , there exists a negative number such thatas .

Proof. Choose the control limit and note thatwhere . Let . Using the law of total expectation and Markov property, we obtainSince is a concave function, by Jensen’s inequality, we haveSubstituting (13) into (12) and combining (11), we can get that for any . Similarly, we can prove for any . Hence, and ; then, . Let for some , where denotes the integer part of . Then,The total probability formula tells us thatEquations (4) and (7) yield thatNote that and almost everywhere converges to 0 as therefore,Notice that, for a large , the relationship between probability distribution function and density function of standard normal distribution isCombining (16)–(18) and the central limit theorem of Markov chain (see [32]), we can get thatPut (19) into (15) and combine to yield thatFor , by the Markov inequality and , we haveInequality (21) implies that the value of tends to 0 as . Put (20) and (21) into (14); then, there exists some constant such thatfor a large .
Using Theorem 5.1.7 in [33], there exists a nonpositive number such thatholds for a large enough . Thus, we can choose a negative number satisfying and .
Next, we prove (10). For any , let . Similar to (20), we can get thatNote that the second and third terms on the right-hand side of the last inequality in (24) tends to 0 as goes to infinity.
This completes the proof.

3. Estimation of with a Positive Control Limit

In the subsequent discussion, we use the equivalence between limit distribution of a sum and that of a suprema sum of Markov chain to estimate the out-of-control when the control limit is a sufficiently large positive constant. The main result is presented in the following theorem.

Theorem 2. Assume that is a homogeneous Markov chain. For any state ifthenfor large and , where and , are defined in (5) and (6), respectively. The sign denotes the infinitesimal of higher order and .

Proof. It follows from (8) that, for ,The estimation of is reduced to the estimation of . According to Theorem 2 in [34], we have the following conclusion:Then, we use (28) to calculate the value of . Let , where and denotes the real number field. When becomes sufficiently large, by (5), (27), and (28), we haveLet , and we havefor the large enough and . When goes to infinity, we deal with the second term on the right-hand side of (30) as follows:Note that, for a large , we haveWe obtain the right-hand side of (30) as follows:as tends to infinity. Combining (30), (33), and , we obtainTo prove the inequality on the left-hand side of (26), let , and goes to infinity; then, we use (27) to yield thatwhere is the distribution function and . Similar to the derivation of (33), let tend to infinity; then, we obtain the estimation of the second term on the right-hand side of (35) asFor the first term on the right-hand side of (35), because the function is monotonically decreasing in the interval with respect to and the distribution function is monotonically nondecreasing in , we havefor and . By (35)–(37), we obtainfor a large enough . Combine (34) and (38) to yield (26). This completes the proof.

4. Numerical Experiment

In this section, we perform two numerical experiments to verify our theoretical results. In our first numerical experiment, let be a sequence of i.i.d. Gaussian random variables with the prechange and the postchange probability densities and . It is assumed that the standard case of a shift in the mean of from to is considered.

According to the proof of Theorem 1, the initial data is chosen as .Given different values of , we obtain the corresponding values of control limit and , which are listed in Table 1.

These results suggest that the value of is equal to 1 with a negative control limit and a given , which is predicted in Theorem 1.

In the second experiment, we choose and . Let be a homogeneous Poisson process with the prechange and the postchange parameters and ; then, the corresponding prechange and postchange transition probabilities are defined aswhere and are arbitrary states from the state space and for .

The results are obtained and presented in Table 2, which show the ratio of and equals approximately 2.38 as . It implies that the value of and the control limit are the infinite of the same order, which is predicted by Theorem 2.

Data Availability

The data used to support the findings of the study are generated by Matlab.

Conflicts of Interest

The authors declare no conflict of interest.

Acknowledgments

Yan Zhu was supported by NSFC, Grant no. 11801353.

References

W. A. Shewhart, Economic Control of Quality of Manufactured Product, Van Nostrand, New York, NY, USA, 1931.
V. Barnett and K. F. Turkman, Statistics for the Environment Publising House, John Wiley & Sons Inc, New Jersey, USA, 1993.
M. Pettersson, “Monitoring a freshwater fish population: statistical surveillance of biodiversity,” Environmetrics, vol. 9, no. 2, pp. 139–150, 1998.
View at: Publisher Site | Google Scholar
R. D. Fricker, Introduction to Statistical Methods for Biosurveillance Publishing House, Cambridge University Press, Cambridge, UK, 2012.
D. Siegmund, “Change-points: from sequential detection to biology and back,” Sequential Analysis, vol. 32, no. 1, pp. 2–14, 2013.
View at: Publisher Site | Google Scholar
M. Frisén, “Evaluations of methods for statistical surveillance,” Statistics in Medicine, vol. 11, no. 11, pp. 1489–1502, 1992.
View at: Publisher Site | Google Scholar
M. Keshavarz, S. Asadzadeh, and S. T. A. Niaki, “Risk-adjusted frailty-based CUSUM control chart for phase I monitoring of patients’ lifetime,” Journal of Statistical Computation and Simulation, vol. 91, no. 2, pp. 334–352, 2021.
View at: Publisher Site | Google Scholar
M. Frisén, “Optimal sequential surveillance for finance, public health, and other areas,” Sequential Analysis, vol. 28, no. 3, pp. 310–337, 2009.
View at: Publisher Site | Google Scholar
D. C. Montgomery, Introduction to Statistical Quality control, John Wiley & Sons Inc, New York, NY, USA, 6th edition, 2009.
P. Qiu, Introduction to Statistical Process Control Publishing House, Chapman & Hall/CRC, Boca Raton, FL, USA, 2014.
N. A. Adegoke, A. N. H. Smith, M. J. Anderson, and M. D. M. Pawley, “Mewma charts when parameters are estimated with applications in gene expression and bimetal thermostat monitoring,” Journal of Statistical Computation and Simulation, vol. 91, no. 1, pp. 37–57, 2021.
View at: Publisher Site | Google Scholar
M. Erfanian, B. Sadeghpour Gildeh, and M. Reza Azarpazhooh, “A new approach for monitoring healthcare performance using generalized additive profiles,” Journal of Statistical Computation and Simulation, vol. 91, no. 1, pp. 167–179, 2021.
View at: Publisher Site | Google Scholar
S. J. Mirkamali, “An improved exponentially weighted moving average chart for monitoring proportions using maxima nomination sampling,” Journal of Statistical Computation and Simulation, vol. 91, no. 2, pp. 282–299, 2021.
View at: Publisher Site | Google Scholar
C. Sonesson and D. Bock, “A review and discussion of prospective statistical surveillance in public health,” Journal of the Royal Statistical Society: Series A (Statistics in Society), vol. 166, no. 1, pp. 5–21, 2003.
View at: Publisher Site | Google Scholar
Z. Song, A. Mukherjee, and J. Zhang, “Some robust approaches based on copula for monitoring bivariate processes and component-wise assessment,” European Journal of Operational Research, vol. 289, no. 1, pp. 177–196, 2021.
View at: Publisher Site | Google Scholar
G. D. Williamson and G. Weatherby Hudson, “A monitoring system for detecting aberrations in public health surveillance reports,” Statistics in Medicine, vol. 18, no. 23, pp. 3283–3298, 1999.
View at: Google Scholar
W. H. Woodall, M. J. Zhao, K. Paynabar, R. Sparks, and J. D. Wilson, “An overview and perspective on social network monitoring,” IISE Transactions, vol. 49, no. 3, pp. 354–365, 2017.
View at: Publisher Site | Google Scholar
P. Charongrattanasakul and A. Pongpullponsak, “Minimizing the cost of integrated systems approach to process control and maintenance model by EWMA control chart using genetic algorithm,” Expert Systems with Applications, vol. 38, no. 5, pp. 5178–5186, 2011.
View at: Publisher Site | Google Scholar
A. J. Duncan, “The economic design of | barX charts used to maintain current control of a process,” Journal of the American Statistical Association, vol. 51, no. 274, pp. 228–242, 1956.
View at: Publisher Site | Google Scholar
E. M. Saniga, “Economic statistical control-chart designs with an application to X̄ and R charts,” Technometrics, vol. 31, no. 3, pp. 313–320, 1989.
View at: Publisher Site | Google Scholar
W. C. Yeong, M. Chong, L. M. Ha, and M. A. Rahim, “Economically optimal de- sign of a multivariate synthetic t² chart,” Communication in Statistics Simulation and Computation, vol. 43, no. 6, Article ID 731122, pp. 1333–1361, 2012.
View at: Publisher Site | Google Scholar
A. A. Aly, R. M. Hamed, and M. A. Mahmoud, “Optimal design of the adaptive exponentially weighted moving average control chart over a range of mean shifts,” Communications in Statistics - Simulation and Computation, vol. 46, no. 2, pp. 890–902, 2017.
View at: Publisher Site | Google Scholar
W. M. Carlyle, D. C. Montgomery, and G. C. Runger, “Optimization problems and methods in quality control and improvement,” Journal of Quality Technology, vol. 32, no. 1, pp. 1–17, 2000.
View at: Publisher Site | Google Scholar
R. Shokrizadeh, A. Saghaei, and V. Amirzadeh, “Optimal design of the variable sampling size and sampling interval variable dimension T2 control chart for monitoring the mean vector of a multivariate normal process,” Communications in Statistics - Simulation and Computation, vol. 47, no. 2, pp. 329–337, 2018.
View at: Publisher Site | Google Scholar
Y. Wu, “Estimation of common change point and isolation of changed panels after sequential detection,” Sequential Analysis, vol. 39, no. 1, pp. 52–64, 2020.
View at: Publisher Site | Google Scholar
L. Xie, Y. Xie, and G. V. Moustakides, “Sequential subspace change point detection,” Sequential Analysis, vol. 39, no. 3, pp. 307–335, 2020.
View at: Publisher Site | Google Scholar
T. L. Lai, “Information bounds and quick detection of parameter changes in stochastic systems,” IEEE Transactions on Information Theory, vol. 44, no. 7, pp. 2917–2929, 1998.
View at: Publisher Site | Google Scholar
G. Lorden, “Procedures for reacting to a change in distribution,” The Annals of Mathematical Statistics, vol. 42, no. 6, pp. 1897–1908, 1971.
View at: Publisher Site | Google Scholar
A. S. Polunchenko and V. Raghavan, “Comparative performance analysis of the Cumulative sum chart and the shiryae-roberts procedure for detecting changes in autocorrelated data,” Applied Stochastic Models in Business and Industry, vol. 34, no. 6, pp. 922–948, 2018.
View at: Publisher Site | Google Scholar
M. Frisén, “Statistical surveillance optimality and methods,” International Statistical Review, vol. 71, no. 2, pp. 403–434, 2003.
View at: Publisher Site | Google Scholar
D. Han, F. Tsung, and J. Xian, “Optimal sequential tests for monitoring changes in the distribution of finite observation sequences,” 2019, https://arxiv.org/abs/1907.13421.
View at: Google Scholar
R. N. Bhattacharya and E. C. Waymire, Stochastic Process with Applications Publishing House, Wiley, New York, NY, USA, 1990.
F. Gregory and L. Vlada, Random Walk: A Modern Introduction, Cambridge University Press, Cambridge, England, UK, 2010.
L. Gao and D. Han, “Extreme value distributions for two kinds of path sums of Markov chain,” Methodology and Computing in Applied Probability, vol. 22, no. 1, pp. 279–294, 2020.
View at: Publisher Site | Google Scholar
W. L. Teoh, M. B. C. Khoo, P. Castagliola, and S. Chakraborti, “Optimal design of the double sampling chart with estimated parameters based on median run length,” Computers & Industrial Engineering, vol. 67, pp. 104–115, 2014.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2021 Yi Guo et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

348

Downloads

566

Citations