Abstract
We propose the degenerate-generalized likelihood ratio test (DGLRT) for one-sided composite hypotheses in cases of independent and dependent observations. The theoretical results show that the DGLRT has controlled error probabilities and stops sampling with probability 1 under some regularity conditions. Moreover, its stopping boundaries are constants and can be easily determined using the provided searching algorithm. According to the simulation studies, the DGLRT has less overall expected sample sizes and less relative mean index (RMI) values in comparison with the sequential probability ratio test (SPRT) and double sequential probability ratio test (2-SPRT). To illustrate the application of it, a real manufacturing data are analyzed.
1. Introduction
Consider the following hypotheses test problem: with the error constraints Here, , , and is the parameter space. Sequential tests for the problem (1.1) with independently and identically distributed (i.i.d.) observations have been widely studied. In cases of the one parameter exponential family with monotone likelihood ratio, the sequential probability ratio test (SPRT) proposed by Wald [1] provided an optimal solution to the problem (1.1), in the sense of minimizing the expected sample sizes (ESSs) at and , among all tests satisfying the constraints (1.2).
However, its ESSs at other parameter points are even larger than that of the test methods with fixed sample sizes. This led Weiss [2], Lai [3], and Lorden [4] to consider the problem (1.1) from the minimax perspective. Subsequently, Huffman [5] extended Lordenβs [4] results to show that the 2-SPRT provides an asymptotically optimal solution to the minimax sequential test problem (1.1). Instead of the minimax approach, Wang et al. [6] proposed a test minimizing weighted ESS based on mixture likelihood ratio (MLR). Since the ESSs over are hard to control and are usually focused on applications, Wang et al. [6] paid much attention to investigate the performance of the ESS over . Many tests for the problem (1.1) under independent observations are developed from other perspectives, including [7β11] and so forth.
It is true that in many practical cases the independence is justified, and hence these tests have been widely used. However, such tests may not be effective in cases when the observations are dependent, for example, Cauchy-class process for sea level (cf. [12]), fractional Gaussian noise with long-range dependence (cf. [13, 14]) and the power law type data in cyber-physical networking systems [15]. Especially for the power law data, the sequential tests for dependent observations are particularly desired. This need is not limited to these cases.
So far, many researchers studied sequential tests for various dependent scenarios. Phatarfod [16] extended the SPRT to test two simple hypotheses versus when observations constitute a Markov chain. Tartakovsky [17] showed that certain combinations of one-sided SPRT still own the asymptotical optimality in the ESS under fairly general conditions for a finite simple hypotheses. Novikov [18] proposed an optimal sequential test for a general problem of testing two simple hypotheses about the distribution of a discrete-time stochastic process. Niu and Varshney [19] proposed the optimal parametric SPRT with correlated data from a system design point of view. To our best knowledge, however, there are few references available for considering the problem (1.1) with dependent observations from the perspective of minimizing the ESS over . Similar to Wang et al. [6], one can extend the MLR to the dependent case. However, unlike the i.i.d. case, the MLR under the dependent case may not be available because of the complexity of its computation. Besides, its test needs to divide into two disjoint parts by inserting a point. In i.i.d. cases, this point can be selected following Huffmanβs [5] suggestion. But, in the dependent case, this suggestion may not be effective. One also can use the generalized likelihood ratio (GLR) instead of the MLR. Unfortunately, as opposite to the MLR, the GLR does not preserve the martingale properties which allow one to choose two constant stopping boundaries in a way to control two types of error. Moreover, the computation of the GLR is hard to be obtained in cases when the maximum likelihood estimator should be searched. This usually happens in the dependent case.
In this paper, we propose a test method for both dependent and independent observations. It has the following features: (1) it has good performances over in the sense of less overall expected sample sizes; (2) its computation is reasonably simple; (3) its stopping boundaries can be determined conveniently. The rest of the paper is organized as follows. In Section 2, we describe the construction of the proposed test in details and present its basic theoretical properties. Based on these theoretical results, we provide a searching algorithm to compute stopping boundaries for our proposed test. In Section 3, we conduct some simulation studies to show the performance of the proposed test. Some concluding remarks are given in Section 4. Some technical details are provided in the appendix.
2. The Proposed Test
Let , and suppose that the conditional probability distribution of each , has an explicit form. Here, and . Thus, likelihood ratio can be defined as Lai [20] introduced this model to construct a sequential test for many simple hypotheses when the observations are dependent. It is very general and also includes the i.i.d. cases.
Example 2.1. Consider, for instance, a simple nonlinear time series model:
In this case, , , and is the probability density function of the standard normal distribution.
To overcome the difficulty stated in Section 1, we propose a test statistic which minimizes the likelihood ratio with restriction to a finite parameter points in . First, we insert (β₯3) points into uniformly, denoted as with , . Next, we define the test statistic as . It can be checked that this test statistic not only preserves the martingale properties, but also inherits the merit of the GLR. As long as is not very large (e.g., ), its computation will be very simple. Thus, it has all the three features stated in Section 1. Since this maximization is restricted to some finite points, we refer to it as degenerate-generalized likelihood ratio (DGLR).
Based on the DGLR, we define a stopping rule for the problem (1.1) by
with the terminal decision rule
where , are two stopping boundaries. Hereafter, the sequential test method with (2.3) and (2.4) is called the degenerate-generalized likelihood ratio test (DGLRT). It has some theoretical properties which are stated as follows. These theoretical properties provide a guide to the design of the DGLRT, whose proofs are provided in the appendix.
Let
be the real error probabilities, where and represent the parameter subsets under and , respectively.
Proposition 2.2. Suppose for any positive integer and every triple . For the DGLRT defined by (2.3) and (2.4), one has for all and for all .
Remark 2.3. The assumption (2.6) given in Proposition 2.2 is not restrictive. This holds for the general one parameter exponential family and many others (cf. Robbins and Siegmund [21]).
Proposition 2.4. Suppose that there exists a constant such that for all and every triple . Under the assumptions stated in Proposition 2.2, one has for all .
Remark 2.5. For , we have
The last inequality follows from (2.6). is positive with probability 1 if . Heuristically, the requirement that the difference be greater than the constant for all amounts to assuming that the sequence of data cumulatively adds information about all the , which is generally true in sequential studies.
From Proposition 2.2, we conclude that the DGLRT satisfies the error constraints (1.2) if and . From Proposition 2.4, it is easy to find that we absolutely stop sampling after finite observations. These results imply that the DGLRT can be useful in a sequential study for testing the problem (1.1).
In the DGLRT (2.3) and (2.4), the value of the parameter should be large but finite. In practice, we suggest that (cf. Section 3). Regarding and , we can compute them by simulation. Proposition 2.2 shows and . Thus, we can search (, ) over with the real error probabilities being computed by simulations. One may consider a density grid searching on . But this is a time consuming job. To reduce the computation, we introduce an efficient approach as follows. In the first step, we can use bisection searching to find () such that . Then, fix to find () such that . Since and increase in and decrease in , we conclude that . Hence, we repeat the above step over . In this way, we generate a sequence of pairs . Following the above program, we have
It can be checked that these pairs converge to the exact stopping boundaries. In practice, we repeat the above process and stop at step if and . Here, % and . Computation involved in finding and is not difficult partly due to the rapid developments in information technology. For example, in the nonlinear time series model (2.2), setting , , , and , it requires 15 minutes to obtain the stopping boundaries and for the DGLRT based on 100,000 simulations, using Intel-Core i7-2.80βGHz CPU. Since this is a one-time computation before testing, it is convenient to accomplish.
3. Numerical Studies
In this section, we present some simulation results regarding the numerical performance of the proposed DGLRT. In the DGLRT, the parameter needs to be chosen. We first investigate the effect of on the performance of the DGLRT according to i.i.d. observations from the normal distribution . Setting and , we compare the DGLRTs with . The corresponding stopping boundaries are , , , and , respectively. The ESSs at (0.1) (i.e., takes values from β0.8 to 0.8 with step 0.1) are computed based on 100,000 simulated data and are provided in Table 1.
Because of the symmetry, we only include results for . Table 1 shows that the ESSs under a larger are smaller than those under a smaller if . Meanwhile, it can be seen that a smaller has a better performance outside . In order to assess the overall performance of the tests, we compute their relative mean index (RMI) values. The RMI is introduced by Han and Tsung [22] for comparing the performance of several control charts. It is defined as where is the total numbers of parameter points (i.e., βs) we considered, denotes the ESS at , and is the smallest one among all the three . So, can be considered as a relative difference of the given test, compared to the best test, at , and RMI is the average of all such difference values. By this index, a test with smaller RMI value is considered better in its overall performance. Since we focus on the performance over the parameter interval , , in this illustration. The resulting RMIs for the DGLRT under are 0.0116, 0.0042, 0.0017, and 0.0011, respectively, which shows that the DGLRT under a larger is more efficient than the one under a smaller . The improvement is minor when is large enough. Considering the complexity of computation, we select for practical purposes. From now on, the DGLRT is always the DGLRT under unless otherwise stated.
Next, we investigate the performance of the DGLRT in controlling the ESSs over . In the i.i.d. case, we know the 2-SPRT has a better performance in controlling the maximum ESS. For the ESSs over the neighborhoods of and , the SPRT provides a closely approximation. Based on extensive simulations, we conclude that these features still preserve in the dependent case. Therefore, the SPRT and the 2-SPRT are compared with the DGLRT in this paper. The following three cases are considered.
Case 1. Observations collected from normal distributions with mean and variance 1. Set and for the test problem (1.1).
Case 2. Observations collected from exponential distributions with mean . The problem (1.1) is set with , , and .
Case 3. Consider the test problem (1.1) for the simple nonlinear time series model (2.2) with , and .
In each case, the inserted point for the 2-SPRT is searched over . The stopping boundaries are also computed following the searching algorithm stated in Section 2. These stopping boundaries are listed in the order of the SPRT, 2-SPRT, and DGLRT: Case 1: , , and ; Case 2: , , and ; and Case 3: , , and . Figures 1β3 display the ESS curves over under the three tests for Cases 1β3 with the dashed line for the SPRT, the dotted line for the 2-SPRT, and the solid line for the DGLRT. Figure 1 shows that the DGLRT is comparable to the 2-SPRT in the middle of the parameter range and performs as well as the SPRT in the two tails. It implies that the DGLRT controls both the maximum ESS and the ESSs under and very well. The same conclusions can also be obtained from Figures 2 and 3. The RMIs for the SPRT, 2-SPRT, and DGLRT under the three cases are also computed. The results are listed in Table 2. It can be seen that the RMI for the DGLRT is the smallest one among the three tests under all three cases. Thus, the DGLRT performs the best, compared with the SPRT and the 2-SPRT over .
To illustrate the DGLRT, we apply it to a real manufacturing data (cf. Chou et al. [23]). A customer specifies an average breaking strength of a strapping tape as 200βpsi, and the standard deviation is 12βpsi. The data are the breaking strength of different strapping tapes, so the random errors mainly stem from the measurement errors. Thus, the observations can be assumed to be independent. The Shapiro and Wilk [24] test shows that the data are taken from a normal distribution. Consider the test problem (1.1) with and and standardize the observations by using a transformation , . Then the resulting test problem is equivalent to versus . Under , the corresponding stopping boundaries for the DGLRT are . Based on the first 20 real observations, we compute the test statistics of the DGLRT, which are displayed in Table 3. In Table 3, standardized indicates . Table 3 shows that increases in rapidly, while keeps constant for under the real data. Since crosses its stopping boundary at the 11th observation, we should accept the null hypothesis according to the terminal decision rule (2.4).
4. Concluding Remarks
In this paper, we have proposed the DGLRT test in cases where the conditional density function has an explicit form. It has been shown that the properties of the DGLRT can guarantee bounding two error probabilities. To make our method be more applicable, we further discuss the selection of the parameter and the searching algorithm for its stopping boundaries. From our numerical results, we conclude that the DGLRT has several merits: (1) in contrast to the SPRT, the DGLRT has much smaller ESS for in the middle of the parameter range and nearly has the same performance for outside the interval . It is not surprising that the 2-SPRT performs the best in minimizing the maximum ESS because it is designed to be optimal in the minimax sense. However, the relative difference of the maximum ESS between the DGLRT and the 2-SPRT is minor. Moreover, for outside , the ESSs of the DGLRT are much smaller than those of the 2-SPRT. That is to say, the DGLRT controls the maximum ESS and the ESSs under two hypotheses; (2) under the RMI criteria, the DGLRT performs more efficiently than the SPRT and the 2-SPRT over ; (3) its implementation is very simple.
While our focus in this paper is on methodological development, there are still some related questions unanswered yet. For instance, at this moment, we do not know how to determine the critical stopping boundaries for the DGLRT in an analytical way instead of the Monte Carlo method. Besides, our method controls the ESS in pointwise, so it can be used to construct control chart for detecting the small shifts. These questions will be addressed in our future research.
Appendix
Proof of Proposition 2.2. Let So, The last inequality follows from (2.6). Till now, we prove that the result for all . The other result can also be proven in a similar way.
Proof of Proposition 2.4. Since we insert (β₯3) points in , we can find a point which belongs to . Thus, there exists a such that . It implies that for . So,
Thus, we have the result that for all . In a similar way, we can obtain for all . Combining the two results, we complete this proof.
Acknowledgments
The authors cordially thank the editor and the anonymous referees for their valuable comments which lead to the improvement of this paper. This research was supported by grants from the National Natural Science Foundation of China (11101156 and 11001083).