Research Article | Open Access
Fuzzy Approach for Group Sequential Test
Buckley’s approach (Buckley (2004), (2005), (2006)) uses sets of confidence intervals by taking into consideration both of the uncertainty and impreciseness of concepts that produce triangular shaped fuzzy numbers for the estimator. This approach produces fuzzy test statistics and fuzzy critical values in hypothesis testing. In addition, the sample size is fixed for this test. When data comes sequentially, however, it is not suitable to study with a fixed sample size test. In such cases, sequential and group sequential tests are recommended. Unlike a sequential test, a group of sequential test provides substantial savings in sample and enables us to make decisions as early as possible. This intends paper to combine the benefits of group sequential test and Buckley's approach using -cuts. It attempts to show that using -cuts can be used within the group sequential tests. To illustrate the test more explicitly a numerical example is also given.
Estimation of unknown parameters of statistical models or testing of statistical hypothesis in fuzzy environments are interesting subjects for different approaches. So far Zadeh , Taheri and Behboodian [2, 3], Taheri , Torabi and Behboodian , and Taheri and Arefi  have worked on these issues. Using a different point of view, Buckley [7–9] developed an approach which uses a set of confidence intervals. In addition, many studies have been done to combine several statistical methods and fuzzy sets, called fuzzy statistics, such as regression analysis, time series analysis, design of experiments, probability theory, conjoint analysis, and control charts . Since statistical tests based on fuzzy test statistics are more flexible than classical tests, they seemed to be competitive tools in certain situations; for example, when the observed value of the test statistic is close to the quantile of the test statistic .
Fixed sample size test is not useful where subjects enter the study sequentially. And thus the accumulated data can be analyzed sequentially. Wald  introduced a sequential probability ratio test (SPRT) which requires substantially fewer observations than a fixed sample size test. Several authors such as Torabi and Behboodian  have proposed fuzzy sequential probability ratio test. Talukdar and Baruah  have fuzzified the SPRT. Torabi and Mirhosseini  introduced the SPRT for fuzzy hypotheses testing. Jamkhaneh and Gildeh  presented a new approach for SPRT based on fuzzy hypothesis.
SPRT has no finite maximum number of observation; it is generally considered inappropriate for clinical trials . Group sequential tests (GST) are generally more practical and they give more possible savings than SPRT when it is impractical to perform an interim analysis after each new observation . GST are widely used in clinical trials. For ethical, scientific, and economic reasons, clinical trials are often repeatedly monitored for evidence of treatment benefit or harm. To achieve this, statisticians conduct interm analysis periodically on accumulating data . Various group sequential testing procedures have been proposed to achieve the desired levels of type-I error [18–20]. Much of the development of GST are reviewed in detail by Jennison and Turnbull .
None of the studies mentioned above consider Buckley’s approach for group sequential tests [7–9]. In this study, our aim is to use a group sequential test using -cuts inspired by Buckley’s approach when response variable has a normal distribution with known variance.
This paper is organized as follows: some preliminaries for fuzzy numbers are presented in Section 2. Buckley’s approach using -cuts for hypothesis testing is briefly reviewed in Section 3. Then Pocock’s group sequential test and using Buckley’s approach within Pocock’s group sequential test for a normal response with known variance are given in Section 4. A numerical example is given in Section 5. Results and discussions are given in Section 6. Finally, concluding remarks and some possible future perspectives are presented in Section 7.
Definition 1. A fuzzy number is a fuzzy subset of the real line . Its membership function satisfies the following criteria : (i)cut set of is a closed interval,(ii) such that , and(iii)convexity such that for ,
where -cut set contains all elements that have a membership grade .
Definition 2. A triangular shaped fuzzy number () is a fuzzy number, whose membership function is defined by three values, , where the base of triangular is the interval and the vertex is .
Definition 3. The -cut of a fuzzy number is a nonfuzzy set defined as . Hence , where and .
Definition 4. Consider a random variable with probability density function , which is the normal probability density with unknown mean , a known variance . To estimate , a random sample from is obtained. Suppose that the mean of this random sample turns out to be , which is a crisp number. It is known that is ; therefore, is . So where is the value that the probability of a random variable exceeding it is ( is the type-I error). Then inequality is solved to produce that is given as follows: This leads directly to the 100% confidence interval for where and denotes the normal density with mean zero and unit variance. With putting these confidence intervals one on top of other, we obtain that is the fuzzy estimator of [7–9] whose -cuts are confidence intervals as for . Hence we obtain the fuzzy estimator of .
3. Hypothesis Testing Using -Cuts
Buckley’s approach [7–9] uses set of confidence intervals producing a triangular shaped fuzzy number for the estimator. Therefore this approach produces a fuzzy test statistic and fuzzy critical values in fuzzy hypothesis testing.
In this section the classical hypothesis test, based on fixed sample size of from mean and variance known, is given. The following hypothesis against is going to be tested at significance level .
From the random sample its mean is computed as and then the test statistic is determined as
Let , , denote the significance level of the test. Now under the null hypothesis , is and decision rule is reject if ; reject if , and do not reject when . In the above decision rule is the value so that the probability of a random variable having the probability density exceeding is ( is the type-I error rate) .
Each -cut is put one over the other in order to get a triangular fuzzy test statistic which is given in Figure 1.
Since the test statistics are fuzzy the critical values will also be fuzzy. There will be two fuzzy critical value sets: (1) let correspond to and (2) let go with . Set , . The end points of an -cut of are computed from the end points of the corresponding -cut of with the following equations: Hence under , is so By using the left end point of in (9), we have Hence -cuts of and are given by respectively [7, 8]. Both and are triangular shaped fuzzy numbers. Since the crisp test statistics has a normal distribution, because this density is symmetric with respect to zero [7–9]. The final decision depends on the relationship between and and reject (Figure 2(a)); reject (Figure 2(b)); do not reject Ho (Figure 2(c)); and or no decision (Figures 2(d) and 2(e)). These situations are explained in detail as follows. For example, if , draw to the right of , then find the height of the intersection as which measures how much is less than or equal to . Thus, if , where is some fixed fraction in (Figure 2(a)). Other situations are summarized in Figures 2(b), 2(c), 2(d), and 2(e). In this figure, the height of the intersection is and as Buckley [7–9] states. Now the results can be given as (1) if , then (Figure 2(a)) and (2) if then , (Figures 2(d) and 2(e)). Similar results hold for versus (Figures 2(b), 2(c), 2(d), and 2(e)).
(a) Reject if
(b) Reject if
(c) Accept if
(d) No decision if
(e) No decision if
It is interesting that after evaluating and , , if , (Figures 2(d) and 2(e)) then the final decision is “no decision" on . This is because of the fuzzy numbers that incorporate all uncertainty in confidence intervals [7–9]. Consequently, hypothesis testing based on fuzzy test statistic and fuzzy critical values that is described above is more realistic and provides more benefits when value of the test statistic is very near to the quantile of the test statistic.
4. Group Sequential Test Using -Cuts
In Pocock’s group sequential test, subject entry is divided into equally sized groups containing subject on each treatment and the data are analyzed after each new group . Consider the response variables to be normal with unknown means and with a common variance . Subjects are randomized sequentially into two treatment groups.
The paper is planned as a test of the null hypothesis against the two sided alternative . Let and be the observed mean responses in the th group of subjects, then the statistics, are normally distributed with mean and unit variance, where . Therefore test statistic is defined with and , where , under , , is a partial sum process of independent identically distributed (i.i.d.) standard normal random variables , . Under , is again partial sum of i.i.d. normal random variables, , , . Critical values of Pococks’s GST () are given in Table 1 for ; , and . Then GST process is as follows:(1)after group (2)after group
The value of noncentrality parameter, , can be determined to achieve a given value of . Corresponding to a specific under to be detected, the required sample size per treatment per stage, , at given and with a maximum of stages, is obtained as The maximum sample size, , to find for maximum of stages is simply
Group sequential tests can also be used in an experiment with only one treatment in which the response results are compared with a known standard. For a normal response with a known variance and hypothesis mean , the critical values and become [17, 22]. When , that is, for fixed sample size test, (17) becomes the classical hypothesis test for normal response.
The test statistic defined with (14) can also be given as follows: Suppose that and , where . Therefore the mean of group of subject can be defined as . Now proceed to the fuzzy situation of that is explained for each group of subject in (5) as triangular shaped fuzzy number is defined as
Substituting the -cuts of into (19) and using interval arithmetic -cuts of the fuzzy test statistics are obtained as follows:
In the sense of this fuzzification fuzzy group sequential test statistic, can be defined as
The defined test statistic sample size fuzzy test statistic for which is given with (7). Afterwards, -cuts of Pococks’s fuzzy critical value (, ) can be calculated. The following equations include calculations for and , respectively: Hence By using the left end point of given with (21) it is possible to have As a result we obtain -cuts of with (27) In the above equation for , is fixed and ranges in the interval . Now so Both and are triangular shaped fuzzy numbers. The final decision depends on the relationship between and and . Therefore fuzzy group sequential test process is as follows:(1)after group (2)after group
These situations are explained in detail in Figure 3. The final decision depends on the relationship between and and reject (Figure 3(a)); reject (Figure 3(b)); do not reject (Figure 3(c)); and or no decision (Figures 3(d) and 3(e)). We take the height of the intersection , which measures how much , is less than, bigger than, or equal to . The advantage of -cuts approach to GST is that, instead of generating and processing a single confidence interval, all the confidence intervals at the same time are calculated in the process of corresponding fuzzy test statistic. Therefore in this study we showed that this advantage is also valid for the process of group sequential tests using -cuts.
(a) Reject if
(b) Reject if
(c) Accept if
(d) No decision if
(e) No decision if
5. Numerical Example
As an illustration we consider a real data in McCleve and Sincich  on page 381 given with Table 2. The content of data is as follows: a new developed diet, that is, low in fats, carbonhydrates, and cholesterol, is intended to be used by people with heart disease. Furthermore the dietitian wishes to examine the effect that this diet has on the weights of obese people. Hence this data set concerns two groups which are called low fat diet (A) and regular diet (B), respectively.
We want to test with , , and at ; sample size which is calculated from (17) is per treatment per stage with maximum sample size . The fuzzy group sequential test statistics , , and and fuzzy Pocock’s critical values , are given in Table 3.
The test statistic and critical value that is calculated in each stage is recalculated using Buckley’s approach. Hence each -cut value is obtained for test statistic and critical value (for , and ). -cuts, base and peak values of , , and , and critical values that are calculated in each stage are detailed in Table 4. Thus, as suggested in Buckley’s approach, not only one value but also more than one confidence interval is used, in order to test hypothesis, so that more information is included in group sequential test process. The results are obtained using the Maple 9 .