Abstract

Classical statistics and many data mining methods rely on “statistical significance” as a sole criterion for evaluating alternative hypotheses. It is very useful to find out the significant difference existing between the samples as well as the population or between two samples. But in this paper, the researchers try to apply the concepts of fuzzy group testing of hypothesis problem between multi group of samples of same size or different, through comparing the parameters like mean, standard deviation, and so forth. Hence we can compare multigroups such that they have the significant difference in their mean or standard deviation or other parameters through the fuzzy group testing of multihypotheses. The authors introduced and investigated the concepts very first time through fuzzy analysis that can decide which group(s) or samples can be taken for further investigation and either is rejected or accepted and hence the next discussion provides the properties of group of samples which may result in the optimized solution for the problem.

1. Introduction

The fuzzy technique can be used to generate solutions to the problems based on “vague, ambiguous, qualitative, incomplete, or imprecise information.” Fuzzy logic is an extension of fuzzy set theory that has developed since five decades approximately. The problem of dealing with imprecision and uncertainty is a part of the wider human experiences before the introduction of fuzzy theory.

In the traditional approach for hypotheses testing, all of the concepts are assumed to be precise and well-defined. But sometimes we have to take decision in an unrealistic manner, because in realistic problems we may come across fuzzy data and fuzzy hypotheses. Sometimes the decisions on samples will not help us to proceed. Arnold [1] studied on fuzzy hypotheses testing with crisp data. The problem of testing fuzzy hypotheses when the observations were crisp was considered by Taheri and Behboodian [2]. Torabi et al. [3] used Neyman-Pearson Lemma for fuzzy hypotheses by testing with vague data. Chachi et al. [4] also studied on the same problem in the context of fuzzy decision problems. Viertl [5] studied testing fuzzy hypotheses using fuzzy data based on fuzzy test statistic.

We know that the analysis of variance (ANOVA) is a powerful statistical tool for test of significance between two or more samples. It is used in a situation where three or more samples have been considered at a time, towards the testing of hypothesis that all the samples are drawn from the same population; otherwise their parameters have the significant difference. The purpose of this analysis is to test the homogeneity of several means. This analysis found to test the null hypothesis is that all the means are equal against the alternative hypotheses that some of them are not equal. So the variance ratio which is used for the comparison in the following designs:(1)completely randomized design (CRD, ANOVA for one-way classification);(2)randomized block design (RBD, ANOVA for two-way classification);(3)latin square design (LSD, ANOVA for three-way classification).

But these designs suffer by the disadvantage of being inherently less informative than other more sophisticated layouts. This makes the design less efficient and results in less sensitivity in detecting significant effects. For example, if null hypothesis is rejected then it concludes that the treatments differ significantly but it is not clear that which samples could be considered for the further investigation also which are inconsiderable in the future analysis. So we take this for our consideration, and we assign a method which would be very useful to test the null hypothesis versus multialternative hypotheses:

In the classical theory of statistical inference there is a one-to-one relationship between the parameter values for which the null hypothesis is accepted and the structure of the confidence intervals. Namely, a family of -level acceptance regions for a statistical test concerning the parameter is equivalent to a certain family of confidence intervals for . The results of a statistical test, therefore, can alternatively be stated in terms of the corresponding confidence interval. By developing the concepts of fuzzy confidence intervals and fuzzy statistical tests and using the one-to-one relationship between tests and confidence intervals may be convenient to generalize a correspondence between fuzzy confidence interval and fuzzy statistical test. In such a case, we have, for example, the degree of membership of the null hypothesized fuzzy parameter in the fuzzy confidence interval as the degree of acceptability of the null hypothesis. In the following, we use such a relationship in fuzzy environment to test the hypothesis corresponding to the parameters of the multisamples simultaneously via fuzzy group testing of hypothesis by fixing the level of confidence values for different fuzzy hypotheses. In this test two or more numbers of samples can be taken for our consideration and simultaneously we tried here to check the hypothesis whether all the samples coincide with the population parameter or some of the samples have much deviated from the population.

2. Fuzzy Group Testing of Hypothesis with Confidence Intervals

2.1. The Fuzzy Group Environment

Definition 1 (see [6]). Let be any nonempty set. A fuzzy subset μ of is a function .

Definition 2 (see [7]). A fuzzy set μ on is called fuzzy subgroup of if for ,(i),(ii).

Definition 3 (see [8]). Let be a finite group. In , a nonempty set is called a HX group on , if is a group with respect to the algebraic operation defined by and , in which its unit element is denoted by .

Definition 4 (see [8]). Let be a fuzzy subset defined on . Let be a HX group on . A fuzzy set defined on is said to be a fuzzy subgroup induced by on or a fuzzy HX subgroup on , if, for any ,(i),(ii),where .

Assumptions. Let be a probability space (where is a set of all possible outcomes of an experiment), (collection of subsets of such that or is collection of ), and is a set of probability measure. is the collection of where is a fuzzy valued function and is a random sample having distribution with parameter , for all .

The fuzzy valued function is a fuzzy random variable if and are two real valued random variables for all where .

Fuzzy random variables and are identical if and are identical for all . If for all for some , then is known as identity element of the group . If all are elements of the HX group , then the comparison may vary between the different collection of and the identity element of be which consisting the identity element “” of . Also it is very clear that for all . The element “” is called reaching height element (RHT) of each group of samples.

Definition 5. Here let us consider , and be the two samples contain equal number of elements, considered for group testing of hypothesis. We say that it is a fuzzy random sample of size “” from , if ’s are independent and identically distributed fuzzy random variables from where . It is very clear that from the definition of independent and identically distributed (i.i.d) fuzzy random variable (FRV), for fuzzy random sample, for , we can see that and for all .

2.2. Group Testing of Hypotheses: Classical Approach

Consider the problem of testing the null hypothesis versus multialternative hypotheses: For each , let () denote the acceptance region of a level α-group testing of hypothesis and , where or 1 or 2 or .

If then and hence We can define the area of level of significance for different hypotheses as follows: The indicator function is defined as

So, a confidence set can be viewed as a statement about testing hypothesis , which exhibits the values for which the hypothesis is completely accepted if

The above said value of the test is a rule stating that the null hypothesis cannot be rejected if the interval contains the hypothesized value and can be rejected if the other intervals completely contain the hypothesized value of the given problem. So here directly we can have the chance for making the conclusion about the multihypotheses using the hypothesized value and the confidence interval of respective hypothesis.

Example 6. Let and be from normal distribution with the unknown means and , respectively. A confidence interval at confidence level , where or 1 or 2 or for of the form This can be derived easily, where is the -quartile of standard normal distribution.
Assume the two random samples with sizes , , and are observed and we want to test versus the other alternative hypotheses , , and at level . Then we can have where and , 1, 2, and .
Hence the graphical representations of ’s are given in Figure 1.

Consider the approximate test statistic where = population mean (parameter), = sample means, = population variance, = variances of samples, and = sample sizes.

Hence we have

In this case, based on the observed value of and , we accept the null hypotheses at level of but the rejection of null hypothesis does not mean as accepted.

2.3. The Proposal in Fuzzy Environment

In this section, we investigate a procedure to provide a fuzzy test for group testing of a fuzzy parameter for fuzzy data which is based on fuzzy confidence intervals. In order to derive the degrees of acceptability of the null and alternative hypotheses, we introduce the following method in this test. It is very clear that the data available are observations of a normal fuzzy random sample with the unknown fuzzy mean and known variance .

That is, and . We restrict our attention only for two sided alternative hypotheses.

(1) First we transform the original problem into a set of crisp group testing problems concerning -levels of the fuzzy parameter. For each level, based on the samples, The following classical group testing problems are solved at level :

We obtain the confidence intervals for the crisp parameters for each , denoted by , and , , respectively. The test functions are defined as follows:

Let us consider two brands of sizes 21 to be compared with the population such that their average life differs or coincides. Consider the average life time of the population to be around 20,000 working life time with the variance 4,00,000 represented by a triangular fuzzy number . We consider the physical problem of testing the hypothesis at level , using fuzzy group testing of hypotheses. We have Hence, the two sided 0.95 confidence intervals for the and are(i) and ;(ii) and .

Similarly, we can also calculate the membership values of the variables and . Then the membership value of fuzzy confidence interval is defined as , .

Here the values of , , , and are calculated by using the formula given in Appendix A.

The degree of membership of in two sided confidence interval is Therefore we have the confidence interval for accepting as (Figure 2).

Let , where and .(1)If (say) lies inside , then briefly the function of acceptance of is (2)Hence we can calculate the membership values of and by using where and .Consider and and the rejection of may result as (3)If lies within the interval of and which lies partially outside of , then the first group can be accepted and the second can be rejected with the help of fuzzy membership value of . Hence is accepted (Figure 3).(4)If lies outside the interval of and lies inside , then the first sample can be rejected and the second may be accepted. Hence is partially accepted (Figure 4).(5)If and both partially lie inside, then it is very clear that both the samples have significant difference by comparing the population parameter. Hence can be chosen (Figure 5).

3. Conclusion

This analysis can be extended for multi groups (more than two) and we can check the chosen groups are coinciding with the population parameter or within the groups. So in a single attempt, we can decide the suitable samples for our further investigation according to the necessity, instead of rejection of null hypothesis. This is an important advantage of this method, so we simultaneously can make the decision that would be better to improve the property analyze of the variables in future. Classical approach of a statistical problem will not be very precise sometimes and it will not help us to make better decisions about the properties of the variables. Many of the methods like this were already introduced by mathematicians to solve problems using concepts of fuzzy. So we try to find one of the solutions for the testing of hypotheses problem through fuzzy group testing of hypotheses which may help us in this aspect and also it may be very useful to make the broad decisions upon the groups of samples.

So through this discussion, we may get the following conclusions.(1)This methodology can be extended to compare more than two groups; meanwhile multialternative hypotheses can be discussed and further property analysis of the variables can also be very effective for future discussion.(2)This can be extended in future for collection of multi-subgroups of a special group called HX group. For example, number of groups (systems) can be developed by choosing different collection of elements of a single group which can be analyzed by using fuzzy group testing method to identify the better performing system(s) to complete the task in the successful manner as well as to consume time. This may be very useful to induce the performance level to the expected height. These are the expected advantages of this research work in future.

Appendices

A. Estimation of Primary Statistics

Consider

Note. We considered 2 groups to be compared; hence we need 20 formulae to conclude. Similarly if we consider “” groups for our test, then we need “” formulae to test the hypotheses.

B. Tentative Intervals of and

See Table 1.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.