Abstract

This paper proposes a distribution-free (or nonparametric) control scheme to monitor a process output that contains two special causes of variation called “block or batch” effects and “treatment or position” effects. The scheme properties (control limits, false alarm rate, and in-control average run length) stay the same under any assumed continuous probability distribution. For moderate sample sizes, these properties can be computed exactly from available tables without the need to estimate the mean or variance of the process. The proposed monitoring scheme requires ranking the observations within blocks and using the method of analysis of means by ranks. The paper includes an illustrative example concerning the grinding process of silicon wafers used in integrated circuits production.

1. Introduction

In statistical process control, there are instances in which the process output contains block effects component in addition to the treatment effects component that is to be controlled. In manufacturing integrated circuits on silicon wafers, Roes and Does ([1, Table 1]) reported data bearing two effects: the batch (or block) effect and the position (or treatment) effect of the wafer under the grinder. To take account of both effects, they proposed a Shewhart-type control chart that is based on a two-way analysis of variance (ANOVA) model for controlling the process treatment mean and for controlling certain linear contrasts of the wafer positions. In manufacturing paper or plastic films, Palm and DeAmico [2] reported that the raw manufacturing material was formed into a continuous sheet (web) which would then be wound up into rolls at the end of the production line. Because of nonuniformities in the product along the cross-direction perpendicular to the direction of the travel of the web, samples were taken from each roll at different cross-directional positions (e.g., the middle, the front or operator side, and the back or motor side.) Palm and DeAmico [2] modeled the data as a two-way ANOVA setup where rolls served as blocks and cross-directional positions served as treatments. To account for the block effects, they suggested performing periodic cross-directional studies that are based on the analysis of main effects (ANOME) when monitoring the silicon oxide on computer chips, Yashchin [3] reported that from each lot, a sample of wafers is selected and then several measurements are made on each of the selected wafers. Based on a nested random effects model, he developed a cumulative sum (CUSUM) control chart for monitoring the process variance components. Earlier, Woodall and Thomas [4] considered monitoring variance components in a nested random effects model situation.

All the above monitoring schemes are distribution based (or parametric) that assume a normal distribution for the process output. The normality assumption, however, may not be valid for some manufacturing data; citations are given in Bakir [5], Chakraborti and Graham [6], and Chakraborti et al. [7].

The purpose of this paper is to develop a distribution-free (or nonparametric) control scheme for monitoring a process output that is under the influence of two special causes of variation called “block or batch” effects and “treatment or position” effects. While accounting for the block component, the proposed scheme monitors and detects differences in the treatment components of the process output. Our approach involves ranking the raw measurements of the process output within blocks and then using the method of analysis of means by ranks (ANOMR) as developed in Bakir [8]. The proposed scheme has the following merits.(a)The scheme takes account of the variability in the data that is due to block effects.(b)The scheme is distribution-free because its properties (control limits, false alarm rate, and in-control average run length) stay the same under any continuous probability distribution. (c)For moderate sample sizes, the scheme properties can be computed exactly without the need to estimate the mean or the standard deviation of the process. For large sample sizes, large sample approximations are available.

2. A Nonparametric Monitoring Scheme

The purpose of the monitoring scheme is to monitor a process output that contains two special causes of variation called “block or batch” effects and “treatment or position” effects. Examples of such output were cited in the Introduction.

2.1. Sampling Plan

At each time instance we take a sample of blocks (e.g., rolls of plastic film or batches), then we make measurements (e.g., at fixed positions on the roll or the batch) on the process output within each of the blocks. Thus, a total of observations is gathered at each sampling time instance .

2.2. Model and Assumptions

For such data, an appropriate model is the following two-way ANOVA model: In model (1), at each sampling instance , we have that is the measurement on the jth treatment in the ith block,   is an unknown constant representing the process overall mean, is an unknown constant representing the effect of the th block nested within time, is an unknown constant representing the fixed effect of the th treatment, and   is a random variable having a continuous probability distribution with zero median; it represents common cause variation in the ()th observation. The are assumed mutually independent.

2.3. Quality Characteristic to Be Monitored

The purpose of the monitoring scheme is to detect differences among the treatment components while accounting for the block effects. This translates to the process being in control if the hypothesis, (called the in-control hypothesis), holds true for all The out-of-control hypothesis is

2.4. Charting Statistic

At each sampling instance and for , let denote the rank of within the th block . Formally, where is the indicator function.

Calculate the treatment rank totals It can be verified that under the in-control hypothesis in (2), the rank totals, , have mean and variance given by Define the treatment rank deviations We will use the ’s as charting statistics for a follow-up monitoring scheme of the process. Define the maximum absolute rank deviations We will use as a charting statistic for an initial monitoring scheme.

2.5. The In-Control Probability Distribution of the Charting Statistic,

At each sampling instance and under the in-control hypothesis in (2), the k! possible rank configurations are equally likely within each of the blocks. Since the blocks are independent, there are equally likely rank configurations at each sampling time instance. Using these facts, Bakir ([8, Table A.1]) tabulates the exact in-control distribution of for the cases when and ; and ; and when and . Bakir’s table gives the values for and satisfying is a nonparametric test statistic because its distribution (under ) is the same for any parent continuous process distribution. The large sample in-control distribution of is discussed in Bakir [8] and its upper percentile points are given in Bakir ([9, Table IV]).

2.6. Signaling Rule and Control Limits
2.6.1. Initial Monitoring

First, we start an initial monitoring scheme by plotting the maximum absolute rank deviation, , against time . For a desired prechosen probability of false signal, , the scheme signals at the first sampling time instance for which The initial monitoring scheme has one control limit given by where is defined in (9).

A signal by this initial monitoring only indicates an out-of-control condition; it does not, however, pinpoints the treatment components that triggered the signal. To achieve this objective, we need to perform a follow-up monitoring.

2.6.2. Follow-Up Monitoring

In the follow-up monitoring scheme we plot the individual rank deviations , in (7), across time. A signal is given at the first time for which The upper and lower control limits for this follow-up monitoring scheme, respectively, are The signaling rules in (10) and (12) are equivalent in the sense that one signals if and only if the other signals too. For large values of n, the value of may be replaced in all the formulae that appear in this section by , where are the upper percentile points of the large sample distribution of given in Bakir ([9, Table IV.])

3. The Average Run Length

Let be a discrete random variable denoting the time (number of sampling time instances) required until the monitoring scheme’s first signal; that is, when .   is called the run length of the scheme and its expected value, , is called the average run length (ARL). Since the random variables are independent, follows a geometric distribution with parameter (probability of a signal), : By properties of the geometric distribution, we get

The monitoring scheme is nonparametric because its in-control (under ) run length distribution is geometric with the same parameter for any parent continuous process distribution. When the process is in control, let ARL0 denote ARL and let FAR denote the false alarm rate (probability of a false signal). Then It is clear that and FAR do not depend on the functional form of the probability distribution of the process. Table 1 gives the exact values of FAR, , and the corresponding UCL for some values of and . Table 1 was generated by using results in Bakir ([8, Table A.1]). For large values of , one may use the large sample distribution of to compute .

4. Illustrative Example

We apply the proposed monitoring scheme to data relating to the grinding process of silicon wafers used in integrated circuits production. The wafers are positioned in a circular arrangement under a grindstone to achieve a certain desired thickness. Thickness measurements () are taken on wafers seated on five fixed positions (labeled P1, P2, P3, P4, and P5) under the grinder. The P1 and P2 positions fall on the outer circle, P3 and P4 in the middle, and P5 in the inner circle. Thickness data of 30 successive batches of wafers are given in Roes and Does ([1, Table 1]). To apply our scheme, we break the data into ten time (sampling) instances with three batches (blocks) at each time. Measurements are made at the five positions (treatments) within each block.

Formulas in (6) give the following values for the mean and variance of the position (treatment) rank totals, : Table 2 displays the thickness data, assignments of the appropriate ranks, and further computations of the rank totals for the five positions (P1, P2, …, P5). Using Table 1 with , and FAR of (equivalent to ), we read the upper control limit: . Figure 1 displays the resulting chart for the initial monitoring scheme and Figures 2 and 3 show the resulting charts for the follow-up monitoring scheme.

Figure 1 shows that, except for the fifth sampling time instance, the successive values of the charting statistic, , fall right on the upper control limit. Thus, the initial monitoring scheme triggers an out-of-control signal indicating significant variation among the wafers' positions. However, Figure 1 does not pinpoint which position or positions caused the signal. We proceed now to perform the follow-up monitoring scheme as shown in Figures 2 and 3.

Figure 2 is a multiline chart showing the individual position profiles across time. This follow-up scheme signals at each time except for the fifth when all positions become within the control limits. Moreover, Figure 2 pinpoints the positions that caused the signal; namely, P1 and P4. Positions 2, 3, and 5 are within the control limits at all sampling times.

The follow-up scheme in Figure 3 is a single-line chart showing the individual position profiles across time. Figure 3 leads to the same conclusions as in Figure 2. Charting the follow-up monitoring scheme by either Figure 2 or Figure 3 is a matter of choice by the practitioner. It is interesting to note that Figure 3 resembles the chart of the famous Red Bead Experiment of Deming ([10], Figure 38 page 347). In the Red Bead Experiment, the numbers of defectives produced by each one of six workers are plotted over four days. The workers correspond to the wafer positions in our current example.

5. Concluding Remarks

In this paper, we propose a distribution-free (or nonparametric) scheme for monitoring treatment components while accounting for block effects in a process output. The procedure is based on a nonparametric version of the analysis of means method. Being distribution-free, the FAR and the , of the procedure stay the same under any assumed continuous distribution for the process. For small sample sizes and small number of treatments, the control limits, FAR and , can be computed exactly from available tables without the need to estimate the mean or variance of the process. The paper includes an example concerning the grinding process of silicon wafers used in integrated circuits production.