About this Journal Submit a Manuscript Table of Contents
ISRN Computational Mathematics
Volume 2012 (2012), Article ID 340415, 12 pages
doi:10.5402/2012/340415
Research Article

On Estimating the Linear-by-Linear Parameter for Ordinal Log-Linear Models: A Computational Study

1School of Mathematical and Physical Sciences, University of Newcastle, Callaghan, NSW 2308, Australia
2School of Veterinary Medicine, University of California, Davis, Davis, CA 95616, USA

Received 17 January 2012; Accepted 5 March 2012

Academic Editors: T. Allahviranloo, H. J. Ruskin, P. B. Vasconcelos, and Q.-W. Wang

Copyright © 2012 Eric J. Beh and Thomas B. Farver. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Estimating linear-by-linear association has long been an important topic in the analysis of contingency tables. For ordinal variables, log-linear models may be used to detect the strength and magnitude of the association between such variables, and iterative procedures are traditionally used. Recently, studies have shown, by way of example, three non-iterative techniques can be used to quickly and accurately estimate the parameter. This paper provides a computational study of these procedures, and the results show that they are extremely accurate when compared with estimates obtained using Newton’s unidimensional method.

1. Introduction

During the 1970s and 1980s parameter estimation procedures for ordinal log-linear and ordinal association models gained considerable attention in the literature. For example, Haberman [1], Goodman [2, 3], Agresti [4], and Gilula and Haberman [5] all considered various approaches to obtain the maximum likelihood estimate (MLE) of the parameters of such models. Recently there has been a renewed interest paid to estimation methods for these ordinal models. Galindo-Garre and Vermunt [6] considered two adaptations of Goodman’s [2] approach while Ait-Sidi-Allal et al. [7] considered an iterative procedure based on Fisher’s scoring algorithm. These approaches can involve computationally intensive procedures and are therefore subject to a number of problems including the poor choice of starting values, or the slow (or even lack of) convergence of the algorithm.

A different approach to estimating parameters from ordinal log-linear (or association) models is to consider the direct, iterative-free, estimation procedure originally proposed by Beh and Davy [8] and recently extended by Beh and Farver [9]. The latter authors presented three noniterative estimation procedures for estimating the association parameter for such models and empirically demonstrated (by considering 14 commonly analysed contingency tables of varying dimension and sample size) that these noniterative procedures can provide estimates that are statistically indistinguishable when compared with the estimate obtained using Newton’s unidimensional method.

By performing a computational study, this paper will explore the accuracy and reliability of Newton’s method and four noniterative procedures on randomly generated contingency tables with a fixed association parameter. Such a study will be made by first briefly describing the ordinal log-linear model (Section 2) and Newton’s method (Section 3.1). A brief mathematical description of the non-iterative techniques of Beh and Farver [9] will be made in Section 3.2. The calculation of the standard error of these estimates is considered in Section 3.3. Section 4.1 describes the mechanism by which contingency tables of varying dimension and association parameter are generated, while the results of the study are discussed in Section 4.2. Section 5 provides an overview of these findings.

2. The Ordinal Log-Linear Model

Suppose we consider an 𝐼 × 𝐽 two-way contingency table, 𝑁 , where the ( 𝑖 , 𝑗 ) th cell entry is given by 𝑛 𝑖 𝑗 for 𝑖 = 1 , , 𝐼 and 𝑗 = 1 , , 𝐽 . Let the grand total of 𝑁 be 𝑛 , and let the matrix of joint relative frequencies be 𝑃 , so that the ( 𝑖 , 𝑗 ) th cell entry is 𝑝 𝑖 𝑗 = 𝑛 𝑖 𝑗 / 𝑛 and 𝐼 𝑖 = 1 𝐽 𝑗 = 1 𝑝 𝑖 𝑗 = 1 . Define the 𝑖 th row marginal proportion by 𝑝 𝑖 = 𝐽 𝑗 = 1 𝑝 𝑖 𝑗 , and define the 𝑗 th column marginal proportion by 𝑝 𝑗 = 𝐼 𝑖 = 1 𝑝 𝑖 𝑗 . When the row and column variables both consist of ordered categories, the logarithm of the ( 𝑖 , 𝑗 ) th cell may be reconstituted by considering the following ordinal log-linear model: l n 𝑚 𝑖 𝑗 = 𝜇 + 𝛼 𝑖 + 𝛽 𝑗 𝑢 + 𝜙 𝑖 𝑢 𝑣 𝑗 𝑣 , ( 1 ) where 𝑚 𝑖 𝑗 is the expected cell frequency of the ( 𝑖 , 𝑗 ) th cell. For this model 𝜇 is the grand mean and 𝛼 𝑖 and 𝛽 𝑗 are the main effects associated with the 𝑖 th row category and 𝑗 th column category, respectively, and are constrained so that 𝐼 𝑖 = 1 𝛼 𝑖 = 𝐽 𝑗 = 1 𝛽 𝑗 = 0 . These parameters may be estimated such that 1 𝜇 = l n ( 𝑛 ) + 𝐼 𝐼 𝑖 = 1 𝑝 l n 𝑖 + 1 𝐽 𝐽 𝑗 = 1 𝑝 l n 𝑗 , 𝛼 𝑖 𝑝 = l n 𝑖 1 𝐼 𝐼 𝑖 = 1 𝑝 l n 𝑖 , ̂ 𝛽 𝑗 𝑝 = l n 𝑗 1 𝐽 𝐽 𝑗 = 1 𝑝 l n 𝑗 , ( 2 )

see Beh and Davy [8]. Model (1) requires the specification of 𝑢 𝑖 which is the score associated with the 𝑖 th row category. When the row categories are ordered, these scores can be arranged in ascending, or descending, order depending on the structure of the row variable. Similarly, 𝑣 𝑗 is the score associated with the 𝑗 th column category for an ordinal column variable; the scores are arranged in ascending (or descending) order. For (1), 𝑢 and 𝑣 are the weighted means of 𝑢 𝑖 and 𝑣 𝑗 , respectively, where the weights are the marginal relative proportions.

The relationship between the two-ordered categorical variables may be determined by estimating the parameter 𝜙 . This parameter is the measure of linear-by-linear association, and its interpretation is aided by considering that the log-odds ratio measure is 𝑚 l n 𝑖 𝑗 𝑚 𝑖 + 1 , 𝑗 + 1 𝑚 𝑖 , 𝑗 + 1 𝑚 𝑖 + 1 , 𝑗 𝑢 = 𝜙 𝑖 + 1 𝑢 𝑖 𝑣 𝑗 + 1 𝑣 𝑗 . ( 3 )

Therefore, if equidistant scores are chosen to reflect the ordered structure of the variables so that 𝑢 2 𝑢 1 = = 𝑢 𝐼 𝑢 𝐼 1 = 1 and 𝑣 2 𝑣 1 = = 𝑣 𝐽 𝑣 𝐽 1 = 1 , then 𝜙 is the common log-odds ratio of the contingency table.

If one considers the log-linear model (1), then the ( 𝑖 , 𝑗 ) th cell can be reconstituted by considering the association model 𝑚 𝑖 𝑗 = 𝛼 𝑖 ̃ 𝛽 𝑗 𝜙 𝑢 e x p 𝑖 𝑢 𝑣 𝑗 𝑣 , ( 4 ) where 𝛼 𝑖 and ̃ 𝛽 𝑗 are subject to the constraint 𝐼 𝑖 = 1 𝛼 𝑖 = 𝐽 𝑗 = 1 ̃ 𝛽 𝑗 = 1 and are treated here as nuisance parameters. The estimation of 𝜙 can be carried out by using Newton’s unidimensional method considered by Goodman [2] if only one set of scores is fixed. One may also refer to Haberman [1], Agresti ([4], Appendix B), Nelder and Wedderburn [10], and Galindo-Garre and Vermunt [6] for iterative algorithms. Recently, non-iterative approaches have been proposed for estimating 𝜙 . Beh and Davy [8] originally considered such an approach for estimating 𝜙 . This work has since been expanded upon by Beh and Farver [9] who proposed another three non-iterative procedures for estimating this parameter and discussed the calculation of the corresponding standard errors. By performing an empirical study, they demonstrated that such non-iterative procedures estimate 𝜙 with as much precision as Newton’s method. Such an approach allows one to easily, and directly, estimate the linear-by-linear parameter without resorting to iterative techniques. This paper continues the study undertaken by Beh and Farver [9] by providing an evaluation of the performance and accuracy of Newton’s method, and the four non-iterative procedures, for estimating the parameter 𝜙 from model (1).

3. On Estimating 𝜙

3.1. Newton’s Unidimensional Method

There are a number of iterative procedures that may be used for estimating 𝜙 from the log-linear model (1). These include the iterative proportional fitting algorithm as described by Agresti ([4], pp. 64-66) and Nelder and Wedderburn’s [10] quasi-likelihood approach. In this paper we will consider Newton’s unidimensional method due to its relative simplicity and popularity. A brief description of Newton’s method for estimating 𝜙 will be provided here although Beh and Farver [9] described it in more detail.

Suppose that each of the cell frequencies of a contingency table 𝑁 , 𝑛 𝑖 𝑗 , are independent Poisson random variables. Also denote the expected value of 𝑛 𝑖 𝑗 (under some model) to be 𝑚 𝑖 𝑗 . In this paper the expected value of 𝑛 𝑖 𝑗 is determined by considering (4). The ( 𝑡 + 1 ) th iteration of Newton’s method involves updating the MLE of 𝜙 , denoted as 𝜙 𝑁 ( 𝑡 + 1 ) , using the estimate obtained from the 𝑡 th iteration such that 𝜙 𝑁 ( 𝑡 + 1 ) = 𝜙 𝑁 ( 𝑡 ) + 𝐼 𝑖 = 1 𝐽 𝑗 = 1 𝑢 𝑖 𝑢 𝑣 𝑗 𝑣 𝑛 𝑖 𝑗 𝑚 ( 𝑡 ) 𝑖 𝑗 𝐼 𝑖 = 1 𝐽 𝑗 = 1 𝑢 𝑖 𝑢 2 𝑣 𝑗 𝑣 2 𝑚 ( 𝑡 ) 𝑖 𝑗 ( 5 ) until convergence is reached. This is achieved when | 𝜙 𝑁 ( 𝑡 + 1 ) 𝜙 𝑁 ( 𝑡 ) | < 𝜀 , for some reasonably small value of 𝜀 . Here 𝑚 ( 𝑡 ) 𝑖 𝑗 is just (4) with 𝜙 𝑁 ( 𝑡 ) in place of 𝜙 . When this stopping criterion is satisfied, the MLE of 𝜙 using Newton’s method, 𝜙 𝑁 , is the final iterative estimate 𝜙 𝑁 ( 𝑡 + 1 ) . This method is adopted in this paper for obtaining the MLE of 𝜙 where 𝜀 = 0 . 0 0 0 1 . To initiate Newton’s method, a starting value of 𝜙 𝑁 ( 0 ) = 0 is used which coincides with independence between the two categorical variables. The standard error of the MLE of 𝜙 using the iterative procedure described by (5) is 𝜙 S E 𝑁 = 1 𝐼 𝑖 = 1 𝐽 𝑗 = 1 𝑢 𝑖 𝑢 2 𝑣 𝑗 𝑣 2 𝑚 𝑖 𝑗 , ( 6 ) where 𝑚 𝑖 𝑗 is the estimated value of 𝑛 𝑖 𝑗 given 𝜙 𝑁 . There are some adaptations of Newton’s unidimensional method that can be included to entice it to converge faster, be more accurate, or both. Beh and Farver [9] consider some possible ways of doing this.

3.2. Noniterative Procedures

The computational study considered in this paper evaluates four non-iterative procedures for estimating 𝜙 . The first approach is 𝜙 L o g N I = 1 𝜎 𝐼 𝜎 𝐽 𝐼 𝐽 𝑖 = 1 𝑗 = 1 𝑝 𝑖 𝑝 𝑗 𝑢 𝑖 𝑢 𝑣 𝑗 𝑣 𝑝 l n 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 , ( 7 ) where 𝜎 2 𝐼 = 𝐼 𝑖 = 1 𝑢 2 𝑖 𝑝 𝑖 ( 𝐼 𝑖 = 1 𝑢 𝑖 𝑝 𝑖 ) 2 and 𝜎 2 𝐽 = 𝐽 𝑗 = 1 𝑣 2 𝑗 𝑝 𝑗 ( 𝐽 𝑗 = 1 𝑣 𝑗 𝑝 𝑗 ) 2 and is termed the log-noniterative estimate or LogNI estimate of 𝜙 . The L o g N I estimate (7) is guaranteed to exist when 𝑝 𝑖 𝑗 > 0 , for all 𝑖 and 𝑗 , and so is restricted to the case where none of the cell frequencies observed in a contingency table are zero. When there exist zero cell frequencies in the contingency table, this problem may be simply overcome by substituting small values where the zeros are observed. Agresti ([11], p. 397) points out that in some situations, especially where a saturated model is considered, replacing zero cells with 0.5 is appropriate, although in other circumstances adding 0.5 may create problems.

An alternative approach to approximating the natural logarithm present in the L o g N I estimate (7) is to consider instead 𝑝 l n 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 𝑝 2 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 𝑝 𝑖 𝑗 + 𝑝 𝑖 𝑝 𝑗 ( 8 )

which takes into account that 𝑝 𝑖 𝑗 / ( 𝑝 𝑖 𝑝 𝑗 ) 0 . Alternatively, the second-order approximation 𝑝 l n 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 𝑝 2 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 𝑝 𝑖 𝑗 + 𝑝 𝑖 𝑝 𝑗 + 1 3 𝑝 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 𝑝 𝑖 𝑗 + 𝑝 𝑖 𝑝 𝑗 3 ( 9 )

may also be considered. By adopting these two approximations of the logarithm function, two estimates of 𝜙 are obtained by considering 𝜙 L o g N I 1 = 2 𝜎 𝐼 𝜎 𝐽 𝐼 𝐽 𝑖 = 1 𝑗 = 1 𝑝 𝑖 𝑝 𝑗 𝑢 𝑖 𝑢 𝑣 𝑗 𝑣 𝑝 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 𝑝 𝑖 𝑗 + 𝑝 𝑖 𝑝 𝑗 , 𝜙 L o g N I 2 = 2 𝜎 𝐼 𝜎 𝐽 𝐼 𝐽 𝑖 = 1 𝑗 = 1 𝑝 𝑖 𝑝 𝑗 𝑢 𝑖 𝑢 × 𝑣 𝑗 𝑣 𝑝 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 𝑝 𝑖 𝑗 + 𝑝 𝑖 𝑝 𝑗 + 1 3 𝑝 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 𝑝 𝑖 𝑗 + 𝑝 𝑖 𝑝 𝑗 3 , ( 1 0 ) which are referred to as the L o g N I 1 and L o g N I 2 estimates of 𝜙 , respectively. An alternative way to deal with the problem of zero cell frequencies is to use another approximation of the logarithm function that is present in the Log NI estimate (7). For example, by considering only the first term in the power series expansion of the function 𝑝 l n 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 𝑝 𝑖 𝑗 𝑝 𝑖 𝑝 𝑗 1 , ( 1 1 ) the L o g N I estimate (7) can be approximated to yield an alternative estimate of 𝜙 𝜙 B D N I = 1 𝜎 𝐼 𝜎 𝐽 𝐼 𝐽 𝑖 = 1 𝑗 = 1 𝑝 𝑖 𝑗 𝑢 𝑖 𝑢 𝑣 𝑗 𝑣 . ( 1 2 ) This estimate was originally proposed in Beh and Davy [8] and so is termed the BD-noniterative estimate or simply the B D N I estimate of 𝜙 and is the fourth non-iterative estimation approach evaluated in the present simulation study.

3.3. Standard Error of the Noniterative Estimates

Beh and Farver [9] show, by considering the B D N I estimate (12), the standard error of 𝜙 B D N I is 𝜙 S E B D N I = 𝜎 𝐼 𝜎 𝐽 𝑛 1 . ( 1 3 ) However, (13) is equivalent to the standard error (6) when the row and column variables are completely independent. To show this, consider the standard error (6). When the two categorical variables are independent so that 𝑚 𝑖 𝑗 = 𝑛 𝑝 𝑖 𝑝 𝑗 , then 𝜙 = 1 . Therefore (6) can be alternatively expressed as = S E 𝜙 = 1 𝑛 𝐼 𝐽 𝑖 = 1 𝑗 = 1 𝑢 𝑖 𝑢 2 𝑣 𝑗 𝑣 2 𝑝 𝑖 𝑝 𝑗 1 = 𝑛 𝐼 𝑖 = 1 𝑢 𝑖 𝑢 2 𝑝 𝑖 𝐽 𝑗 = 1 𝑣 𝑗 𝑣 2 𝑝 𝑗 1 = 𝑛 𝜎 2 𝐼 𝜎 2 𝐽 1 = 𝜎 𝐼 𝜎 𝐽 𝑛 1 . ( 1 4 )

This standard error provides an indication of the maximum standard error that one would obtain for a contingency table with sample size 𝑛 . For the computational study considered in this paper, the standard error of the non-iterative estimates is calculated using (6).

4. The Computational Study

4.1. Randomly Generating a Contingency Table

In order to study the accuracy and reliability of Newton’s method and the four non-iterative procedures described above, we consider a simple procedure for generating a two-way contingency table of dimension 𝐼 × 𝐽 given a value of 𝜙 and sample size 𝑛 . The procedure involves selecting an independent and random sample of 𝐼   pseudo-marginal row proportions, denoted by ̃ 𝑝 𝑖 , from a uniform distribution from the interval [ 0 , 1 ] . A sample of 𝐽 pseudo-marginal column proportions, ̃ 𝑝 𝑗 , is generated in a similar manner. The random marginal proportions of the 𝑖 th row and 𝑗 th column can therefore be obtained such that ̆ 𝑝 𝑖 = ̃ 𝑝 𝑖 𝐼 𝑖 = 1 ̃ 𝑝 𝑖 , ̆ 𝑝 𝑗 = ̃ 𝑝 𝑗 𝐽 𝑗 = 1 ̃ 𝑝 𝑗 , ( 1 5 )

respectively, ensuring that 𝐼 𝑖 = 1 ̆ 𝑝 𝑖 = 1 and 𝐽 𝑗 = 1 ̆ 𝑝 𝑗 = 1 . Given these randomly generated marginal proportions and the pre-defined scores used to reflect the ordered structure of the categories, the mean row and column scores are 𝑢 = 𝐼 𝑖 = 1 𝑠 𝐼 ( 𝑖 ) ̆ 𝑝 𝑖 , 𝑣 = 𝐽 𝑗 = 1 𝑠 𝐽 ( 𝑗 ) ̆ 𝑝 𝑗 , ( 1 6 )

respectively. For this computational study, natural row scores, { 𝑠 𝐼 ( 𝑖 ) = 𝑖 𝑖 = 1 , 2 , , 𝐼 } and natural column scores { 𝑠 𝐽 ( 𝑗 ) = 𝑗 𝑗 = 1 , 2 , , 𝐽 } were chosen to reflect the ordinal structure of the categories. However, any scoring procedure may be considered to reflect the ordinal structure of categorical variables such as midrank scores [12] and Nishisato scores [13]. The ( 𝑖 , 𝑗 ) th joint marginal cell frequency is then calculated based on the ordinal log-linear model (1) such that ̃ 𝑛 𝑖 𝑗 = e x p 𝜇 + 𝛼 𝑖 + 𝛽 𝑗 𝑢 + 𝜙 𝑖 𝑢 𝑣 𝑗 𝑣 , ( 1 7 )

where ̃ 𝑛 𝑖 𝑗 is the frequency observed in the ( 𝑖 , 𝑗 ) th cell, given the randomly generated marginal proportions, the pre-determined 𝜙 value, and 1 𝜇 = l n ( 𝑛 ) + 𝐼 𝐼 𝑖 = 1 l n ̆ 𝑝 𝑖 + 1 𝐽 𝐽 𝑗 = 1 l n ̆ 𝑝 𝑗 , 𝛼 𝑖 = l n ̆ 𝑝 𝑖 1 𝐼 𝐼 𝑖 = 1 l n ̆ 𝑝 𝑖 , 𝛽 𝑗 = l n ̆ 𝑝 𝑗 1 𝐽 𝐽 𝑗 = 1 l n ̆ 𝑝 𝑗 . ( 1 8 )

It is apparent that the ̃ 𝑛 𝑖 𝑗 values obtained will not be guaranteed to be integers. Therefore, each cell value is rounded to the nearest integer to obtain the randomly generated contingency table. Thus, the sample size of n specified in the algorithm is only an approximate sample size value.

4.2. Results

The computational study performed in this paper is based on random contingency tables (using the procedure described in Section 4.1) of different dimensions with a sample size of approximately 1000. Tables were generated with the following dimensions: 2 × 2, 2 × 3, 2 × 4, 2 × 5, 3 × 3, 3 × 4, 3 × 5, 4 × 4, 4 × 5, and 5 × 5. For each dimension, 200 random tables were generated each with a fixed value of 𝜙 ; the 𝜙 values considered in this paper were 0, 0.01, 0.05, and 0.1 to 1.5 at increments of 0.1. In terms of the exponent of 𝜙 (i.e., the common log-odds ratio of the table), these are equivalent to 1, 1.01, 1.05, and 1.11 to 4.48, respectively. In practice, the common log-odds ratio may be negative; however, for the purposes of exploring the behaviour of Newton’s unidimensional method and the four non-iterative procedures (the L o g N I , L o g N I 1 , L o g N I 2 , and B D N I estimates) against the true value of 𝜙 , we restricted our attention to positive 𝜙 values. A preliminary investigation revealed that randomly generated contingency tables with a negative 𝜙 behaved in a similar manner to tables generated with a positive 𝜙 value.

Tables 1, 2, 3, 4, and 5 provide a summary of the mean estimate and standard error (in parentheses) of 𝜙 from the 200 randomly generated contingency tables. Table 1 summarises the results obtained using Newton’s method, and Table 2 provides a summary of the accuracy of 𝜙 L o g N I (7) for estimating 𝜙 . Table 3 summarises the results obtained by considering 𝜙 L o g N I 1 (10) to estimate the parameter while Tables 4 and 5 summarise output when using 𝜙 L o g N I 2 (7) and 𝜙 B D N I (12), respectively. We also took into consideration the number of iterations needed for Newton’s method to converge to four decimal places. Thus, Table 6 summarises the median number of iterations needed for convergence to occur. Also summarised in this table is the range (maximum and minimum) of the number of iterations observed for the each set of 200 simulated tables.

tab1
Table 1: Mean estimate (and standard error) of 𝜙 using Newton’s unidimensional method, 𝜙 𝑁 , based on 200 simulated contingency tables with a specified value of 𝜙 and 𝑛 = 1 0 0 0 .
tab2
Table 2: Mean estimate (and standard error) of 𝜙 using 𝜙 L o g N I based on 200 simulated contingency tables with a specified value of 𝜙 and 𝑛 1 0 0 0 . The values in bold are where the mean of the 200 𝑃 values obtained from the Wald test (of the difference between the estimated and true parameter) is significant at the 1% level. The italicised values are significant at the 5% level.
tab3
Table 3: Mean estimate (and standard error) of 𝜙 using 𝜙 L o g N I 1 based on 200 simulated contingency tables with a specified value of 𝜙 and 𝑛 1 0 0 0 . The values in bold are where the mean of the 200 𝑃 values obtained from the Wald test (of the difference between the estimated and true parameter) is significant at the 1% level. The italicised values are significant at the 5% level.
tab4
Table 4: Mean estimate (and standard error) of 𝜙 using 𝜙 L o g N I 2 based on 200 simulated contingency tables with a specified value of 𝜙 and 𝑛 1 0 0 0 . The values in bold are where the mean of the 200 𝑃 values obtained from the Wald test (of the difference between the estimated and true parameter) is significant at the 1% level. The italicised values are significant at the 5% level.
tab5
Table 5: Mean estimate (and standard error) of 𝜙 using 𝜙 B D N I based on 200 simulated contingency tables with a specified value of 𝜙 and 𝑛 1 0 0 0 . The values in bold are where the mean of the 200 𝑃 values obtained from the Wald test (of the difference between the estimated and true parameter) is significant at the 1% level. The italicised values are significant at the 5% level.
tab6
Table 6: Median number of iterations needed for Newton’s unidimensional method to converge to four decimal places. In parenthesis is the range of iterations for the 200 simulated contingency tables.

Table 1 and Figure 1 show that Newton’s method is very accurate for estimating 𝜙 . The Wald statistic of the mean parameter estimate indicates that such a procedure gives estimates of 𝜙 that are statistically indistinguishable from the true estimate for all sized contingency tables and 𝜙 values. However, the number of iterations needed for Newton’s method to estimate the parameter varies greatly depending on the size of the contingency table and the magnitude of 𝜙 (Table 6). Contingency tables of small dimension and small 𝜙 required no more than about 10 iterations for convergence to four decimal places to occur. Due to the accuracy of Newton’s method, the legends attached to Figure 1 highlighting the varying table sizes have been omitted.

340415.fig.001
Figure 1: Graphical view of the performance of Newton’s unidimensional method for estimating 𝜙 . Mean estimate of parameter versus true parameter for the different values of 𝜙 and contingency tables of varying dimension.

Table 6 shows that for contingency tables of all dimensions considered, if 𝜙 is larger than 0.8, then the number of iterations for convergence was nearly always larger than 10 with a median number of iterations ranging from 20 (for a 2 × 2 contingency table) to 9769 (for a 5 × 5 table). For the 200 randomly generated 5 × 5 contingency tables, the number of iterations until convergence was achieved ranging from an economical 9 iterations to nearly 35000.

Table 2 and Figure 2 show that of the four non-iterative procedures, using 𝜙 L o g N I to estimate 𝜙 came closest to matching the performance of Newton’s method. There were 15 cases out of 180 total cases where the mean estimate of 𝜙 obtained using 𝜙 L o g N I was different from the true 𝜙 at the 1% level of significance. Ten of these 15 cases occurred when estimating a 𝜙 > 1 . 0 0 in a contingency table having a dimension of 4 × 5 or 5 × 5. Due to the accuracy of the 𝜙 L o g N I estimate, the legends attached to Figure 2 highlighting the varying table sizes have been omitted.

340415.fig.002
Figure 2: Graphical view of the performance of 𝜙 L o g N I (7) for estimating 𝜙 . Mean estimate of parameter versus true parameter for the different values of 𝜙 and contingency tables of varying dimension.

Tables 3, 4, and 5 and the corresponding figures show that the estimators 𝜙 L o g N I 1 , 𝜙 L o g N I 2 , and 𝜙 B D N I are stable and accurate for relatively small contingency tables and where 𝜙 is also small (not exceeding approximately 0.5). However they performed poorly for relatively large tables with a large parameter value, continually underestimating the parameter. Figures 3, 4 and 5 graphically verify these results. Using 𝜙 L o g N I 2 resulted in 30 cases (out of 180 total cases) where the mean estimate of 𝜙 was different from the true 𝜙 value (at the 1% level of significance) with 23 of these 30 cases occurring when estimating a 𝜙 1 . 0 0 in a contingency table having a dimension of 3 × 5, 4 × 4, 4 × 5, or 5 × 5. Using 𝜙 L o g N I 1 resulted in 35 cases where the mean estimate of 𝜙 was statistically different from the true 𝜙 value with 27 of these 35 cases occurring when estimating a 𝜙 0 . 9 0 in a contingency table having a dimension of 3 × 5, 4 × 4, 4 × 5 or 5 × 5. Using 𝜙 B D N I resulted in 43 cases where the mean estimate of 𝜙 was statistically different from the true 𝜙 value with 31 of these 43 cases occurring when estimating a 𝜙 0 . 8 0 in a contingency table having a dimension of 3 × 5, 4 × 4, 4 × 5 or 5 × 5.

340415.fig.003
Figure 3: Graphical view of the performance of 𝜙 L o g N I 1 (10) for estimating 𝜙 . Mean estimate of parameter versus true parameter for the different values of 𝜙 and contingency tables of varying dimension.
340415.fig.004
Figure 4: Graphical view of the performance of 𝜙 L o g N I 2 (7) for estimating 𝜙 . Mean estimate of parameter versus true parameter for the different values of 𝜙 and contingency tables of varying dimension.
340415.fig.005
Figure 5: Graphical view of the performance of 𝜙 B D N I (12) for estimating 𝜙 . Mean estimate of parameter versus true parameter for the different values of 𝜙 and contingency tables of varying dimension.

Two hundred (200) contingency tables of dimension 4 × 5 were randomly generated (using the algorithm described in Section 4.1) with 𝜙 = 0 . 4 and sample size approximately 1000. For these tables, Figure 6 illustrates the stability of each of the five estimation procedures and summarises the 𝑃 value obtained from the Wald test of the estimates 𝜙 𝑁 , 𝜙 L o g N I , 𝜙 L o g N I 1 , 𝜙 L o g N 2 , and 𝜙 B D N I when compared with 𝜙 = 0 . 4 . All five methods provide reliable and accurate estimates of the parameter and, for all the tables generated, are statistically indistinguishable when compared with the true value of the parameter. These figures show that Newton’s method (Figure 6(a)) provides the most reliable estimate since the 𝑃 value calculated from each of the 200 tables is consistently near 1.0. Of the non-iterative procedures Figure 6(b) shows that 𝜙 L o g N I is nearly as stable (although slightly more variable) when compared with Newton’s method.

fig6
Figure 6: The 𝑃 -values obtained from the Wald Test comparing the true 𝜙 value against (a) 𝜙 N (b) 𝜙 L o g N I , (c) 𝜙 L o g N I 2 , (d) 𝜙 L o g N I 1 and (e) 𝜙 B D N I . These 𝑃 -values are based on the estimates obtained from randomly generating 200 contingency tables of dimension 4 × 5 where 𝜙 = 0.4 and 𝑛 1 0 0 0 .

The three remaining non-iterative procedures also behave well, but provide poorer estimates of 𝜙 than Newton’s method or 𝜙 L o g N I . Figure 6(c) shows that 𝜙 L o g N I 2 provides estimates that are closer to 𝜙 than 𝜙 L o g N I 1 Figure 6(d) shows that the 𝑃 value of 𝜙 L o g N I 1 is centred around 0.7 while Figure 6(c) shows that the 𝑃 value of 𝜙 L o g N I 2 is centred around 0.9. This is to be expected since 𝜙 L o g N I 1 is a first-order approximation of the L o g N I estimate (7) while 𝜙 L o g N I 2 is its second-order approximation. For the 200 4 × 5 tables, these two non-iterative estimation procedures provide more accurate estimates of 𝜙 than 𝜙 B D N I . Figure 6(e) shows that the 𝑃 value from the Wald test of the estimate is centred around 0.5.

5. Discussion

Iterative procedures can be computationally very intensive. Certainly for those contingency tables that are of large dimension, or where 𝜙 is large (but unknown), the number of iterations needed for convergence to occur can be excessively large. This is not necessarily a major issue for those who are experienced in statistical programming. However, for those researchers that are not, a simple direct estimation procedure can prove to be advantageous. This paper has demonstrated the benefit of considering alternative, iteration-free, procedures for estimating the linear-by-linear parameter of an ordinal log-linear, and its related association model.

For small contingency tables (in terms of dimension and 𝜙 ) any of the non-iterative procedures considered in this paper will produce estimates that are statistically similar to their true value. For large contingency tables with moderately small values of 𝜙 (less than 0.4), any of the non-iterative procedures will work extremely well without the need for programming and executing iterative procedures.

Of the four non-iterative procedures considered in this paper, 𝜙 L o g N I (7) provides more accurate and reliable results than its counterparts. In fact there were only a few cases where the estimate produced was statistically different (according to the Wald test) from the true parameter value. However, an inspection of Table 2 reveals that these differences appear negligible and that such a result can be attributed to the very small standard error of the non-iterative estimate. Certainly Figure 2 shows the accurate performance of 𝜙 L o g N I .

The enhanced accuracy of 𝜙 L o g N I , when compared with the remaining non-iterative estimates, is not unexpected since it is easily obtained by rearranging (4) and aggregating across 𝑖 and 𝑗 . Since 𝜙 B D N I , 𝜙 L o g N I 1 , and 𝜙 L o g N I 2 are calculated using functional approximations of 𝜙 L o g N I , this latter procedure will always produce more accurate estimates. Such a conclusion is observed by considering the results in Section 4.

One can also observe the relative accuracy of the non-iterative methods. Since the right-hand side of (11) is less than its left-hand side, 𝜙 B D N I > 𝜙 L o g N I 𝜙 for all contingency tables. This is indeed the case when comparing the results summarised in Tables 2 and 5. This inequality becomes more severe for contingency tables of relatively large dimension. For similar reasons 𝜙 L o g N I 1 < 𝜙 L o g N I 2 , and this is clear when comparing Table 3 with Table 4. However, such a consideration is not a trivial exercise since it allows one to better understand how many terms (from the approximation of the logarithm function) will yield an accurate and reliable estimate of 𝜙 .

Beh and Davy [8], who originally considered the accuracy of the non-iterative estimation procedures for ordinal log-linear models, showed from a practical perspective that 𝜙 B D N I will produce accurate estimates of the true parameter. The more extensive study by Beh and Farver [9] confirmed these findings. However, the approximation (11) is valid only if 1 < 𝑝 𝑖 𝑗 / ( 𝑝 𝑖 𝑝 𝑗 ) 1 . If only the positive values of this ratio are considered, 𝜙 B D N I will provide reasonably good estimates of 𝜙 only when 𝑝 𝑖 𝑗 / ( 𝑝 𝑖 𝑝 𝑗 ) 1 for all 𝑖 and 𝑗 . Therefore, such a result suggests that the B D N I procedure will produce better estimates of 𝜙 for relatively small contingency tables than for larger tables.

The computations performed here have focused on generating contingency tables with a sample size of approximately 1000. Further investigations need to be undertaken to study the behaviour of the non-iterative estimation procedures for varying sample sizes. Preliminary investigations reveal that, for contingency tables with a sample size exceeding about 200, the estimate 𝜙 L o g N I is similar to the estimate obtained using Newton’s unidimensional method and the true parameter. Further work on the application of such non-iterative procedures needs investigating for other types of models of association for ordinal contingency tables.

References

  1. S. J. Haberman, “Log linear models for frequency tables with ordered classifications,” Biometrics, vol. 30, no. 4, pp. 589–600, 1974. View at Scopus
  2. L. A. Goodman, “Simple models of the analysis of association in cross-classifications having ordered categories,” Journal of the American Statistical Association, vol. 74, pp. 537–552, 1979.
  3. L. A. Goodman, “The analysis of cross-classified data having ordered and/or unordered categories: association models, correlation models, and asymmetry models for contingency tables with or without missing entries,” The Annals of Statistics, vol. 13, pp. 10–69, 1985.
  4. A. Agresti, Analysis of Ordinal Categorical Data, Wiley, New York, NY, USA, 1984.
  5. Z. Gilula and S. J. Haberman, “Canonical analysis of contingency tables by maximum likelihood,” Journal of the American Statistical Association, vol. 81, pp. 780–788, 1986.
  6. F. Galindo-Garre and J. K. Vermunt, “The order-restricted association model: two estimation algorithms and issues in testing,” Psychometrika, vol. 69, no. 4, pp. 641–654, 2004. View at Scopus
  7. M. L. Aït-Sidi-Allal, A. Baccini, and A. M. Mondot, “A new algorithm for estimating the parameters and their asymptotic covariance in correlation and association models,” Computational Statistics and Data Analysis, vol. 45, no. 3, pp. 389–421, 2004. View at Publisher · View at Google Scholar · View at Scopus
  8. E. J. Beh and P. J. Davy, “A non-iterative alternative to ordinal log-linear models,” Journal of Applied Mathematics and Decision Sciences, vol. 8, no. 2, pp. 67–86, 2004. View at Scopus
  9. E. J. Beh and T. B. Farver, “An evaluation of non-iterative methods for estimating the linear-by-linear parameter of ordinal log-linear models,” The Australian and New Zealand Journal of Statistics, vol. 51, no. 3, pp. 335–352, 2009. View at Publisher · View at Google Scholar · View at Scopus
  10. J. A. Nelder and R. W. M. Wedderburn, “Generalized linear models,” Journal of the Royal Statistical Society A, vol. 135, pp. 370–384, 1972.
  11. A. Agresti, Categorical Data Analysis, Wiley, Hoboken, NJ, USA, 2nd edition, 2002.
  12. V. N. Nair, “Testing in industrial experiments with ordered categorical data,” Technometrics, vol. 28, no. 4, pp. 283–291, 1986. View at Scopus
  13. S. Nishisato and P. S. Arri, “Nonlinear programming approach to optimal scaling of partially ordered categories,” Psychometrika, vol. 40, no. 4, pp. 525–548, 1975. View at Publisher · View at Google Scholar · View at Scopus