The Scientific World Journal

Review Article

Misuse of Statistical Methods in 10 Leading Chinese Medical Journals in 1998 and 2008

Table 1

Errors/defects in statistical methods.


Types of errors	Incorrect use in 1998 (%)	Incorrect use in 2008 (%)		value	OR* (1998/2008)	95% CI**

-test	305 (62.0%)	253 (44.4%)	32.83	<0.001	2.04	(1.60,2.61)
(1) Using multiple -test for multiple group comparison	153 (31.1%)	129 (22.6％)	9.71	0.002	1.54	(1.17,2.03)
(2) Using paired -test for unpaired data or vice versa	40 (8.1%)	34 (6.0%)	1.91	0.167	1.40	(0.87,2.24)
(3) Using -test under nonparametric setting	89 (18.1%)	60 (10.5%)	12.52	<0.001	1.88	(1.32,2.67)
(4) Using -test without considering the baseline	52 (10.6%)	33 (5.8%)	8.19	0.004	1.92	(1.22,3.03)
(5) Using -test to conduct repeated-measure data	73 (14.8%)	60 (10.5%)	4.48	0.034	1.48	(1.03,2.13)
Others	28 (5.7%)	8 (1.4%)	14.82	<0.001	4.24	(1.91,9.39)

Contingency tables	154 (48.3%)	169 (32.3%)	21.35	<0.001	1.96	(1.47,2.60)
(1) No continuity correction or Fisher exact test if needed	52 (16.3%)	53 (10.1%)	6.90	0.009	1.73	(1.15,2.61)
(2) No significant level adjustment for multiple comparison	82 (25.7%)	74 (14.2%)	17.53	<0.001	2.10	(1.48,2.98)
(3) Misusing Chi-square test for paired fourfold table	10 (3.1%)	12 (2.3%)	0.55	0.458	1.38	(0.59,3.23)
(4) Using Chi-square test for ranked data	29 (9.1%)	31 (5.9%)	3.00	0.083	1.59	(0.94,2.69)
(5) Ignorance of stratification factors	12 (3.8%)	12 (2.3%)	1.54	0.215	1.66	(0.74,3.75)
(6) Using value of Chi-square test instead of contingency coefficient to describe the correlation of two variables	8 (2.5%)	4 (0.8%)	3.13	0.077	3.34	(0.98,11.18)
Others	21 (6.6%)	19 (3.6%)	3.81	0.051	1.87	(0.99,3.53)

ANOVA***	128 (63.4%)	263 (59.0%)	1.12	0.289	1.20	(0.85,1.70)
(1) Using one-factorial ANOVA to analyse data from multifactorial designs	10 (5.0%)	31 (7.0%)	0.94	0.333	0.70	(0.34,1.45)
(2) Ignoring the setting of ANOVA for completely random design data	25 (12.4%)	53 (11.9%)	0.03	0.858	1.05	(0.63,1.74)
(3) No multiple pair-wise comparison of ANOVA when needed	25 (12.4%)	28 (6.3%)	6.89	0.009	2.11	(1.20,3.72)
(4) Misusing the method of multiple pair-wise comparison of ANOVA	51 (25.3%)	132 (29.6%)	1.30	0.255	0.80	(0.55,1.17)
(5) Using ANOVA to analyse repeated-measures data	45 (22.3%)	63 (14.1%)	6.65	0.010	1.74	(1.14,2.67)
Others	16 (7.9%)	10 (2.2%)	11.64	0.001	3.75	(1.67,8.42)

Rank transformation nonparametric test	29 (43.3%)	33 (17.7%)	17.57	<0.001	3.56	(1.93,6.57)
(1) Using multiple pair-wise comparison for multiple group comparison	14 (20.9%)	20 (10.7%)	4.43	0.035	2.21	(1.04,4.47)
(2) Using wrong type of rank sum test for different study types	4 (6.0%)	3 (1.6%)	2.07	0.150	3.89	(0.85,17.88)
Others	20 (29.9%)	6 (3.2%)	38.11	<0.001	12.84	(4.88,33.77)

*OR: odds ratio; **CI: confidence interval; ***ANOVA: analysis of variance.
Incorrect use of (%): for each statistical method, is the number of articles using this statistical methods incorrectly and the percentage = n/the number of papers using this statistical methods × 100%; for each error under certain statistical methods, is the number of articles with this mistake and the percentage = n/the number of papers using these statistical methods × 100%.