BioMed Research International

Research Article

Multivariate Cluster-Based Multifactor Dimensionality Reduction to Identify Genetic Interactions for Multiple Quantitative Phenotypes

Pseudocode of multi-CMDR.

(01) perform fuzzy k-means clustering with noise cluster for phenotypes
(02) remove samples in noise cluster
(03) compute global ratio
(04) get all combinations of SNPs
(05) divide samples into N folds
(06) for k = 1 to N
(07) set samples in kth folds as test dataset and the other samples as training data
(08) for i = 1 to number of all combinations of SNPs
(09) get all combination of genotypes
(10) for j = 1 to number of all combination of genotypes
(11) compute local ratio
(12) classify each genotype combination as if , otherwise
(13) end j
(14) compute Hotelling’s statistics for training and test data
(15) end i
(16) select the best SNP combination at fold by comparing Hotelling’s statistics for training data
(17) end k
(18) compute CVC and select SNP combination with highest CVC as the best SNP combination
(19) compute p-value by permutation test for the best SNP combination