Research Article

Jackknife Model Averaging Prediction Methods for Complex Phenotypes with Gene Expression Levels by Integrating External Pathway Information

Table 1

Sample sizes and the number of genes for each cancer in the TCGA dataset used in our analysis.

PhenotypesInitial gene expression dataInitial clinical data (N)Final data after quality control
NGNG

BRCA1,21820,5311,2471,08317,675
COAD32920,53155127517,493
CRC43420,53173636717,510
PAAD18320,53119617817,675

Note. N is the sample size and G denotes the number of genes. The average number of genes incorporated in each pathway for the seven phenotypes was 65 (ranging from 1 to 1,139), and about 21% genes belonged to multiple pathways. BRCA: breast cancer; CRC: colon and rectal cancer; COAD: colon cancer; PAAD: pancreatic cancer.