Research Article

An Efficient Approach to Screening Epigenome-Wide Data

Table 6

Simulation results for selecting important variables among 2,000 candidates including the most and least important surrogate variables across 600 subjects.

BonferroniFDRTT
.sv = 5.sv = 10.sv = 15.sv = 5.sv = 10.sv = 15.sv = 5.sv = 10.sv = 15

Most important surrogate variables included

# incorrect
000666543
232191919755
665262929664
121111403839777

Sensitivity
111111111
0.980.970.981110.980.980.98
0.970.970.9750.9950.9950.9950.9850.9850.985
0.970.9730.9731110.9880.9880.985

Specificity
1110.9970.9970.9970.9970.9980.998
1110.990.990.990.9970.9980.998
1110.9860.9840.9840.9980.9980.999
1110.9750.9760.9760.9990.9990.999

Most important surrogate variables not included

# incorrect
101010101010101010
100100100100100100100100100
200200200200200200200200200
400400400400400400400400400

Sensitivity
000000000
000000000
000000000
000000000

Specificity
111111111
111111111
111111111
111111111

FDR: false discovery rate, TT: training and testing, .sv = number of surrogate variables, and : the number of truly important CpG sites out of 2,000.