Research Article

Applications of Different Weighting Schemes to Improve Pathway-Based Analysis

Table 1

A brief summary of the four proposed weighting schemes. The algorithms are expressed for the pathway of 𝑚 genes and 𝑛 samples. absT and Qdiff algorithms calculate the weight for the 𝑗 t h gene in a pathway, and RWV selects an optimal random weight vector 𝑤 minimizing 𝑃 -value or OOB error rate, and RWM selects an optimal weight matrix 𝑤 minimizing 𝑃 -value or OOB error rate.

NameAlgorithmNotes

absT 𝑊 | 𝑇 | ( 𝑗 ) = | 𝑇 ( 𝑗 ) | / 𝑚 𝑗 = 1 | 𝑇 ( 𝑗 ) | Based on two-sample 𝑡 statistics of a gene, apply the same weight across all samples of a gene
Qdiff 𝑊 | 𝑄 𝑑 𝑖 𝑓 𝑓 | ( 𝑗 ) = | 𝑄 𝑄 ( 𝑗 ) | / 𝑚 𝑗 = 1 | 𝑄 𝑄 ( 𝑗 ) | Based on the global test statistic 𝑄 for a pathway, apply the same weight across all samples for a gene
RWV (Random Weight Vector) 𝑤 𝑋 = 𝑤 1 𝑋 1 𝑤 2 𝑋 2 𝑤 𝑚 𝑋 𝑚 𝑚 number of random weights in predefined range that minimizes 𝑃 -value in the global test, or OOB error rates in the random forests for a pathway
RWM (Random Weight Matrix) 𝑤 𝑋 = 𝑤 1 , 1 𝑋 1 , 1 𝑤 1 , 2 𝑋 1 , 2 . . . 𝑤 1 , 𝑚 𝑋 1 , 𝑚 𝑤 2 , 1 𝑋 2 , 1 . . . . . . 𝑤 2 , 𝑚 𝑋 2 , 𝑚 𝑤 . . . . . . . . . . . . 𝑛 1 , 1 𝑋 𝑛 1 , 1 . . . . . . 𝑤 𝑛 1 , 𝑚 𝑋 𝑛 1 , 𝑚 𝑤 𝑛 , 1 𝑋 𝑛 , 1 𝑤 𝑛 , 2 𝑋 𝑛 , 2 . . . 𝑤 𝑛 , 𝑚 𝑋 𝑛 , 𝑚 𝑚 × 𝑛 number of random weights in predefined range that minimizes 𝑃 -value in the global test or error rates in the random forests for a pathway