Research Article

Correlating Information Contents of Gene Ontology Terms to Infer Semantic Similarity of Gene Products

Table 1

Fold changes of semantic similarity scores within protein families against those outside families.

Semantic similarity measures
CorrelationCosineJaccardWang ResnikSchlickerLinJiang

210226.9156.5113.2671.8562.3702.6692.3311.524
35628.9868.4463.8271.9882.6803.1002.6411.629
43609.6088.7604.0272.0372.7993.2472.7611.656
52409.3599.1354.1312.0652.8433.3242.8271.662
61829.9979.2144.2242.1052.9013.4102.8881.692
714110.109.7414.3632.1062.9523.4762.9181.690
81109.9219.4094.4322.1012.8533.4092.8951.661
9899.8809.3214.4452.0942.8573.4192.9081.643
10759.8149.2734.5162.0902.8463.4302.8981.644

m: minimum number of proteins in a family. n: number of protein families, each containing at least m proteins.