Research Article

Using SVD on Clusters to Improve Precision of Interdocument Similarity Measure

Table 4

Similarity measure on Chinese documents of SVD on clusters and other SVD based LSI methods. PR is the abbreviation for “preservation rate” and the best performances (measured by average precision) are marked in bold type.

PRSVDSVDC (-Means)SVDC (SOMs)SVRADEIRR

1.00.4312 ± 0.02130.4312 ± 0.02130.4312 ± 0.02130.4272 ± 0.02000.3632 ± 0.02860.2730 ± 0.0168
0.90.4312 ± 0.02790.4537 ± 0.02720.4463 ± 0.02450.4272 ± 0.01860.3394 ± 0.03030.2735 ± 0.0238
0.80.4358 ± 0.04220.4581 ± 0.02060.4458 ± 0.02390.4273 ± 0.02090.3136 ± 0.01370.2735 ± 0.0109
0.70.4495 ± 0.03870.4597 ± 0.01990.4573 ± 0.01460.4273 ± 0.01280.3075 ± 0.00680.2732 ± 0.0127
0.60.4550 ± 0.01760.4607 ± 0.02030.4547 ± 0.02940.4273 ± 0.03050.3006 ± 0.02080.2730 ± 0.0134
0.50.4573 ± 0.04060.4613 ± 0.01390.4588 ± 0.01640.4273 ± 0.03790.2941 ± 0.01730.2729 ± 0.0141
0.40.4587 ± 0.03950.4624 ± 0.00980.4659 ± 0.02550.4275 ± 0.02940.2857 ± 0.01940.2726 ± 0.290
0.30.4596 ± 0.01970.4644 ± 0.01830.4582 ± 0.02030.4285 ± 0.03050.2727 ± 0.02000.2666 ± 0.242
0.20.4602 ± 0.04010.4663 ± 0.03530.4432 ± 0.02760.4305 ± 0.01900.2498 ± 0.02280.2672 ± 0.0166
0.10.4617 ± 0.04090.4705 ± 0.00580.4513 ± 0.01880.4343 ± 0.01930.3131 ± 0.01460.2557 ± 0.0188