Research Article

A Similarity Search Using Molecular Topological Graphs

Table 1

The average values and the hit ratio with optimized coefficients for various λ values. The coefficient1 is the parameter in (4) for the matrix and the coefficient2 is the parameter in (4) for the matrix. The hit ratio is the % of the active compounds found within the first 1% of selected compounds of the database. The average and σ are the average value and the standard deviation of the scores. *: the top-ranked compounds when diphenhydramine is the template.

00.250.50.751

Hit ratio28.86%36.48%35.76%35.69%26.93%
value80.1482.5380.3782.0972.72
Coefficient10.0020.010.020.010
Coeffcient200.000050.000050.000010.0001
Average0.100*10-43.61*10-47.13*10-410.64*10-414.163*10-4
σ7.50*10-61.62*10-43.24*10-44.87*10-46.50*10-4
1st compound*     
-score1.3212.2182.1892.1792.174
Score0.000082950.0017230.0019440.0021640.002383
2nd compound*     
-score1.3152.2022.1792.1712.167
Score0.00012660.0044250.0053330.0062420.00715
3rd compound*     
-score1.3142.1922.1692.1612.157
Score0.00013380.0059440.0084020.010850.01331