Research Article

Machine-Learning Prediction of Oral Drug-Induced Liver Injury (DILI) via Multiple Features and Endpoints

Table 1

Paired -test results of AUC values during 10-fold cross-validations with or without using protein-binding features.

Logistic regressionRandom forest
DatabaseFeatures

DrugDexECFP6 fingerprints-3.511.96-03-2.481.80-02
PubChem fingerprints-3.095.38-03-2.561.48-02
Standard fingerprints-3.322.86-03-2.262.94-02
Constitutional descriptors-2.124.35-02-2.965.41-03
Electronic descriptors-4.441.14-04-6.107.04-07
Geometrical descriptors-5.754.22-06-8.306.47-10
Hybrid descriptors-3.501.90-03-8.795.96-10
Topological descriptors-2.352.43-02-1.936.11-02
All fingerprints-2.342.68-02-1.945.95-02
All descriptors-2.631.29-02-2.481.78-02
All combined-10.252.76-21-10.563.79-23

DrugPointsECFP6 fingerprints-2.065.60-02-2.998.91-03
PubChem fingerprints-3.269.78-030.109.19-01
Standard fingerprints-2.662.10-02-2.492.51-02
Constitutional descriptors-3.204.97-03-2.184.28-02
Electronic descriptors-3.315.00-03-3.512.98-03
Geometrical descriptors-5.424.06-05-5.216.70-05
Hybrid descriptors-4.809.79-04-2.313.55-02
Topological descriptors-4.048.19-04-3.047.08-03
All fingerprints-2.412.75-02-2.035.80-02
All descriptors-4.613.56-04-2.353.08-02
All combined-10.132.42-19-7.301.04-11

DailyMedECFP6 fingerprints-0.794.50-01-0.317.62-01
PubChem fingerprints-2.247.56-02-0.357.37-01
Standard fingerprints0.001.00+00-0.854.19-01
Constitutional descriptors-0.943.80-01-1.561.53-01
Electronic descriptors-1.252.58-01-1.651.30-01
Geometrical descriptors-2.108.66-02-4.807.95-04
Hybrid descriptors-2.813.74-02-1.491.79-01
Topological descriptors-0.277.97-01-0.268.00-01
All fingerprints0.109.26-01-0.238.24-01
All descriptors-0.903.97-01-0.565.87-01
All combined-3.162.06-03-2.884.74-03

For each -test, the AUC score vectors of model performance on all endpoints were paired up and compared. ; .