Research Article

Optimization Method of an Antibreast Cancer Drug Candidate Based on Machine Learning

Table 1

The contribution of the top 224 important molecular descriptors (from low to high).

DescriptorImportance

Smax110.00065
MATSp70.00065
CIC40.000652
MDEC-140.000656
S170.000659
minHCsats0.00066
Smin340.000661
SHaaCH0.000664
SIC30.000664
SHCsats0.000681
Smin0.000681
CrippenLogP0.000682
maxsOH0.000684
ATSc10.000684
bcutm130.000684
phi0.000686
MATSm30.000688
CIC30.000688
VSAEstate70.000691
SPC-40.000695
EstateVSA70.000702
Smin80.000704
WTPT-50.000706
TPSA10.000708
naccr0.00071
MATSm70.000712
maxdsN0.000712
CIC10.000713
Smin350.000714
ATSe50.000716
minHCsatu0.000725
GATSp30.000726
GATSm50.000727
ALogp20.000729
GATSp70.00073
EstateVSA10.000737
IDE0.000741
mindO0.000744
mChi10.000745
SaasC0.00076
bcute90.000761
nAtomLAC0.000762
maxdssC0.000771
GATSe70.000775
Smax0.000781
ETA_Epsilon_10.000787
MATSv50.000789
bcutp50.000793
IC10.000796
maxHBint70.000797
QCss0.000823
CIC60.000823
ALogP0.000826
bcutm30.00083
SsOH0.000847
BertzCT0.000851
EstateVSA40.000851
SdssC0.000855
bcutm20.000866
MAXDN0.000868
PC60.000872
MATSm60.000891
SHBint50.000897
SaaCH0.0009
MATSp50.000901
MRVSA60.000904
slogPVSA10.000904
MATSm50.000922
bcute120.000926
J0.000927
GATSm40.000927
MRVSA50.000933
MATSm10.000934
GATSm80.000939
Smin120.000946
hmin0.00095
VC-40.000962
MATSe50.000963
MATSp40.000964
PEOEVSA50.000967
minHBd0.000971
GATSv30.000974
bcutm90.000979
PEOEVSA80.00098
ECCEN0.000987
MATSm80.000988
IC20.000995
BCUTp-1l0.001004
minssCH20.001017
QHss0.001019
Smax160.00102
bcutm120.001026
ETA_EtaP_F0.001026
ETA_dEpsilon_D0.001038
bcute40.001038
WTPT-30.001042
MAXDP20.001042
knotpv0.001043
MDEO-110.001043
maxHCsats0.00105
Chiv5ch0.001063
GATSe50.001072
VPC-50.001081
MATSv80.00109
maxsF0.001096
QNmin0.001109
ETA_BetaP_s0.001109
Chiv6ch0.00111
IC30.001117
VPC-60.001119
VSAEstate20.001121
MATSp30.001137
slogPVSA20.00114
WTPT-40.001162
gmin0.001163
minHBint60.001176
minHBint70.001195
Smax240.001225
MATSp60.001229
PEOEVSA10.001234
SIC20.001237
S340.001257
bcute10.001278
MATSv30.001281
SC-50.001283
dchi00.00129
SIC10.001291
maxHBd0.001307
PEOEVSA70.001342
MDEC-240.001345
SCH-70.001347
SHBd0.001349
MATSe80.001375
MATSv10.001375
SHCsatu0.001386
Smin150.001398
BCUTp-1h0.00141
GATSm30.001461
bcutp120.001465
MLFER_BH0.001485
GATSv10.001559
QOmax0.001589
slogPVSA00.001592
bcute100.001605
Smin240.001609
MATSp10.001616
Chiv30.001662
QNmax0.001663
bcutv40.00168
VCH-50.001717
VSAEstate40.001785
ATSc50.001813
C3SP20.001831
mindssC0.001846
ATSc20.001859
minHBint100.001866
ATSc30.001892
MDEC-220.001909
MAXDP0.001935
knotp0.001942
GATSm10.002
GATSp40.002026
maxsssCH0.002031
S250.002032
bcutp10.002049
ETA_Shape_Y0.002124
bcutp90.002186
XLogP0.002237
ATSc40.002298
maxHBint50.002321
maxHBint80.002379
minsOH0.002424
GATSm20.002443
SPC-60.00248
MATSe30.002549
MLFER_S0.002597
SHBint60.002733
ndssC0.002743
bcutv10.002787
VCH-70.002879
BCUTc-1l0.002923
QCmax0.00298
Scar0.003191
minssO0.003312
BCUTc-1h0.003494
MLFER_A0.003711
TopoPSA0.003768
MDEO-120.003868
minHBa0.004054
Smin330.004253
SHsOH0.004402
GATSe80.004485
PEOEVSA60.00461
Mnc0.004826
MATSe10.004978
LDI0.005205
MDEC-330.005471
GATSe10.005843
bcute20.005866
VC-50.006197
nC0.0064
nHBAcc0.006408
LogP20.006781
SHBint100.006873
Hy0.007517
kappam30.007695
VSAEstate10.007799
QNss0.009891
minsssN0.01014
LogP0.011371
maxssO0.011647
ATSp4.10.013044
QHmax0.014923
C1SP20.01631
minHBint50.017225
ATSv50.017404
Smax350.018515
minHsOH0.025883
maxHsOH0.028849
LipoaffinityIndex0.031403
QOmin0.041317
Qmin0.043254
MDEC-230.049635
ATSp5.10.143226