Research Article

Comparative Analysis on Alignment-Based and Pretrained Feature Representations for the Identification of DNA-Binding Proteins

Table 3

Results of embedded FS methods using three regularizers of the linear model in the light of 5CV on PDB1616.

Feature setFS method#FeaturesFivefold cross-validation on PDB1616
ACC (%)MCCSP (%)SN (%)

PSSMR_AllNFS58071.470.430667.8275.12
ElasticNet18872.770.456369.6875.87
Lasso6172.830.457769.4376.24
LassoLars5873.510.470771.5375.50

PSSMS_AllNFS58072.770.459765.9779.58
ElasticNet20774.010.484767.2080.82
Lasso5473.820.479468.3279.33
LassoLars3872.770.458666.9678.59

ESM_AvgNFS128079.270.590872.5286.01
ElasticNet43081.930.644275.3788.49
Lasso14283.110.665677.9788.24
LassoLars15182.430.651477.7287.13

ESM_AllNFS3712078.900.584371.5386.26
ElasticNet88486.140.726780.9491.34
Lasso36787.870.759883.9191.83
LassoLars25086.700.735383.6689.73