Research Article

Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection

Table 2

Optimal 47 features for prediction of RNA-binding proteins.

RankFeature

1EIPP of ASP in protein sequence for the pKa values of amino group
2EIPP of GLU in protein sequence for the Balaban index
3BP(2)
4EIPP of TYR in protein sequence for the pKa values of amino group
5CT of class a, class b, and class e
6CT of class d, class b, and class e

7EIPP of HIS in protein sequence for the pKa values of amino group
8EIPP of LYS in protein sequence for the pKa values of carboxyl group
9CT of class b, class d, and class e
10CT of class d, class c, and class e
11EIPP of MET in protein sequence for the molecular mass
12CT of class b, class e, and class a
13EIPP of ARG in protein sequence for the pKa values of amino group
14NBP(2)
15CT of class c, class e, and class d
16BP(1)
17EIPP of TRP in protein sequence for the pKa values of amino group
18CT of class d, class d, and class e
19EIPP of LYS in protein sequence for the Balaban index
20NBP(1)
21CT of class c, class a, and class d
22CT of class b, class e, and class d
23CT of class e, class d, and class e
24EIPP of HIS in protein sequence for the pKa values of carboxyl group
25CT of class d, class c, and class f
26CT of class e, class f, and class d
27CT of class e, class b, and class d
28CT of class d, class e, and class c
29EIPP of GLY in protein sequence for the pKa values of carboxyl group
30EIPP of THR in protein sequence for the molecular mass
31CT of class c, class b, and class e
32CT of class c, class e, and class a
33EIPP of GLN in protein sequence for Wiener index
34EIPP of SER in protein sequence for Wiener index
35EIPP of ASN in protein sequence for the molecular mass
36CT of class b, class a, and class c
37CT of class e, class d, and class f
38CT of class e, class b, and class a
39EIPP of TRP in protein sequence for the pKa values of carboxyl group
40CT of class a, class e, and class c
41EIPP of ARG in protein sequence for the lowest free energy
42CT of class e, class c, and class d
43EIPP of LYS in protein sequence for the molecular mass

44CT of class e, class e, and class d
45EIPP of TYR in protein sequence for Wiener index
46CT of class e, class c, and class b
47CT of class f, class c, and class d