Research Article

Rule-Based Knowledge Acquisition Method for Promoter Prediction in Human and Drosophila Species

Table 4

Top 20 descriptors of 4-mer motifs. Top 20 descriptors of the 4-mer motifs are contained in the reference set of 167 DNASDs. The descriptors of the TATA motif are ranked at the 199th and 98th when applied for the HPL and DPL datasets, respectively.

RankHPL datasetDPL dataset
4-mer motifScoreIncluded ( )4-mer motifScoreIncluded ( )

1TGAA1000+AAAG1000+
2TGAT941+AAGA956+
3CCGG878TTCG948+
4TATG843+AGAA922
5TGGA817GAAA866
6GATG770+AAGG791+
7TCAA739+CGCC787
8TACA702+AGAT777
9AGGC697AATA759
10ATGA694+TCGC747
11TTGA672+TGAT744+
12CGGC662TGAA732
13CAGG651ATCG732+
14ATGT634TCGA724+
15AGCG633CGGT724
16CGCG629ATAA712+
17AGCC618CGAT710
18TCAT595CGCG703
19GAGC592+GAAG699+
20AGGG582ATAG697
25 AAGT642
199TATA11198 TATA365

+: included in the set of DNASDs.
−: not included in the set of DNASDs.