Research Article
Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees
Table 3
Base set of attributes.
| Attribute | Description of the attribute in relation to the control or candidate sequence |
| chromLen/position | The ratio of the length of the chromosome over the position on that chromosome | ShannonEntropyNorm | Shannon entropy normalized to the sequence length | G% | Percentage of G base composition | C% | Percentage of C base composition | T% | Percentage of T base composition | A% | Percentage of A base composition | DuplexEnergy | The duplex energy between the miRNAs:miRNAs* | DuplexEnergyNorm | The duplex energy normalized to the length of the duplex structure | MaxMismatch | Maximum number of mismatches in the duplex structure based on both sides of the structure | minMatchPercent | Minimum % match based on length of the duplex structure both sides of the structure | DeltaG | Minimum free energy for the stem loop | DeltaGnorm | Minimum free energy normalized to the length of the stem loop | longestDotSet | Longest run of mismatches in the stem loop | longestBracketSet | Longest run of matches in the stem loop | loopCountNorm | Number of loop heads normalized to the length of the stem loop |
|
|