Research Article

Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees

Table 3

Base set of attributes.

AttributeDescription of the attribute in relation to the control or candidate sequence

chromLen/position The ratio of the length of the chromosome over the position on that chromosome
ShannonEntropyNormShannon entropy normalized to the sequence length
G%Percentage of G base composition
C%Percentage of C base composition
T%Percentage of T base composition
A%Percentage of A base composition
DuplexEnergyThe duplex energy between the miRNAs:miRNAs*
DuplexEnergyNormThe duplex energy normalized to the length of the duplex structure
MaxMismatchMaximum number of mismatches in the duplex structure based on both sides of the structure
minMatchPercentMinimum % match based on length of the duplex structure both sides of the structure
DeltaGMinimum free energy for the stem loop
DeltaGnormMinimum free energy normalized to the length of the stem loop
longestDotSetLongest run of mismatches in the stem loop
longestBracketSetLongest run of matches in the stem loop
loopCountNormNumber of loop heads normalized to the length of the stem loop