About this Journal Submit a Manuscript Table of Contents
Journal of Nucleic Acids
Volume 2012 (2012), Article ID 652979, 10 pages
http://dx.doi.org/10.1155/2012/652979
Research Article

Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees

Division of Plant Sciences, Research School of Biology, College of Medicine, Biology & Environment, The Australian National University, Canberra, ACT 0200, Australia

Received 6 July 2012; Revised 10 September 2012; Accepted 17 September 2012

Academic Editor: Thomas Litman

Copyright © 2012 Philip H. Williams et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary Material

Supplementary Figure S1: The graph shows the relationship of sensitivity and specificity over 28 boosting runs. The insert in the middle of the graph zooms in to show details. It can be seen that the accuracy improves only little with varying levels of boosting trials.

Supplementary Figure S2: A representative decision tree output from running C5.0. The terminal nodes end with classification of Not_miRNA or class_miRNA. The number in parentheses represents the count of sequences correctly classified terminating at the node. If there are two numbers the second is the count of incorrectly classified cases at that node based on training data of known outcome.

Supplementary Figure S3: Sequence TCCTGGCCTGATTGAGTGGCA (shown pulled out from the stem-loop) starting at position 8 from the 5' end is from Sorghum bicoloras [GenBank:CN127271] and found at position 57396890 on chromosome 5. Despite randomly picking short sequences from ESTs as negative controls for training, the model does not classify some as negative controls.

Supplementary Figure S4: Sequence TGCAAGCCTGTTGTTGAGCGA (shown pulled out from the stem-loop) starting at position 8 from the 5' end is from Vitis viniferaas [GenBank:EE095594] found on chromosome 6 at position 1508484. Some negative controls classified as miRNA exist within precursor-like predicted stem-loops.

  1. Supplementary Figures