Research Article

A De Novo Genome Assembly Algorithm for Repeats and Nonrepeats

Figure 5

Schematic of extending nonrepeats and boundary detection. The graphic illustration of extending repeats and boundary detection. Green line represents the extended nonrepeats, blue line represents the extending nonrepeats, and the red line represents the potential repeats. The yellow lines represent the supporting reads overlapped with the extension. We assume that the sequencing depth , and let . Therefore, in the process of extending nonrepeats, the mean value of dynamic overlapping interval filtered by sliding window should be smaller than or equal to . The dotted box represents the potential boundary of repeats. Consequently, if we set , the extension will be stopped at B1 or the extension will be stopped at B2.
736473.fig.005