Research Article
SCBI_MapReduce, a New Ruby Task-Farm Skeleton for Automated Parallelisation and Distribution in Chunks of Sequences: The Implementation of a Boosted Blast+
Table 1
SCBI_MAPREDUCE performance tests using three different datasets on the “x86 upgraded” cluster. Execution times are expressed in seconds. The number immediately before X indicates the number of reads grouped in a chunk for every parallel task.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
aIntegers were subjected to futile, intensive calculations that took at least 1 s on every object. bThe dataset of real-world sequences consisted of 261 304 sequence reads (mean: 276 nt; mode: 263 nt; coefficient of variation: 11%) obtained from a 454/FLX sequencer downloaded from the SRA database (AC# SRR069473). cThe dataset of artificial sequences consisted of 425 438 sequences obtained using the software ART with a 2 coverage simulating a 454/FLX sequencing from the Danio rerio chromosome 1 (AC# NC_007112.5). dUsing one core is equivalent to a linear job without any parallelisation or distribution; it acts as control reference. |