An Effective Big Data Supervised Imbalanced Classification Approach for Ortholog Detection in Related Yeast Species
Table 2
Big data framework, applications, and algorithms.
Big data framework
Application
Algorithms
Hadoop 2.0.0 (Cloudera CDH4.7.1) with the head node configured as name-node and job-tracker, and the rest as data-nodes and task-trackers
(i) MapReduce ROS implementation (ii) A cost-sensitive approach for Random Forest MapReduce algorithm (RF-BD) (iii) MapReduce RF implementation (Mahout library)
RF-BDCS ROS (100%) + RF-BD ROS (130%) + RF-BD
Apache Spark 1.0.0 with the head node configured as master and name-node, and the rest as workers and data-nodes