Research Article

An Effective Big Data Supervised Imbalanced Classification Approach for Ortholog Detection in Related Yeast Species

Table 2

Big data framework, applications, and algorithms.

Big data frameworkApplicationAlgorithms

Hadoop 2.0.0 (Cloudera CDH4.7.1) with the head node configured as name-node and job-tracker, and the rest as data-nodes and task-trackers(i) MapReduce ROS implementation
(ii) A cost-sensitive approach for Random Forest MapReduce algorithm (RF-BD)
(iii) MapReduce RF implementation (Mahout library)
RF-BDCS
ROS (100%) + RF-BD
ROS (130%) + RF-BD

Apache Spark 1.0.0 with the head node configured as master and name-node, and the rest as workers and data-nodes Apache Spark Support Vector Machines (MLLib)ROS (100%) + SVM-BD
ROS (130%) + SVM-BD