Research Article

A Distributed Framework for Predictive Analytics Using Big Data and MapReduce Parallel Programming

Algorithm 3

Map function of MapReduce-II
Function MAP-II (testing dataset)
   Input: testing dataset TD with k instances and n attributes
   Partition the dataset TD into z partitions as p1, p2, p3…..ps
   Read x_test[], y_test for each partitioned dataset
      Predict y_predict with the MR-MLR model
      Convert it into (key, value > pair as < Dataset_id, (y_predict, y_test)>
   Output < Dataset_id, (y_predict, y_test)>
End