Research Article

A Distributed Framework for Predictive Analytics Using Big Data and MapReduce Parallel Programming

Algorithm 2

Reduce function of MapReduce-I
Function REDUCE-1(MAP-I output)
   read < Dataset id, (intercept, coefficients[]> from HDFS
      for i = 1 to s partitions
       sum_intercept+ = intercept
      end for
      for i = 1 to s partitions
         for j = 1 to n attributes
       sum_coefficients[]+ = coefficients[]
        end for
       end for
   compute avg_intercept, avg_coefficients[]
   construct a learned MR-MLR model
   output < MR-MLR model>
end