Research Article
A Distributed Framework for Predictive Analytics Using Big Data and MapReduce Parallel Programming
Table 1
Summary description of the used datasets.
| S. No. | Dataset | #Attributes | #Data types | #Instances | #File size (MB) | #Year |
| 1 | Combined cycle power plant | 4 | Multivariate | 9568 | 1.93 | 2014 | 2 | Wave energy converters | 49 | Multivariate | 288000 | 123 | 2019 | 3 | Year prediction MSD (subset of million song dataset) | 90 | Multivariate | 515345 | 433 | 2011 | 4 | Superconductivity data | 81 | Multivariate | 21263 | 26.8 | 2018 |
|
|