Research Article
Distributed Nonparametric and Semiparametric Regression on SPARK for Big Data Forecasting
Table 1
Characteristics of the datasets.
| Dataset | Number of records | Number of factors |
| Synthetic data | 10,000 | 2 | Traffic data | 6,500 | 7 | Airlines delays | 120,000,000 (13,000) | 29 + 22 (10) |
|
|