Research Article

Distributed Nonparametric and Semiparametric Regression on SPARK for Big Data Forecasting

Table 1

Characteristics of the datasets.

Dataset Number of recordsNumber of factors

Synthetic data10,0002
Traffic data6,5007
Airlines delays120,000,000 (13,000)29 + 22 (10)