Research Article
Optimizing Hadoop Performance for Big Data Analytics in Smart Grid
Table 1
Hadoop core parameters as GEP variables.
| GEP variables | Hadoop parameters | Default values | Data types |
| 0 | io.sort.factor | 10 | Integer | 1 | io.sort.mb | 100 | Integer | 2 | io.sort.spill.percent | 0.80 | Float | 3 | mapred.reduce.tasks | 1 | Integer | 4 | mapreduce.tasktracker.map.tasks.maximum | 2 | Integer | 5 | mapreduce.tasktracker.reduce.tasks.maximum | 2 | Integer | 6 | mapred.child.java.opts | 200 | Integer | 7 | mapreduce.reduce.shuffle.input.buffer.percent | 0.70 | Float | 8 | mapred.inmem.merge.threshold | 1000 | Integer | 9 | Input data size (number of samples/MB) | User | Integer |
|
|