Research Article

Research on Parallel Support Vector Machine Based on Spark Big Data Platform

Table 2

Configuration of test cluster.

NumberNode nameHadoop’s configurationSpark’s configuration

1Masterdfs.replication = 3; map.tasks.maximum = 16; reduce.tasks.maximum = 2; child.java.opts = -Xmx4096 MSPARK_MEM = 20g
2Slave01
3Slave02
4Slave03
5Slave04
6Slave05