Table of Contents Author Guidelines Submit a Manuscript
Advances in Agriculture
Volume 2016, Article ID 7081491, 6 pages
Research Article

Analysis of Plant Breeding on Hadoop and Spark

1Jiaxing Vocational Technical College, No. 547 Tongxiang Road, Jiaxing, Zhejiang 314036, China
2Zhejiang University, No. 38 Zhejiang University Road Yuquan Campus, Hangzhou 310012, China

Received 7 December 2015; Revised 4 April 2016; Accepted 11 April 2016

Academic Editor: Tibor Janda

Copyright © 2016 Shuangxi Chen et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Analysis of crop breeding technology is one of the important means of computer-assisted breeding techniques which have huge data, high dimensions, and a lot of unstructured data. We propose a crop breeding data analysis platform on Spark. The platform consists of Hadoop distributed file system (HDFS) and cluster based on memory iterative components. With this cluster, we achieve crop breeding large data analysis tasks in parallel through API provided by Spark. By experiments and tests of Indica and Japonica rice traits, plant breeding analysis platform can significantly improve the breeding of big data analysis speed, reducing the workload of concurrent programming.