Table of Contents Author Guidelines Submit a Manuscript
Journal of Electrical and Computer Engineering
Volume 2016, Article ID 3516358, 8 pages
http://dx.doi.org/10.1155/2016/3516358
Research Article

Heuristic Data Placement for Data-Intensive Applications in Heterogeneous Cloud

1Tianjin University of Science and Technology, Tianjin 300222, China
2Tianjin House Fund Management Center, Tianjin 300222, China

Received 1 December 2015; Accepted 12 April 2016

Academic Editor: Hui Cheng

Copyright © 2016 Qing Zhao et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. E. Deelman, G. Singh, M. Livny, B. Berriman, and J. Good, “The cost of doing science on the cloud: the Montage example,” in Proceedings of the ACM/IEEE Conference on Supercomputing (SC '08), Austin, Tex, USA, November 2008.
  2. T. Kosar and M. Livny, “A framework for reliable and efficient data placement in distributed computing systems,” Journal of Parallel and Distributed Computing, vol. 65, no. 10, pp. 1146–1157, 2005. View at Publisher · View at Google Scholar · View at Scopus
  3. T. Kosar and M. Livny, “Stork: making data placement a first class citizen in the grid,” in Proceedings of the 24th International Conference on Distributed Computing Systems (ICDCS '04), pp. 342–349, Tokyo, Japan, March 2004.
  4. H. Liu and D. Orban, “GridBatch: cloud computing for large-scale data-intensive batch applications,” in Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGRID '08), pp. 295–305, Lyon, France, May 2008. View at Publisher · View at Google Scholar · View at Scopus
  5. E. Deelman and A. Chervenak, “Data management challenges of data-intensive scientific workflows,” in Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGRID '08), pp. 687–692, Lyon, France, May 2008. View at Publisher · View at Google Scholar · View at Scopus
  6. A. Weiss, “Computing in the clouds,” netWorker, vol. 11, no. 4, pp. 16–25, 2007. View at Publisher · View at Google Scholar
  7. Google App Engine, http://code.google.com/appengine/.
  8. Amazon Elastic Computing Cloud, http://aws.amazon.com/ec2/.
  9. Hadoop, http://hadoop.apache.org/.
  10. J. Xie, S. Yin, X. Ruan et al., “Improving MapReduce performance through data placement in heterogeneous Hadoop clusters,” in Proceedings of the IEEE International Parallel & Distributed Processing Symposium, April 2010.
  11. A. Chervenak, E. Deelman, M. Livny et al., “Data placement for scientific applications in distributed environments,” in Proceedings of the 8th IEEE/ACM International Conference on Grid Computing, pp. 267–274, IEEE, Austin, Tex, USA, September 2007. View at Publisher · View at Google Scholar · View at Scopus
  12. G. Singh, K. Vahi, A. Ramakrishnan et al., “Optimizing workflow data footprint,” Scientific Programming, vol. 15, no. 4, pp. 249–268, 2007. View at Publisher · View at Google Scholar · View at Scopus
  13. Z. Wu, X. Liu, Z. Ni, D. Yuan, and Y. Yang, “A market-oriented hierarchical scheduling strategy in cloud workflow systems,” Journal of Supercomputing, vol. 63, no. 1, pp. 256–293, 2013. View at Publisher · View at Google Scholar · View at Scopus
  14. A. Talukder, M. Kirley, and R. Buyya, “Multiobjective differential evolution for scheduling workflow applications on global grids,” Concurrency & Computation Practice & Experience, vol. 21, no. 13, pp. 1742–1756, 2009. View at Google Scholar
  15. X. Liu, Z. Ni, Z. Wu, D. Yuan, J. Chen, and Y. Yang, “A novel general framework for automatic and cost-effective handling of recoverable temporal violations in scientific workflow systems,” Journal of Systems and Software, vol. 84, no. 3, pp. 492–509, 2011. View at Publisher · View at Google Scholar · View at Scopus
  16. T. Ghafarian and B. Javadi, “Cloud-aware data intensive workflow scheduling on volunteer computing systems,” Future Generation Computer Systems, vol. 51, pp. 87–97, 2014. View at Publisher · View at Google Scholar · View at Scopus
  17. D. Yuan, Y. Yang, X. Liu, and J. Chen, “A data placement strategy in scientific cloud workflows,” Future Generation Computer Systems, vol. 26, no. 8, pp. 1200–1214, 2010. View at Publisher · View at Google Scholar · View at Scopus
  18. E.-D. Zhao, Y.-Q. Qi, X.-X. Xiang, and Y. Chen, “A data placement strategy based on genetic algorithm for scientific workflows,” in Proceedings of the 8th International Conference on Computational Intelligence and Security (CIS '12), pp. 146–149, Guangzhou, China, November 2012. View at Publisher · View at Google Scholar · View at Scopus
  19. Q. Zhao and C. Xiong, “An improved data layout algorithm based on data correlation clustering in cloud,” in Proceedings of the International Symposium on Information Technology Convergence, October 2014.
  20. Q. Zhao, C. Xiong, X. Zhao, C. Yu, and J. Xiao, “A data placement strategy for data-intensive scientific workflows in cloud,” in Proceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid '15), pp. 928–934, IEEE, Shenzhen, China, May 2015. View at Publisher · View at Google Scholar · View at Scopus