Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2014 (2014), Article ID 859279, 10 pages
http://dx.doi.org/10.1155/2014/859279
Research Article

Cloud Based Metalearning System for Predictive Modeling of Biomedical Data

Faculty of Organizational Sciences, University of Belgrade, Jove Ilića 154, 11000 Belgrade, Serbia

Received 20 December 2013; Accepted 21 January 2014; Published 14 April 2014

Academic Editors: R. Colomo-Palacios and V. Stantchev

Copyright © 2014 Milan Vukićević et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. K. Kincade, “Data mining: digging for healthcare gold,” Insurance and Technology, vol. 23, no. 2, pp. IM2–IM7, 1998. View at Google Scholar
  2. H. C. Koh and G. Tan, “Data mining applications in healthcare,” Journal of Healthcare Information Management, vol. 19, no. 2, pp. 64–72, 2011. View at Google Scholar · View at Scopus
  3. K. Srinivas, B. K. Rani, and A. Govrdhan, “Applications of data mining techniques in healthcare and prediction of heart attacks,” International Journal on Computer Science and Engineering, vol. 2, no. 2, pp. 250–255, 2010. View at Google Scholar
  4. P. Maji, “Mutual information-based supervised attribute clustering for microarray sample classification,” IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 1, pp. 127–140, 2012. View at Publisher · View at Google Scholar · View at Scopus
  5. R. Das, “A comparison of multiple classification methods for diagnosis of Parkinson disease,” Expert Systems with Applications, vol. 37, no. 2, pp. 1568–1572, 2010. View at Publisher · View at Google Scholar · View at Scopus
  6. A. Thomas, N. H. Patterson, M. M. Marcinkiewicz, A. Lazaris, P. Metrakos, and P. Chaurand, “Histology-driven data mining of lipid signatures from multiple imaging mass spectrometry analyses: application to human colorectal cancer liver metastasis biopsies,” Analytical Chemistry, vol. 85, no. 5, pp. 2860–2866, 2013. View at Google Scholar
  7. R. L. Grossman and K. P. White, “A vision for a biomedical cloud,” Journal of Internal Medicine, vol. 271, no. 2, pp. 122–130, 2012. View at Publisher · View at Google Scholar · View at Scopus
  8. M. Vukićević, B. Delibasic, Z. Obradovic, M. Jovanović, and M. Suknović, “A method for design of data-tailored partitioning algorithms for optimizing the number of clusters in microarray analysis,” in Proceedings of the IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB '12), pp. 252–259, IEEE, San Diego, Calif, USA, 2012.
  9. H. Chae, I. Jung, H. Lee, S. Marru, S. W. Lee, and S. Kim, “Bio and health informatics meets cloud: BioVLab as an example,” Health Information Science and Systems, vol. 1, no. 1, article 6, 2013. View at Google Scholar
  10. J. Sun and C. K. Reddy, “Big data analytics for healthcare,” in Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 1525–1525, ACM, August 2013.
  11. M. Matijaš, J. A. Suykens, and S. Krajcar, “Load forecasting using a multivariate meta-learning system,” Expert Systems With Applications, vol. 40, no. 11, pp. 4427–4437, 2013. View at Google Scholar
  12. S. Zhou, K. K. Lai, and J. Yen, “A dynamic meta-learning rate-based model for gold market forecasting,” Expert Systems with Applications, vol. 39, no. 6, pp. 6168–6173, 2012. View at Publisher · View at Google Scholar · View at Scopus
  13. J. Kanda, C. Soares, E. Hruschka, and A. de Carvalho, “A meta-learning approach to select meta-heuristics for the traveling salesman problem using MLP-based label ranking,” in Neural Information Processing, pp. 488–495, Springer, Berlin, Germany, 2012. View at Google Scholar
  14. A. Attig and P. Perner, “Meta-learning for image processing based on case-based reasoning,” in Computational Intelligence in Healthcare, vol. 4, pp. 229–264, Springer, Berlin, Germany, 2010. View at Google Scholar
  15. A. Vera-Baquero, R. Colomo-Palacios, and O. Molloy, “Business process analytics using a big data approach,” IT Professional, vol. 15, no. 6, pp. 29–35, 2013. View at Publisher · View at Google Scholar
  16. L. Dai, X. Gao, Y. Guo, J. Xiao, and Z. Zhang, “Bioinformatics clouds for big data manipulation,” Biology Direct, vol. 7, no. 1, article 43, 2012. View at Publisher · View at Google Scholar
  17. I. K. Lai, S. K. Tam, and M. F. Chan, “Knowledge cloud system for network collaboration: a case study in medical service industry in China,” Expert Systems with Applications, vol. 39, no. 15, pp. 12205–12212, 2012. View at Publisher · View at Google Scholar
  18. S. Sonnenburg, M. L. Braun, S. O. Cheng et al., “The need for open source software in machine learning,” Journal of Machine Learning Research, vol. 8, pp. 2443–2466, 2007. View at Google Scholar · View at Scopus
  19. S. Marston, Z. Li, S. Bandyopadhyay, J. Zhang, and A. Ghalsasi, “Cloud computing: the business perspective,” Decision Support Systems, vol. 51, no. 1, pp. 176–189, 2011. View at Publisher · View at Google Scholar · View at Scopus
  20. N. Sultan, “Cloud computing for education: a new dawn?” International Journal of Information Management, vol. 30, no. 2, pp. 109–116, 2010. View at Publisher · View at Google Scholar · View at Scopus
  21. W. Zhang and Q. Chen, “From E-government to C-government via cloud computing,” in Proceedings of the 1st International Conference on E-Business and E-Government (ICEE '10), pp. 679–682, IEEE, May 2010. View at Publisher · View at Google Scholar · View at Scopus
  22. A. Fernández, S. del Río, F. Herrera, and J. M. Benítez, “An overview on the structure and applications for business intelligence and data mining in cloud computing,” in Proceedings of the 7th International Conference on Knowledge Management in Organizations: Service and Cloud Computing, pp. 559–570, Springer, Berlin, Germany, January 2013.
  23. S. P. Ahuja, S. Mani, and J. Zambrano, “A survey of the state of cloud computing in healthcare,” Network and Communication Technologies, vol. 1, no. 2, pp. 12–19, 2012. View at Google Scholar
  24. W. Li, J. Yan, Y. Yan, and J. Zhang, “Xbase: cloud-enabled information appliance for healthcare,” in Proceedings of the 13th International Conference on Extending Database Technology: Advances in Database Technology (EDBT '10), pp. 675–680, March 2010. View at Publisher · View at Google Scholar · View at Scopus
  25. M. C. Schatz, B. Langmead, and S. L. Salzberg, “Cloud computing and the DNA data race,” Nature Biotechnology, vol. 28, no. 7, pp. 691–693, 2010. View at Publisher · View at Google Scholar · View at Scopus
  26. T. Oinn, M. Addis, J. Ferris et al., “Taverna: a tool for the composition and enactment of bioinformatics workflows,” Bioinformatics, vol. 20, no. 17, pp. 3045–3054, 2004. View at Publisher · View at Google Scholar · View at Scopus
  27. B. Giardine, C. Riemer, R. C. Hardison et al., “Galaxy: a platform for interactive large-scale genome analysis,” Genome Research, vol. 15, no. 10, pp. 1451–1455, 2005. View at Publisher · View at Google Scholar · View at Scopus
  28. M. C. Schatz, “CloudBurst: highly sensitive read mapping with MapReduce,” Bioinformatics, vol. 25, no. 11, pp. 1363–1369, 2009. View at Publisher · View at Google Scholar · View at Scopus
  29. B. D. O'Connor, B. Merriman, and S. F. Nelson, “SeqWare query engine: storing and searching sequence data in the cloud,” BMC Bioinformatics, vol. 11, supplement 12, article S2, 2010. View at Publisher · View at Google Scholar · View at Scopus
  30. D. Hong, A. Rhie, S. Park et al., “FX: an RNA-seq analysis tool on the cloud,” Bioinformatics, vol. 28, no. 5, pp. 721–723, 2012. View at Publisher · View at Google Scholar · View at Scopus
  31. S. V. Angiuoli, M. Matalka, A. Gussman et al., “CloVR: a virtual machine for automated and portable sequence analysis from the desktop using cloud computing,” BMC Bioinformatics, vol. 12, no. 1, article 356, 2011. View at Publisher · View at Google Scholar · View at Scopus
  32. R. Zhang and L. Liu, “Security models and requirements for healthcare application clouds,” in Proceedings of the 3rd IEEE International Conference on Cloud Computing (CLOUD '10), pp. 268–275, IEEE, July 2010. View at Publisher · View at Google Scholar · View at Scopus
  33. R. Wooten, R. Klink, F. Sinek, Y. Bai, and M. Sharma, “Design and implementation of a secure healthcare social cloud system,” in Proceedings of the 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid '12), pp. 805–810, IEEE, May 2012.
  34. W. Liu and E. K. Park, “E-healthcare cloud computing application solutions: cloud-enabling characteristics, challenges and adaptations,” in Proceedings of the International Conference on Computing, Networking and Communications (ICNC '13), pp. 437–443, IEEE, January 2013.
  35. T. Giles and R. Wilcox, “IBM Watson and medical records text analytics,” in Proceedings of the 2011 Healthcare Information and Management Systems Society Meeting, 2011.
  36. A. Abbasi, C. Albrecht, A. Vance, and J. Hansen, “Metafraud: a meta-learning framework for detecting financial fraud,” MIS Quarterly, vol. 36, no. 4, pp. 1293–1327, 2012. View at Google Scholar
  37. C. Lemke and B. Gabrys, “Meta-learning for time series forecasting and forecast combination,” Neurocomputing, vol. 73, no. 10-12, pp. 2006–2016, 2010. View at Publisher · View at Google Scholar · View at Scopus
  38. N. Iam-On, T. Boongoen, and S. Garrett, “LCE: a link-based cluster ensemble method for improved gene expression data analysis,” Bioinformatics, vol. 26, no. 12, pp. 1513–1519, 2010. View at Publisher · View at Google Scholar · View at Scopus
  39. K. A. Smith-Miles, “Towards insightful algorithm selection for optimisation using meta-learning concepts,” in Proceedings of the (IEEE World Congress on Computational Intelligence) IEEE International Joint Conference on Neural Networks (IJCNN '08), pp. 4118–4124, IEEE, Hong Kong, June 2008. View at Scopus
  40. K. A. Smith-Miles, “Cross-disciplinary perspectives on meta-learning for algorithm selection,” ACM Computing Surveys, vol. 41, no. 1, article 6, 2008. View at Publisher · View at Google Scholar · View at Scopus
  41. B. Delibašić, K. Kirchner, J. Ruhland, M. Jovanović, and M. Vukićević, “Reusable components for partitioning clustering algorithms,” Artificial Intelligence Review, vol. 32, no. 1–4, pp. 59–75, 2009. View at Publisher · View at Google Scholar
  42. B. Delibašić, M. Vukićević, M. Jovanović, K. Kirchner, J. Ruhland, and M. Suknović, “An architecture for component-based design of representative-based clustering algorithms,” Data and Knowledge Engineering, vol. 75, pp. 78–98, 2012. View at Google Scholar
  43. M. Suknović, B. Delibašić, M. Jovanović, M. Vukićević, D. Becejski-Vujaklija, and Z. Obradovic, “Reusable components in decision tree induction algorithms,” Computational Statistics, vol. 27, no. 1, pp. 127–148, 2012. View at Publisher · View at Google Scholar · View at Scopus
  44. M. Vukićević, B. Delibašić, M. Jovanović, M. Suknović, and Z. Obradović, “Internal evaluation measures as proxies for external indices in clustering gene expression data,” in Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM '11), pp. 574–577, IEEE Computer Society, 2011. View at Publisher · View at Google Scholar
  45. M. Vukićević, K. Kirchner, B. Delibašić, M. Jovanović, J. Ruhland, and M. Suknović, “Finding best algorithmic components for clustering microarray data,” Knowledge and Information Systems, vol. 35, no. 1, pp. 111–130, 2013. View at Publisher · View at Google Scholar
  46. A. Nascimiento, R. Prudencio, M. de Souto, and I. Costa, “Mining rules for the automatic selection process of clustering methods applied to cancer gene expression data,” in Proceedings of the 19th International Conference on Artificial Neural Networks: Part II, pp. 20–29, Springer, Limassol, Cyprus, 2009.
  47. N. X. Vinh, J. Epps, and J. Bailey, “Information theoretic measures for clusterings comparison: variants, properties, normalization and correction for chance,” Journal of Machine Learning Research, vol. 11, pp. 2837–2854, 2010. View at Google Scholar · View at Scopus
  48. S. Radovanović, M. Vukićević, M. Jovanović, B. Delibasic, and M. Suknović, “Meta-learning system for clustering gene expression microarray data,” in Proceedings of the 4th RapidMiner Community Meeting and Conference (RCOMM '13), pp. 97–111, Shaker, Porto, Portugal, 2013.
  49. A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, and S. Anthony, “Hive: a warehousing solution over a map-reduce framework,” Proceedings of the VLDB Endowment, vol. 2, no. 2, pp. 1626–1629, 2009. View at Google Scholar
  50. R. Anil, T. Dunning, and E. Friedman, Mahout in Action, Manning, 2011.
  51. V. Stantchev and M. Malek, “Addressing dependability throughout the SOA life cycle,” IEEE Transactions on Services Computing, vol. 4, no. 2, pp. 85–95, 2011. View at Publisher · View at Google Scholar · View at Scopus
  52. I. Mierswa, M. Wurst, R. Klinkenberg, M. Scholz, and T. Euler, “YALE: rapid prototyping for complex data mining tasks,” in Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '06), pp. 935–940, ACM, August 2006. View at Scopus
  53. Development Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2005.
  54. Z. Prekopcsák, G. Makrai, T. Henk, and C. Gáspár-Papanek, “Radoop: analyzing big data with rapidminer and hadoop,” in Proceedings of the 2nd RapidMiner Community Meeting and Conference (RCOMM '11), June 2011.
  55. J. N. van Rijn, B. Bischl, L. Torgo, B. Gao, V. Umaashankar, and S. Fischer, “OpenML: a collaborative science platform,” in Machine Learning and Knowledge Discovery in Databases, pp. 645–649, Springer, Berlin, Germany, 2013. View at Google Scholar