Table of Contents Author Guidelines Submit a Manuscript
BioMed Research International
Volume 2017 (2017), Article ID 1729301, 9 pages
https://doi.org/10.1155/2017/1729301
Research Article

Protein Function Prediction Using Deep Restricted Boltzmann Machines

College of Computer and Information Science, Southwest University, Chongqing, China

Correspondence should be addressed to Guoxian Yu

Received 30 March 2017; Accepted 30 May 2017; Published 28 June 2017

Academic Editor: Peter J. Oefner

Copyright © 2017 Xianchun Zou et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. R. J. Roberts, “Identifying protein functiona call for community action,” PLoS Biology, vol. 2, no. 3, p. e42, 2004. View at Publisher · View at Google Scholar
  2. G. Pandey, V. Kumar, and M. Steinbach, in Computational approaches for protein function prediction: a survey, pp. 6–28, Department of Computer Science and Engineering, University of Minnesota, A survey, 2006.
  3. A. M. Schnoes, D. C. Ream, A. W. Thorman, P. C. Babbitt, and I. Friedberg, “Biases in the experimental annotations of protein function and their effect on our understanding of protein function space,” PLoS Computational Biology, vol. 9, no. 5, Article ID e1003063, 2013. View at Publisher · View at Google Scholar · View at Scopus
  4. P. D. Thomas, V. Wood, C. J. Mungall, S. E. Lewis, and J. A. Blake, “On the use of gene ontology annotations to assess functional similarity among orthologs and paralogs: a short report,” PLoS Computational Biology, vol. 8, no. 2, Article ID e1002386, 2012. View at Publisher · View at Google Scholar · View at Scopus
  5. P. Legrain, R. Aebersold, A. Archakov et al., “The human proteome project: current state and future direction,” Molecular & Cellular Proteomics, vol. 10, no. 7, article 009993, 2011. View at Publisher · View at Google Scholar
  6. P. Radivojac, W. Clark, T. Oron et al., “A large-scale evaluation of computational protein function prediction,” Nature Methods, vol. 10, no. 3, pp. 221–227, 2013. View at Publisher · View at Google Scholar
  7. D. Lee, O. Redfern, and C. Orengo, “Predicting protein function from sequence and structure,” Nature Reviews Molecular Cell Biology, vol. 8, no. 12, pp. 995–1005, 2007. View at Publisher · View at Google Scholar · View at Scopus
  8. R. Sharan, I. Ulitsky, and R. Shamir, “Network-based prediction of protein function,” Molecular Systems Biology, vol. 3, p. 88, 2007. View at Google Scholar · View at Scopus
  9. M. Cao, C. M. Pietras, X. Feng et al., “New directions for diffusion-based network prediction of protein function: Incorporating pathways with confidence,” Bioinformatics, vol. 30, no. 12, pp. I219–I227, 2014. View at Publisher · View at Google Scholar · View at Scopus
  10. N. Cesa-Bianchi, M. Re, and G. Valentini, “Synergy of multi-label hierarchical ensembles, data fusion, and cost-sensitive methods for gene functional inference,” Machine Learning, vol. 88, no. 1-2, pp. 209–241, 2012. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  11. G. Yu, C. Domeniconi, H. Rangwala, G. Zhang, and Z. Yu, “Transductive multi-label ensemble classification for protein function prediction,” in Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2012, pp. 1077–1085, chn, August 2012. View at Publisher · View at Google Scholar · View at Scopus
  12. G. Yu, G. Fu, J. Wang, and H. Zhu, “Predicting Protein Function via Semantic Integration of Multiple Networks,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 13, no. 2, pp. 220–232, 2016. View at Publisher · View at Google Scholar · View at Scopus
  13. M. Ashburner, C. A. Ball, J. A. Blake et al., “Gene ontology: tool for the unification of biology,” Nature Genetics, vol. 25, no. 1, pp. 25–29, 2000. View at Publisher · View at Google Scholar · View at Scopus
  14. O. D. King, R. E. Foulger, S. S. Dwight, J. V. White, and F. P. Roth, “Predicting gene function from patterns of annotation,” Genome Research, vol. 13, no. 5, pp. 896–904, 2003. View at Publisher · View at Google Scholar · View at Scopus
  15. P. Khatri, B. Done, A. Rao, A. Done, and S. Draghici, “A semantic analysis of the annotations of the human genome,” Bioinformatics, vol. 21, no. 16, pp. 3416–3421, 2005. View at Publisher · View at Google Scholar · View at Scopus
  16. B. Done, P. Khatri, A. Done, and S. Drǎghici, “Predicting novel human gene ontology annotations using semantic analysis,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 7, no. 1, pp. 91–99, 2010. View at Publisher · View at Google Scholar · View at Scopus
  17. Y. Tao, L. Sam, J. Li, C. Friedman, and Y. A. Lussier, “Information theory applied to the sparse gene ontology annotation network to predict novel gene function,” Bioinformatics, vol. 23, no. 13, pp. i529–i538, 2007. View at Publisher · View at Google Scholar · View at Scopus
  18. G. Yu, H. Zhu, C. Domeniconi, and J. Liu, “Predicting protein function via downward random walks on a gene ontology,” BMC Bioinformatics, vol. 16, no. 1, article no. 271, 2015. View at Publisher · View at Google Scholar · View at Scopus
  19. G. Salton, A. Wong, and C. S. Yang, “A vector space model for automatic indexing,” Communications of the ACM, vol. 18, no. 11, pp. 613–620, 1975. View at Publisher · View at Google Scholar · View at Scopus
  20. H. Tong, C. Faloutsos, and J.-Y. Pan, “Random walk with restart: Fast solutions and applications,” Knowledge and Information Systems, vol. 14, no. 3, pp. 327–346, 2008. View at Publisher · View at Google Scholar · View at Scopus
  21. G. Yu, H. Rangwala, C. Domeniconi, G. Zhang, and Z. Yu, “Protein function prediction with incomplete annotations,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 11, no. 3, pp. 579–591, 2013. View at Publisher · View at Google Scholar
  22. G. Yu, C. Domeniconi, H. Rangwala, and G. Zhang, “Protein function prediction using dependence maximization,” in Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases, vol. 8188 of Lecture Notes in Computer Science, pp. 574–589, Springer Berlin Heidelberg. View at Publisher · View at Google Scholar
  23. G. Yu, H. Zhu, and C. Domeniconi, “Predicting protein functions using incomplete hierarchical labels,” BMC Bioinformatics, vol. 16, no. 1, article no. 1, 2015. View at Publisher · View at Google Scholar · View at Scopus
  24. G. Fu, G. Yu, J. Wang, and Z. Zhang, “Novel protein function prediction using a direct hybrid graph,” Science China-Information Science, vol. 46, no. 4, pp. 461–475, 2016. View at Google Scholar
  25. L. Deng and D. Yu, “Deep learning: methods and applications,” Foundations and Trends in Signal Processing, vol. 7, no. 3-4, pp. 197–387, 2013. View at Publisher · View at Google Scholar · View at Scopus
  26. J. Eickholt and J. Cheng, “Predicting protein residue-residue contacts using deep networks and boosting,” Bioinformatics, vol. 28, no. 23, pp. 3066–3072, 2012. View at Publisher · View at Google Scholar · View at Scopus
  27. Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” Nature, vol. 521, no. 7553, pp. 436–444, 2015. View at Publisher · View at Google Scholar
  28. G. E. Hinton and R. R. Salakhutdinov, “Reducing the dimensionality of data with neural networks,” American Association for the Advancement of Science. Science, vol. 313, no. 5786, pp. 504–507, 2006. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  29. D. Chicco, P. Sadowski, and P. Baldi, “Deep autoencoder neural networks for gene ontology annotation predictions,” in Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM BCB 2014, pp. 533–540, usa, September 2014. View at Publisher · View at Google Scholar · View at Scopus
  30. G. E. Hinton, S. Osindero, and Y.-W. Teh, “A fast learning algorithm for deep belief nets,” Neural Computation, vol. 18, no. 7, pp. 1527–1554, 2006. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  31. I. Fasel and J. Berry, “Deep belief networks for real-time extraction of tongue contours from ultrasound during speech,” in Proceedings of the 20th International Conference on Pattern Recognition, ICPR 2010, pp. 1493–1496, tur, August 2010. View at Publisher · View at Google Scholar · View at Scopus
  32. A. Fischer and C. Igel, “An Introduction to Restricted Boltzmann Machines,” in Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, vol. 7441 of Lecture Notes in Computer Science, pp. 14–36, Springer Berlin Heidelberg, Berlin, Heidelberg, 2012. View at Publisher · View at Google Scholar
  33. R. Salakhutdinov, A. Mnih, and G. Hinton, “Restricted Boltzmann machines for collaborative filtering,” in Proceedings of the 24th International Conference on Machine learning (ICML '07), vol. 227, pp. 791–798, Corvallis, Oregon, June 2007. View at Publisher · View at Google Scholar · View at Scopus
  34. Y. Wang and J. Zeng, “Predicting drug-target interactions using restricted Boltzmann machines,” Bioinformatics, vol. 29, no. 13, pp. i126–i134, 2013. View at Publisher · View at Google Scholar · View at Scopus
  35. X. Li, F. Zhao, and Y. Guo, “Conditional restricted boltzmann machines for multi-label learning with incomplete labels,” in Proceedings of the in Proceedings of 18th International Conference on Artificial Intelligence and Statistics, pp. 635–643, 2015.
  36. G. E. Hinton, “Training products of experts by minimizing contrastive divergence,” Neural Computation, vol. 14, no. 8, pp. 1771–1800, 2002. View at Publisher · View at Google Scholar · View at Scopus
  37. G. Hinton, “A practical guide to training restricted Boltzmann machines,” in Neural Networks: Tricks of the Trade, G. Montavon, G. B. Orr, and K.-R. Müller, Eds., vol. 7700 of Lecture Notes in Computer Science, pp. 599–619, Springer, Berlin, Germany, 2nd edition, 2012. View at Publisher · View at Google Scholar
  38. R. Salakhutdinov and G. E. Hinton, “Deep Boltzmann Machines,” in Proceedings of the In Proceedings of 12th International Conference on Artificial Intelligence and Statistics, pp. 448–455, 2009.
  39. N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: a simple way to prevent neural networks from overfitting,” The Journal of Machine Learning Research, vol. 15, no. 1, pp. 1929–1958, 2014. View at Google Scholar · View at MathSciNet
  40. M.-L. Zhang and Z.-H. Zhou, “A review on multi-label learning algorithms,” IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 8, pp. 1819–1837, 2014. View at Publisher · View at Google Scholar · View at Scopus
  41. Y. Jiang, “An expanded evaluation of protein function prediction methods shows an improvement in accuracy,” Genome Biology, vol. 17, no. 1–19, pp. 1819–1837, 2016. View at Google Scholar
  42. L. Wilcoxon, “Individual comparison by ranking methods,” Biometrics, vol. 1, no. 6, pp. 80–83, 1945. View at Publisher · View at Google Scholar
  43. J. Demsar, “Statistical comparisons of classifiers over multiple data sets,” The Journal of Machine Learning Research, vol. 7, no. 1, pp. 1–30, 2006. View at Google Scholar · View at MathSciNet