Table of Contents Author Guidelines Submit a Manuscript
BioMed Research International
Volume 2016, Article ID 1793272, 8 pages
http://dx.doi.org/10.1155/2016/1793272
Research Article

Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier

School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China

Received 5 January 2016; Revised 30 March 2016; Accepted 31 March 2016

Academic Editor: Paul Harrison

Copyright © 2016 Xiao Wang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. M. D. Jacobson, M. Weil, and M. C. Raff, “Programmed cell death in animal development,” Cell, vol. 88, no. 3, pp. 347–354, 1997. View at Publisher · View at Google Scholar · View at Scopus
  2. G. Evan and T. Littlewood, “A matter of life and cell death,” Science, vol. 281, no. 5381, pp. 1317–1322, 1998. View at Publisher · View at Google Scholar · View at Scopus
  3. J. C. Reed and G. Paternostro, “Postmitochondrial regulation of apoptosis during heart failure,” Proceedings of the National Academy of Sciences of the United States of America, vol. 96, no. 14, pp. 7614–7616, 1999. View at Publisher · View at Google Scholar · View at Scopus
  4. J. B. Schulz, M. Weller, and M. A. Moskowitz, “Caspases as treatment targets in stroke and neurodegenerative diseases,” Annals of Neurology, vol. 45, no. 4, pp. 421–429, 1999. View at Publisher · View at Google Scholar · View at Scopus
  5. G.-Z. Li, X. Wang, X. H. Hu, J.-M. Liu, and R.-W. Zhao, “Multilabel learning for protein subcellular location prediction,” IEEE Transactions on Nanobioscience, vol. 11, no. 3, pp. 237–243, 2012. View at Publisher · View at Google Scholar · View at Scopus
  6. M. Suzuki, R. J. Youle, and N. Tjandra, “Structure of bax: coregulation of dimer formation and intracellular localization,” Cell, vol. 103, no. 4, pp. 645–654, 2000. View at Publisher · View at Google Scholar · View at Scopus
  7. K.-C. Chou and H.-B. Shen, “Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-nearest neighbor classifiers,” Journal of Proteome Research, vol. 5, no. 8, pp. 1888–1897, 2006. View at Publisher · View at Google Scholar · View at Scopus
  8. K. Chou, “Impacts of bioinformatics to medicinal chemistry,” Medicinal Chemistry, vol. 11, no. 3, pp. 218–234, 2015. View at Publisher · View at Google Scholar
  9. K.-C. Chou and H.-B. Shen, “A new method for predicting the subcellular localization of eukaryotic proteins with both single and multiple sites: Euk-mPLoc 2.0,” PLoS ONE, vol. 5, no. 4, Article ID e9931, 2010. View at Publisher · View at Google Scholar · View at Scopus
  10. K.-C. Chou and H.-B. Shen, “Plant-mPLoc: a top-down strategy to augment the power for predicting plant protein subcellular localization,” PLoS ONE, vol. 5, no. 6, Article ID e11335, 2010. View at Publisher · View at Google Scholar · View at Scopus
  11. H.-B. Shen and K.-C. Chou, “A top-down approach to enhance the power of predicting human protein subcellular localization: Hum-mPLoc 2.0,” Analytical Biochemistry, vol. 394, no. 2, pp. 269–274, 2009. View at Publisher · View at Google Scholar · View at Scopus
  12. H.-B. Shen and K.-C. Chou, “Gpos-mploc: a top-down approach to improve the quality of predicting subcellular localization of gram-positive bacterial proteins,” Protein and Peptide Letters, vol. 16, no. 12, pp. 1478–1484, 2009. View at Publisher · View at Google Scholar · View at Scopus
  13. H.-B. Shen and K.-C. Chou, “Gneg-mPLoc: a top-down strategy to enhance the quality of predicting subcellular localization of Gram-negative bacterial proteins,” Journal of Theoretical Biology, vol. 264, no. 2, pp. 326–333, 2010. View at Publisher · View at Google Scholar · View at Scopus
  14. H.-B. Shen and K.-C. Chou, “Virus-mPLoc: a fusion classifier for viral protein subcellular location prediction by incorporating multiple sites,” Journal of Biomolecular Structure and Dynamics, vol. 28, no. 2, pp. 175–186, 2010. View at Publisher · View at Google Scholar · View at Scopus
  15. Z.-C. Wu, X. Xiao, and K.-C. Chou, “ILoc-Gpos: a multi-layer classifier for predicting the subcellular localization of Singleplex and multiplex gram-positive bacterial proteins,” Protein and Peptide Letters, vol. 19, no. 1, pp. 4–14, 2012. View at Publisher · View at Google Scholar · View at Scopus
  16. Z.-C. Wu, X. Xiao, and K.-C. Chou, “ILoc-Plant: a multi-label classifier for predicting the subcellular localization of plant proteins with both single and multiple sites,” Molecular BioSystems, vol. 7, no. 12, pp. 3287–3297, 2011. View at Publisher · View at Google Scholar · View at Scopus
  17. X. Xiao, Z.-C. Wu, and K.-C. Chou, “iLoc-Virus: a multi-label learning classifier for identifying the subcellular localization of virus proteins with both single and multiple sites,” Journal of Theoretical Biology, vol. 284, no. 1, pp. 42–51, 2011. View at Publisher · View at Google Scholar · View at Scopus
  18. K.-C. Chou, Z.-C. Wu, and X. Xiao, “iLoc-Euk: a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins,” PLoS ONE, vol. 6, no. 3, Article ID e18258, 2011. View at Publisher · View at Google Scholar · View at Scopus
  19. K.-C. Chou, Z.-C. Wu, and X. Xiao, “ILoc-Hum: using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites,” Molecular BioSystems, vol. 8, no. 2, pp. 629–641, 2012. View at Publisher · View at Google Scholar · View at Scopus
  20. X. Xiao, Z.-C. Wu, and K.-C. Chou, “A multi-label classifier for predicting the subcellular localization of gram-negative bacterial proteins with both single and multiple sites,” PLoS ONE, vol. 6, no. 6, Article ID e20592, 2011. View at Publisher · View at Google Scholar · View at Scopus
  21. W.-Z. Lin, J.-A. Fang, X. Xiao, and K.-C. Chou, “ILoc-Animal: a multi-label learning classifier for predicting subcellular localization of animal proteins,” Molecular BioSystems, vol. 9, no. 4, pp. 634–644, 2013. View at Publisher · View at Google Scholar · View at Scopus
  22. X. Wang and G.-Z. Li, “Multilabel learning via random label selection for protein subcellular multilocations prediction,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 10, no. 2, pp. 436–446, 2013. View at Publisher · View at Google Scholar · View at Scopus
  23. X. Wang, W. W. Zhang, Q. W. Zhang, and G.-Z. Li, “MultiP-SChlo: multi-label protein subchloroplast localization prediction with Chou's pseudo amino acid composition and a novel multi-label classifier,” Bioinformatics, vol. 31, no. 16, pp. 2639–2645, 2015. View at Publisher · View at Google Scholar · View at Scopus
  24. S. B. Wan, M.-W. Mak, and S.-Y. Kung, “MGOASVM: multi-label protein subcellular localization based on gene ontology and support vector machines,” BMC Bioinformatics, vol. 13, article 290, 2012. View at Publisher · View at Google Scholar · View at Scopus
  25. S. Wan, M.-W. Mak, and S.-Y. Kung, “HybridGO-Loc: mining hybrid features on gene ontology for predicting subcellular localization of multi-location proteins,” PLoS ONE, vol. 9, no. 3, Article ID e89545, 2014. View at Publisher · View at Google Scholar · View at Scopus
  26. H. Nakashima and K. Nishikawa, “Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies,” Journal of Molecular Biology, vol. 238, no. 1, pp. 54–61, 1994. View at Publisher · View at Google Scholar · View at Scopus
  27. K.-C. Chou, “Prediction of protein cellular attributes using pseudo-amino acid composition,” Proteins: Structure, Function and Genetics, vol. 43, no. 3, pp. 246–255, 2001. View at Publisher · View at Google Scholar · View at Scopus
  28. R. Mott, J. Schultz, P. Bork, and C. P. Ponting, “Predicting protein cellular localization using a domain projection method,” Genome Research, vol. 12, no. 8, pp. 1168–1174, 2002. View at Publisher · View at Google Scholar · View at Scopus
  29. M.-W. Mak, J. Guo, and S.-Y. Kung, “PairProSVM: protein subcellular localization based on local pairwise profile alignment and SVM,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 5, no. 3, pp. 416–422, 2008. View at Publisher · View at Google Scholar · View at Scopus
  30. K. Nakai and M. Kanehisa, “Expert system for predicting protein localization sites in gram-negative bacteria,” Proteins: Structure, Function and Genetics, vol. 11, no. 2, pp. 95–110, 1991. View at Publisher · View at Google Scholar · View at Scopus
  31. O. Emanuelsson, H. Nielsen, S. Brunak, and G. Von Heijne, “Predicting subcellular localization of proteins based on their N-terminal amino acid sequence,” Journal of Molecular Biology, vol. 300, no. 4, pp. 1005–1016, 2000. View at Publisher · View at Google Scholar · View at Scopus
  32. K.-C. Chou and Y.-D. Cai, “Using functional domain composition and support vector machines for prediction of protein subcellular location,” Journal of Biological Chemistry, vol. 277, no. 48, pp. 45765–45769, 2002. View at Publisher · View at Google Scholar · View at Scopus
  33. X. Wang and G.-Z. Li, “A multi-label predictor for identifying the subcellular locations of singleplex and multiplex eukaryotic proteins,” PLoS ONE, vol. 7, no. 5, Article ID e36317, 2012. View at Publisher · View at Google Scholar · View at Scopus
  34. R. Nair and B. Rost, “Sequence conserved for subcellular localization,” Protein Science, vol. 11, no. 12, pp. 2836–2847, 2002. View at Publisher · View at Google Scholar · View at Scopus
  35. Z. Lu, D. Szafron, R. Greiner et al., “Predicting subcellular localization of proteins using machine-learned classifiers,” Bioinformatics, vol. 20, no. 4, pp. 547–556, 2004. View at Publisher · View at Google Scholar · View at Scopus
  36. K. Doctor and G. P. Zhou, “Subcellular location prediction of apoptosis proteins,” Proteins, vol. 50, pp. 40–48, 2003. View at Google Scholar
  37. A. Bulashevska and R. Eils, “Predicting protein subcellular locations using hierarchical ensemble of Bayesian classifiers based on Markov chains,” BMC Bioinformatics, vol. 7, article 298, 2006. View at Publisher · View at Google Scholar · View at Scopus
  38. Z.-H. Zhang, Z.-H. Wang, Z.-R. Zhang, and Y.-X. Wang, “A novel method for apoptosis protein subcellular localization prediction combining encoding based on grouped weight and support vector machine,” FEBS Letters, vol. 580, no. 26, pp. 6169–6174, 2006. View at Publisher · View at Google Scholar · View at Scopus
  39. Y.-L. Chen and Q.-Z. Li, “Prediction of the subcellular location of apoptosis proteins,” Journal of Theoretical Biology, vol. 245, no. 4, pp. 775–783, 2007. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  40. Y.-L. Chen and Q.-Z. Li, “Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition,” Journal of Theoretical Biology, vol. 248, no. 2, pp. 377–381, 2007. View at Publisher · View at Google Scholar · View at Scopus
  41. Y.-S. Ding and T.-L. Zhang, “Using Chou's pseudo amino acid composition to predict subcellular localization of apoptosis proteins: an approach with immune genetic algorithm-based ensemble classifier,” Pattern Recognition Letters, vol. 29, no. 13, pp. 1887–1892, 2008. View at Publisher · View at Google Scholar · View at Scopus
  42. L. Zhang, B. Liao, D. Li, and W. Zhu, “A novel representation for apoptosis protein subcellular localization prediction using support vector machine,” Journal of Theoretical Biology, vol. 259, no. 2, pp. 361–365, 2009. View at Publisher · View at Google Scholar · View at Scopus
  43. J.-D. Qiu, S.-H. Luo, J.-H. Huang, X.-Y. Sun, and R.-P. Liang, “Predicting subcellular location of apoptosis proteins based on wavelet transform and support vector machine,” Amino Acids, vol. 38, no. 4, pp. 1201–1208, 2010. View at Publisher · View at Google Scholar · View at Scopus
  44. T. G. Liu, X. Q. Zheng, C. H. Wang, and J. Wang, “Prediction of subcellular location of apoptosis proteins using pseudo amino acid composition: an approach from auto covariance transformation,” Protein & Peptide Letters, vol. 17, no. 10, pp. 1263–1269, 2010. View at Publisher · View at Google Scholar · View at Scopus
  45. H. Lin, H. Wang, H. Ding, Y.-L. Chen, and Q.-Z. Li, “Prediction of subcellular localization of apoptosis protein using Chou's pseudo amino acid composition,” Acta Biotheoretica, vol. 57, no. 3, pp. 321–330, 2009. View at Publisher · View at Google Scholar · View at Scopus
  46. Q. Gu, Y.-S. Ding, X.-Y. Jiang, and T.-L. Zhang, “Prediction of subcellular location apoptosis proteins with ensemble classifier and feature selection,” Amino Acids, vol. 38, no. 4, pp. 975–983, 2010. View at Publisher · View at Google Scholar · View at Scopus
  47. X. Q. Yu, X. Q. Zheng, T. G. Liu, Y. Dou, and J. Wang, “Predicting subcellular location of apoptosis proteins with pseudo amino acid composition: approach from amino acid substitution matrix and auto covariance transformation,” Amino Acids, vol. 42, no. 5, pp. 1619–1625, 2012. View at Publisher · View at Google Scholar · View at Scopus
  48. V. Saravanan and P. T. V. Lakshmi, “APSLAP: an adaptive boosting technique for predicting subcellular localization of apoptosis protein,” Acta Biotheoretica, vol. 61, no. 4, pp. 481–497, 2013. View at Publisher · View at Google Scholar · View at Scopus
  49. S. Zhang, Y. Liang, and Z. Bai, “A novel reduced triplet composition based method to predict apoptosis protein subcellular localization,” Match, vol. 73, no. 2, pp. 559–571, 2015. View at Google Scholar · View at Scopus
  50. T. G. Liu, P. Y. Tao, X. W. Li, Y. Qin, and C. Wang, “Prediction of subcellular location of apoptosis proteins combining tri-gram encoding based on PSSM and recursive feature elimination,” Journal of Theoretical Biology, vol. 366, pp. 8–12, 2015. View at Publisher · View at Google Scholar · View at Scopus
  51. Gene Ontology Consortium, “The Gene Ontology (GO) database and informatics resource,” Nucleic Acids Research, vol. 32, supplement 1, pp. 258D–D261, 2004. View at Publisher · View at Google Scholar
  52. E. Camon, M. Magrane, D. Barrell et al., “The Gene Ontology Annotation (GOA) database: sharing knowledge in uniprot with gene oncology,” Nucleic Acids Research, vol. 32, pp. D262–D266, 2004. View at Publisher · View at Google Scholar · View at Scopus
  53. K.-C. Chou, “Some remarks on predicting multi-label attributes in molecular biosystems,” Molecular BioSystems, vol. 9, no. 6, pp. 1092–1100, 2013. View at Publisher · View at Google Scholar · View at Scopus
  54. K.-C. Chou and H.-B. Shen, “Cell-PLoc: a package of web servers for predicting subcellular localization of proteins in various organisms,” Nature Protocols, vol. 3, no. 2, pp. 153–162, 2008. View at Publisher · View at Google Scholar · View at Scopus
  55. S. B. Wan, M.-W. Mak, and S.-Y. Kung, “GOASVM: a subcellular location predictor by incorporating term-frequency gene ontology into the general form of Chou's pseudo-amino acid composition,” Journal of Theoretical Biology, vol. 323, pp. 40–48, 2013. View at Publisher · View at Google Scholar · View at Scopus
  56. X. Wang, G.-Z. Li, and W.-C. Lu, “Virus-ECC-mPLoc: a multi-label predictor for predicting the subcellular localization of virus proteins with both single and multiple sites based on a general form of chou's pseudo amino acid composition,” Protein & Peptide Letters, vol. 20, no. 3, pp. 309–317, 2013. View at Google Scholar · View at Scopus
  57. P.-M. Feng, W. Chen, H. Lin, and K.-C. Chou, “IHSP-PseRAAAC: identifying the heat shock protein families using pseudo reduced amino acid alphabet composition,” Analytical Biochemistry, vol. 442, no. 1, pp. 118–125, 2013. View at Publisher · View at Google Scholar · View at Scopus
  58. H. Lin, E.-Z. Deng, H. Ding, W. Chen, and K.-C. Chou, “IPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition,” Nucleic Acids Research, vol. 42, no. 21, pp. 12961–12972, 2014. View at Publisher · View at Google Scholar · View at Scopus
  59. W. Chen, P.-M. Feng, H. Lin, and K.-C. Chou, “iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition,” Nucleic Acids Research, vol. 41, no. 6, article e68, 2013. View at Publisher · View at Google Scholar · View at Scopus
  60. W.-R. Qiu, X. Xiao, and K.-C. Chou, “iRSpot-TNCPseAAC: identify recombination spots with trinucleotide composition and pseudo amino acid components,” International Journal of Molecular Sciences, vol. 15, no. 2, pp. 1746–1766, 2014. View at Publisher · View at Google Scholar · View at Scopus
  61. W. Chen, H. Ding, P. Feng, H. Lin, and K. C. Chou, “iACP: a sequence-based tool for identifying anticancer peptides,” Oncotarget, vol. 7, no. 13, pp. 16895–16909, 2016. View at Publisher · View at Google Scholar
  62. J. Jia, Z. Liu, X. Xiao, B. Liu, and K. Chou, “iPPBS-Opt: a sequence-based ensemble classifier for identifying protein-protein binding sites by optimizing imbalanced training datasets,” Molecules, vol. 21, no. 1, p. 95, 2016. View at Publisher · View at Google Scholar
  63. X. Xiao, P. Wang, W.-Z. Lin, J.-H. Jia, and K.-C. Chou, “IAMP-2L: a two-level multi-label classifier for identifying antimicrobial peptides and their functional types,” Analytical Biochemistry, vol. 436, no. 2, pp. 168–177, 2013. View at Publisher · View at Google Scholar · View at Scopus