Table of Contents Author Guidelines Submit a Manuscript
BioMed Research International
Volume 2016 (2016), Article ID 6802832, 10 pages
http://dx.doi.org/10.1155/2016/6802832
Research Article

ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier

1College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
2Shanghai University of Medicine & Health Sciences, Shanghai 201318, China

Received 1 June 2016; Revised 15 July 2016; Accepted 7 August 2016

Academic Editor: Dariusz Mrozek

Copyright © 2016 Daozheng Chen et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. H. S. Chan and K. A. Dill, “The protein folding problem,” Physics Today, vol. 46, no. 2, pp. 24–32, 1993. View at Google Scholar · View at Scopus
  2. D. T. Jones, W. R. Taylor, and J. M. Thornton, “A new approach to protein fold recognition,” Nature, vol. 358, no. 6381, pp. 86–89, 1992. View at Publisher · View at Google Scholar · View at Scopus
  3. J. Xu, M. Li, D. Kim, and Y. Xu, “RAPTOR: optimal protein threading by linear programming,” Journal of Bioinformatics and Computational Biology, vol. 1, no. 1, pp. 95–117, 2003. View at Publisher · View at Google Scholar · View at Scopus
  4. J. Söding, A. Biegert, and A. N. Lupas, “The HHpred interactive server for protein homology detection and structure prediction,” Nucleic Acids Research, vol. 33, supplement 2, pp. W244–W248, 2005. View at Publisher · View at Google Scholar · View at Scopus
  5. J. Cheng and P. Baldi, “A machine learning information retrieval approach to protein fold recognition,” Bioinformatics, vol. 22, no. 12, pp. 1456–1463, 2006. View at Publisher · View at Google Scholar · View at Scopus
  6. W. Zhang, S. Liu, and Y. Zhou, “SP5: improving protein fold recognition by using torsion angle profiles and profile-based gap penalty model,” PLoS ONE, vol. 3, no. 6, Article ID e2325, 2008. View at Publisher · View at Google Scholar · View at Scopus
  7. R.-X. Yan, J.-N. Si, C. Wang, and Z. Zhang, “DescFold: a web server for protein fold recognition,” BMC Bioinformatics, vol. 10, article 416, 2009. View at Publisher · View at Google Scholar · View at Scopus
  8. I. Dubchak, I. Muchnik, S. R. Holbrook, and S.-H. Kim, “Prediction of protein folding class using global description of amino acid sequence,” Proceedings of the National Academy of Sciences of the United States of America, vol. 92, no. 19, pp. 8700–8704, 1995. View at Publisher · View at Google Scholar · View at Scopus
  9. I. Dubchak, I. Muchnik, C. Mayor, I. Dralyuk, and S.-H. Kim, “Recognition of a protein fold in the context of the SCOP classification,” Proteins: Structure, Function, and Bioinformatics, vol. 35, no. 4, pp. 401–407, 1999. View at Publisher · View at Google Scholar · View at Scopus
  10. Q. Dong, S. Zhou, and J. Guan, “A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation,” Bioinformatics, vol. 25, no. 20, pp. 2655–2662, 2009. View at Publisher · View at Google Scholar · View at Scopus
  11. X. Guo and X. Gao, “A novel hierarchical ensemble classifier for protein fold recognition,” Protein Engineering, Design and Selection, vol. 21, no. 11, pp. 659–664, 2008. View at Publisher · View at Google Scholar · View at Scopus
  12. H.-B. Shen and K.-C. Chou, “Ensemble classifier for protein fold pattern recognition,” Bioinformatics, vol. 22, no. 14, pp. 1717–1722, 2006. View at Publisher · View at Google Scholar · View at Scopus
  13. C. H. Q. Ding and I. Dubchak, “Multi-class protein fold recognition using support vector machines and neural networks,” Bioinformatics, vol. 17, no. 4, pp. 349–358, 2001. View at Publisher · View at Google Scholar · View at Scopus
  14. K. Chen and L. Kurgan, “PFRES: protein fold classification by using evolutionary information and predicted secondary structure,” Bioinformatics, vol. 23, no. 21, pp. 2843–2850, 2007. View at Publisher · View at Google Scholar · View at Scopus
  15. L. Wei, M. Liao, Y. Gao, R. Ji, Z. He, and Q. Zou, “Improved and promising identificationof human microRNAs by incorporatinga high-quality negative set,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 11, no. 1, pp. 192–201, 2014. View at Publisher · View at Google Scholar · View at Scopus
  16. A. G. Murzin, S. E. Brenner, T. Hubbard, and C. Chothia, “SCOP: a structural classification of proteins database for the investigation of sequences and structures,” Journal of Molecular Biology, vol. 247, no. 4, pp. 536–540, 1995. View at Publisher · View at Google Scholar · View at Scopus
  17. K.-C. Chou, “Prediction of protein cellular attributes using pseudo-amino acid composition,” Proteins: Structure, Function, and Bioinformatics, vol. 43, no. 3, pp. 246–255, 2001. View at Publisher · View at Google Scholar · View at Scopus
  18. T. Yang, V. Kecman, L. Cao, C. Zhang, and J. Zhexue Huang, “Margin-based ensemble classifier for protein fold recognition,” Expert Systems with Applications, vol. 38, no. 10, pp. 12348–12355, 2011. View at Publisher · View at Google Scholar · View at Scopus
  19. M. T. A. Shamim, M. Anwaruddin, and H. A. Nagarajaram, “Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs,” Bioinformatics, vol. 23, no. 24, pp. 3320–3327, 2007. View at Publisher · View at Google Scholar · View at Scopus
  20. P. Deschavanne and P. Tufféry, “Enhanced protein fold recognition using a structural alphabet,” Proteins: Structure, Function and Bioinformatics, vol. 76, no. 1, pp. 129–137, 2009. View at Publisher · View at Google Scholar · View at Scopus
  21. J. Lyons, K. K. Paliwal, A. Dehzangi, R. Heffernan, T. Tsunoda, and A. Sharma, “Protein fold recognition using HMM–HMM alignment and dynamic programming,” Journal of Theoretical Biology, vol. 393, pp. 67–74, 2016. View at Publisher · View at Google Scholar · View at Scopus
  22. H.-B. Shen and K.-C. Chou, “Predicting protein fold pattern with functional domain and sequential evolution information,” Journal of Theoretical Biology, vol. 256, no. 3, pp. 441–446, 2009. View at Publisher · View at Google Scholar
  23. Z. Feng, X. Hu, Z. Jiang, H. Song, and M. A. Ashraf, “The recognition of multi-class protein folds by adding average chemical shifts of secondary structure elements,” Saudi Journal of Biological Sciences, vol. 23, no. 2, pp. 189–197, 2016. View at Publisher · View at Google Scholar
  24. J.-Y. Yang and X. Chen, “Improving taxonomy-based protein fold recognition by using global and local features,” Proteins: Structure, Function, and Bioinformatics, vol. 79, no. 7, pp. 2053–2064, 2011. View at Publisher · View at Google Scholar · View at Scopus
  25. T. Damoulas and M. A. Girolami, “Probabilistic multi-class multi-kernel learning: on protein fold recognition and remote homology detection,” Bioinformatics, vol. 24, no. 10, pp. 1264–1270, 2008. View at Publisher · View at Google Scholar · View at Scopus
  26. W. Chmielnicki and K. Stąpor, “A hybrid discriminative/generative approach to protein fold recognition,” Neurocomputing, vol. 75, no. 1, pp. 194–198, 2012. View at Publisher · View at Google Scholar · View at Scopus
  27. L. Liu, X.-Z. Hu, X.-X. Liu, Y. Wang, and S.-B. Li, “Predicting protein fold types by the general form of Chou's pseudo amino acid composition: approached from optimal feature extractions,” Protein and Peptide Letters, vol. 19, no. 4, pp. 439–449, 2012. View at Publisher · View at Google Scholar · View at Scopus
  28. A. Sharma, J. Lyons, A. Dehzangi, and K. K. Paliwal, “A feature extraction technique using bi-gram probabilities of position specific scoring matrix for protein fold recognition,” Journal of Theoretical Biology, vol. 320, pp. 41–46, 2013. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  29. K. K. Paliwal, A. Sharma, J. Lyons, and A. Dehzangi, “Improving protein fold recognition using the amalgamation of evolutionary-based and structural based information,” BMC Bioinformatics, vol. 15, supplement 16, p. S12, 2014. View at Publisher · View at Google Scholar
  30. K. K. Paliwal, A. Sharma, J. Lyons, and A. Dehzangi, “A tri-gram based feature extraction technique using linear probabilities of position specific scoring matrix for protein fold recognition,” IEEE Transactions on Nanobioscience, vol. 13, no. 1, pp. 44–50, 2014. View at Publisher · View at Google Scholar · View at Scopus
  31. A. Dehzangi, K. Paliwal, J. Lyons, A. Sharma, and A. Sattar, “A segmentation-based method to extract structural and evolutionary features for protein fold recognition,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 11, no. 3, pp. 510–519, 2014. View at Publisher · View at Google Scholar · View at Scopus
  32. J. Lyons, A. Dehzangi, R. Heffernan et al., “Advancing the accuracy of protein fold recognition by utilizing profiles from hidden markov models,” IEEE Transactions on NanoBioscience, vol. 14, no. 7, pp. 761–772, 2015. View at Publisher · View at Google Scholar
  33. H. Saini, G. Raicar, A. Sharma et al., “Probabilistic expression of spatially varied amino acid dimers into general form of Chou's pseudo amino acid composition for protein fold recognition,” Journal of Theoretical Biology, vol. 380, pp. 291–298, 2015. View at Publisher · View at Google Scholar · View at Scopus
  34. A. Dehzangi, S. Phon-Amnuaisuk, and O. Dehzangi, “Using random forest for protein fold prediction problem: an empirical study,” Journal of Information Science and Engineering, vol. 26, no. 6, pp. 1941–1956, 2010. View at Google Scholar · View at Scopus
  35. J. Li, J. Wu, and K. Chen, “PFP-RFSM: protein fold prediction by using random forests and sequence motifs,” Journal of Biomedical Science and Engineering, vol. 6, no. 12, pp. 1161–1170, 2013. View at Publisher · View at Google Scholar
  36. Z. Feng and X. Hu, “Recognition of 27-class protein folds by adding the interaction of segments and motif information,” BioMed Research International, vol. 2014, Article ID 262850, 9 pages, 2014. View at Publisher · View at Google Scholar · View at Scopus
  37. L. Wei, M. Liao, X. Gao, and Q. Zou, “An improved protein structural classes prediction method by incorporating both sequence and structure information,” IEEE Transactions on Nanobioscience, vol. 14, no. 4, pp. 339–349, 2015. View at Publisher · View at Google Scholar · View at Scopus
  38. A. Dehzangi, S. Phon-Amnuaisuk, M. Manafi, and S. Safa, “Using rotation forest for protein fold prediction problem: an empirical study,” in Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, C. Pizzuti, M. D. Ritchie, and M. Giacobini, Eds., vol. 6023 of Lecture Notes in Computer Science, pp. 217–227, Springer, Berlin, Germany, 2010. View at Publisher · View at Google Scholar
  39. G. Bologna and R. D. Appel, “A comparison study on protein fold recognition,” in Proceedings of the 9th International Conference on Neural Information Processing (ICONIP '02), pp. 2492–2496, IEEE, Singapore, November 2002. View at Publisher · View at Google Scholar
  40. L. Wei, M. Liao, X. Gao, and Q. Zou, “Enhanced protein fold prediction method through a novel feature extraction technique,” IEEE Transactions on NanoBioscience, vol. 14, no. 6, pp. 649–659, 2015. View at Publisher · View at Google Scholar · View at Scopus
  41. C. Lin, Y. Zou, J. Qin et al., “Hierarchical classification of protein folds using a novel ensemble classifier,” PLoS ONE, vol. 8, no. 2, article e56499, 2013. View at Publisher · View at Google Scholar · View at Scopus
  42. N. K. Fox, S. E. Brenner, and J.-M. Chandonia, “SCOPe: structural classification of proteins—extended, integrating SCOP and ASTRAL data and classification of new structures,” Nucleic Acids Research, vol. 42, no. 1, pp. D304–D309, 2014. View at Publisher · View at Google Scholar · View at Scopus
  43. S. F. Altschul, T. L. Madden, A. A. Schäffer et al., “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs,” Nucleic Acids Research, vol. 25, no. 17, pp. 3389–3402, 1997. View at Publisher · View at Google Scholar · View at Scopus
  44. W. Kabsch and C. Sander, “Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features,” Biopolymers, vol. 22, no. 12, pp. 2577–2637, 1983. View at Publisher · View at Google Scholar · View at Scopus
  45. H. Lin, E.-Z. Deng, H. Ding, W. Chen, and K.-C. Chou, “IPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition,” Nucleic Acids Research, vol. 42, no. 21, pp. 12961–12972, 2014. View at Publisher · View at Google Scholar · View at Scopus
  46. J. Jia, Z. Liu, X. Xiao, B. Liu, and K.-C. Chou, “iPPI-Esml: an ensemble classifier for identifying the interactions of proteins by incorporating their physicochemical properties and wavelet transforms into PseAAC,” Journal of Theoretical Biology, vol. 377, pp. 47–56, 2015. View at Publisher · View at Google Scholar · View at Scopus
  47. W.-R. Qiu, X. Xiao, and K.-C. Chou, “iRSpot-TNCPseAAC: identify recombination spots with trinucleotide composition and pseudo amino acid components,” International Journal of Molecular Sciences, vol. 15, no. 2, pp. 1746–1766, 2014. View at Publisher · View at Google Scholar · View at Scopus
  48. W. Chen, H. Ding, P. Feng, H. Lin, and K.-C. Chou, “iACP: a sequence-based tool for identifying anticancer peptides,” Oncotarget, vol. 7, no. 13, pp. 16895–16909, 2016. View at Publisher · View at Google Scholar
  49. J. Jia, Z. Liu, X. Xiao, B. Liu, and K.-C. Chou, “iSuc-PseOpt: identifying lysine succinylation sites in proteins by incorporating sequence-coupling effects into pseudo components and optimizing imbalanced training dataset,” Analytical Biochemistry, vol. 497, pp. 48–56, 2016. View at Publisher · View at Google Scholar
  50. B. Liu, L. Fang, F. Liu, X. Wang, and K.-C. Chou, “IMiRNA-PseDPC: microRNA precursor identification with a pseudo distance-pair composition approach,” Journal of Biomolecular Structure and Dynamics, vol. 34, no. 1, pp. 220–232, 2016. View at Publisher · View at Google Scholar · View at Scopus
  51. J. Jia, Z. Liu, X. Xiao, B. Liu, and K.-C. Chou, “pSuc-Lys: predict lysine succinylation sites in proteins with PseAAC and ensemble random forest approach,” Journal of Theoretical Biology, vol. 394, pp. 223–230, 2016. View at Publisher · View at Google Scholar · View at Scopus
  52. B. Liu, L. Fang, R. Long, X. Lan, and K.-C. Chou, “iEnhancer-2L: a two-layer predictor for identifying enhancers and their strength by pseudo k-tuple nucleotide composition,” Bioinformatics, vol. 32, no. 3, pp. 362–369, 2016. View at Publisher · View at Google Scholar
  53. J. Jia, Z. Liu, X. Xiao, B. Liu, and K. Chou, “iCar-PseCp: identify carbonylation sites in proteins by Monto Carlo sampling and incorporating sequence coupled effects into general PseAAC,” Oncotarget, vol. 7, no. 23, pp. 34558–34570, 2016. View at Publisher · View at Google Scholar
  54. B. Liu, R. Long, and K.-C. Chou, “iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework,” Bioinformatics, vol. 32, no. 16, pp. 2411–2418, 2016. View at Publisher · View at Google Scholar
  55. W.-R. Qiu, B.-Q. Sun, X. Xiao, D. Xu, and K.-C. Chou, “iPhos-PseEvo: identifying human phosphorylated proteins by incorporating evolutionary information into general PseAAC via grey system theory,” Molecular Informatics, 2016. View at Publisher · View at Google Scholar
  56. K.-C. Chou, “Some remarks on protein attribute prediction and pseudo amino acid composition,” Journal of Theoretical Biology, vol. 273, no. 1, pp. 236–247, 2011. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  57. N. J. Cheung, X.-M. Ding, and H.-B. Shen, “Protein folds recognized by an intelligent predictor based-on evolutionary and structural information,” Journal of Computational Chemistry, vol. 37, no. 4, pp. 426–436, 2016. View at Publisher · View at Google Scholar · View at Scopus
  58. J. Lyons, N. Biswas, A. Sharma, A. Dehzangi, and K. K. Paliwal, “Protein fold recognition by alignment of amino acid residues using kernelized dynamic time warping,” Journal of Theoretical Biology, vol. 354, pp. 137–145, 2014. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  59. K. Kavousi, B. Moshiri, M. Sadeghi, B. N. Araabi, and A. A. Moosavi-Movahedi, “A protein fold classifier formed by fusing different modes of pseudo amino acid composition via PSSM,” Computational Biology and Chemistry, vol. 35, no. 1, pp. 1–9, 2011. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  60. P. Du, X. Wang, C. Xu, and Y. Gao, “PseAAC-Builder: a cross-platform stand-alone program for generating various special Chou's pseudo-amino acid compositions,” Analytical Biochemistry, vol. 425, no. 2, pp. 117–119, 2012. View at Publisher · View at Google Scholar · View at Scopus
  61. D.-S. Cao, Q.-S. Xu, and Y.-Z. Liang, “Propy: a tool to generate various modes of Chou's PseAAC,” Bioinformatics, vol. 29, no. 7, pp. 960–962, 2013. View at Publisher · View at Google Scholar · View at Scopus
  62. S.-X. Lin and J. Lapointe, “Theoretical and experimental biology in one—a symposium in honour of Professor Kuo-Chen Chou’s 50th anniversary and Professor Richard Giegé's 40th anniversary of their scientific careers,” Journal of Biomedical Science and Engineering, vol. 6, no. 4, pp. 435–442, 2013. View at Publisher · View at Google Scholar
  63. W. Chen, T.-Y. Lei, D.-C. Jin, H. Lin, and K.-C. Chou, “PseKNC: a flexible web server for generating pseudo K-tuple nucleotide composition,” Analytical Biochemistry, vol. 456, no. 1, pp. 53–60, 2014. View at Publisher · View at Google Scholar · View at Scopus
  64. W. Chen, X. Zhang, J. Brooker, H. Lin, L. Zhang, and K.-C. Chou, “PseKNC-General: a cross-platform package for generating various modes of pseudo nucleotide compositions,” Bioinformatics, vol. 31, no. 1, pp. 119–120, 2015. View at Publisher · View at Google Scholar · View at Scopus
  65. B. Liu, F. Liu, L. Fang, X. Wang, and K.-C. Chou, “RepDNA: a Python package to generate various modes of feature vectors for DNA sequences by incorporating user-defined physicochemical properties and sequence-order effects,” Bioinformatics, vol. 31, no. 8, pp. 1307–1309, 2015. View at Publisher · View at Google Scholar · View at Scopus
  66. B. Liu, F. Liu, X. Wang, J. Chen, L. Fang, and K.-C. Chou, “Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences,” Nucleic Acids Research, vol. 43, no. 1, pp. W65–W71, 2015. View at Publisher · View at Google Scholar
  67. L. Zhang, X. Zhao, and L. Kong, “Predict protein structural class for low-similarity sequences by evolutionary difference information into the general form of Chou's pseudo amino acid composition,” Journal of Theoretical Biology, vol. 355, pp. 105–110, 2014. View at Publisher · View at Google Scholar · View at Scopus
  68. S. Zhang, “Accurate prediction of protein structural classes by incorporating PSSS and PSSM into Chou's general PseAAC,” Chemometrics and Intelligent Laboratory Systems, vol. 142, pp. 28–35, 2015. View at Publisher · View at Google Scholar · View at Scopus
  69. L. Holm and C. Sander, “Removing near-neighbour redundancy from large protein sequence collections,” Bioinformatics, vol. 14, no. 5, pp. 423–429, 1998. View at Publisher · View at Google Scholar · View at Scopus
  70. I. Letunic, R. R. Copley, B. Pils, S. Pinkert, J. Schultz, and P. Bork, “SMART 5: domains in the context of genomes and networks,” Nucleic Acids Research, vol. 34, supplement 1, pp. D257–D260, 2006. View at Publisher · View at Google Scholar · View at Scopus
  71. R. D. Finn, J. Mistry, B. Schuster-Böckler et al., “Pfam: clans, web tools and services,” Nucleic Acids Research, vol. 34, supplement 1, pp. D247–D251, 2006. View at Publisher · View at Google Scholar · View at Scopus
  72. R. L. Tatusov, N. D. Fedorova, J. D. Jackson et al., “The COG database: an updated version includes eukaryotes,” BMC Bioinformatics, vol. 4, article 41, 2003. View at Publisher · View at Google Scholar · View at Scopus
  73. A. Marchler-Bauer, J. B. Anderson, M. K. Derbyshire et al., “CDD: a conserved domain database for interactive domain family analysis,” Nucleic Acids Research, vol. 35, no. 1, pp. D237–D240, 2007. View at Publisher · View at Google Scholar · View at Scopus
  74. A. A. Schäffer, L. Aravind, T. L. Madden et al., “Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements,” Nucleic Acids Research, vol. 29, no. 14, pp. 2994–3005, 2001. View at Publisher · View at Google Scholar · View at Scopus
  75. N. Landwehr, M. Hall, and E. Frank, “Logistic model trees,” Machine Learning, vol. 59, no. 1-2, pp. 161–205, 2005. View at Publisher · View at Google Scholar · View at Scopus
  76. C.-C. Chang and C.-J. Lin, “LIBSVM: a library for support vector machines,” ACM Transactions on Intelligent Systems and Technology, vol. 2, no. 3, article 27, 2011. View at Publisher · View at Google Scholar
  77. S. S. Keerthi, S. K. Shevade, C. Bhattacharyya, and K. R. K. Murthy, “Improvements to Platt's SMO algorithm for SVM classifier design,” Neural Computation, vol. 13, no. 3, pp. 637–649, 2001. View at Publisher · View at Google Scholar · View at Scopus
  78. G. H. John and P. Langley, “Estimating continuous distributions in Bayesian classifiers,” in Proceedings of the 11th Conference on Uncertainty in Artificial Intelligence, pp. 338–345, Morgan Kaufmann, 1995.
  79. J. J. Rodríguez, L. I. Kuncheva, and C. J. Alonso, “Rotation forest: a new classifier ensemble method,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 10, pp. 1619–1630, 2006. View at Publisher · View at Google Scholar · View at Scopus
  80. J. Gama, “Functional trees,” Machine Learning, vol. 55, no. 3, pp. 219–250, 2004. View at Publisher · View at Google Scholar · View at Scopus
  81. A. Liaw and M. Wiener, “Classification and regression by randomForest,” R News, vol. 2, no. 3, pp. 18–22, 2002. View at Google Scholar
  82. K.-C. Chou and H.-B. Shen, “Predicting protein subcellular location by fusing multiple classifiers,” Journal of Cellular Biochemistry, vol. 99, no. 2, pp. 517–527, 2006. View at Publisher · View at Google Scholar · View at Scopus
  83. K.-C. Chou and H.-B. Shen, “Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-nearest neighbor classifiers,” Journal of Proteome Research, vol. 5, no. 8, pp. 1888–1897, 2006. View at Publisher · View at Google Scholar · View at Scopus
  84. K.-C. Chou and H.-B. Shen, “Signal-CF: a subsite-coupled and window-fusing approach for predicting signal peptides,” Biochemical and Biophysical Research Communications, vol. 357, no. 3, pp. 633–640, 2007. View at Publisher · View at Google Scholar · View at Scopus
  85. H.-B. Shen and K.-C. Chou, “Gpos-PLoc: an ensemble classifier for predicting subcellular localization of Gram-positive bacterial proteins,” Protein Engineering, Design and Selection, vol. 20, no. 1, pp. 39–46, 2007. View at Publisher · View at Google Scholar · View at Scopus
  86. H.-B. Shen and K.-C. Chou, “Hum-mPLoc: an ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites,” Biochemical and Biophysical Research Communications, vol. 355, no. 4, pp. 1006–1011, 2007. View at Publisher · View at Google Scholar · View at Scopus
  87. H.-B. Shen, J. Yang, and K.-C. Chou, “Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction,” Amino Acids, vol. 33, no. 1, pp. 57–67, 2007. View at Publisher · View at Google Scholar · View at Scopus
  88. H.-B. Shen and K.-C. Chou, “QuatIdent: a web server for identifying protein quaternary structural attribute by fusing functional domain and sequential evolution information,” Journal of Proteome Research, vol. 8, no. 3, pp. 1577–1584, 2009. View at Publisher · View at Google Scholar
  89. J. Jia, Z. Liu, X. Xiao, B. Liu, and K.-C. Chou, “iPPBS-Opt: a sequence-based ensemble classifier for identifying protein-protein binding sites by optimizing imbalanced training datasets,” Molecules, vol. 21, no. 1, p. 95, 2016. View at Publisher · View at Google Scholar