Table of Contents Author Guidelines Submit a Manuscript
Advances in Bioinformatics
Volume 2015, Article ID 843030, 7 pages
http://dx.doi.org/10.1155/2015/843030
Research Article

Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information

1Department of Bioinformatics, Sathyabama University, Chennai 600119, India
2Department of Biotechnology, IIT Madras, Chennai 600032, India
3Computational Biology Research Center (CBRC), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 135-0064, Japan

Received 28 June 2014; Revised 7 January 2015; Accepted 8 January 2015

Academic Editor: Paul Harrison

Copyright © 2015 M. Xavier Suresh et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. K. G. Fleming, “Riding the wave: structural and energetic principles of helical membrane proteins,” Current Opinion in Biotechnology, vol. 11, no. 1, pp. 67–71, 2000. View at Publisher · View at Google Scholar · View at Scopus
  2. J. Drews, “Drug discovery: a historical perspective,” Science, vol. 287, no. 5460, pp. 1960–1964, 2000. View at Publisher · View at Google Scholar · View at Scopus
  3. E. Wallin and G. von Heijne, “Genome-wide analysis of integral membrane proteins from eubacterial, archaean, and eukaryotic organisms,” Protein Science, vol. 7, no. 4, pp. 1029–1038, 1998. View at Google Scholar · View at Scopus
  4. T. J. Stevens and I. T. Arkin, “Do more complex organisms have a greater proportion of membrane proteins in their genomes?” Proteins: Structure, Function, and Genetics, vol. 39, no. 4, pp. 417–420, 2000. View at Publisher · View at Google Scholar
  5. S. J. Opella, A. Nevzorov, M. F. Mesleh, and F. M. Marassi, “Structure determination of membrane proteins by NMR spectroscopy,” Biochemistry and Cell Biology, vol. 80, no. 5, pp. 597–604, 2002. View at Publisher · View at Google Scholar · View at Scopus
  6. Y. Kyogoku, Y. Fujiyoshi, I. Shimada et al., “Structural genomics of membrane proteins,” Accounts of Chemical Research, vol. 36, no. 3, pp. 199–206, 2003. View at Publisher · View at Google Scholar · View at Scopus
  7. S. Jones and J. M. Thornton, “Searching for functional sites in protein structures,” Current Opinion in Chemical Biology, vol. 8, no. 1, pp. 3–7, 2004. View at Publisher · View at Google Scholar · View at Scopus
  8. S. Folkertsma, P. van Noort, J. van Durme et al., “A family-based approach reveals the function of residues in the nuclear receptor ligand-binding domain,” Journal of Molecular Biology, vol. 341, no. 2, pp. 321–335, 2004. View at Publisher · View at Google Scholar · View at Scopus
  9. I. D. Pogozheva, M. J. Przydzial, and H. I. Mosberg, “Homology modeling of opioid receptor-ligand complexes using experimental constraints,” The AAPS Journal, vol. 7, no. 2, pp. E434–E448, 2005. View at Publisher · View at Google Scholar · View at Scopus
  10. Z. Zhang and M. G. Grigorov, “Similarity networks of protein binding sites,” Proteins: Structure, Function, and Bioinformatics, vol. 62, no. 2, pp. 470–478, 2006. View at Publisher · View at Google Scholar · View at Scopus
  11. J. D. Watson, R. A. Laskowski, and J. M. Thornton, “Predicting protein function from sequence and structural data,” Current Opinion in Structural Biology, vol. 15, no. 3, pp. 275–284, 2005. View at Publisher · View at Google Scholar · View at Scopus
  12. R. A. Palmer and H. Niwa, “X-ray crystallographic studies of protein-ligand interactions,” Biochemical Society Transactions, vol. 31, no. 5, pp. 973–979, 2003. View at Publisher · View at Google Scholar · View at Scopus
  13. J. Clarkson and I. D. Campbell, “Studies of protein-ligand interactions by NMR,” Biochemical Society Transactions, vol. 31, no. 5, pp. 1006–1009, 2003. View at Publisher · View at Google Scholar · View at Scopus
  14. T. Simonson, G. Archontis, and M. Karplus, “Free energy simulations come of age: protein-ligand recognition,” Accounts of Chemical Research, vol. 35, no. 6, pp. 430–437, 2002. View at Publisher · View at Google Scholar · View at Scopus
  15. H. J. Hinz, “Thermodynamics of protein-ligand interactions: calorimetric approaches,” Annual Review of Biophysics and Bioengineering, vol. 12, pp. 285–317, 1983. View at Publisher · View at Google Scholar · View at Scopus
  16. R. Perozzo, G. Folkers, and L. Scapozza, “Thermodynamics of protein-ligand interactions: history, presence, and future aspects,” Journal of Receptors and Signal Transduction, vol. 24, no. 1-2, pp. 1–52, 2004. View at Publisher · View at Google Scholar · View at Scopus
  17. L. Y. Lian, I. L. Barsukov, M. J. Sutcliffe, K. H. Sze, and G. C. K. Roberts, “Protein-ligand interactions: exchange processes and determination of ligand conformation and protein-ligand contacts,” Methods in Enzymology, vol. 239, pp. 657–700, 1994. View at Publisher · View at Google Scholar · View at Scopus
  18. M. Hendlich, A. Bergner, J. Günther, and G. Klebe, “Relibase: design and development of a database for comprehensive analysis of protein-ligand interactions,” Journal of Molecular Biology, vol. 326, no. 2, pp. 607–620, 2003. View at Publisher · View at Google Scholar · View at Scopus
  19. C. Taroni, S. Jones, and J. M. Thornton, “Analysis and prediction of carbohydrate binding sites,” Protein Engineering, vol. 13, no. 2, pp. 89–98, 2000. View at Publisher · View at Google Scholar · View at Scopus
  20. M. J. J. M. Zvelebil and M. J. E. Sternberg, “Analysis and prediction of the location of catalytic residues in enzymes,” Protein Engineering Design & Selection, vol. 2, no. 2, pp. 127–138, 1988. View at Publisher · View at Google Scholar · View at Scopus
  21. G. J. Bartlett, C. T. Porter, N. Borkakoti, and J. M. Thornton, “Analysis of catalytic residues in enzyme active sites,” Journal of Molecular Biology, vol. 324, no. 1, pp. 105–121, 2002. View at Publisher · View at Google Scholar · View at Scopus
  22. G. Mirceva and A. Kulakov, “Improvement of protein binding sites prediction by selecting amino acid residues’ features,” Journal of Structural Biology, vol. 189, no. 1, pp. 9–19, 2015. View at Publisher · View at Google Scholar
  23. P. Chen, J. Z. Huang, and X. Gao, “LigandRFs: random forest ensemble to identify ligand-binding residues from sequence information alone,” BMC Bioinformatics, vol. 15, supplement 15, p. S4, 2014. View at Publisher · View at Google Scholar
  24. C. Yan and Y. Wang, “A graph kernel method for DNA-binding site prediction,” BMC Systems Biology, vol. 8, supplement 4, article S10, 2014. View at Publisher · View at Google Scholar
  25. Y.-F. Liou, P. Charoenkwan, Y. Srinivasulu et al., “SCMHBP: prediction and analysis of heme binding proteins using propensity scores of dipeptides,” BMC Bioinformatics, vol. 15, supplement 16, article S4, 2014. View at Publisher · View at Google Scholar
  26. S. C. Izidoro, R. C. de Melo-Minardi, and G. L. Pappa, “GASS: identifying enzyme active sites with genetic algorithms,” Bioinformatics, 2014. View at Publisher · View at Google Scholar
  27. B. Liu, B. Liu, F. Liu, and X. Wang, “Protein Binding Site Prediction by Combining Hidden Markov Support Vector Machine and Profile-Based Propensities,” The Scientific World Journal, vol. 2014, Article ID 464093, 6 pages, 2014. View at Publisher · View at Google Scholar
  28. B. Park, J. Im, N. Tuvshinjargal, W. Lee, and K. Han, “Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models,” Computer Methods and Programs in Biomedicine, vol. 117, no. 2, pp. 158–167, 2014. View at Publisher · View at Google Scholar
  29. Z.-R. Xie and M.-J. Hwang, “Methods for predicting protein—ligand binding sites,” in Molecular Modeling of Proteins, vol. 1215 of Methods in Molecular Biology, pp. 383–398, Springer, New York, NY, USA, 2015. View at Publisher · View at Google Scholar
  30. J. A. Capra and M. Singh, “Predicting functionally important residues from sequence conservation,” Bioinformatics, vol. 23, no. 15, pp. 1875–1882, 2007. View at Publisher · View at Google Scholar · View at Scopus
  31. J. D. Fischer, C. E. Mayer, and J. Söding, “Prediction of protein functional residues from sequence by probability density estimation,” Bioinformatics, vol. 24, no. 5, pp. 613–620, 2008. View at Publisher · View at Google Scholar · View at Scopus
  32. J.-O. Janda, M. Busch, F. Kück, M. Porfenenko, and R. Merkl, “CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure,” BMC Bioinformatics, vol. 13, no. 1, article 55, 2012. View at Publisher · View at Google Scholar · View at Scopus
  33. M. M. Gromiha and M. Suwa, “Influence of amino acid properties for discriminating outer membrane proteins at better accuracy,” Biochimica et Biophysica Acta, vol. 1764, no. 9, pp. 1493–1497, 2006. View at Publisher · View at Google Scholar · View at Scopus
  34. M. M. Gromiha and M. Suwa, “Discrimination of outer membrane proteins using machine learning algorithms,” Proteins: Structure, Function and Genetics, vol. 63, no. 4, pp. 1031–1037, 2006. View at Publisher · View at Google Scholar · View at Scopus
  35. M. X. Suresh, M. M. Gromiha, and M. Suwa, “Membrane protein-ligand interactions: new insights from statistical analysis and molecular modeling studies,” Seibutsu Butsuri, vol. 46, p. S408, 2006. View at Google Scholar
  36. M. X. Suresh, M. M. Gromiha, and M. Suwa, “Analysis and discrimination of ligand binding membrane proteins using a simple statistical approach,” in Proceedings of the 17th International Conference on Genome Informatics (GIW '06), p. 120, Yokohama, Japan, 2006.
  37. C. Elkan, “Boosting and naive Bayesian learning,” Tech. Rep. CS97-557, Department of Computer Science and Engineering, University of California, San Diego, Calif, USA, 1997. View at Google Scholar
  38. H. Zhang and J. Su, “Naive Bayesian classifiers for ranking,” in Proceedings of the 15th European Conference on Machine Learning (ECML '04), Pisa, Italy, 2004.
  39. H. M. Berman, J. Westbrook, Z. Feng et al., “The protein data bank,” Nucleic Acids Research, vol. 28, no. 1, pp. 235–242, 2000. View at Publisher · View at Google Scholar · View at Scopus
  40. G. E. Tusnády, Z. Dosztányi, and I. Simon, “PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank,” Nucleic Acids Research, vol. 33, supplement 1, pp. D275–D278, 2005. View at Publisher · View at Google Scholar · View at Scopus
  41. M. Ikeda, M. Arai, T. Okuno, and T. Shimizu, “TMPDB: a database of experimentally-characterized transmembrane topoligies,” Nucleic Acids Research, vol. 31, no. 1, pp. 406–409, 2003. View at Publisher · View at Google Scholar · View at Scopus
  42. P. Raman, V. Cherezov, and M. Caffrey, “The membrane protein data bank,” Cellular and Molecular Life Sciences, vol. 63, no. 1, pp. 36–51, 2006. View at Publisher · View at Google Scholar · View at Scopus
  43. S. H. White and W. C. Wimley, “Membrane protein folding and stability: physical principles,” Annual Review of Biophysics and Biomolecular Structure, vol. 28, pp. 319–365, 1999. View at Publisher · View at Google Scholar · View at Scopus
  44. M. G. McCammon and C. V. Robinson, “Structural change in response to ligand binding,” Current Opinion in Chemical Biology, vol. 8, no. 1, pp. 60–65, 2004. View at Publisher · View at Google Scholar · View at Scopus
  45. I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann, San Francisco, Calif, USA, 2005.
  46. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, “The WEKA data mining software: an update,” ACM SIGKDD Explorations Newsletter, vol. 11, no. 1, pp. 10–18, 2009. View at Publisher · View at Google Scholar
  47. S. F. Altschul, T. L. Madden, A. A. Schäffer et al., “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs,” Nucleic Acids Research, vol. 25, no. 17, pp. 3389–3402, 1997. View at Publisher · View at Google Scholar · View at Scopus
  48. M. M. Gromiha, M. Oobatake, and A. Sarai, “Important amino acid properties for enhanced thermostability from mesophilic to thermophilic proteins,” Biophysical Chemistry, vol. 82, no. 1, pp. 51–67, 1999. View at Publisher · View at Google Scholar · View at Scopus
  49. M. M. Gromiha and M. X. Suresh, “Discrimination of mesophilic and thermophilic proteins using machine learning algorithms,” Proteins: Structure, Function and Genetics, vol. 70, no. 4, pp. 1274–1279, 2008. View at Publisher · View at Google Scholar · View at Scopus
  50. P. Baldi, S. Brunak, Y. Chauvin, C. A. F. Andersen, and H. Nielsen, “Assessing the accuracy of prediction algorithms for classification: an overview,” Bioinformatics, vol. 16, no. 5, pp. 412–424, 2000. View at Publisher · View at Google Scholar
  51. J. Zhang, P. Hellwig, J. P. Osborne et al., “Site-directed mutation of the highly conserved region near the Q-loop of the cytochrome bd quinol oxidase from Escherichia coli specifically perturbs heme b595,” Biochemistry, vol. 40, no. 29, pp. 8548–8556, 2001. View at Publisher · View at Google Scholar · View at Scopus
  52. J. Zhang, P. Hellwig, J. P. Osborne, and R. B. Gennis, “Arginine 391 in subunit I of the cytochrome bd quinol oxidase from Escherichia coli stabilizes the reduced form of the hemes and is essential for quinol oxidase activity,” The Journal of Biological Chemistry, vol. 279, no. 52, pp. 53980–53987, 2004. View at Publisher · View at Google Scholar · View at Scopus
  53. K. Yang, J. Zhang, A. S. Vakkasoglu et al., “Glutamate 107 in subunit I of the cytochrome bd quinol oxidase from Escherichia coli is protonated and near the heme d/heme b595 binuclear center,” Biochemistry, vol. 46, no. 11, pp. 3270–3278, 2007. View at Publisher · View at Google Scholar · View at Scopus
  54. M. M. Gromiha, Y. Yabuki, M. X. Suresh, A. M. Thangakani, M. Suwa, and K. Fukui, “TMFunction: database for functional residues in membrane proteins,” Nucleic Acids Research, vol. 37, no. 1, pp. D201–D204, 2009. View at Publisher · View at Google Scholar · View at Scopus