About this Journal Submit a Manuscript Table of Contents
Advances in Artificial Neural Systems
Volume 2011 (2011), Article ID 617427, 8 pages
Research Article

Soft Topographic Maps for Clustering and Classifying Bacteria Using Housekeeping Genes

ICAR-CNR, Consiglio Nazionale delle Ricerche, Viale delle Scienze, Ed.11, 90128 Palermo, Italy

Received 11 May 2011; Revised 13 July 2011; Accepted 26 July 2011

Academic Editor: Tomasz G. Smolinski

Copyright © 2011 Massimo La Rosa et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. G. M. Garrity, B. A. Julia, and T. Lilburn, “The revised road map to the manual,” in Bergey's Manual of Systematic Bacteriology, G. M. Garrity, Ed., pp. 159–187, Springer, New York, NY, USA, 204.
  2. C. R. Woese, E. Stackebrandt, T. J. Macke, and G. E. Fox, “A phylogenetic definition of the major eubacterial taxa,” Systematic and Applied Microbiology, vol. 6, no. 2, pp. 143–151, 1985. View at Scopus
  3. I. T. Joliffe, Principal Component Analysis, Springer, New York, NY, USA, 1986.
  4. J. E. Clarridge, “Impact of 16S rRNA gene sequence analysis for identification of bacteria on clinical microbiology and infectious diseases,” Clinical Microbiology Reviews, vol. 17, no. 4, pp. 840–862, 2004. View at Publisher · View at Google Scholar · View at PubMed
  5. M. Drancourt, C. Bollet, A. Carlioz, R. Martelin, J.-P. Gayral, and D. Raoult, “16S ribosomal DNA sequence analysis of a large collection of environmental and clinical unidentifiable bacterial isolates,” Journal of Clinical Microbiology, vol. 38, pp. 3623–3630, 2000.
  6. M. Drancourt, P. Berger, and D. Raoult, “Systematic 16S rRNA gene sequencing of atypical clinical isolates identified 27 new bacterial species associated with humans,” Journal of Clinical Microbiology, vol. 42, no. 5, pp. 2197–2202, 2004. View at Publisher · View at Google Scholar
  7. M. Oja, P. Somervuo, S. Kaski, and T. Kohonen, “Clustering of human endogenous retrovirus sequences with median self-organizing map,” in Proceedings of the Workshop on Self-Organizing Maps (WSOM '03), 2003.
  8. M. Oja, G. O. Sperber, J. Blomberg, and S. Kaski, “Self-organizing map-based discovery and visualization of human endogenous retroviral sequence groups,” International Journal of Neural Systems, vol. 15, no. 3, Article ID 163179, 2005.
  9. W. R. Pearson and D. J. Lipman, “Improved tools for biological sequence comparison,” Proceedings of the National Academy of Sciences of the United States of America, vol. 85, no. 8, pp. 2444–2448, 1988.
  10. T. Kohonen and P. Somervuo, “How to make large self-organizing maps for nonvectorial data,” Neural Networks, vol. 15, no. 8-9, pp. 945–952, 2002.
  11. T. Kohonen, Self-Organizing Maps, Springer, Berlin, Germany, 1995.
  12. P. Somervuo and T. Kohonen, “Clustering and visualization of large protein sequence databases by means of an extension of the self-organizing map,” in Proceedings of the 3rd International Conference on Discovery Science, pp. 76–85, 2000.
  13. Y. Chen, K. D. Reilly, A. P. Sprague, and Z. Guan, “Seqoptics: a protein sequence clustering method,” in Proceedings of the 1st International Multi- Symposiums on Computer and Computational Sciences (IMSCCS'06), vol. 1, pp. 69–75, June 2006.
  14. M. Ankerst, M. M. Breunig, H. P. Kriegel, and J. Sander, “Optics: ordering points to identify the clustering structure,” in Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 49–60, Philadelphia, Pa, USA, June 1999.
  15. A. J. Butte and I. S. Kohane, “Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements,” in Proceedings of the Pacific Symposium on Biocomputing, vol. 5, pp. 415–426, 2000.
  16. M. Remm, C. E. V. Storm, and E. L. L. Sonnhammer, “Automatic clustering of orthologs and in-paralogs from pairwise species comparisons,” Journal of Molecular Biology, vol. 314, no. 5, pp. 1041–1052, 2001.
  17. S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, “Basic local alignment search tool,” Journal of Molecular Biology, vol. 232, pp. 584–599, 1993.
  18. G. M. Garrity and T. G. Lilburn, “Self-organizing and self-correcting classifications of biological data,” Bioinformatics, vol. 21, no. 10, pp. 2309–2314, 2005. View at Publisher · View at Google Scholar · View at PubMed
  19. C. Fyfe, W. Barbakh, W. C. Ooi, and H. Ko, “Topological mappings of video and audio data,” International Journal of Neural Systems, vol. 18, no. 6, pp. 481–489, 2008. View at Publisher · View at Google Scholar
  20. M. La Rosa, G. Di Fatta, S. Gaglio, G. M. Giammanco, R. Rizzo, and A. M. Urso, “Soft topographic map for clustering and classification of bacteria,” in Advances in Intelligent Data Analysis VII, vol. 4723 of Lecture Notes in Computer Science, pp. 332–343, 2007.
  21. J. D. Thompson, D. G. Higgins, and T. J. Gibson, “CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice,” Nucleic Acids Research, vol. 22, no. 22, pp. 4673–4680, 1994.
  22. S. B. Needleman and C. D. Wunsch, “A general method applicable to the search for similarities in the amino acid sequence of two proteins,” Journal of Molecular Biology, vol. 48, no. 3, pp. 443–453, 1970.
  23. T. H. Jukes and C. R. Cantor, “Evolution of protein molecules,” in Mammalian Protein Metabolism, H. N. Munro, Ed., pp. 21–132, Academic Press, New York, NY, USA, 1969.
  24. A. N. Gorban and A. Zinovyev, “Principal manifolds and graphs in practice: from molecular biology to dynamical systems,” International Journal of Neural Systems, vol. 20, no. 3, pp. 219–232, 2010. View at Publisher · View at Google Scholar
  25. W. Barbakh and C. Fyfe, “Online clustering algorithms,” International Journal of Neural Systems, vol. 18, no. 3, pp. 185–194, 2008. View at Publisher · View at Google Scholar
  26. S. P. Luttrell, “A Bayesian analysis of self-organizing maps,” Neural Computation, vol. 6, pp. 767–794, 1994.
  27. T. Graepel, M. Burger, and K. Obermayer, “Self-organizing maps: generalizations and new optimization techniques,” Neurocomputing, vol. 21, no. 1–3, pp. 173–190, 1998. View at Publisher · View at Google Scholar
  28. T. Graepel and K. Obermayer, “A stochastic self-organizing map for proximity data,” Neural Computation, vol. 11, no. 1, pp. 139–155, 1999.
  29. GenBank, 2007, http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Nucleotide.
  30. Fasta, 2007, http://www.ncbi.nlm.nih.gov/blast/fasta.shtml.
  31. A. Ultsch, “Maps for the visualization of high dimensional data spaces,” in Proceedings of the Workshop on Self-Organizing Maps (WSOM '03), vol. 3, pp. 225–230, 2003.
  32. D. Vidaurre and J. Muruzábal, “A quick assessment of topology preservation for SOM structures,” IEEE Transactions on Neural Networks, vol. 18, no. 5, pp. 1524–1528, 2007. View at Publisher · View at Google Scholar
  33. E. W. Weisstein, The CRC Concise Encyclopedia of Mathematics, CRC Press, New York, NY, USA, 1999.
  34. M. Li, X. Chen, X. Li, B. Ma, and P. M. B. Vitanyi, “The similarity metric,” IEEE Transactions on Information Theory, vol. 50, no. 12, pp. 3250–3264, 2004.
  35. M. La Rosa, S. Gaglio, R. Rizzo, and A. Urso, “Normalised compression distance and evolutionary distance of genomic sequences: comparison of clustering results,” International Journal of Knowledge Engineering and Soft Data Paradigms, vol. 1, no. 4, pp. 345–362, 2009.
  36. W. S. Torgerson, “Multidimensional scaling: I. Theory and method,” Psychometrika, vol. 17, pp. 401–419, 1952.