- About this Journal
- Abstracting and Indexing
- Aims and Scope
- Annual Issues
- Article Processing Charges
- Articles in Press
- Author Guidelines
- Bibliographic Information
- Citations to this Journal
- Contact Information
- Editorial Board
- Editorial Workflow
- Free eTOC Alerts
- Publication Ethics
- Reviewers Acknowledgment
- Submit a Manuscript
- Subscription Information
- Table of Contents
International Journal of Evolutionary Biology
Volume 2012 (2012), Article ID 342482, 5 pages
Comparative Analyses of Base Compositions, DNA Sizes, and Dinucleotide Frequency Profiles in Archaeal and Bacterial Chromosomes and Plasmids
Agricultural Bioinformatics Research Unit, Graduate School of Agricultural Sciences, University of Tokyo, Tokyo 113-8657, Japan
Received 25 November 2011; Revised 11 January 2012; Accepted 19 January 2012
Academic Editor: Hideaki Nojiri
Copyright © 2012 Hiromi Nishida. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
In the present paper, I compared guanine-cytosine (GC) contents, DNA sizes, and dinucleotide frequency profiles in 109 archaeal chromosomes, 59 archaeal plasmids, 1379 bacterial chromosomes, and 854 bacterial plasmids. In more than 80% of archaeal and bacterial plasmids, the GC content was lower than that of the host chromosome. Furthermore, most of the differences in GC content found between a plasmid and its host chromosome were less than 10%, and the GC content in plasmids and host chromosomes was highly correlated (Pearson’s correlation coefficient in bacteria and 0.917 in archaea). These results support the hypothesis that horizontal gene transfers have occurred frequently via plasmid distribution during evolution. GC content and chromosome size were more highly correlated in bacteria () than in archaea (). Interestingly, there was a tendency for archaea with plasmids to have higher GC content in the chromosome and plasmid than those without plasmids. Thus, the dinucleotide frequency profile of the archaeal plasmids has a bias toward high GC content.
DNA base composition, specifically guanine-cytosine (GC) content, is a bacterial taxonomic marker. For example, actinobacteria have high, whereas clostridia have low GC-containing genomes . In addition, assessing the dinucleotide frequency profile, a genome signature, of a genomic DNA sequence is a powerful tool to compare different chromosomes and plasmids [2–6]. In bacterial chromosomes, GC content and DNA size are correlated [7–10]. In bacterial phages, plasmids, and inserted sequences, the GC contents are lower than those of their host chromosomes .
Replication of and transcription from plasmid DNA are controlled mainly by factors encoded by the chromosome of the host organism. Therefore, it is hypothesized that the GC content and genome signature of a plasmid are similar to those of the chromosome of the host organism. In addition, it is believed that horizontal gene transfers have occurred frequently via plasmid distribution during evolution . For example, a cell-cell communication system may be distributed among the genus Streptomyces using horizontal gene transfer via plasmids .
Prokaryotes consist of 2 evolutionarily distinct groups: archaea and bacteria . Comparative genomics in bacteria is very advanced, while the whole genome sequence data of archaea is currently limited. Due to recent developments in DNA sequence technology, more than 100 archaeal genome sequences have been elucidated. In this study, I compared GC contents, DNA sizes, and dinucleotide frequency profiles in archaeal and bacterial chromosomes and plasmids.
2. Materials and Methods
In this study, 109 archaeal chromosomes, 59 archaeal plasmids, 1379 bacterial chromosomes, and 854 bacterial plasmids were used from the database OligoWeb, searching oligonucleotide frequencies (http://insilico.ehu.es/oligoweb/). According to the annotation of the database OligoWeb, chromosomes and plasmids were distinguished. Pearson’s correlation coefficient calculation, statistical tests, and drawing plots were performed using the software R (http://www.r-project.org/).
The 59 archaeal plasmids and 854 bacterial plasmids are distributed into 26 and 393 organisms, respectively. Some of the archaea and bacteria have 2 or 3 chromosomes. Therefore, in total, the 26 archaeal host organisms and 393 bacterial host organisms have 28 and 441 chromosomes, respectively. The GC contents of bacterial plasmids were found to be lower than those of the host chromosomes (Figure 1, Supplementary Table S1), which is consistent with a previous study . In addition, the GC contents of archaeal plasmids were also lower than those of the host chromosomes (Figure 2, Supplementary Table S2). Furthermore, 777 (81.5%) of the 953 pairs of bacterial chromosome and plasmid, and 57 (85.1%) of the 67 pairs of archaeal chromosome and plasmid showed that the plasmid GC content is lower than that of its host chromosome (Figure 3). In addition, 746 (78.3%) of the 953 bacterial pairs and 47 (70.1%) of the 67 archaeal pairs showed less than 10% difference between GC content of the plasmid and its host chromosome (Figure 3).
The GC contents in plasmids and the host chromosomes were highly correlated in both bacteria and archaea (Pearson’s correlation coefficient and , respectively; Figures 4 and 5, resp.). Furthermore, in terms of size, the GC content and chromosome size were more highly correlated in bacteria than archaea (Figures 6 and 7, Supplementary Tables S3 and S4). Pearson’s correlation coefficients between GC content and chromosome size of archaea and bacteria were 0.195 and 0.460, respectively. In archaea, organisms with high GC content chromosome tend to have plasmid (Figures 2 and 7). Thus, the dinucleotide frequency profile of the archaeal plasmids has a bias toward high GC content (Figure 8).
I hypothesize that GC content, a genomic signature, of a plasmid is related to host specificity and host range. Here, I showed that the GC content of a plasmid is lower than that of its host chromosome (Figures 1 and 2). However, in most cases, the difference in GC content between a plasmid and its host chromosome was less than 10% (Figure 3), strongly suggesting that host organisms cannot maintain and regulate plasmids with very different base compositions.
On the other hand, some organisms had a great difference in GC content between their chromosomes and plasmids. For example, in bacteria, Frankia symbiont of Datisca glomerata has the greatest difference (GC content of the chromosome is 70%; that of the plasmid pFSYMDG02 is 43.1%), and Desulfovibrio magneticus RS-1 has the second greatest difference (GC content of the chromosome is 62.8%; that of the plasmid pDMC2 is 37.2%) (Supplementary Table S1). I am so interested in the regulation system for these plasmids.
In this analysis, there was a tendency for plasmid-containing archaea to have higher GC content in the host chromosome and plasmid than those without plasmids (Figures 2, 5, and 7). I have no idea why archaea with mid- and low-GC chromosome tend to lack plasmids. The GC content bias was not found in bacteria (Figures 1, 4, and 6). Thus, although the dinucleotide frequency profiles between the bacterial chromosomes and plasmids were similar, those between the archaeal chromosomes and plasmids were different (Figure 8).
GC content and chromosome size in bacteria are weakly correlated (), which is consistent with previous reports [7–10]. However, the GC content and chromosome size in archaea are less correlated (). Considering these results, the relationship between GC content and chromosome size may differ in archaea and bacteria. In order to understand the high GC content bias of archaeal plasmids and elucidate the relationship between GC content and chromosome size in archaea, more archaeal genome sequence data are needed.
The author thanks Professor Teruhiko Beppu for his valuable comments.
- N. Sueoka, “Variation and heterogeneity of base composition of deoxyribonucleic acids: a compilation of old and new data,” Journal of Molecular Biology, vol. 3, no. 1, pp. 31–40, 1961.
- A. Campbell, J. Mrázek, and S. Karlin, “Genome signature comparisons among prokaryote, plasmid, and mitochondrial DNA,” Proceedings of the National Academy of Sciences of the United States of America, vol. 96, no. 16, pp. 9184–9189, 1999.
- M. W. J. van Passel, A. Bart, A. C. M. Luyf, A. H. C. van Kampen, and A. van der Ende, “Compositional discordance between prokaryotic plasmids and host chromosomes,” BMC Genomics, vol. 7, article 26, 2006.
- J. Mrázek, “Phylogenetic signals in DNA composition: limitations and prospects,” Molecular Biology and Evolution, vol. 26, no. 5, pp. 1163–1169, 2009.
- H. Suzuki, H. Yano, C. J. Brown, and E. M. Top, “Predicting plasmid promiscuity based on genomic signature,” Journal of Bacteriology, vol. 192, no. 22, pp. 6045–6055, 2010.
- H. Suzuki, M. Sota, C. J. Brown, and E. M. Top, “Using Mahalanobis distance to compare genomic signatures between bacterial plasmids and chromosomes,” Nucleic Acids Research, vol. 36, no. 22, article e147, 2008.
- H. Musto, H. Naya, A. Zavala, H. Romero, F. Alvarez-Valín, and G. Bernardi, “Genomic GC level, optimal growth temperature, and genome size in prokaryotes,” Biochemical and Biophysical Research Communications, vol. 347, no. 1, pp. 1–3, 2006.
- D. Mitchell, “GC content and genome length in Chargaff compliant genomes,” Biochemical and Biophysical Research Communications, vol. 353, no. 1, pp. 207–210, 2007.
- F. B. Guo, H. Lin, and J. Huang, “A plot of G + C content against sequence length of 640 bacterial chromosomes shows the points are widely scattered in the upper triangular area,” Chromosome Research, vol. 17, no. 3, pp. 359–364, 2009.
- S. D. Bentley and J. Parkhill, “Comparative genomic structure of prokaryotes,” Annual Review of Genetics, vol. 38, pp. 771–792, 2004.
- E. P. C. Rocha and A. Danchin, “Base composition bias might result from competition for metabolic resources,” Trends in Genetics, vol. 18, no. 6, pp. 291–294, 2002.
- J. Davison, “Genetic exchange between bacteria in the environment,” Plasmid, vol. 42, no. 2, pp. 73–91, 1999.
- H. Nishida, Y. Ohnishi, T. Beppu, and S. Horinouchi, “Evolution of γ-butyrolactone synthases and receptors in Streptomyces,” Environmental Microbiology, vol. 9, no. 8, pp. 1986–1994, 2007.
- C. R. Woese, O. Kandler, and M. L. Wheelis, “Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya,” Proceedings of the National Academy of Sciences of the United States of America, vol. 87, no. 12, pp. 4576–4579, 1990.