Functional Genomics, Genetics, and BioinformaticsView this Special Issue
Research Article | Open Access
Dong Yu, Yuan Jin, Zhiqiu Yin, Hongguang Ren, Wei Zhou, Long Liang, Junjie Yue, "A Genome-Wide Identification of Genes Undergoing Recombination and Positive Selection in Neisseria", BioMed Research International, vol. 2014, Article ID 815672, 9 pages, 2014. https://doi.org/10.1155/2014/815672
A Genome-Wide Identification of Genes Undergoing Recombination and Positive Selection in Neisseria
Currently, there is particular interest in the molecular mechanisms of adaptive evolution in bacteria. Neisseria is a genus of gram negative bacteria, and there has recently been considerable focus on its two human pathogenic species N. meningitidis and N. gonorrhoeae. Until now, no genome-wide studies have attempted to scan for the genes related to adaptive evolution. For this reason, we selected 18 Neisseria genomes (14 N. meningitidis, 3 N. gonorrhoeae and 1 commensal N. lactamics) to conduct a comparative genome analysis to obtain a comprehensive understanding of the roles of natural selection and homologous recombination throughout the history of adaptive evolution. Among the 1012 core orthologous genes, we identified 635 genes with recombination signals and 10 genes that showed significant evidence of positive selection. Further functional analyses revealed that no functional bias was found in the recombined genes. Positively selected genes are prone to DNA processing and iron uptake, which are essential for the fundamental life cycle. Overall, the results indicate that both recombination and positive selection play crucial roles in the adaptive evolution of Neisseria genomes. The positively selected genes and the corresponding amino acid sites provide us with valuable targets for further research into the detailed mechanisms of adaptive evolution in Neisseria.
Homologous recombination and positive selection are two indispensable sources of genetic variation and play central roles in the adaptive evolution of many bacteria species [1, 2]. Of the two mechanisms, homologous recombination occurs frequently in some bacteria, such as Streptomyces , Helicobacter pylori , and Neisseria , and could possibly speed adaptation by reducing competition between beneficial mutations . There is also evidence for positive selection in specific genes in certain pathogens, such as Listeria monocytogenes , Salmonella , Streptococcus , Campylobacter , and Actinobacilus pleuropneumoniae . These positively selected genes are usually involved in the dynamic interaction between host and pathogen [12, 13].
At present, there are well-developed methods for detecting genes undergoing recombination and selection. Phi  and GENECONV  are two common methods used to detect recombination based on different statistical tests. The -based method is typically used to estimate the ratio of the rate of nonsynonymous nucleotide substitutions to that of synonymous substitutions [16, 17]. This ratio indicates whether a gene has been under positive selection (), neutral selection (), or purifying selection (). Combined with the codon models developed by Nielsen and Yang [16, 18], which allow variation in among sites, this method can identify positive selection signals when there are only few positive sites. All these methods will be employed in this study to detect the genes with the history of recombination or positive selection.
Neisseria is a genus of bacteria that colonizes the mucosal surfaces of many animals. Of the known 14 species, only 2 species, Neisseria meningitides and Neisseria gonorrhoeae, are human pathogens; and the remainders are all commensal or nonpathogenic. Until now, there have been many comparative genomic studies on the genomic evolution of these two pathogenic species [5, 19–27]. Homologous recombination has been found to play a key role in the adaptive evolution of Neisseria; however, few studies have characterised the effect of positive selection on the Neisseria genome. Only two genes, porB  and pilE , have received attention, and both have undergone strong positive selection pressure. In this study, we used the genome sequences available for the strains of N. meningitidis, N. gonorrhoeae, and nonpathogenic N. lactamica to investigate the contributions of recombination and positive selection to the evolution of Neisseria genomes. Considering the high sequence diversity and open pan-genome, we focused on the core genome genes during our scan for recombined genes and positively selected genes. Statistical tests and a literature review were conducted to determine the association between genes and the properties of this genus.
2. Materials and Methods
2.1. Data Preparation
Eighteen genome sequences of Neisseria, including complete proteomes and the corresponding coding genes, were retrieved from the NCBI Genome database (http://www.ncbi.nlm.nih.gov/genome/bacteria/). Detailed information, such as Genbank ID and genome size, is listed in Table 1. The COGs (clusters of orthologous groups of proteins) functional classification for each proteome was conducted with ID mapping from the Uniprot database . Then, using Neisseria gonorrhoeae FA 1090 as the reference genome, stand-alone BLAST was performed against the proteomes of the remaining 17 strains for homologs (sequence identity > 80% and alignment coverage > 80%) of each of the FA_1090 proteins. For each of the core genes from FA_1090, BLAST was performed against all 18 genomes (including the reference genome) with the same thresholds, and multiple copies in any genome were reported and removed from further analysis. The remaining core proteins were defined as the core orthologs of Neisseria.
2.2. Alignment and Calculation of Nucleotide Diversity, Informative Sites, Codon Bias, dN, and dS
The orthologous protein sequences were aligned using the method implemented in muscle . Then, multiple codon alignments of genes corresponding to protein sequence alignments were obtained using PAL2NAL . Using the resulting gene alignments, the gene-by-gene number of informative sites and the nucleotide diversity were obtained from the output of the PhiPack program .
In this study, the effective number of codons (Nc) was used to measure the codon bias. The Nc value ranges from 20 for the strongest bias to 61 for no bias , and the program CodonW (http://sourceforge.net/projects/codonw/) was used to calculate the values of Nc for each gene. The number of synonymous nucleotide substitutions per synonymous site (dS) and the number of nonsynonymous nucleotide substitutions per nonsynonymous site (dN) were estimated from the gene alignments using the program SNAP .
2.3. Detection of Recombination
Four statistical procedures GENECONV , pairwise homoplasy index (Phi) , maximum , and neighbor similarity score (NSS)  were run on the aligned genes to discover the homologous recombination signals. For the analyses of GENECONV, the parameter -scale was set to 1, which allows mismatches within a recombining fragment. The values were calculated from 10000 random permutations of the data. The remaining three programs were implemented in the PhiPack package and were run with default parameters.
2.4. Detection of Selection
FastTree  was used to construct maximum likelihood phylogenetic trees with a general time-reversible (GTR) model of nucleotide substitution for each gene alignment. The resulting topologies of ML trees were applied to subsequent selection analysis.
The codeml program from PAML  was used to detect the genes under positive selection. Two site-specific models were applied: the null model M1a (nearly neutral) and the alternative model M2a (positive selection); the two models differ by the statistical distribution assumed for the ratio. The latter model allows sites with , whereas the former only allows sites with varying between 0 and 1. To ensure convergence to the best likelihood, all calculations were performed three times. A likelihood ratio test (LRT) was then carried out to infer the occurrence of sites under positive selection pressure through comparing M1a against M2a. values were determined from the LRT scores calculated by the module of the PAML package.
2.5. Statistical Analysis
Correction for multiple testing was performed using the method presented by Benjamini and Hochberg . For all genes tested for recombination and positive selection, -values were calculated for each value using the package [40, 41] (-value with the proportion of true null hypothesis set to 1). According to the conservation of tests, false discovery rates of 10% and 20% were used for the recombination analyses and positive selection detection, respectively.
The significance level for differences among the properties, including nucleotide diversity, codon bias, dS, and dN, between a COG and other COGs was determined using the nonparametric Mann-Whitney -test. Correlation between each COG and evolutionary forces (homologous recombination and positive selection) was estimated using a binomial test. Then, Bonferroni corrections for multiple comparisons were performed according to the number of one-sided tests. The significance level was set to 5%. All statistical tests were carried out using Python scripts and .
3. Results and Discussion
3.1. Characterization of the Orthologous Genes in 18 Neisseria Genomes
Previous studies [42–45] showed that both intraspecies and interspecies recombination could act as the important genetic mechanism in generating new clones and alleles in Neisseria. The genus Neisseria consists of two important pathogenic species and a dozen species that are never or rarely pathogenic. At present, there are only 18 completely sequenced genomes of genus Neisseria available, including 14 N. meningitidis, 3 N. gonorrhoeae, and 1 N. lactamics genomes. Thus, we selected all 18 genomes to conduct a genome-wide scan for the identification of genes exhibiting recombination or positively selected signals.
The phylogenetic relationships of the 18 strains were first established based on the 7 housekeeping genes frequently used for multilocus sequence typing (MLST) analysis of Neisseria: abcZ, adk, aroE, fumC, gdh, pdhC, and pgm . The 7 genes were concatenated to construct a maximum likelihood tree with high bootstrap values as shown in Figure 1. In the tree, the three species were divided into three clades and formed a monophyly, respectively.
In the next step, N. gonorrhoeae FA 1090 was used as the reference genome to perform a BLAST search against the other 17 Neisseria genomes for orthologs. Finally, 1034 genes were identified as present in all 18 genomes, containing the initial definition of the core genome for these Neisseria strains and accounting for 38.73% to 55.45% of the coding genes in each genome. This proportion is similar to that in previous analysis of Neisseria meningitides genomes [5, 27]. Of the 1034 core genes, 22 genes occurred as two or more copies in some genomes and were excluded from further analysis. The remaining 1012 genes with a single copy per genome were then defined as the core orthologous genes for subsequent analysis of homologous recombination and natural selection.
Among these genes, genes in COGs “Replication, recombination, and repair” were found to show higher nucleotide diversity than genes in other COGs (Table 2). For the association between the number of informative sites and COGs, the same result was obtained, which means genes in category “Replication, recombination, and repair” also had more informative sites than genes in other COGs (Table 2).
|“>” or “<” indicates the direction of the one-sided tests (i.e. “>Codon bias” shows Bonferroni-corrected -values for associations between genes in a given COG and higher codon bias as compared to the genes in other COGs, and “<Codon bias” represents a contrast tendency).|
Tests for codon bias were performed using Nc values (a lower Nc means increased codon bias).
The effective number of codons, abbreviated as Nc, was used to measure the codon bias for each orthologous gene. Genes categorised into the COG “Translation, ribosomal structure and biogenesis” were evident to have a significant higher codon bias compared with genes in other COG categories (Table 2). It is well known that genes with a lower Nc can have a strong bias and are more likely to be highly expressed [47–49]. So, the genes in the two COGs might present housekeeping features in the fundamental life cycle and essential physiological activities of Neisseria.
In the same way, an association between COGs and dN or dS was also observed. There were 4 COGs in which genes were found to have higher rates of synonymous nucleotide substitutions in comparison with other categories. On the other hand, genes in the other 4 COGs also showed a tendency to have higher rates of nonsynonymous substitutions in comparison with genes in other COGs (Table 2). It is worth noting that all the genes in the core genome in Neisseria had higher dS and dN rates than the genes in other bacteria, for example, E. coli  and A. pleuropneumoniae , indicating that strong natural selection might act on Neisseria.
3.2. A Considerable Number of Genes Showing Evidence of Recombination
Until now, there were several different strategies for identifying the homologous recombination regions in sequences. In this study, four common statistical test methods, including NSS, Max-χ2, Phi, and GENECONV, were employed to detect the recombination signals among the 1012 orthologous genes. As a result, a total of 996 genes (98.4% of all 1012 core genome genes) were found to show significant evidence (FDR < 10%) of recombination by at least one of the four tests. Overall, 951, 968, 842, and 727 genes were identified to show significant evidence of recombination by NSS, Max-χ2, Phi, and GENECONV, respectively. Additionally, a total of 635 genes (62.7% of 1012 core genome genes) were showed recombination signals in all four tests. The proportion of genes undergoing recombination ranged from 62.7% to 98.4%, which is higher than those typically observed in other bacteria, such as E. coli. The result suggests that homologous recombination plays an important role in the evolution of Neisseria genomes.
In a previous work , Joseph et al. identified 459 ortholog genes with signs of recombination in Neisseria meningitidis genomes, which accounts for 39.6% of all core genome genes. In this work, only Neisseria meningitidis genomes were for recombination test, the abovementioned 459 orthologous genes with signs of recombination could be considered intraspecies recombinations. In our present work, in addition to the N. meningitidis genomes, the genomes of Neisseria gonorrhoeae, and Neisseria lactamica were also selected for the recombination analyses and several interspecies recombination genes were identified. The interspecies recombination events in the genus Neisseria have been reported many times [44–46]. It is not surprising that the proportion of genes with recombination signals in the present work is markedly higher than the value observed by Joseph et al. It can be deduced that both intraspecies and interspecies recombination could act as important genetic mechanisms for generating new clones and alleles  in Neisseria.
To test whether the high percentage of core genome genes with a recombination signal is caused by the choice of genomes, we carried out the same analysis on the 14 N. meningitidis genome sequences with the same parameters. We first obtained 1211 orthologous genes with a single copy per genome. Among these orthologous genes, 634 (52.4%) genes were identified to show significant evidence of recombination by all the four tests. In this case, a lower percentage of genes with recombination signals were identified, confirming that the choice of genomes really has an impact on the percentage of recombined genes in the core genome. It also indicated that interspecies recombination indeed has a role in the evolution of Neisseria genomes. Additionally, a higher proportion of genes with recombination signals were observed in these 14 N. meningitidis genomes compared with the results in Joseph’s work. The reason could lie in the differences in the specific genomes in both analyses, suggesting that intraspecies recombination plays an unexpected role in the evolution of the N. meningitidis genome. In a word, recombination acts as an important and irreplaceable genetic mechanism in shaping the genomes of genus Neisseria.
Moreover, it is worth noting that the core genes identified as recombinants have high rates of dS and dN, nucleotide diversity and the number of information sites (, , and , respectively, one-sided -test). The association between COG categories and the number of recombined genes was also estimated (Figure 2). Only two COGs “general function prediction only” and “function unknown” were significantly overrepresented with recombined genes. However, after Bonferroni correction, all the genes exhibiting evidence of recombination were distributed with no significance in all COGs. This unbiasedness of recombined genes in function further confirmed the role of recombination in shaping genomes during the evolution of Neisseria.
3.3. 10 Genes Showing Evidence of Positive Selection
The detection of positive selection for the 1012 orthologs was conducted in PAML, and models M1a and M2a of variable selective pressure across codon sites were used to estimate selective pressure and test for positive selection. Based on LRT statistics for comparing the null model and alternative model with distribution and correction for multiple testing (FDR < 20%), a total of 10 genes were identified to be under strong selected pressure. Of the 10 genes, 4 belonged to the COG “Replication, recombination, and repair”, and 3 were in the COG “Inorganic ion transport and metabolism.” The remaining three genes were classified into the “cell wall/membrane/envelope biogenesis,” “nucleotide transport and metabolism,” and “function unknown”, respectively (Table 3).
In the same way, two obvious discrepancies were observed, respectively, for values of dS and the number of informative sites between genes under positive selection and the remaining genes ( and , one-sided -test). Furthermore, all 10 positively selected genes were found to show significant evidence of recombination detected by at least one recombination test. Only one gene was not in the genes identified by all four tests. The probable reason for this is that recombination could form phylogenetic incongruence [51, 52].
Compared to the high proportion of recombined genes, few positively selected genes (10) were identified, accounting for approximately 1% of the core genome. Similar proportion was also obtained in E. coli , but is smaller than those of other pathogenic bacteria, such as A. pleuropneumoniae .
Among the protein products encoded by the 10 positively selected genes, only 8 proteins were annotated with definite functions. We found that these proteins were either involved in DNA processing or inorganic transport and metabolism.
Of the 10 genes, recB, encoding the DNA helicase, is an integral part of recBCD homologous recombined enzyme. Mutations in recB are required for double-strand break repair  and can also reduce the frequency of many types of recombination events . dnaE, dnaX and polA are all DNA polymerase genes. The first two encode the polymerase iii subunits, and the last encodes polymerase I. All three play fundamental roles in DNA metabolism, including DNA replication, recombination, and repair. In a word, positive selection on the four genes might ensure the strain to adapt to frequent recombination in the genomes.
AmtB encodes an ammonium transporter and is involved in ammonium transmembrane transporter activity. uraA encodes a uracil permease involved in transmembrane transport as well and acts as a membrane-bound facilitator for the transport of uracil across the cell membrane into the cytoplasm ; it is therefore necessary for uracil uptake, especially at low exogenous uracil concentrations and even under conditions with high UPRTase activity.
Hup encodes a TonB-dependent receptor that utilizes heme as an iron source . It has been reported that mutations in the hemoglobin receptor gene have profound effects on the survival of N. meningitidis in an infant rat, indicating that this gene is important for the virulence of Neisseria .
FrpB is clearly a virulence gene, encoding an iron-regulated outer membrane protein. It is a member of the TonB-dependent transporter family and is responsible for iron uptake into the periplasm. FrpB is subject to a high degree of antigenic variation, principally through a region of hypervariable sequence exposed on the cell surface [58, 59].
In a word, the four genes play important roles in the uptake of nutrition. So the adaptive changes in these proteins might be beneficial for Neisseria to survive in the host.
Our analysis reported here indicates that both homologous recombination and positive selection play important roles in the evolution of the core genome in Neisseria. Additionally, homologous recombination has a greater contribution to the genetic variation of a large number of genes with recombination signals. Only 10 genes were identified to be under positive selection, which also showed significant evidence of recombination. However, the positively selected genes were found to be involved in DNA processing or located on the cell membrane. The former reduce the frequency of recombination and enables a stable genetic environment, while the latter maintain a dynamic interaction with the external environment, as well as with the host. Overall, the changes in these positively selected genes result in an improvement in bacterial fitness in response to a variety of environmental signals. These genes can be regarded as a screened gene set for further analysis of the mechanisms of adaptive evolution in Neisseria.
Conflict of Interests
The authors declare that they have no conflict of interests.
Junjie Yue and Long Liang formulated the study. Dong Yu performed the research. Yuan Jin and Zhiqiu Yin analysed the data. Hongguang Ren and Wei Zhou participated in analysis and discussion. Dong Yu wrote the paper. All authors read and approved the final paper.
This work was supported by the National Key Program for Infectious Diseases of China (2011ZX10004-001), the National Basic Research Program of China (2013CB910804), and the Innovation Foundation of AMMS (No. 2012CXJJ023).
- G. Bell, Selection: The Mechanism of Evolution, Chapman & Hall, 1997.
- B. Alberts, A. Johnson, J. Lewis et al., “DNA replication, repair, and recombination,” in Molecular Biology of the Cell, p. 845, Garland Science, 2002.
- J. R. Doroghazi and D. H. Buckley, “Widespread homologous recombination within and between Streptomyces species,” ISME Journal, vol. 4, no. 9, pp. 1136–1143, 2010.
- S. Suerbaum, J. Maynard Smith, K. Bapumia et al., “Free recombination within Helicobacter pylori,” Proceedings of the National Academy of Sciences of the United States of America, vol. 95, no. 21, pp. 12619–12624, 1998.
- B. Joseph, R. F. Schwarz, B. Linke et al., “Virulence evolution of the human pathogen neisseria meningitidis by recombination in the core and accessory genome,” PLoS ONE, vol. 6, no. 4, Article ID e18441, 2011.
- T. F. Cooper, “Recombination speeds adaptation by reducing competition between beneficial mutations in populations of Escherichia coli,” PLoS Biology, vol. 5, article e225, no. 9, 2007.
- Y. L. Tsai, S. B. Maron, P. McGann, K. K. Nightingale, M. Wiedmann, and R. H. Orsi, “Recombination and positive selection contributed to the evolution of Listeria monocytogenes lineages III and IV, two distinct and well supported uncommon L. monocytogenes lineages,” Infection, Genetics and Evolution, vol. 11, no. 8, pp. 1881–1890, 2011.
- Y. Soyer, R. H. Orsi, L. D. Rodriguez-Rivera, Q. Sun, and M. Wiedmann, “Genome wide evolutionary analyses reveal serotype specific patterns of positive selection in selected Salmonella serotypes,” BMC Evolutionary Biology, vol. 9, no. 1, article 264, 2009.
- T. Lefébure and M. J. Stanhope, “Evolution of the core and pan-genome of Streptococcus: positive selection, recombination, and genome composition,” Genome Biology, vol. 8, no. 5, article R71, 2007.
- T. Lefébure and M. J. Stanhope, “Pervasive, genome-wide positive selection leading to functional divergence in the bacterial genus Campylobacter,” Genome Research, vol. 19, no. 7, pp. 1224–1232, 2009.
- Z. Xu, H. Chen, and R. Zhou, “Genome-wide evidence for positive selection and recombination in Actinobacillus pleuropneumoniae,” BMC Evolutionary Biology, vol. 11, no. 1, article 203, 2011.
- L. Petersen, J. P. Bollback, M. Dimmic, M. Hubisz, and R. Nielsen, “Genes under positive selection in Escherichia coli,” Genome Research, vol. 17, no. 9, pp. 1336–1343, 2007.
- R. C. Brunham, F. A. Plummer, and R. S. Stephens, “Bacterial antigenic variation, host immune response, and pathogen-host coevolution,” Infection and Immunity, vol. 61, no. 6, pp. 2273–2276, 1993.
- T. C. Bruen, H. Philippe, and D. Bryant, “A simple and robust statistical test for detecting the presence of recombination,” Genetics, vol. 172, no. 4, pp. 2665–2681, 2006.
- S. Sawyer, “Statistical tests for detecting gene conversion,” Molecular Biology and Evolution, vol. 6, no. 5, pp. 526–538, 1989.
- Z. Yang, R. Nielsen, N. Goldman, and A. K. Pedersen, “Codon-substitution models for heterogeneous selection pressure at amino acid sites,” Genetics, vol. 155, no. 1, pp. 431–449, 2000.
- Z. Yang and J. R. Bielawski, “Statistical methods for detecting molecular adaptation,” Trends in Ecology and Evolution, vol. 15, no. 12, pp. 496–503, 2000.
- R. Nielsen and Z. Yang, “Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene,” Genetics, vol. 148, no. 3, pp. 929–936, 1998.
- J. C. Dunning Hotopp, R. Grifantini, N. Kumar et al., “Comparative genomics of Neisseria meningitidis: core genome, islands of horizontal transfer and pathogen-specific genes,” Microbiology, vol. 152, part 12, pp. 3733–3749, 2006.
- B. Joseph, S. Schneiker-Bekel, A. Schramm-Glück et al., “Comparative genome biology of a serogroup B carriage and disease strain supports a polygenic nature of meningococcal virulence,” Journal of Bacteriology, vol. 192, no. 20, pp. 5363–5377, 2010.
- M. Unemo and W. M. Shafer, “Antibiotic resistance in Neisseria gonorrhoeae: origin, evolution, and lessons learned for the future,” Annals of the New York Academy of Sciences, vol. 1230, pp. E19–E28, 2011.
- D. A. Caugant, “Genetics and evolution of Neisseria meningitidis: importance for the epidemiology of meningococcal disease,” Infection, Genetics and Evolution, vol. 8, no. 5, pp. 558–565, 2008.
- J. S. Bennett, S. D. Bentley, G. S. Vernikos et al., “Independent evolution of the core and accessory gene sets in the genus Neisseria: insights gained from the genome of Neisseria lactamica isolate 020-06,” BMC Genomics, vol. 11, no. 1, article 652, 2010.
- C. O. Buckee, K. A. Jolley, M. Recker et al., “Role of selection in the emergence of lineages and the evolution of virulence in Neisseria meningitidis,” Proceedings of the National Academy of Sciences of the United States of America, vol. 105, no. 39, pp. 15082–15087, 2008.
- J. S. Bennett, K. A. Jolley, P. F. Sparling et al., “Species status of Neisseria gonorrhoeae: evolutionary and epidemiological inferences from multilocus sequence typing,” BMC Biology, vol. 5, article 35, 2007.
- K. A. Jolley, D. J. Wilson, P. Kriz, G. McVean, and M. C. J. Maiden, “The influence of mutation, recombination, population history, and selection on patterns of genetic diversity in Neisseria meningitidis,” Molecular Biology and Evolution, vol. 22, no. 3, pp. 562–569, 2005.
- C. Schoen, J. Blom, H. Claus et al., “Whole-genome comparison of disease and carriage strains provides insights into virulence evolution in Neisseria meningitidis,” Proceedings of the National Academy of Sciences of the United States of America, vol. 105, no. 9, pp. 3473–3478, 2008.
- N. H. Smith, J. M. Smith, and B. G. Spratt, “Sequence evolution of the porB gene of Neisseria gonorrhoeae and Neisseria meningitidis: evidence of positive Darwinian selection,” Molecular Biology and Evolution, vol. 12, no. 3, pp. 363–370, 1995.
- T. D. Andrews and T. Gojobori, “Strong positive selection and recombination drive the antigenic variation of the PilE protein of the human pathogen Neisseria meningitidis,” Genetics, vol. 166, no. 1, pp. 25–32, 2004.
- “Activities at the Universal Protein Resource (UniProt),” Nucleic Acids Research, vol. 42, pp. D191–D198, 2014.
- R. C. Edgar, “MUSCLE: multiple sequence alignment with high accuracy and high throughput,” Nucleic Acids Research, vol. 32, no. 5, pp. 1792–1797, 2004.
- M. Suyama, D. Torrents, and P. Bork, “PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments,” Nucleic Acids Research, vol. 34, pp. W609–W612, 2006.
- F. Wright, “The “effective number of codons” used in a gene,” Gene, vol. 87, no. 1, pp. 23–29, 1990.
- T. Ota and M. Nei, “Variance and covariances of the numbers of synonymous and nonsynonymous substitutions per site,” Molecular Biology and Evolution, vol. 11, no. 4, pp. 613–619, 1994.
- J. M. Smith, “Analyzing the mosaic structure of genes,” Journal of Molecular Evolution, vol. 34, no. 2, pp. 126–129, 1992.
- I. B. Jakobsen and S. Easteal, “A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequences,” Computer Applications in the Biosciences, vol. 12, no. 4, pp. 291–295, 1996.
- M. N. Price, P. S. Dehal, and A. P. Arkin, “Fasttree: computing large minimum evolution trees with profiles instead of a distance matrix,” Molecular Biology and Evolution, vol. 26, no. 7, pp. 1641–1650, 2009.
- Z. Yang, “PAML 4: phylogenetic analysis by maximum likelihood,” Molecular Biology and Evolution, vol. 24, no. 8, pp. 1586–1591, 2007.
- Y. Benjamini and Y. Hochberg, “Controlling the false discovery rate: a practical and powerful approach to multiple testing,” Journal of the Royal Statistical Society B, vol. 57, no. 1, pp. 289–300, 1995.
- R Development Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, 2014.
- J. D. Storey and R. Tibshirani, “Statistical significance for genomewide studies,” Proceedings of the National Academy of Sciences of the United States of America, vol. 100, no. 16, pp. 9440–9445, 2003.
- L. D. Bowler, Q. Y. Zhang, J. Y. Riou, and B. G. Spratt, “Interspecies recombination between the penA genes of Neisseria meningitidis and commensal Neisseria species during the emergence of penicillin resistance in N. meningitidis: natural events and laboratory simulation,” Journal of Bacteriology, vol. 176, no. 2, pp. 333–337, 1994.
- J. Zhou, L. D. Bowler, and B. G. Spratt, “Interspecies recombination, and phylogenetic distortions, within the glutamine synthetase and shikimate dehydrogenase genes of Neisseria meningitidis and commensal Neisseria species,” Molecular Microbiology, vol. 23, no. 4, pp. 799–812, 1997.
- E. Feil, J. Zhou, J. M. Smith, and B. G. Spratt, “A comparison of the nucleotide sequences of the adk and recA genes of pathogenic and commensal Neisseria species: evidence for extensive interspecies recombination within adk,” Journal of Molecular Evolution, vol. 43, no. 6, pp. 631–640, 1996.
- E. C. Holmes, R. Urwin, and M. C. J. Maiden, “The influence of recombination on the population structure and evolution of the human pathogen Neisseria meningitidis,” Molecular Biology and Evolution, vol. 16, no. 6, pp. 741–749, 1999.
- M. C. J. Maiden, J. A. Bygraves, E. Feil et al., “Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms,” Proceedings of the National Academy of Sciences of the United States of America, vol. 95, no. 6, pp. 3140–3145, 1998.
- M. Gouy and C. Gautier, “Codon usage in bacteria: correlation with gene expressivity,” Nucleic Acids Research, vol. 10, no. 22, pp. 7055–7074, 1982.
- A. Carbone, F. Képès, and A. Zinovyev, “Codon bias signatures, organization of microorganisms in codon space, and lifestyle,” Molecular Biology and Evolution, vol. 22, no. 3, pp. 547–561, 2005.
- H. Willenbrock and D. W. Ussery, “Prediction of highly expressed genes in microbes based on chromatin accessibility,” BMC Molecular Biology, vol. 8, article 11, 2007.
- I. K. Jordan, I. B. Rogozin, Y. I. Wolf, and E. V. Koonin, “Essential genes are more evolutionarily conserved than are nonessential genes in bacteria,” Genome Research, vol. 12, no. 6, pp. 962–968, 2002.
- M. Anisimova, R. Nielsen, and Z. Yang, “Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites,” Genetics, vol. 164, no. 3, pp. 1229–1236, 2003.
- R. H. Orsi, Q. Sun, and M. Wiedmann, “Genome-wide analyses reveal lineage specific contributions of positive selection and recombination to the evolution of Listeria monocytogenes,” BMC Evolutionary Biology, vol. 8, no. 1, article 233, 2008.
- C. J. Saveson and S. T. Lovett, “Tandem repeat recombination induced by replication fork defects in Escherichia coli requires a novel factor, RadC,” Genetics, vol. 152, no. 1, pp. 5–13, 1999.
- S. T. Lovett, C. Luisi-DeLuca, and R. D. Kolodner, “The genetic dependence of recombination in recD mutants of Escherichia coli,” Genetics, vol. 120, no. 1, pp. 37–45, 1988.
- P. S. Andersen, D. Frees, R. Fast, and B. Mygind, “Uracil uptake in Escherichia coli K-12: isolation of uraA mutants and cloning of the gene,” Journal of Bacteriology, vol. 177, no. 8, pp. 2008–2013, 1995.
- D. Perkins-Balding, M. Ratliff-Griffin, and I. Stojiljkovic, “Iron transport systems in Neisseria meningitidis,” Microbiology and Molecular Biology Reviews, vol. 68, no. 1, pp. 154–171, 2004.
- I. Stojiljkovic, V. Hwa, L. de Saint Martin et al., “The Neisseria meningitidis haemoglobin receptor: Its role in iron utilization and virulence,” Molecular Microbiology, vol. 15, no. 3, pp. 531–541, 1995.
- P. van der Ley, J. van der Biezen, R. Sutmuller, P. Hoogerhout, and J. T. Poolman, “Sequence variability of FrpB, a major iron-regulated outer-membrane protein in the pathogenic neisseriae,” Microbiology, vol. 142, no. 11, part 1, pp. 3269–3274, 1996.
- M. Beucher and P. F. Sparling, “Cloning, sequencing, and characterization of the gene encoding FrpB, a major iron-regulated, outer membrane protein of Neisseria gonorrhoeae,” Journal of Bacteriology, vol. 177, no. 8, pp. 2041–2049, 1995.
Copyright © 2014 Dong Yu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.