Microbial Diversity for BiotechnologyView this Special Issue
Review Article | Open Access
Microbial Diversity in the Era of Omic Technologies
Human life and activity depends on microorganisms, as they are responsible for providing basic elements of life. Although microbes have such a key role in sustaining basic functions for all living organisms, very little is known about their biology since only a small fraction (average 1%) can be cultured under laboratory conditions. This is even more evident when considering that >88% of all bacterial isolates belong to four bacterial phyla, the Proteobacteria, Firmicutes, Actinobacteria, and Bacteroidetes. Advanced technologies, developed in the last years, promise to revolutionise the way that we characterize, identify, and study microbial communities. In this review, we present the most advanced tools that microbial ecologists can use for the study of microbial communities. Innovative microbial ecological DNA microarrays such as PhyloChip and GeoChip that have been developed for investigating the composition and function of microbial communities are presented, along with an overview of the next generation sequencing technologies. Finally, the Single Cell Genomics approach, which can be used for obtaining genomes from uncultured phyla, is outlined. This tool enables the amplification and sequencing of DNA from single cells obtained directly from environmental samples and is promising to revolutionise microbiology.
Microbes are essential for every part of the human life on Earth as they are responsible for converting the key elements of life—carbon, nitrogen, oxygen, and sulfur—into forms accessible to all other living things . Even more interestingly, the majority of the photosynthetic capacity of the planet does not depend on plants but on microbes . Microbial communities are closely associated with plants and animals making necessary nutrients, metals, and vitamins available to their hosts. For humans, the billions of gut microbes assist us to digest food, break down toxins, and fight off pathogens . Humanity not only depends on microbes for nutritional and health reasons but also for cleaning up pollutants in the environment, such as oil and chemical spills . Amazingly, these activities are not carried out by individual microbes but by microbial communities that can adapt and excel even under extreme environmental changes. These communities can live under extreme conditions, at pH level, pressure, and temperatures, in which no other organism can survive. This has been achieved through numerous strategies that have been developed by microbes for survival. Their genomes contain a vast array of biochemical transformations, and the microbial cells have accumulated DNA changes over a period of billion years of environmental change and evolution .
Human civilization has been greatly improved by the development of numerous technologies that have their source in microbes. For instance, they are being used to produce a vast array of antibiotics and drugs for clinical use, to remediate pollutants in soil and water, to produce biofuels, to enhance and protect agricultural crops, and to ferment human foods and even they are used as markers for the detection of diseases [2, 3].
Comparative analysis of ribosomal RNA (rRNA) sequences implied that all of the cellular life belonged to one of the three domains, namely, Bacteria, Archaea, and Eukarya . This enabled the definition of the major lineages (phyla or divisions) within the three primary domains . Microbes are the most diverse group on Earth on the basis of phylogeny and functionality, occupying every conceivable niche. The vast majority of these organisms are characterized through culture-independent molecular surveys using conserved marker genes like the small subunit ribosomal RNA or more recently the shotgun sequencing (metagenomics) [6, 7]. Although microbes are such an important group, the characterization, identification, and quantification remain an immense challenge even with the conventional molecular tools [8, 9].
The most important revolution in microbial ecology was the use of DNA sequencing in phylogenetic studies and the application of this technology to uncultured organisms in the 1980s and the 1990s. This transformed microbiology revealed that the prokaryotic diversity was vastly underestimated with the current classical cultivation-based techniques [4, 6, 7, 10, 11]. In the last ten years, metagenomic projects have been combined with next generation sequencing (NGS) technologies. This has boosted microbial ecology forward in a very fast pace [1, 12, 13]. Current NGS technologies provide a throughput, which is at least 100 times that of classical Sanger sequencing, and the technologies are quickly improving [14–16]. This makes NGS one of the hottest topics in biological sciences. With the assistance of the newly developed discipline of metagenomics and the high-throughput sequencing technologies, scientists can now unravel the mysteries of the life of the still uncultured microorganisms [6, 17]. The use of metagenomic approaches led to the discovery of a large array of new genes and enabled the genome sequence of various uncultured microbes [18–20]. This is true for low to medium complex ecosystems. In highly diverse environments, metagenomic approaches have not been so successful since assembly is extremely challenging due to the highly heterogeneity of these samples. A way to overcome this bottleneck is to use a single cell genomics approach that has been recently developed and allows the genome analysis of individual community members [21, 22].
Furthermore, powerful high-throughput tools can be provided with the use of phylogenetic oligonucleotide and functional gene arrays [23–25]. The so-called phylogenetic oligonucleotide arrays (POAs-PhyloChip) use a short oligonucleotide design against a phylogenetic marker gene (such as the 16S rRNA gene). They target polymerase chain reaction (PCR) amplified rRNA gene fragments, or directly retrieved community rRNA (genes) and can, at least in principle, be designed to detect any microorganism . In contrast, functional gene arrays (FGAs) detect selected genes or gene families that encode key enzymes that are diagnostic for a certain metabolic pathway [27–29]. Therefore, these arrays are confined to diversity analysis of selected microbial guilds, while PhyloChips are best suited for detecting changes in the taxonomic composition of microbial populations.
In this review, we present a synopsis on (a) the functional gene arrays and phylogenetic oligonucleotide arrays that are currently available to the scientific community, (b) the next generation sequencing technologies with an emphasis on 16S rRNA amplicon pyrosequencing and, (c) single cell genomics, a new approach to study the microbial dark matter.
2. PhyloChip and Functional Gene Arrays
DNA microarrays are able to detect microbial sequences from any sample in a parallel and very fast, high throughput way. This technology has found applications across most sectors of life sciences, including environmental microbiology and microbial ecology . Microbial ecological microarrays have been developed for investigating the composition and functions of microbial communities in environmental niches . This tool is valuable in bacterial diversity studies since a single array can contain thousands of DNA sequences with a high degree of specificity .
The small subunit ribosomal RNA gene (16S rRNA) is the biomarker of choice for characterizing complex microbial communities [12, 33]. This biomarker is extensively used for phylogenetic analysis since it contains highly conserved and variable regions that allow a reliable and detailed microbial classification. The most comprehensive POA is the PhyloChip, which uses the Affymetrix format (Santa Clara, CA) [34–36]. The PhyloChip is updated regularly, and currently two versions of the DNA microarray are available. The development of the second generation PhyloChip (G2) started in 2002 and it became available in 2006. The PhyloChip G3 is currently available through the Second Genome Inc. PhyloChip G2 contains 297,851 perfect match (PM) and mismatch (MM) 16S rRNA gene probes for the detection of 842 subfamilies or 8,741 taxa, covering 121 bacterial and archaeal orders . The remaining 209,093 probes are control probes or pathogen detection probes (Table 1) [25, 34, 38].
|GeoChip contains genes targeting human microbiomes in 139 functional gene families with 36,062 probes.|
The design of PhyloChip G2 was based on over 30,000 16S rRNA gene sequences retrieved from the “Greengenes” database in March 2002 . Chimeric sequences were filtered out and, subsequently, they were aligned resulting in 8,935 clusters (OTUs), which contained approximately 0–3% sequence divergence. The region selected for probe design was flanked on both sides by universally conserved segments that are used as PCR priming sites. These sites can be used to amplify bacterial and/or archaeal genomic materials. The probe selection strategy was to obtain an effective set of probes capable of correctly categorizing mixed amplicons into their proper OTUs. To correctly identify each OTU, a set of 11 or more specific 25-mers (probes) was designed. These probes were prevalent in members of a given OTU but dissimilar from sequences outside the given OTU. Probes that were complementary to target sequences were selected and termed perfect match (PM) probes. Each PM probe was paired with a control 25-mer termed mismatching (MM) probe, identical in all positions except the 13th base. The mismatching probe did not contain a central 17-mer complementary to sequences in any OTU. The target probe and MM probe constitute a probe pair analyzed together. The PhyloChip G2 is arranged in a grid of 732 columns and rows allowing the placement of 506,944 features. For the design of PhyloChip G3, a similar approach was adapted . The Greengenes database was used and the filtered rRNA gene sequences were clustered to enable selection of perfectly complementary probes representing each sequence of a cluster. Amplicons that were containing 17-mers with sequence identity to a cluster were positioned in that cluster. This analysis gave rise to 59,959 clusters, each capturing an average of 0.5% sequence divergence (Table 1). These clusters were considered as operational taxonomic units (OTUs). The PhyloChip G3 provides an analysis spanning in 2 domains, 147 phyla, 1,123 classes, 1,219 orders, and 10,993 subfamilies .
One of the main advantages that the PhyloChip provides is the great sensitivitythat it delivers. A typical 16S PCR reaction with a yield of approximately 500 ng of amplicons provides more than 600 billion sequences, which allows even the less abundant populations to be tracked in addition to the dominant ones. Also, the PhyloChip has been shown to reveal greater diversity within a community when compared with rRNA Sanger sequencing clone libraries due to the placement of the entire gene product on the microarray compared with the analysis of up to thousands of individual molecules by traditional sequencing methods . The main disadvantage of this technology is that the design of the PhyloChip, and in general of the DNA microarrays, does not allow the discovery and the characterization of novel taxa. This is true for the functional gene arrays as well since novel functions cannot be identified through this approach.
The PhyloChips G2 and G3 have been shown to provide identification resolution at the family to subfamily levels [34, 37, 40], and they have been used in over 80 publications. This technology has been used to successfully describe the microbial profile in a vast spectrum of complex ecosystems like solar salterns, industrial waste, olive-mill waste marine environments, coral reefs, air craft particulate air, soil, plant tissues, and various human microbiota [23, 25, 35, 36, 39, 41–44].
Utilization of the PhyloChip in olive-mill waste revealed a cultivar-dependent microbial profile . With the implementation of the PhyloChip, a broader diversity was identified dominated by members of all classes of Proteobacteria, Firmicutes, Bacteroidetes, Chloroflexi, Cyanobacteria and Actinobacteria, while members of the phyla Acidobacteria, Planctomycetes, Gemmatimonadetes, and Verrucomicrobia and the candidate divisions OP3 (Omnitrophica ), TM7, AD3, marine group A (Marinimicrobia ), and SPAM were minor constituents of the bacterial biota.
The PhyloChip was used to associate microbial communities in aerosols . Samples were collected over a 17-week period in San Antonio and Austin. Both sites were a part of a biosurveillance effort to detect bioterrorism threads. A diverse group of microorganisms associated with aerosol was detected with the PhyloChip. Sequences similar to or related to potential pathogens including Campylobacteraceae, Helicobacteraceae, and Francisella-like and bacteria related to Bacillus anthracis, Rickettsia, and Clostridium were identified.
Functional Gene Arrays (FGAs) are a special type of DNA microarrays containing probes for key genes involved in microbial functional processes. FGAs are composed of probes for key genes, involved in microbial functional processes of interest [29, 46, 47]. This type of array allows the simultaneous examination of many functional genes unlike PCR-based techniques that limit the number of genes that can be examined at one time [29, 46–48]. FGAs are especially useful for the study of environmental samples since the precise functions are not known due to the lack of cultured microorganisms and the high degree of diversity and metabolic flexibility that exists in microbial communities .
GeoChip 3.0 is the most comprehensive DNA microarray currently available for studying microbial communities associated with biogeochemical cycling, global climate change, bioenergy, agriculture, land use, ecosystem management, environmental cleanup and restoration, bioreactor systems, and human health. The design of GeoChip 3.0 involved the use of 56,990 gene sequences from 292 functional genes utilizing 27,812 probes. Eight degenerate probes for the 16S rRNA gene were used for positive controls, while 672 unique probes designed from hypothetical genes of seven sequenced genomes of hyperthermophiles were used as negative controls . In GeoChip 3.0, 292 key enzymes/genes were used to target a variety of microbial mediated processes. In brief, a total of 41 enzymes/genes are selected to detect different functional processes of the carbon cycle, and 16 enzymes/genes are targeting the nitrogen cycling processes. Four enzymes/genes are used to detect the sulfur cycling of microbial communities, while three enzymes were targeting the phosphorus cycling of microbial communities. The enzymes hydrogenase and cytochrome detect energy metabolism processes of microbial communities, while a total of 41 genes/enzymes cover resistance mechanisms for Ag, Al, As, Cd, Co, Cr, Cu, Hg, Ni, Pb, Se, Te and Zn .
In GeoChip, the ability to monitor degradation pathways is also incorporated with a total of 173 genes/enzymes being used to detect and monitor the degradation of 86 organic contaminants commonly found in the environment [24, 51]. Finally, eleven genes for antibiotic resistance were also included.
Based on GeoChip 3.0, the latest generation, GeoChip 4.0 in the NimbleGen format has been developed, which not only contains functional categories from GeoChip 3.0 but also includes additional functional categories, such as genes involved in stress responses, bacterial phages, and virulence [50, 52].
It can be postulated that future FGAs will be more comprehensive for the survey of diverse microbial communities, and at the same time they will be more specific for the detection and identification of microbial communities for particular ecosystems or functional processes of interest .
The microbial communities of a Gulf of Mexico coastal salt marsh were recently examined during and after the influx of petroleum hydrocarbons following the Deepwater Horizon oil spill using PhyloChip and GeoChip microarray analyses . The abundance of phyla containing previously described hydrocarbon-degrading bacteria like Proteobacteria, Bacteroidetes, and Actinobacteria, increased in hydrocarbon-contaminated sediments and decreased once the hydrocarbon was below detection. Interestingly, the functional genes involved in the hydrocarbon degradation were enriched in hydrocarbon-contaminated sediments. Once the hydrocarbon concentration was reduced, the detection of functional genes involved in degrading alkanes, cycloalkanes, aromatic carboxylic acids, chlorinated aromatics, polycyclic aromatics, and other aromatics decreased significantly.
In conclusion, DNA microarrays, that use multiple rRNAs or other phylogenetic markers, can be deployed to track variations in population structure and community function over time and space. The use of DNA microarrays based on selected genes (and gene variants) that are involved in interesting processes can be used to assess a community’s ability to perform a collective function, such as biodegradation of contaminants, and monitor, for example, during bioremediation changes over relevant periods.
3. Next Generation Sequencing Technologies
The beginning of the modern DNA sequencing era began with the completion of the first draft of the human genome [54–56]. This was the turning point that led to further innovation and improved development of new advanced strategies of high-throughput DNA sequencing. These technologies are called the next generation sequencing (NGS), and this is the term that we are going to use throughout this review. The main principle in NGS involves DNA molecules that are being sequenced in a massively parallel fashion in a flow cell. The sequencing is conducted either in a continuous real-time manner or in a stepwise iterative process. During this highly parallel process, each clonal template or single molecule is independently sequenced and can be counted among the total sequences generated .
At the moment, six platforms from the second and the third generation sequencing technologies are available with most platforms requiring short template DNAs (200–1000 bp) and with each template containing a forward and reverse primer binding site .
The GS FLX Pyrosequencer utilizes next generation sequencing technology known as pyrosequencing. The technique was first developed by Pal Nyren and his student Mostafa Ronhaghi at the Royal Institute of Technology in 1996 [59, 60] and is now available through Roche 454 Life Technologies. Pyrosequencing uses beads and starts with a single template molecule, which is amplified with emulsion PCR (emPCR). Millions of beads after the emPCR are loaded onto a picolitre plate, which is specially designed so that each well can hold only a single bead. The beads are sequenced in a parallel way by flowing pyrosequencing reagents across the plate. During pyrosequencing, the DNA is synthesized under a complex reaction that includes ATP sulfurylase, luciferase enzymes, adenosine 5′ phosphosulfate and luciferin substrates. These are incorporated in such a way that the pyrophosphate group releases upon addition of a nucleotide that results in the production of detectable light [61, 62]. The amount of light produced with pyrosequencing is proportional to the number of nucleotides incorporated. The FLX instrument provides 100 flows of each nucleotide during an 8 h run. This produces an average read length of 250 nucleotides. Analysis software examines the raw reads using various quality filters for removing poor quality sequences and mixed sequences that contain more than one initial DNA fragment per bead. Sequences that do not contain the initiating TCGA sequence are removed through the quality control test. The filtered reads yield approximately 100 Mb of quality data. It is accepted that FLX reads are of adequate length to assemble small genomes such as bacterial and viral genomes to high quality and contiguity . For the specifications of the 454 technology please see Table 2.
|2 Gb for the MiSeq and 600 Gb for the HiSeq2000.|
Solexa developed the second commercial NGS platform. Solexa was subsequently acquired by Illumina and is now known by the name Illumina. Roche/454 and Illumina engage the principle of sequencing by synthesis. Illumina uses a solid glass surface that is very similar to a microscope slide, for capturing individual molecules and bridge PCR to amplify DNA into small clusters of identical molecules. These clusters at the end are sequenced with a strategy that is equivalent to Sanger sequencing. The difference lies in the use of dye-labelled terminators. 3′-O-Fluorophore-labeled nucleotides are used as reversible terminators of DNA polymerization. This reversible terminator ensures that, in one step, only one nucleotide can be incorporated. After the template is flooded with nucleotides and the binding step is accomplished, the unincorporated reagents are washed away and another round of dye-labelled terminators are added . When compared with 454 sequencing, the Illumina sequencing technology achieves (a) much higher throughput (~1.5 Gbp/run) at the cost of significantly smaller read lengths and (b) high accuracy with error rates of less than 1% (Table 2). The sequencing approach is not affected by homopolymer runs to the same extent as the 454 technology .
The third commercial NGS technology was developed by SOLiD, and it is using ligation to determine sequences. Until recently, SOLiD was producing more reads than Illumina. Read lengths for SOLiD are user defined and range between 25 and 35 bp, while each sequencing run yields between 2 and 4 Gb of DNA sequence data [66, 67].
Ion Torrent uses a sequencing strategy similar to the 454, except that (i) hydrogen ions () are detected (instead of a pyrophosphatase cascade) and (ii) sequencing chips conform to common design and manufacturing standards used for commercial microchips . No cameras, lasers or fluorescent dyes are required with the Ion Torrent technology, while the common microchip design standards means that low-cost manufacturing can be used. Ion Torrent was purchased by Life Technologies in 2010. The first early instruments, the Ion Personal Genome Machine (PGM), was deployed in late 2010, while in September 2012, the Ion Proton was launched which is capable of producing larger outputs. Field effect transistors (FETs) are used to measure a change in pH in a microwell structure. To increase the throughput, the Ion Torrent sequencing chip makes use of a highly dense microwell array in which each well acts as an individual DNA polymerization reaction chamber. These chambers contain a DNA polymerase and a sequencing fragment. Below this layer of microwells an ion-sensitive layer is present, followed by a sublayer composed of a highly dense FET array aligned with the microwell array. Sequential cycling of the four nucleotides into the microwells enables primary sequence resolution since the FET detector senses the change in pH created during nucleotide incorporation and converts this signal to a recordable voltage change. While this method of ion sensing-based sequencing by synthesis offers great potential to reduce the cost of sequencing, there are several limitations with regards to sequencing complete genomes. The Ion PGM was mainly targeting small genomes given the output capability of the instrument (currently up to 1 Gb). The newly launched Ion Proton uses larger chips with higher densities and is said to be able to generate 10 Gb per run (Table 2). These characteristics make the technology suitable for exome and whole genome sequencing. Currently, the short read lengths place a burden on the reassembly process and limit the assembly of de novo sequencing projects due to an inability to read through long repetitive regions in the genome. Finally, error accumulation can occur if reaction wells are not properly purged between reaction steps, with an error rate of 1.78% being reported for Ion Torrent [63, 69].
The first commercial single-molecule sequencer (3rd generation) has been developed by Helicos. The high cost of the instruments and short read lengths unfortunately limited adoption of this platform, and at the moment Helicos no longer sells instruments; instead it conducts sequencing via a service centre model.
PacBio has developed an instrument that sequences individual DNA molecules in real time . Individual DNA polymerases are attached to the bottom of 50 nm wide wells that are termed zero-mode waveguides (ZMWs). Each polymerase is allowed to carry out second strand DNA synthesis in the presence of γ-phosphate fluorescently-labeled nucleotides. The width of the ZMW is such that light cannot proliferate through the waveguide, but energy can penetrate a short distance and excite the fluorophores attached to those nucleotides that are in the vicinity of the polymerase at the bottom of the well. As each base is incorporated, a distinctive pulse of fluorescence is detected in real time. The first instruments were deployed in late 2010. The low cost per experiment and fast run times have generated much enthusiasm for this platform, especially among investors. Although high accuracy can be achieved through circular consensus sequencing, which involves sequencing shorter templates multiple times, this instrument generates single-pass reads that average less than 85% nucleotide accuracy .
For each technology, there is a trade-off between advantages and disadvantages. The 454 technology delivers the longest read length but with the lowest throughput (8 MB/h during a 9 h run—Table 2) and suffers from errors in homopolymeric tracts, even when assembles are at high coverage. MiSeq (Illumina) generates the highest throughput per run and lowest error rate of the instruments but delivers shorter read lengths than those of the 454. Ion Torrent currently produces short reads and the worst performance with homopolymers, although the new chemistry has improved performance. Ion Torrent delivers the fastest throughput (80–100 Mb/h—Table 2) and shortest run time of an approx. 3 h. This platform has also shown the greatest improvement in performance in recent months [69, 71].
4. 16S rRNA Pyrosequencing and Hypervariable Regions
Using the 16S ribosomal RNA gene as a phylogenetic marker was a real breakthrough for microbial ecology studies, with several culture-independent methods being developed since Pace et al.  proposed the direct cloning of environmental DNA. PCR-based molecular techniques enabled the description of microbial taxonomic diversity: (a) by means of fingerprinting methods, which separate rDNA fragments according to their length and/or nucleotide composition like denaturing/temperature gradient gel electrophoresis (DGGE/TGGE) , restriction fragment length polymorphisms (RFLP) , terminal restriction fragment length polymorphism (T-RFLP) , single-strand conformation polymorphism (SSCP) , and automated rRNA intergenic spacer analysis (ARISA); (b) by microscopy using FISH (fluorescence in situ hybridization) and derived methods (CARD-FISH, MAR-FISH); and (c) by cloning 16S rRNA gene fragments and subsequently sequencing the clones following the Sanger sequencing method. It is true that fingerprinting technologies enable the processing of many samples, but they are inadequate for taxonomic identification and suffer from a lack of resolution. Finally, cloning/sequencing and FISH are not compatible with high-throughput approaches.
Cloning and sequencing of the 16S ribosomal RNA gene using conserved broad-range PCR primers was and still is the most common molecular approach for estimating the microbial diversity. But, with the development of NGS technologies, direct sequencing of PCR amplicons became feasible [77, 78]. It is true that the rapid development of sequencing technologies has opened a new dimension in biodiversity analysis, but the most critical step for an accurate rDNA amplicon analysis remains the correct choice of primers and the hypervariable regions that will be targeted [78, 79].
The 16S rRNA gene in bacteria is comprised of interspersed conserved and variable sequences including eight (8) hypervariable regions (V1–V8) (Figure 1). The eight hypervariable regions spanned nucleotides 69–99, 137–242, 433–497, 576–682, 822–879, 986–1043, 1117–1173, and 1243–1294 for V1 through V8, respectively [23, 80]. These hypervariable regions range in size from approximately 50 to 100 bases in length, while sequences differ with respect to variation and corresponding utility for universal microbial identification (Figure 1). Hypervariable regions of the 16S rRNA gene are flanked by conserved sequences (Figure 1) and this enables the design of “universal” PCR primers that can amplify 16S rRNA hypervariable regions from a large number of different bacteria species [39, 80, 81] (see Table 3).
The most critical step for accurate characterization of bacterial and archaeal communities using rDNA amplicon analysis is the choice of primers. Using suboptimal primers pairs will lead to underrepresentation or underselection against single species or even whole groups which can lead to questionable biological conclusions . Different hypervariable regions evolve at different rates, and different species of the same genus may be similar in some hypervariable regions and more divergent in others . Primer bias occurs when the selected primers do not anneal to the DNA from all members of the community equally, but preferentially amplify certain taxonomic groups . This can lead to the failure in detecting some bacterial/archaeal species since in rare biospheres bacteria and archaea can never be identified if the employed primers are not applicable to them. This will lead to incomplete surveys in metagenomic studies .
It has been shown that sequences of 500–700 bp are required for phylogenetic discrimination at the species levels [84, 85]. With the NGS technologies, fragments of up to 700 bp are being sequenced regularly with investigations supporting that use of the V1, V2, and V3 regions for deep sequencing and characterization of bacterial and archaeal sequences [86, 87]. Others suggest that regions generated using primer pairs 8F-338R and 967F-1046R for V6 overestimate species richness and promote the V4–V6 generated using primer pairs 530F-805R, 805F-1046R and 967F-1220R as the most appropriate [82, 88] (Table 3). Also, fragments encompassing the V3, V7, and V7+V8 hypervariable regions (generated using primer pairs 338F-530R, 1046F-1220R, and 1046F-1392R) underestimated species richness  (Table 3). Recent studies demonstrated that the V7-V8 fragments achieve better microbial community coverage from a complex ecosystem . It is highly recommended to use primers that are targeting two regions of the 16S rRNA gene in all deep-sequencing efforts when trying to characterize highly heterogeneous microbial communities [81, 88], although good representation of a microbial community has been achieved by targeting single hypervariable regions .
5. NGS Amplicon Sequencing in Microbial Ecology
In the recent years, mass sequencing of environmental samples has been the leading approach for microbial ecology studies. Irrespective of the ecosystem studied, the vast majority of the studies deployed the 454 pyrosequencing platform, although Illumina-based studies are in the increase.
Soil bacterial diversity was examined using NGS technologies, and this approach revealed that the agricultural management of the soil can influence the diversity of bacteria and archaea [90–92]. The pH is the principal diversity driver for both Bacteria and Archaea [92–94] with the Archaea being highly correlated only with pH. The fungal community composition was less strongly affected by pH . Soil fungal diversity was the focus of other studies in forest and agricultural ecosystems. Analysis using ITS amplicons revealed that in forests most species belong to the Dikarya subkingdom (Ascomycota and Basidiomycota), with the Agaricomycetes being the most dominant fungal class . In agricultural ecosystems, it has been revealed that the diversity of the fungal community declines with soil depth with communities forming distinct groups among the strata .
Marine environments have been used in studies with NGS technologies. Analysis of the 18S rRNA gene identified members from all six eukaryotic supergroups. It also revealed that the eukaryotic microbiota was dominated by dinoflagellates and close relatives which demonstrates the importance of this group to marine ecosystems . The use of 18S rDNA pyrosequencing enabled the characterization of uncultured eukaryotes like flagellates, which are known as MArine STramenopiles (MAST) . Finally, the use of 16S rRNA tag pyrosequencing analysis from a temperate marine coastal site over a period of 6 years, suggested that seasonal changes in environmental variables are more important than trophic interactions .
NGS technologies have been applied in freshwater environmental samples. Monchy et al.  used cloning/sequencing and SSU tag pyrosequencing to study the fungal diversity in freshwater lake ecosystems. This study indicated that geographical, physical, and chemical factors of the biotope influence the species community structure and spatial variability. In a very interesting study by Logares et al. , the bacterioplankton communities in a unique system of coastal Antarctic lakes exposed to progressive long-term environmental change was examined using 454 pyrosequencing of the 16S rDNA gene (V3-V4 regions). Progressive long-term salinity change appears to have promoted the diversification of bacterioplankton communities by modifying the composition of ancestral communities and by allowing the establishment of new taxa . NGS technologies are more robust in describing the structuring of understudied or highly divergent populations. For instance, new putative clades belonging to Mamiellophyceae, Foraminifera, Dictyochophyceae, and Euglenida were recently detected in eight freshwater ecosystems using rDNA pyrotag data .
Air microbial diversity has been recently studied using pyrosequencing technologies in the New York City subway platforms and associated sites . Eukaryotic diversity was mainly fungal, dominated by organisms of types associated with wood rot. Bacterial diversity was dominated by human skin bacterial species including Staphylococcus epidermidis (the most abundant and prevalent commensal of the human integument), S. hominis, S. cohnii, S. caprae, and S. haemolyticus, while no organisms of public health concern were identified .
6. Single Cell Genomics
Recent estimates predict that the number of microbial species in the world are well into millions, and based on the rRNA phylogeny, these species fall within approximately 60 major lines of descent within the bacterial and archaeal domains [33, 104]. From the 60 major lines, at least half have no cultivated representatives, and they are called “candidate” phyla. Even from the phyla with culturable representatives, 88% belong to only four bacterial phyla, the Proteobacteria, Firmicutes, Actinobacteria, and Bacteroidetes. One approach for sequencing candidate phyla is by deploying a metagenomic approach, thus obtaining genome sequences from the microbial dark mater through direct sequencing of DNA from microbial communities . The use of such approach enabled the draft to complete genome recovery from candidate divisions: (a) WWE1/Cloacimonetes (Wastewater Evry 1) with the Cloacamonas acidaminovorans, (b) NC10 with the Methylomirabilis oxyfera, (c) OP1/Acetothermia with the Acetothermum autotrophicum, and (d) from Korarchaeota the Korachaeum cryptofilum [105–108]. With the development of new bioinformatic tools in combination with deep metagenomic sequencing low abundant genomes have been recovered including members of candidate phyla like OP11 (Microgenomates), OD1 (Parcubacteria), and GNO2 (Gracilibacteria) .
Another approach that can be deployed in order to obtain genomic data from candidate divisions is the single cell genomics technique. With this approach, cells from any environmental sample can be isolated, and after amplification the DNA can be sequenced . In more detail, almost any environmental sample can be processed immediately or stored in the presence of betaine or glycerol so that the integrity of the cells be preserved . The next step involves cell separation and this is currently achieved with the use of fluorescence-activated cell sorting (FACS) [111–113]. In comparison to micromanipulation, FACS minimizes the risk of contamination since a few picoliters of sample are sorted each time. Cell lysis follows (Figure 2) with the most effective method being the alkaline lysis . Whole Genome Amplification can be achieved through the Multiple Displacement Amplification (MDA) approach producing long, overlapping amplicons that can be used with the NGS technologies. In silico DNA normalization and specialized software have been developed to counteract the drawbacks of MDA [111, 115]. Before the genome sequencing of the Single Amplified Genomes (SAGs) using NGS technologies, a PCR step can be included for screening purposes. Amplification and Sanger sequencing of the 16S rRNA enables phylogenetic characterization of the SAGs (Figure 2). The recovery of genomic information from single cells varies from 0% up to complete genomes and depends on the intrinsic properties of the cell and on the components of the SCG pipeline deployed. For instance, the above described approach has been successful to target members of the candidate phyla TM7, OP11 (Microgenomates) and Poribacteria [116–118] and led to the recovery of genomic sequences of microorganisms from several deep-branching phylogenetic groups with no cultured representatives. Other examples include Picobiliphytes and divergent groups of aquatic Proteobacteria, Flavobacteria, and Archaea [111, 112, 119–122].
With the single cell genomics (SCG) approach, for, first time, a direct link between phylogenetic and metabolic markers of uncultured bacteria and archaea is possible. A recent example is the discovery of chemolithoautotrophical pathways in uncultured Proteobacteria , which reconciliate the current discrepancies in dark ocean’s carbon budget. SCG is also capable of producing reference genomes of the uncultured microorganisms, enabling the study of complex ecosystems. For instance, Single Cell Genomics have been deployed to investigate biogeographic distribution of uncultured marine Flavobacteria, and marine bacterioplankton that were involved in the degradation of hydrocarbons during the Deepwater Horizon oil spill [112, 123].
In a recent study by Rinke et al. , an SCG approach was deployed targeting 201 uncultivated archaeal and bacterial cells that belong to 29 major mostly uncharted branches of the tree of life, the so-called “microbial dark matter.” Sequencing of 201 single amplified genomes enabled the resolution of many intra- and interphylum-level relationships and enabled the characterization of new superphyla. The first one, Terrabacteria, comprises terrestrial bacterial phyla of Actinobacteria, Cyanobacteria, Thermi (Deinococcus-Thermus), Chloroflexi, Firmicutes, and Armatimonadetes. This superphylum comprises monoderm (single membrane) and atypical lineages. The second superphylum is the Patescribacteria (patesco (Latin), meaning bare), which reflects the reduced metabolic capacities of these lineages. Finally, the superphylum DPANN was identified composed by the Diapherotrites (pMC2A384), Parvarchaeota, Aenigmarchaeota (DSEG), and Nanohaloarchaeota. Finally, substantive genomic data for 11 bacterial candidate divisions and several highly divergent archaeal groups related to Nanoarchaeota were resolved as monophyletic groups. This enabled the proposition of names for these candidate divisions based on their inferred physiology and distinguishing properties (Table 4).
SCG generates a whole new way for exploiting bacterial diversity since, to date, biotechnological applications rely almost exclusively on the part of the microbial world that can be cultured, that is, less than 1% of the microbial diversity. Although metagenomic-based bioprospecting provides an alternative [124–126], the main advantage that the single cell genomics approach offers is that rather than individual genes of the uncultured microorganisms the whole genome is sequenced. This approach enables the construction of complex metabolic pathways, ensuring that all discovered genes are originating from the same cell. Some early examples of biotechnological exploitations through the use of the single cell genomic technique include recoveries of polyketide biosynthesis pathways from sponge symbionts [118, 127] and the discovery of uncultured microorganisms that degrade specific macromolecules and fix CO2 through chemoautotrophy [111, 128, 129].
The study of the microbial diversity is important for understanding the link between diversity, community structure, and function. The most advanced technologies in this quest are DNA microarrays, NGS, and single cell genomics. DNA microarrays provide a fast and high-throughput approach for the parallel detection of microbes from any sample. The most comprehensive phylogenetic oligonucleotide array is the PhyloChip, which uses the Affymetrix format with a current analysis of 59,959 OTUs. GeoChip 3.0 is an advanced Functional Gene Array with a resolution of 292 key enzymes/genes containing more than 28,000 probes. These DNA microarrays revolutionized the way that complex microbial communities are being studied. The development of the new-generation sequencing technologies challenged the use of DNA microarrays in microbial community studies. It appears that in low to medium complexity ecosystems use of the NGS technologies is the most promising approach. In highly complex ecosystems, the sequence-based technologies suffer from random sampling, under sampling, and rRNA interference [130–132]. DNA microarrays like PhyloChip and GeoChip are excellent tools when highly complex ecosystems are examined due to their unique features like (a) rapid output, (b) quick sample preparation, (c) community comparison, (d) quick data analysis, and (d) resistance to contaminants.
NGS technologies will continue to improve both accuracy and throughput with benchtop sequencers becoming the standard equipment in individual labs. We believe that amplicon analysis will become in the near future a quick screening technique, preliminary to more detailed metagenomic studies, rather than the final stage in ecological analysis.
SCG provides the ability to read genetic information at the basic level of biological organization. The capacity to sequence any genomic region of an uncultured cell provides for the first time a direct link between phylogenetic and metabolic markers. The power of SCG has been demonstrated by revealing metabolic features and in situ interactions that have not been able to be characterized before with any other molecular approach. SCG offers a unique opportunity to obtain genomic information from major uncultivated microbial lineages.
Finally, we believe that new approaches exploiting NGS technologies and Single Cell Genomics jointly will be developed targeting genomes and associating them with quantitative measurements. This approach can target all active members (abundant and rare) of a given ecosystem, measure the transcribed genes, and obtain the full genome.
This work was partially supported by EU PEOPLE-2012-IAPP 324349, by the Hellenic Ministry of Education ARCHIMEDES 409-12, and by intramural funds of the University of Patras to George Tsiamis.
- J. C. Venter, K. Remington, J. F. Heidelberg et al., “Environmental genome shotgun sequencing of the Sargasso Sea,” Science, vol. 304, no. 5667, pp. 66–74, 2004.
- National Research Council (US) Committee on Metagenomics: Challenges and Functional Applications, The New Science of Metagenomics: Revealing the Secrets of Our Microbial Planet, National Academies Press, Washington, DC, USA, 2007.
- J. Handelsman, “Metagenomics: application of genomics to uncultured microorganisms,” Microbiology and Molecular Biology Reviews, vol. 68, no. 4, pp. 669–685, 2004.
- C. R. Woese and G. E. Fox, “Phylogenetic structure of the prokaryotic domain: the primary kingdoms,” Proceedings of the National Academy of Sciences of the United States of America, vol. 74, no. 11, pp. 5088–5090, 1977.
- C. R. Woese, O. Kandler, and M. L. Wheelis, “Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya,” Proceedings of the National Academy of Sciences of the United States of America, vol. 87, no. 12, pp. 4576–4579, 1990.
- J. Rajendhran and P. Gunasekaran, “Microbial phylogeny and diversity: small subunit ribosomal RNA sequence analysis and beyond,” Microbiological Research, vol. 166, no. 2, pp. 99–110, 2011.
- J. A. Gilbert and C. L. Dupont, “Microbial metagenomics: beyond the genome,” Annual Review of Marine Science, vol. 3, pp. 347–371, 2011.
- D. Wu, P. Hugenholtz, K. Mavromatis et al., “A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea,” Nature, vol. 462, no. 7276, pp. 1056–1060, 2009.
- N. C. Kyrpides, “Fifteen years of microbial genomics: meeting the challenges and fulfilling the dream,” Nature Biotechnology, vol. 27, no. 7, pp. 627–632, 2009.
- D. J. Lane, B. Pace, and G. J. Olsen, “Rapid determination of 16S ribosomal RNA sequences for phylogenetic analyses,” Proceedings of the National Academy of Sciences of the United States of America, vol. 82, no. 20, pp. 6955–6959, 1985.
- J. L. Stein, T. L. Marsh, K. Y. Wu, H. Shizuya, and E. F. Delong, “Characterization of uncultivated prokaryotes: isolation and analysis of a 40-kilobase-pair genome fragment from a planktonic marine archaeon,” Journal of Bacteriology, vol. 178, no. 3, pp. 591–599, 1996.
- S. G. Tringe and P. Hugenholtz, “A renaissance for the pioneering 16S rRNA gene,” Current Opinion in Microbiology, vol. 11, no. 5, pp. 442–446, 2008.
- D. B. Rusch, A. L. Halpern, G. Sutton et al., “The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific,” PLoS Biology, vol. 5, no. 3, article e77, 2007.
- R. Drmanac, A. B. Sparks, M. J. Callow et al., “Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays,” Science, vol. 327, no. 5961, pp. 78–81, 2010.
- E. R. Mardis, “The impact of next-generation sequencing technology on genetics,” Trends in Genetics, vol. 24, no. 3, pp. 133–141, 2008.
- M. L. Metzker, “Sequencing technologies the next generation,” Nature Reviews Genetics, vol. 11, no. 1, pp. 31–46, 2010.
- J. Handelsman, M. R. Rondon, S. F. Brady, J. Clardy, and R. M. Goodman, “Molecular biological access to the chemistry of unknown soil microbes: a new frontier for natural products,” Chemistry and Biology, vol. 5, no. 10, pp. R245–R249, 1998.
- G. W. Tyson, J. Chapman, P. Hugenholtz et al., “Community structure and metabolism through reconstruction of microbial genomes from the environment,” Nature, vol. 428, no. 6978, pp. 37–43, 2004.
- D. Chivian, E. L. Brodie, E. J. Alm et al., “Environmental genomics reveals a single-species ecosystem deep within earth,” Science, vol. 322, no. 5899, pp. 275–278, 2008.
- T. Woyke, H. Teeling, N. N. Ivanova et al., “Symbiosis insights through metagenomic analysis of a microbial consortium,” Nature, vol. 443, no. 7114, pp. 950–955, 2006.
- F. B. Dean, J. R. Nelson, T. L. Giesler, and R. S. Lasken, “Rapid amplification of plasmid and phage DNA using Phi29 DNA polymerase and multiply-primed rolling circle amplification,” Genome Research, vol. 11, no. 6, pp. 1095–1099, 2001.
- R. S. Lasken, “Single-cell genomic sequencing using multiple displacement amplification,” Current Opinion in Microbiology, vol. 10, no. 5, pp. 510–516, 2007.
- T. C. Hazen, E. A. Dubinsky, T. Z. DeSantis et al., “Deep-sea oil plume enriches indigenous oil-degrading bacteria,” Science, vol. 330, no. 6001, pp. 204–208, 2010.
- Z. He, Y. Deng, J. D. Van Nostrand et al., “GeoChip 3.0 as a high-throughput tool for analyzing microbial community composition, structure and functional activity,” ISME Journal, vol. 4, no. 9, pp. 1167–1179, 2010.
- C. A. Kellogg, Y. M. Piceno, L. M. Tom, T. Z. DeSantis, D. G. Zawada, and G. L. Andersen, “PhyloChip microarray comparison of sampling methods used for coral microbial ecology,” Journal of Microbiological Methods, vol. 88, no. 1, pp. 103–109, 2012.
- T. Z. DeSantis, I. Dubosarskiy, S. R. Murray, and G. L. Andersen, “Comprehensive aligned sequence construction for automated design of effective probes (CASCADE-P) using 16S rDNA,” Bioinformatics, vol. 19, no. 12, pp. 1461–1468, 2003.
- L. Wu, D. K. Thompson, G. Li, R. A. Hurt, J. M. Tiedje, and J. Zhou, “Development and evaluation of functional gene arrays for detection of selected genes in the environment,” Applied and Environmental Microbiology, vol. 67, no. 12, pp. 5780–5790, 2001.
- J. Adamczyk, M. Hesselsoe, N. Iversen et al., “The isotope array, a new tool that employs substrate-mediated labeling of rrna for determination of microbial community structure and function,” Applied and Environmental Microbiology, vol. 69, no. 11, pp. 6875–6887, 2003.
- P. Dennis, E. A. Edwards, S. N. Liss, and R. Fulthorpe, “Monitoring gene expression in mixed microbial communities by using DNA microarrays,” Applied and Environmental Microbiology, vol. 69, no. 2, pp. 769–778, 2003.
- L. Bodrossy and A. Sessitsch, “Oligonucleotide microarrays in microbial diagnostics,” Current Opinion in Microbiology, vol. 7, no. 3, pp. 245–254, 2004.
- T. Majtán, G. Bukovská, and J. Timko, “DNA microarrays—techniques and applications in microbial systems,” Folia Microbiologica, vol. 49, no. 6, pp. 635–664, 2004.
- V. Torsvik and L. Øvreås, “Microbial diversity and function in soil: from genes to ecosystems,” Current Opinion in Microbiology, vol. 5, no. 3, pp. 240–245, 2002.
- P. Hugenholtz and N. C. Kyrpides, “A changing of the guard: genomics update,” Environmental Microbiology, vol. 11, no. 3, pp. 551–553, 2009.
- E. L. Brodie, T. Z. DeSantis, J. P. Moberg Parker, I. X. Zubietta, Y. M. Piceno, and G. L. Andersen, “Urban aerosols harbor diverse and dynamic bacterial populations,” Proceedings of the National Academy of Sciences of the United States of America, vol. 104, no. 1, pp. 299–304, 2007.
- G. Tsiamis, K. Katsaveli, S. Ntougias et al., “Prokaryotic community profiles at different operational stages of a Greek solar saltern,” Research in Microbiology, vol. 159, no. 9-10, pp. 609–627, 2008.
- G. Tsiamis, G. Tzagkaraki, A. Chamalaki et al., “Olive-mill wastewater bacterial communities display a cultivar specific profile,” Current Microbiology, vol. 64, no. 2, pp. 197–203, 2012.
- E. L. Brodie, T. Z. DeSantis, D. C. Joyner et al., “Application of a high-density oligonucleotide microarray approach to study bacterial population dynamics during uranium reduction and reoxidation,” Applied and Environmental Microbiology, vol. 72, no. 9, pp. 6288–6298, 2006.
- D.-Y. Lee, K. Shannon, and L. A. Beaudette, “Detection of bacterial pathogens in municipal wastewater using an oligonucleotide microarray and real-time quantitative PCR,” Journal of Microbiological Methods, vol. 65, no. 3, pp. 453–467, 2006.
- K. Katsaveli, D. Vayenas, G. Tsiamis, and K. Bourtzis, “Bacterial diversity in Cr(VI) and Cr(III)-contaminated industrial wastewaters,” Extremophiles, vol. 16, no. 2, pp. 285–296, 2012.
- T. Z. DeSantis, E. L. Brodie, J. P. Moberg, I. X. Zubieta, Y. M. Piceno, and G. L. Andersen, “High-density universal 16S rRNA microarray analysis reveals broader diversity than typical clone library when sampling the environment,” Microbial Ecology, vol. 53, no. 3, pp. 371–383, 2007.
- T. M. Korves, Y. M. Piceno, L. M. Tom et al., “Bacterial communities in commercial aircraft high-efficiency particulate air (HEPA) filters assessed by PhyloChip analysis,” Indoor Air, vol. 23, no. 1, pp. 50–61, 2013.
- F. Reith, J. Brugger, C. M. Zammit et al., “Influence of geogenic factors on microbial communities in metallogenic Australian soils,” The ISME Journal, vol. 6, pp. 2107–2118, 2012.
- U. S. Sagaram, K. M. Deangelis, P. Trivedi, G. L. Andersen, S.-E. Lu, and N. Wang, “Bacterial diversity analysis of huanglongbing pathogen-infected citrus, using phyloChip arrays and 16S rRNA gene clone library sequencing,” Applied and Environmental Microbiology, vol. 75, no. 6, pp. 1566–1574, 2009.
- M. J. Cox, Y. J. Huang, K. E. Fujimura et al., “Lactobacillus casei abundance is associated with profound shifts in the infant gut microbiome,” PLoS One, vol. 5, no. 1, Article ID e8745, 2010.
- C. Rinke, P. Schwientek, A. Sczyrba et al., “Insights into the phylogeny and coding potential of microbial dark matter,” Nature, vol. 499, no. 7459, pp. 431–437, 2013.
- C. Palmer, E. M. Bik, M. B. Eisen et al., “Rapid quantitative profiling of complex microbial populations,” Nucleic Acids Research, vol. 34, no. 1, p. e5, 2006.
- J.-C. Cho and J. M. Tiedje, “Bacterial species determination from DNA-DNA hybridization by using genome fragments and DNA microarrays,” Applied and Environmental Microbiology, vol. 67, no. 8, pp. 3677–3682, 2001.
- E. A. Greene and G. Voordouw, “Analysis of environmental microbial communities by reverse sample genome probing,” Journal of Microbiological Methods, vol. 53, no. 2, pp. 211–219, 2003.
- J. D. Van Nostrand, Z. He, and J. Zhou, “Dynamics of microbes in the natural setting: development of the Geochip,” in Environmental Microbiology, K. Sen and N. J. Ashbolt, Eds., Caister Academic, Norfolk, UK, 2011.
- Z. He, Y. Deng, and J. Zhou, “Development of functional gene microarrays for microbial community analysis,” Current Opinion in Biotechnology, vol. 23, no. 1, pp. 49–55, 2012.
- J. Xie, Z. He, X. Liu et al., “GeoChip-based analysis of the functional gene diversity and metabolic potential of microbial communities in acid mine drainage,” Applied and Environmental Microbiology, vol. 77, no. 3, pp. 991–999, 2011.
- Z. He, J. D. Van Nostrand, and J. Zhou, “Applications of functional gene microarrays for profiling microbial communities,” Current Opinion in Biotechnology, vol. 23, no. 3, pp. 460–466, 2012.
- M. J. Beazley, R. J. Martinez, S. Rajan et al., “Microbial community analysis of a coastal salt marsh affected by the Deepwater Horizon oil spill,” PLoS One, vol. 7, no. 7, Article ID e41305, 2012.
- G. Yamey, “Scientists unveil first draft of human genome,” British Medical Journal, vol. 321, no. 7252, p. 7, 2000.
- J. Craig Venter, M. D. Adams, E. W. Myers et al., “The sequence of the human genome,” Science, vol. 291, no. 5507, pp. 1304–1351, 2001.
- E. S. Lander, L. M. Linton, B. Birren et al., “Initial sequencing and analysis of the human genome,” Nature, vol. 409, no. 6822, pp. 860–921, 2001.
- C. S. Pareek, R. Smoczynski, and A. Tretyn, “Sequencing technologies and genome sequencing,” Journal of Applied Genetics, vol. 52, no. 4, pp. 413–435, 2011.
- T. P. Niedringhaus, D. Milanova, M. B. Kerby, M. P. Snyder, and A. E. Barron, “Landscape of next-generation sequencing technologies,” Analytical Chemistry, vol. 83, no. 12, pp. 4327–4341, 2011.
- M. Ronaghi, S. Karamohamed, B. Pettersson, M. Uhlén, and P. Nyrén, “Real-time DNA sequencing using detection of pyrophosphate release,” Analytical Biochemistry, vol. 242, no. 1, pp. 84–89, 1996.
- M. Ronaghi, M. Uhlén, and P. Nyrén, “A sequencing method based on real-time pyrophosphate,” Science, vol. 281, no. 5375, pp. 363–365, 1998.
- M. Margulies, M. Egholm, W. E. Altman et al., “Genome sequencing in microfabricated high-density picolitre reactors,” Nature, vol. 437, no. 7057, pp. 376–380, 2005.
- E. R. Mardis, “A decade's perspective on DNA sequencing technology,” Nature, vol. 470, no. 7333, pp. 198–203, 2011.
- O. Morozova, M. Hirst, and M. A. Marra, “Applications of new sequencing technologies for transcriptome analysis,” Annual Review of Genomics and Human Genetics, vol. 10, pp. 135–151, 2009.
- H. L. Harris, L. J. Brennan, B. A. Keddie, and H. R. Braig, “Bacterial symbionts in insects: balancing life and death,” Symbiosis, vol. 51, no. 1, pp. 37–53, 2010.
- N. Whiteford, T. Skelly, C. Curtis et al., “Swift: primary data analysis for the Illumina Solexa sequencing platform,” Bioinformatics, vol. 25, no. 17, pp. 2194–2199, 2009.
- F. Tang, C. Barbacioru, Y. Wang et al., “mRNA-Seq whole-transcriptome analysis of a single cell,” Nature Methods, vol. 6, no. 5, pp. 377–382, 2009.
- N. Cloonan, A. R. R. Forrest, G. Kolle et al., “Stem cell transcriptome profiling via massive-scale mRNA sequencing,” Nature Methods, vol. 5, no. 7, pp. 613–619, 2008.
- J. M. Rothberg, W. Hinz, T. M. Rearick et al., “An integrated semiconductor device enabling non-optical genome sequencing,” Nature, vol. 475, no. 7356, pp. 348–352, 2011.
- M. A. Quail, M. Smith, P. Coupland et al., “A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers,” BMC Genomics, vol. 13, article 341, 2012.
- J. Eid, A. Fehr, J. Gray et al., “Real-time DNA sequencing from single polymerase molecules,” Science, vol. 323, no. 5910, pp. 133–138, 2009.
- N. J. Loman, R. V. Misra, T. J. Dallman et al., “Performance comparison of benchtop high-throughput sequencing platforms,” Nature Biotechnology, vol. 30, no. 5, pp. 434–439, 2012.
- N. R. Pace, D. A. Stahl, D. J. Lane, and G. J. Olsen, “The analysis of natural microbial populations by ribosomal RNA sequences,” ASM News, vol. 51, no. 4, pp. 4–12, 1985.
- G. Muyzer, “DGGE/TGGE a method for identifying genes from natural ecosystems,” Current Opinion in Microbiology, vol. 2, no. 3, pp. 317–322, 1999.
- G. Laguerre, M.-R. Allard, F. Revoy, and N. Amarger, “Rapid identification of rhizobia by restriction fragment length polymorphism analysis of PCR-amplified 16S rRNA genes,” Applied and Environmental Microbiology, vol. 60, no. 1, pp. 56–63, 1994.
- J. Dunbar, L. O. Ticknor, and C. R. Kuske, “Assessment of microbial diversity in four Southwestern United States soils by 16S rRNA gene terminal restriction fragment analysis,” Applied and Environmental Microbiology, vol. 66, no. 7, pp. 2943–2950, 2000.
- D.-H. Lee, Y.-G. Zo, and S.-J. Kim, “Nonradioactive method to study genetic profiles of natural bacterial communities by PCR-single-strand-conformation polymorphism,” Applied and Environmental Microbiology, vol. 62, no. 9, pp. 3112–3120, 1996.
- D. Medini, D. Serruto, J. Parkhill et al., “Microbiology in the post-genomic era,” Nature Reviews Microbiology, vol. 6, no. 6, pp. 419–430, 2008.
- A. Fabrice and R. Didier, “Exploring microbial diversity using 16S rRNA high-throughput methods,” Journal of Computer Science and Systems Biology, pp. 074–092, 2009.
- P. D. Schloss, D. Gevers, and S. L. Westcott, “Reducing the effects of PCR amplification and sequencing Artifacts on 16s rRNA-based studies,” PLoS One, vol. 6, no. 12, Article ID e27310, 2011.
- B. J. Baker and J. F. Banfield, “Microbial communities in acid mine drainage,” FEMS Microbiology Ecology, vol. 44, no. 2, pp. 139–152, 2003.
- C. W. Nossa, W. E. Oberdorf, L. Yang et al., “Design of 16S rRNA gene primers for 454 pyrosequencing of the human foregut microbiome,” World Journal of Gastroenterology, vol. 16, no. 33, pp. 4135–4144, 2010.
- N. Youssef, C. S. Sheik, L. R. Krumholz, F. Z. Najar, B. A. Roe, and M. S. Elshahed, “Comparison of species richness estimates obtained using nearly complete fragments and simulated pyrosequencing-generated fragments in 16S rRNA gene-based environmental surveys,” Applied and Environmental Microbiology, vol. 75, no. 16, pp. 5227–5236, 2009.
- Y. Wang and P.-Y. Qian, “Conservative fragments in bacterial 16S rRNA genes and primer design for 16S ribosomal DNA amplicons in metagenomic studies,” PLoS One, vol. 4, no. 10, Article ID e7401, 2009.
- B. J. Paster, S. K. Boches, J. L. Galvin et al., “Bacterial diversity in human subgingival plaque,” Journal of Bacteriology, vol. 183, no. 12, pp. 3770–3783, 2001.
- J. E. Clarridge III, “Impact of 16S rRNA gene sequence analysis for identification of bacteria on clinical microbiology and infectious diseases,” Clinical Microbiology Reviews, vol. 17, no. 4, pp. 840–862, 2004.
- B. Wang, J. Wang, W. Zhang, and D. R. Meldrum, “Application of synthetic biology in cyanobacteria and algae,” Frontiers in Microbiology, vol. 3, article 344, 2012.
- S. Chakravorty, D. Helb, M. Burday, N. Connell, and D. Alland, “A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteria,” Journal of Microbiological Methods, vol. 69, no. 2, pp. 330–339, 2007.
- P. S. Kumar, M. R. Brooker, S. E. Dowd, and T. Camerlengo, “Target region selection is a critical determinant of community fingerprints generated by 16S Pyrosequencing,” PLoS One, vol. 6, no. 6, Article ID e20956, 2011.
- S. Vasileiadis, E. Puglisi, M. Arena, F. Cappa, P. S. Cocconcelli, and M. Trevisan, “Soil bacterial diversity screening using single 16S rRNA gene V regions coupled with multi-million read generating sequencing technologies,” PLoS One, vol. 7, no. 8, Article ID e42671, 2012.
- L. F. W. Roesch, R. R. Fulthorpe, A. Riva et al., “Pyrosequencing enumerates and contrasts soil microbial diversity,” ISME Journal, vol. 1, no. 4, pp. 283–290, 2007.
- V. Acosta-Martínez, S. Dowd, Y. Sun, and V. Allen, “Tag-encoded pyrosequencing analysis of bacterial diversity in a single soil type as affected by management and land use,” Soil Biology and Biochemistry, vol. 40, no. 11, pp. 2762–2770, 2008.
- S. Vasileiadis, E. Puglisi, M. Arena et al., “Soil microbial diversity patterns of a lowland spring environment,” FEMS Microbiology Ecology, 2013.
- J. Rousk, E. Bååth, P. C. Brookes et al., “Soil bacterial and fungal communities across a pH gradient in an arable soil,” ISME Journal, vol. 4, no. 10, pp. 1340–1352, 2010.
- H. Nacke, A. Thürmer, A. Wollherr et al., “Pyrosequencing-based assessment of bacterial community structure along different management types in German forest and grassland soils,” PLoS One, vol. 6, no. 2, Article ID e17000, 2011.
- M. Buée, M. Reich, C. Murat et al., “454 Pyrosequencing analyses of forest soils reveal an unexpectedly high fungal diversity,” New Phytologist, vol. 184, no. 2, pp. 449–456, 2009.
- A. Jumpponen, K. L. Jones, and J. Blair, “Vertical distribution of fungal communities in tallgrass prairie soil,” Mycologia, vol. 102, no. 5, pp. 1027–1041, 2010.
- T. Stoeck, D. Bass, M. Nebel et al., “Multiple marker parallel tag environmental DNA sequencing reveals a highly complex eukaryotic community in marine anoxic water,” Molecular Ecology, vol. 19, no. 1, pp. 21–31, 2010.
- R. Logares, S. Audic, S. Santini, M. C. Pernice, C. de Vargas, and R. Massana, “Diversity patterns and activity of uncultured marine heterotrophic flagellates unveiled with pyrosequencing,” ISME Journal, vol. 6, no. 10, pp. 1823–1833, 2012.
- J. A. Gilbert, J. A. Steele, J. G. Caporaso et al., “Defining seasonal marine microbial community dynamics,” ISME Journal, vol. 6, no. 2, pp. 298–308, 2012.
- S. Monchy, G. Sanciu, M. Jobard et al., “Exploring and quantifying fungal diversity in freshwater lake ecosystems using rDNA cloning/sequencing and SSU tag pyrosequencing,” Environmental Microbiology, vol. 13, no. 6, pp. 1433–1453, 2011.
- R. Logares, E. S. Lindström, S. Langenheder et al., “Biogeography of bacterial communities exposed to progressive long-term environmental change,” The ISME Journal, vol. 7, no. 5, pp. 937–948, 2013.
- N. Taib, J. F. Mangot, I. Domaizon, G. Bronner, and D. Debroas, “Phylogenetic affiliation of SSU rRNA genes generated by massively parallel sequencing: new insights into the freshwater protist diversity,” PLoS One, vol. 8, article e58950, no. 3, 2013.
- C. E. Robertson, L. K. Baumgartner, J. K. Harris et al., “Culture-independent analysis of aerosol microbiology in a metropolitan subway system,” Applied and Environmental Microbiology, vol. 79, no. 11, pp. 3485–3493, 2013.
- C. Pedrós-Alió, “Marine microbial diversity: can it be determined?” Trends in Microbiology, vol. 14, no. 6, pp. 257–263, 2006.
- E. Pelletier, A. Kreimeyer, S. Bocs et al., “‘Candidatus Cloacamonas acidaminovorans’: genome sequence reconstruction provides a first glimpse of a new bacterial division,” Journal of Bacteriology, vol. 190, no. 7, pp. 2572–2579, 2008.
- K. F. Ettwig, M. K. Butler, D. Le Paslier et al., “Nitrite-driven anaerobic methane oxidation by oxygenic bacteria,” Nature, vol. 464, no. 7288, pp. 543–548, 2010.
- J. G. Elkins, M. Podar, D. E. Graham et al., “A korarchaeal genome reveals insights into the evolution of the Archaea,” Proceedings of the National Academy of Sciences of the United States of America, vol. 105, no. 23, pp. 8102–8107, 2008.
- H. Tamaki, Y. Tanaka, H. Matsuzawa et al., “Armatimonas rosea gen. nov., sp. nov., of a novel bacterial phylum, Armatimonadetes phyl. nov., formally called the candidate phylum OP10,” International Journal of Systematic and Evolutionary Microbiology, vol. 61, no. 6, pp. 1442–1447, 2011.
- K. C. Wrighton, B. C. Thomas, I. Sharon et al., “Fermentation, hydrogen, and sulfur metabolism in multiple uncultivated bacterial phyla,” Science, vol. 337, no. 6102, pp. 1661–1665, 2012.
- T. Kalisky and S. R. Quake, “Single-cell genomics,” Nature Methods, vol. 8, no. 4, pp. 311–314, 2011.
- B. K. Swan, M. Martinez-Garcia, C. M. Preston et al., “Potential for chemolithoautotrophy among ubiquitous bacteria lineages in the dark ocean,” Science, vol. 333, no. 6047, pp. 1296–1300, 2011.
- T. Woyke, G. Xie, A. Copeland et al., “Assembling the marine metagenome, one cell at a time,” PLoS One, vol. 4, no. 4, Article ID e5299, 2009.
- T. Woyke, D. Tighe, K. Mavromatis et al., “One bacterial cell, one complete genome,” PLoS One, vol. 5, no. 4, Article ID e10314, 2010.
- A. Raghunathan, H. R. Ferguson Jr., C. J. Bornarth, W. Song, M. Driscoll, and R. S. Lasken, “Genomic DNA amplification from a single bacterium,” Applied and Environmental Microbiology, vol. 71, no. 6, pp. 3342–3347, 2005.
- K. Leung, H. Zahn, T. Leaver et al., “A programmable droplet-based microfluidic device applied to multiparameter analysis of single microbes and microbial communities,” Proceedings of the National Academy of Sciences of the United States of America, vol. 109, no. 20, pp. 7665–7670, 2012.
- M. Podar, C. B. Abulencia, M. Walcher et al., “Targeted access to the genomes of low-abundance organisms in complex microbial communities,” Applied and Environmental Microbiology, vol. 73, no. 10, pp. 3205–3214, 2007.
- N. H. Youssef, P. C. Blainey, S. R. Quake, and M. S. Elshahed, “Partial genome assembly for a candidate division OP11 single cell from an anoxic spring (Zodletone spring, Oklahoma),” Applied and Environmental Microbiology, vol. 77, no. 21, pp. 7804–7814, 2011.
- A. Siegl, J. Kamke, T. Hochmuth et al., “Single-cell genomics reveals the lifestyle of Poribacteria, a candidate phylum symbiotically associated with marine sponges,” ISME Journal, vol. 5, no. 1, pp. 61–70, 2011.
- H. S. Yoon, D. C. Price, R. Stepanauskas et al., “Single-cell genomics reveals organismal interactions in uncultivated marine protists,” Science, vol. 332, no. 6030, pp. 714–717, 2011.
- R. Ghai, L. Pašić, A. B. Fernández et al., “New abundant microbial groups in aquatic hypersaline environments,” Scientific Reports, vol. 1, 135, 2011.
- H. Chitsaz, J. L. Yee-Greenbaum, G. Tesler et al., “Efficient de novo assembly of single-cell bacterial genomes from short-read data sets,” Nature Biotechnology, vol. 29, no. 10, pp. 915–922, 2011.
- P. C. Blainey, A. C. Mosier, A. Potanina, C. A. Francis, and S. R. Quake, “Genome of a low-salinity ammonia-oxidizing archaeon determined by single-cell and metagenomic analysis,” PLoS One, vol. 6, no. 2, Article ID e16626, 2011.
- O. U. Mason, T. C. Hazen, S. Borglin et al., “Metagenome, metatranscriptome and single-cell sequencing reveal microbial response to Deepwater Horizon oil spill,” The ISME Journal, vol. 6, no. 9, pp. 1715–1727, 2012.
- M. Hess, A. Sczyrba, R. Egan et al., “Metagenomic discovery of biomass-degrading genes and genomes from cow rumen,” Science, vol. 331, no. 6016, pp. 463–467, 2011.
- P. Lorenz and J. Eck, “Metagenomics and industrial applications,” Nature Reviews Microbiology, vol. 3, no. 6, pp. 510–516, 2005.
- P. D. Schloss and J. Handelsman, “Biotechnological prospects from metagenomics,” Current Opinion in Biotechnology, vol. 14, no. 3, pp. 303–310, 2003.
- A. Siegl and U. Hentschel, “PKS and NRPS gene clusters from microbial symbiont cells of marine sponges by whole genome amplification,” Environmental Microbiology Reports, vol. 2, no. 4, pp. 507–513, 2010.
- M. Hess, A. Sczyrba, R. Egan et al., “Metagenomic discovery of biomass-degrading genes and genomes from cow rumen,” Science, vol. 331, no. 6016, pp. 463–467, 2011.
- M. Martinez-Garcia, D. M. Brazel, B. K. Swan et al., “Capturing single cell genomes of active polysaccharide degraders: an unexpected contribution of verrucomicrobia,” PLoS One, vol. 7, no. 4, Article ID e35314, 2012.
- M. A. Moran, “Metatranscriptomics: eavesdropping on complex microbial communities,” Microbe, vol. 4, no. 7, pp. 329–335, 2009.
- J. Zhou, L. Wu, Y. Deng et al., “Reproducibility and quantitation of amplicon sequencing-based detection,” ISME Journal, vol. 5, no. 8, pp. 1303–1313, 2011.
- J. Zhou, S. Kang, C. W. Schadt, and C. T. Garten Jr., “Spatial scaling of functional gene diversity across various microbial taxa,” Proceedings of the National Academy of Sciences of the United States of America, vol. 105, no. 22, pp. 7768–7773, 2008.
- S. Turner, K. M. Pryer, V. P. W. Miao, and J. D. Palmer, “Investigating deep phylogenetic relationships among cyanobacteria and plastids by small subunit rRNA sequence analysis,” Journal of Eukaryotic Microbiology, vol. 46, no. 4, pp. 327–338, 1999.
- D. J. Lane, “16S/23S rRNA sequencing,” in Nucleic Acid Techniques in Bacterial Systematics, E. Stackebrandt and M. Goodfellow, Eds., pp. 115–175, John Wiley & Son, New York, NY, USA, 1991.
- M. T. Suzuki and S. J. Giovannoni, “Bias caused by template annealing in the amplification of mixtures of 16S rRNA genes by PCR,” Applied and Environmental Microbiology, vol. 62, no. 2, pp. 625–630, 1996.
- J. C. Makemson, N. R. Fulayfil, W. Landry et al., “Shewanella woodyi sp. nov., an Exclusively respiratory luminous bacterium isolated from the Alboran sea,” International Journal of Systematic Bacteriology, vol. 47, no. 4, pp. 1034–1039, 1997.
- N. Boon, W. De Windt, W. Verstraete, and E. M. Top, “Evaluation of nested PCR-DGGE (denaturing gradient gel electrophoresis) with group-specific 16S rRNA primers for the analysis of bacterial communities from different wastewater treatment plants,” FEMS Microbiology Ecology, vol. 39, no. 2, pp. 101–112, 2002.
- H. Heuer, K. Hartung, G. Wieland, I. Kramer, and K. Smalla, “Polynucleotide probes that target a hypervariable region of 16S rRNA genes to identify bacterial isolates corresponding to bands of community fingerprints,” Applied and Environmental Microbiology, vol. 65, no. 3, pp. 1045–1049, 1999.
- J. Jonasson, M. Olofsson, and H.-J. Monstein, “Classification, identification and subtyping of bacteria based on pyrosequencing and signature matching of 16S rDNA fragments,” APMIS, vol. 110, no. 3, pp. 263–272, 2002.
Copyright © 2013 Sofia Nikolaki and George Tsiamis. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.