Computational Approaches for Microalgal Biofuel Optimization: A Review

Koussa, Joseph; Chaiboonchoe, Amphun; Salehi-Ashtiani, Kourosh

doi:https://doi.org/10.1155/2014/649453

BioMed Research International

On this page

Abstract Introduction Conclusion Acknowledgments References Copyright Related Articles

Special Issue

Renewable Energy and Alternative Fuel Technologies

View this Special Issue

Review Article | Open Access

Volume 2014 | Article ID 649453 | https://doi.org/10.1155/2014/649453

Computational Approaches for Microalgal Biofuel Optimization: A Review

Joseph Koussa,¹Amphun Chaiboonchoe,¹and Kourosh Salehi-Ashtiani¹

Academic Editor: Meisam Tabatabaei

Received06 Jun 2014

Revised28 Aug 2014

Accepted01 Sept 2014

Published21 Sept 2014

Abstract

The increased demand and consumption of fossil fuels have raised interest in finding renewable energy sources throughout the globe. Much focus has been placed on optimizing microorganisms and primarily microalgae, to efficiently produce compounds that can substitute for fossil fuels. However, the path to achieving economic feasibility is likely to require strain optimization through using available tools and technologies in the fields of systems and synthetic biology. Such approaches invoke a deep understanding of the metabolic networks of the organisms and their genomic and proteomic profiles. The advent of next generation sequencing and other high throughput methods has led to a major increase in availability of biological data. Integration of such disparate data can help define the emergent metabolic system properties, which is of crucial importance in addressing biofuel production optimization. Herein, we review major computational tools and approaches developed and used in order to potentially identify target genes, pathways, and reactions of particular interest to biofuel production in algae. As the use of these tools and approaches has not been fully implemented in algal biofuel research, the aim of this review is to highlight the potential utility of these resources toward their future implementation in algal research.

1. Introduction

Biofuel production from microalgae has been receiving attention as an alternative energy source due to its high biomass productivity and minimal land resource requirement. However, there is still a need to improve algal productivity in order to make algal-based bioproducts economically viable. Metabolic network reconstructions of algae can offer insight into genetic modification strategies that can be used to improve microalgal strains. A large number of computational tools have been developed, allowing a range of analyses and predictions, based on genetic and thermodynamic constraints embedded in in the network, to identify bioengineering strategies that can result in enhanced biofuel production of the engineered algal strain. Although a fair number of algal genomes have been fully sequenced, only a few metabolic network models have been reconstructed for these species, hampering algal bioengineering progress [1].

The utilities of metabolic network models span over several types of applications. On one hand, these models help contextualizing high throughput experimental data, for example, integrating gene expression data with metabolic pathways under different growth conditions [2]. Metabolic models can also unveil targets for metabolic engineering approaches, which can lead to increased production of target metabolites [3] or preferentially increase respiration rates [4]. On the other hand, with the availability of large and diverse biological data sets, metabolic network models can provide a framework to integrate such omics data and allow the formulation and testing of downstream hypotheses. Last, cross-species metabolic comparison represents one more utility of such reconstructions through which identification of differentially activated metabolic pathways can be achieved among other comparative analyses [5]. Herein we review the reconstruction of metabolic network models and major computational tools and pipelines that hold the potential to contribute to the optimization of algal strains for biofuel production. We describe a number of tools that remain mostly unused by the algal research community. This is reflected from the observation that only 7 algal-based PGDBs (Pathway/Genome Database) are available in Pathway Tools [6], while approximately 3,500 PGDBs are available for nonalgal species (please see below for more information). The use of some of the herein discussed tools, already applied to the multitude nonalgal organisms, ranging from human to E. coli, provides strategies for algal biofuels optimization with major enhancement potential.

2. Metabolic Network Model Reconstruction

Metabolic network reconstruction from genomic and large-scale experimental data can help understand and predict metabolic processes and pathways. A number of tools and databases have been developed specifically to facilitate metabolic network reconstruction. In addition, new analysis tools and approaches are being developed along with the expansion of relevant databases and resources. Table 1 presents some of the existing databases and tools for metabolic network reconstruction.

Metabolic network reconstruction requires information on gene-protein-reaction associations to reconstruct evidence-based, species-specific networks. Protein database resources and tools help to link information between enzymes, EC numbers, genes, proteins, pathways, and substrates. These include BRENDA [7], ExPASy [8], and UniProt (Universal Protein Resource) [9]. BRENDA (BRaunschweig ENzyme DAtabase) enzyme portal is the enzyme information system, which integrates information from seven databases to provide functional biochemical and molecular data. To explore and visualize metabolic networks as maps of metabolic pathways, a number of freely available pathway databases exist. For example, BioCyc, MetaCyc [10], KEGG (Kyoto Encyclopedia of Genes and Genomes) [11], Reactome [12], and BiGG [13] can be named. In turn, common metabolic reconstruction tools include COBRA (more specifically its rBioNet component) [14–16], Model SEED [17], and Pathway Tools [6].

Pathway Tools [6, 18] is an integrated software tool that can create in a semiautomated manner organism-specific network and pathways databases (called Pathway/Genome Database, or PGDB). The PGDBs are essentially knowledge bases that users can query and visualize. For instance, dead-end metabolite analysis and visualization of predicted reaction fluxes can be done easily under “cellular overview” option of the software (Figure 1(a)). A collection of approximately 3,530 PGDBs can be found in BioCyc, which users can visualize, manage, and analyze. Out of these 3,530 PGDBs, only 7 relate to algae (both prokaryotic and eukaryotic), namely, Thalassiosira pseudonana, Nannochloropsis gaditana, Acaryochloris marina, Anabaena cylindrica, Anabaena variabilis, Synechococcus elongatus, and Chlamydomonas reinhardtii. None of the aforementioned algal PGDBs are well-curated with most of them having had slight validation. One of the intensively curated PGDBs is MetaCyc [19–21], which serves as a generic knowledge base that organism-specific networks can be reconstructed from. Homo sapiens (HumanCyc), E. coli (EcoCyc), and Arabidopsis (AraCyc) are some examples of curated, species-specific knowledge bases that can be found in BioCyc (http://biocyc.org/). Kbase (http://kbase.us/) and Biomart [22] are other examples of knowledge bases and knowledge-management platforms that are freely available and allow integration and reconciliation of a variety of data sources.

(a)

(b)

(c)

Genome-scale metabolic reconstructions have continued to expand along with the increased availability of sequenced, annotated genomes. Recent reviews describe the timeline of the appearance of publicly available metabolic models since 1999 for eukaryotes, prokaryotes and archea, and the algorithms that were used [23, 24]. The processes require inputs from different databases and experimental validations. A standard procedure for the reconstruction of genome-scale metabolic networks has been described in detail by Thiele and Palsson [25].

The process of network reconstruction, starting from genome sequences to the finished reconstructed network, is generally time-consuming and labor-intensive. Therefore, automation of the process has been of interest. A limited number of software tools for automated reconstruction are currently available (some examples are given in Table 2); for instance, AUTOGRAPH [26], GEMSiRV [27], MicrobesFlux [28], MetRxn [29], Model SEED [17, 30], SuBliMinaL Toolbox [31], FAME [32], and RAVEN Toolbox [33] can be named. A systematic comparison between some of these platforms can be found in [34]. While draft metabolic models can be generated through such software tools, intensive manual curation is still needed to resolve errors; wrong assignments, fill gaps and reconcile inconsistencies in the generated network.

3. Pathway Visualization

Visualization is a powerful approach to leverage understanding of pathways and reconstructed metabolic networks. In metabolic networks, nodes represent metabolites and edges denote reactions. There are a number of web-based tools to visualize biochemical and metabolic pathways; for example, Biocarta (http://www.biocarta.com/), ExPaSy (Expert Protein Analysis System, http://www.expasy.org/), and KEGG (Kyoto Encyclopedia of Genes and Genomes) can be named; however, most are static pages with only a few resources allowing authorized users to edit the pathways. The advantages that BioCyc/MetaCyc offer compared to KEGG include the ability to carry out pathway analysis, operon prediction, or comparative pathway analysis (for more details see [35]) and visualize the results.

Cytoscape [36, 37] is a biological network visualization and data integration tool that can be used to visualize the results from FBA studies (please see Constraint Based Analysis section for information on FBA). CytoSEED [38] is a Cytoscape plug-in to visualize results from the Model SEED. Fluxviz [39] is another Cytoscape plug-in to visualize flux distribution in the molecular interaction network. VANTED [40, 41] is another data visualization and data integration tool which can be utilized as a stand-alone tool. FluxMap [42] and FBA-SimVis [43] are VANTED plug-in for visualization of metabolic flux after FBA analysis. In addition, Paint4net [44] is a tool to automatically generate maps of reaction fluxes in conjunction with COBRA toolbox (Figure 1(c)).

Most recently, MetDraw [45], a new tool for visualization of genome-scale metabolic networks, has been developed (Figure 1(b)). This tool is compatible with systems biology markup language (SBML) file inputs and allows export of the map image as SVG files. It also allows visualization of metabolomics and reaction fluxes added to gene-protein expression data and overlays all of them on the reconstructed network map. The range of file formats available for data export render the postmodification of the maps, with commonly used image editing software, a simple task.

Although the generation of metabolic network models has been gaining momentum, these models may not provide a complete or accurate representation of metabolism. Particularly, automated modeling has allowed the faster generation of network models, yet reconciliation between the model itself and the biochemical and genomic data is invariably needed. Such model refinements lead to a more accurate reconstruction, allowing more accurate downstream analyses. A common step in such reconstruction refinements is filling reaction gaps to decrease the numbers of dead-end metabolites and enhance the network connectivity. Several tools and algorithms have been set in place to address gap finding and gap filling in metabolic network reconstructions. Some of these tools include, but are not limited to, Gapfill, MEP, GrowMatch, BNICE, and the hole filler in Pathway tools.

4.1. Gapfind and Gapfill

These tools have been developed using two distinct algorithms that initially identify (Gapfind) what the authors have called a “no production” or “no consumption” metabolites [46] through analyzing the production or consumption fluxes in the metabolic model. Subsequently, the identified no production/consumption metabolites are considered as “gaps” and the Gapfill algorithm will attempt to fill them through four major ways. Initially, the algorithm will consider all of the available reactions in the model and reverse them; it will then attempt to import reactions that involve the metabolites from well-curated databases such as MetaCyc [10]. Lastly, it will attempt to fill these gaps by adding transport reactions either internal transport ones, as in from one cellular compartment to the other, or external transport reactions that can either take from or excrete to the extracellular medium.

4.2. MEP and Pathway Tools Hole Filler

On the other hand MEP and Pathway Tools hole filler represent an alternative approach that tackles the gap filling issue identifying missing genes rather than missing reactions, and these tools achieve this goal using expression data and species homology, respectively. As such, this will eventually lead to the expansion of the reconstructed model to include more genes and enzymes and possibly rewire the connectivity of the network [47, 48].

4.3. GrowMatch

This tool has been developed as a model refinement tool rather than a gap filler tool where the aim of such an application would be to reconcile inconsistencies between metabolic model predictions in silico and growth data in vivo. This computational tool can suggest suppression of specific genes to resolve what is referred to as Growth No Growth (GNG) inconsistencies and alternatively adds functionalities to genes to resolve No Growth/Growth (NGG) inconsistencies [49].

4.4. BNICE

It is a framework that considers specific pathways rather than the full-scale model and allows for the optimization of the pathways. It identifies all possible chemical compounds that can be produced by the reactions and enzymes of the pathway [50]. Although this tool is not a model refinement tool per se, the outcome of the pathway optimization can ultimately lead to provisional addition of compounds to the metabolic model and subsequent searches (independently from the tool) for the corresponding genes to provide genomic evidence for the pathway. This approach is similar in outcome to the Gapfind/Gapfill approach.

All of the above and many more tools are of critical importance in the manual curation of metabolic network models. Although the above-mentioned tools ultimately lead to a similar outcome, each may present unique advantages and has specific requirements (Table 3). The choice and use of such tools would thus lead to a higher quality reconstruction and most importantly a higher predictive power.

5. Constraint-Based Modeling, FBA, and Integration of Expression Data

Subsequent to generation of well-curated metabolic network models of organisms, several downstream applications can be used to explore the emergent system’s properties. Having a network set in place, the fluxes of each of the component reactions can be evaluated and moreover modified in an attempt to increase or decrease the production or consumption of key metabolites. In the case of algal biofuel optimization, it is of high interest to achieve directional overproduction of lipids that constitute the basis for algal biofuels. Making use of the known metabolic networks and via a constraint based modeling approach, the identification of genes, pathways, and knockout strategies, that interfere or alter, the expression profiles relevant to production of enzymes related to lipid synthesis and metabolites involved in lipid synthesis pathways is readily achievable. This can be done through a number of computational tools with the outcomes evaluated in silico using flux balance analysis (FBA) [51] and further validated by in vivo experiments.

FBA constrains the metabolite fluxes and their biochemical reactions by four main parameters: mass conservation, thermodynamics (reaction reversibility), steady state assumption for internal metabolite concentrations, and nutrient availability. Based on these constraints, reaction boundaries are set, and a system of linear differential equations is solved with a biologically relevant objective function optimized. The solution space for an FBA can be reduced in size by more constraints and boundaries imposed on reactions and fluxes where the optimal flux distribution achieving the optimized function is a feasible solution for the problem.

Some of the available tools and algorithms that are able to perform such tasks include (but are not restricted to) Optknock, Optstrain, Optflux, MTA, iMAT, BioMet toolbox, PROM, GIMME, E-Flux, MADE, SIMUP, and TIGER, with some allowing the integration of expression data to the metabolic model. These tools are described below.

5.1. GIMME, iMAT, and MADE

Gene inactivity moderated by metabolism and expression (GIMME) [52] is a tool that allows for the integration of expression data to metabolic networks yet optimizing the functionality of the model towards a set objective function by minimizing the use of reaction categorized as inactive. GIMME reduces the sets of reactions to a binary on/off mode whereas each reaction flux is compared to a set threshold and deemed “off” if the flux does not reach that value [53, 54]. Similarly, integrative metabolic analysis tool (iMAT) [55] performs the same task as GIMME in such a way that transcript levels of genes are compared and the corresponding reactions are then assigned value of −1, 0, and 1 to refer to low, moderate, or high levels of expression. Further ahead, the algorithm will then optimize the model to make use of as many reactions having “1” coefficient and decreases the reactions with “−1” coefficient in order to achieve a set objective function. Here too, a threshold needs to be set for expression data comparison to be done. As both iMAT and GIMME require a manually set threshold, this gives rise to biases. In an attempt to evade such a complication, MADE [56], or metabolic adjustment by differential expression, has been developed to carry out similar tasks as the previous two tools yet without the need of manual assignment of a threshold. It will rather require as input expression data originating from more than one condition and will then comparatively, based on the differential expression of each of the genes under each of the conditions, set a threshold based on which the reactions will then be reduced to binary on/off code [53, 54].

5.2. E-Flux

While the above-mentioned tools allow the incorporation of expression data to metabolic model reconstructions and subsequently allow optimization of these models towards a set objective function by suppressing reactions categorized as inactive or of low activity, E-flux allows for this optimization through constraining the upper bounds of the metabolite fluxes based on the expression data by imposing tight constraints on metabolites and reactions where the fluxes will not reach a set value and vice versa [57].

5.3. Optknock, Optstrain, and Optflux

These tools have been used to identify gene knockout strategies (Optknock) [58] that lead to the overproduction of a target metabolite or overexpression strategies (Optstrain) [59] that result into an optimized strain with respect to a set objective function. Optflux on the other hand uses evolutionary algorithms and the previously mentioned Optknock algorithm to identify metabolic engineering targets as well as a range of other applications from phenotype simulations to metabolic flux analysis and calculation of elementary flux mode [60].

5.4. BioMet Toolbox

It is a web-based resource that can be used to perform stoichiometric analyses and integration of transcriptome and interactome data to a metabolic network. It also allows performing linear programming simulations, optimizing for an increased or decreased growth rate, as well as substrate consumption and production. Single or double knockout simulations can also be achieved as well as the detection of key metabolites around which high transcriptional activity is noted [61].

5.5. MTA

Metabolic transformation algorithm [62] is an alternative approach that leads to the prediction of gene knockout strategies able to shift the metabolism of a cell and alter its state from a “source” state to a “target” state. Gene expression profiles are used in order to predict knockouts that modify the flux distribution of the source state in a way to match the desired target state.

5.6. TIGER

It is a toolbox that can be used to integrate expression, metabolic and regulatory information into a genome scale model. It also accounts for gene-protein-reaction associations and couples it with its regulatory profile. One of its added values is its ability to identify model inconsistencies and thus it allows for a modification of the reconstructed network above and beyond being an integration tool [63].

5.7. SIMUP

Most recently, this algorithm was reported offering one unique feature with respect to all of the above introduced tools. The algorithm aids in identifying metabolic engineering strategies that can force the cell to coutilize two different sugar substrates thus, in effect, placing the cell in a “synthetic survival” state in a way that the cell is now forced to metabolize two different sugars simultaneously instead of preferentially consuming one. The net effect can be to simplify the fermentation cycle [64, 65].

In the context of biofuels, all of the above algorithms and tools present huge potential for achieving higher production of the desired bioproducts in microorganisms. The preferential use of one tool over the other may depend on the nature of data available rather than the ultimate goal (Table 4). The identification of knockout strategies that could alter the lipid metabolism by overproducing it, or the detection of highly regulated key metabolites in the lipid pathway, or even achieving a strain able to coutilize two separate sources of energy for its survival, all represent promising outcomes of such applications and several attempts have been already made making use of such algorithms (the results could be found in more detail in the published articles [66, 67]).

6. Omics Data Integration Tools

Beyond the integration of expression data to network models, a deeper understanding of the functional model requires further integration of proteomics, metabolomics, fluxomics, and phenotypic data with transcriptomics data. Computational tools and algorithms have been recently set forth to achieve the aforementioned integrations. IOMA, MASS, and MBA are examples of such endeavors.

6.1. IOMA

Integrative omics-metabolic analysis is an algorithm that allows the integration of metabolomics and proteomic data to the metabolic network model and also evaluates the kinetics of the reactions included [68].

6.2. MASS

Mass action stoichiometric simulation [69] achieves integration of fluxomic data on top of the metabolomics and proteomics data sets which leads to the dynamic reconstruction of the model in place.

6.3. MBA

Model-building algorithm [70] has been recently reported with an added feature allowing it to also integrate phenotypic data on top of all the above-mentioned omics data sets, thus potentially leading to tissue-specific model reconstruction.

With respect to phenotypic data, one interesting tool that may generate such type of data and can be used in conjunction with MBA, for example, is the Biolog phenotype microarray technology [71, 72]. The Biolog is a powerful technology providing high-throughput quantitation of phenotypic data, useful in identifying additional biochemical assays and improving a metabolic model reconstruction.

The phenotype microarray (PM) technology developed by Biolog (Hayward, CA, USA) can be used for the phenotypic analysis. Biolog is an in vitro assay that measures the respiration of cells as a function of time in hundreds of microwells simultaneously. Each PM plate contains 96 wells seeded with different metabolite and monitored automatically over time via the OmniLog machine. Metabolite utilization within the cell is determined by the amount of color development produced by a tetrazolium-based redox dye. Various 96-well metabolite plates (or PMs) can be used to measures carbon source, nitrogen, sulfur, and phosphorus utilization phenotypes. Some plates were used to test for osmotic/ion and pH effects. Data analysis is performed using the opm software package [73]. The Biolog technology has also been successfully used to fill gaps in metabolic networks to enhance models [74].

7. Bioengineering, Parts and Circuits

With all of the above tools readily available to use and many others currently in use but not described in this review, the identification of new pathways and reactions has been made easier than ever before. In the context of bioengineering, the significance of these computational tools is in guiding wet-bench experimental design as opposed to providing solely theoretical insight into the system as a whole. More specifically, with regard to biofuel production, the identification of knockout strategies or differential expression of genes or enzymes that might lead to overproduction of biofuels would be only of theoretical value if not coupled with more applicable approaches to achieve the targets in vivo. This is where the contributions of synthetic biology approaches are of crucial importance and significance. Once the target pathways have been identified, the parts forming those pathways, in engineering terms, are to be made available in order to mimic the cell metabolic circuitry and alter it. Parts are defined as genes and ribosomal binding sites, promoters, terminators, and polymerases [75]. Most recently, Talebi et al. have successfully achieved a 12% increase in the total lipid content of the microalgae Dunaliella salina, transforming it witha bioengineered plasmid comprising specific parts, genes, and inducible promoters, driving the cellular carbon flux into the fatty acids biosynthesis pathway [76].

Biological circuits are furthermore defined as a designed device made out of a set of parts and engineered in a way to confer an added functionality to a system. Figure 2 illustrates, in a comparative approach to electrical circuits, what a newly designed biological circuit can achieve. A number of biological circuits have been previously realized [77–79] and genetic parts are now made available through a number of databases such as the MIT Registry of Standard Biological Parts’ (http://partsregistry.org/). A more in-depth review on the tools and applications that lead to the design of circuits was published by Marchicio et al. and can be referred to for more details [80].

8. Emerging Algal-Specific Computational and Experimental Resources

Optimizing algae for biofuel production requires a deep understanding of algal metabolic networks with genomic, fluxomics, proteomics, and metabolomics data integration. Figure 3 conceptualizes an integrative approach to build, refine, and validate an algae based metabolic model with predictive power to guide potential bioengineering targets aimed at optimizing algae for biofuel production.

Furthermore, a better understanding of the biological system through functional modeling using data generated from the sequencing technologies is still one of the research challenges. Functional modeling requires gene ontology (GO) annotation for enrichment analysis. GO enrichment analysis tools identify GO terms with statistical significance in the reference set. Algal Functional Annotation Tool is the algae-specific genome annotation tool that uses gene lists from AUGUSTUS, JGI, or phytozome gene models for Chlamydomonas reinhardtii and Chlorella NC64A [81] to perform functional term enrichment. This functional annotation tool provides analytical power for interpretation of obtained large-scale experimental data.

Interestingly, a new approach in bioengineering, transcription factor engineering approach (TFE) [67], is regarded as a highly promising approach and considers transcription factors as parts able to modify biological circuits. An ongoing work (in the authors’ laboratory) is now attempting to systematically clone transcription and chromatin factors (TF and CF) of C. reinhardtii thus making available to the scientific community a full library of TF and CF parts that can easily be introduced as part of a new design. Figure 4 represents one step further downstream the initial cloning and describes the transfer of cloned ORFs from the entry vector to the destination vector of choice. These ORFs can be considered as potential parts to be used in bioengineering endeavors when model-based predictions call for their use. Furthermore, the metabolic ORFeome of C. reinhardtii has been previously generated and the reconstruction of its central metabolic network has been done [82–84]. Following that, genome-scale reconstructed networks of C. reinhardtii were released accounting for around 2000 reactions and their associated genes and metabolites [82, 85]. Added to these models, a PGDB for C. reinhardtii has been made available as ChlamyCyc [86] making use of Pathway Tools platform and thus making the investigations of the metabolic and regulatory networks of such algae far more at hand. Prior and in parallel to these advances a species specific resource, Chlamydomonas Resource Center (http://chlamycollection.org/), has served the algal community offering a library of Chlamydomonas strains amongst other parts and tools, which provide needed resources for experimental protocols targeting various aspects of algal biology, including the metabolism of lipids and biofuels in this organism.

9. Conclusion

The above reviewed computational tools and approaches in conjunction with the high interests of the scientific community in synthetic biology offer a new perspective in accelerating biofuel production and microalgal optimization research. The pressing economical and environmental challenges of the use of fossil fuels will furthermore lead to a positive selective pressure towards the use of these strategies aiming at the optimization of biofuel producing strains. A large set of biofuel types can serve as alternative energy sources which currently include ethanol, n-butanol, iso-butanol, short chain alcohols, short chain alkanes, biodiesel (FAMEs), and fatty alcohols. These tools and applications are promising yet much more optimizations need to be achieved in order for biofuel production to compete with available fossil fuels. With the “green revolution” and the more environmentally conscious population, we expect this field to expand significantly in the coming years, building on the available resources for systems and synthetic biology and achieving the generation of strains optimized for biofuel production.

Conflict of Interests

The authors declare that there is no conflict of interests.

Acknowledgments

Major support for this work was provided by New York University Abu Dhabi Institute grant G1205 and NYU Abu Dhabi Faculty Research Funds AD060. The authors thank Basel Khraiwesh and Ashish Jaiswal for designing some of the figures and Bushra Saeed Dohai for discussions.

References

J. E. Koskimaki, A. S. Blazier, A. F. Clarens, and J. A. Papin, “Computational models of algae metabolism for industrial applications,” Industrial Biotechnology, vol. 9, no. 4, pp. 185–195, 2013.
View at: Publisher Site | Google Scholar
R. Usaite, K. R. Patil, T. Grotkjær, J. Nielsen, and B. Regenberg, “Global transcriptional and physiological responses of Saccharomyces cerevisiae to ammonium, L-alanine, or L-glutamine limitation,” Applied and Environmental Microbiology, vol. 72, no. 9, pp. 6194–6203, 2006.
View at: Publisher Site | Google Scholar
R. M. Zelle, E. de Hulster, W. A. van Winden et al., “Malic acid production by Saccharomyces cerevisiae: engineering of pyruvate carboxylation, oxaloacetate reduction, and malate export,” Applied and Environmental Microbiology, vol. 74, no. 9, pp. 2766–2777, 2008.
View at: Publisher Site | Google Scholar
M. Izallalen, R. Mahadevan, A. Burgard et al., “Geobacter sulfurreducens strain engineered for increased rates of respiration,” Metabolic Engineering, vol. 10, no. 5, pp. 267–275, 2008.
View at: Publisher Site | Google Scholar
M. A. Oberhardt, B. Ø. Palsson, and J. A. Papin, “Applications of genome-scale metabolic reconstructions,” Molecular Systems Biology, vol. 5, no. 1, article 320, 2009.
View at: Publisher Site | Google Scholar
P. D. Karp, S. M. Paley, M. Krummenacker et al., “Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology,” Briefings in Bioinformatics, vol. 11, no. 1, Article ID bbp043, pp. 40–79, 2009.
View at: Publisher Site | Google Scholar
I. Schomburg, A. Chang, S. Placzek et al., “BRENDA in 2013: integrated reactions, kinetic data, enzyme function data, improved disease classification: new options and contents in BRENDA,” Nucleic Acids Research, vol. 41, no. 1, pp. D764–D772, 2013.
View at: Publisher Site | Google Scholar
P. Artimo, M. Jonnalagedda, K. Arnold et al., “ExPASy: SIB bioinformatics resource portal,” Nucleic Acids Research, vol. 40, no. 1, pp. W597–W603, 2012.
View at: Publisher Site | Google Scholar
T. U. Consortium, “Ongoing and future developments at the Universal Protein Resource,” Nucleic Acids Research, vol. 39, supplement 1, pp. D214–D219, 2011.
View at: Google Scholar
R. Caspi, T. Altman, R. Billington et al., “The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome databases,” Nucleic Acids Research, vol. 42, no. 1, pp. D459–D471, 2014.
View at: Publisher Site | Google Scholar
M. Kanehisa, S. Goto, Y. Sato, M. Furumichi, and M. Tanabe, “KEGG for integration and interpretation of large-scale molecular data sets,” Nucleic Acids Research, vol. 40, no. 1, pp. D109–D114, 2012.
View at: Publisher Site | Google Scholar
D. Croft, G. O'Kelly, G. Wu et al., “Reactome: a database of reactions, pathways and biological processes,” Nucleic Acids Research, vol. 39, supplement 1, pp. D691–D697, 2011.
View at: Publisher Site | Google Scholar
J. Schellenberger, J. O. Park, T. M. Conrad, and B. T. Palsson, “BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions,” BMC Bioinformatics, vol. 11, article 213, 2010.
View at: Publisher Site | Google Scholar
S. A. Becker, A. M. Feist, M. L. Mo, G. Hannum, B. Ø. Palsson, and M. J. Herrgard, “Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox,” Nature Protocols, vol. 2, no. 3, pp. 727–738, 2007.
View at: Publisher Site | Google Scholar
J. Schellenberger, R. Que, R. M. T. Fleming et al., “Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0,” Nature Protocols, vol. 6, no. 9, pp. 1290–1307, 2011.
View at: Publisher Site | Google Scholar
S. G. Thorleifsson and I. Thiele, “rBioNet: a COBRA toolbox extension for reconstructing high-quality biochemical networks,” Bioinformatics, vol. 27, no. 14, Article ID btr308, pp. 2009–2010, 2011.
View at: Publisher Site | Google Scholar
S. Devoid, R. Overbeek, M. DeJongh et al., “Automated genome annotation and metabolic model reconstruction in the SEED and model SEED,” in Systems Metabolic Engineering, pp. 17–45, 2013.
View at: Publisher Site | Google Scholar
P. D. Karp, S. Paley, and P. Romero, “The pathway tools software,” Bioinformatics, vol. 18, supplement 1, pp. S225–S232, 2002.
View at: Publisher Site | Google Scholar
R. Caspi, T. Altman, J. M. Dale et al., “The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases,” Nucleic Acids Research, vol. 38, supplement 1, Article ID gkp875, pp. D473–D479, 2009.
View at: Publisher Site | Google Scholar
R. Caspi, T. Altman, K. Dreher et al., “The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases,” Nucleic Acids Research, vol. 40, no. 1, pp. D742–D753, 2012.
View at: Publisher Site | Google Scholar
R. Caspi, H. Foerster, C. A. Fulcher et al., “The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome databases,” Nucleic Acids Research, vol. 36, supplement 1, pp. D623–D631, 2008.
View at: Publisher Site | Google Scholar
A. Kasprzyk, “BioMart: driving a paradigm change in biological data management,” Database, vol. 2011, Article ID bar049, 2011.
View at: Publisher Site | Google Scholar
T. Y. Kim, S. B. Sohn, Y. B. Kim, W. J. Kim, and S. Y. Lee, “Recent advances in reconstruction and applications of genome-scale metabolic models,” Current Opinion in Biotechnology, vol. 23, no. 4, pp. 617–623, 2012.
View at: Publisher Site | Google Scholar
M. A. Oberhardt, J. Puchałka, V. A. P. M. dos Santos, and J. A. Papin, “Reconciliation of genome-scale metabolic reconstructions for comparative systems analysis,” PLoS Computational Biology, vol. 7, no. 3, Article ID e1001116, 2011.
View at: Publisher Site | Google Scholar
I. Thiele and B. Ø. Palsson, “A protocol for generating a high-quality genome-scale metabolic reconstruction,” Nature Protocols, vol. 5, no. 1, pp. 93–121, 2010.
View at: Publisher Site | Google Scholar
R. A. Notebaart, F. H. J. van Enckevort, C. Francke, R. J. Siezen, and B. Teusink, “Accelerating the reconstruction of genome-scale metabolic networks,” BMC Bioinformatics, vol. 7, article 296, 2006.
View at: Publisher Site | Google Scholar
Y.-C. Liao, M.-H. Tsai, F.-C. Chen, and C. A. Hsiung, “GEMSiRV: a software platform for GEnome-scale metabolic model simulation, reconstruction and visualization,” Bioinformatics, vol. 28, no. 13, Article ID bts267, pp. 1752–1758, 2012.
View at: Publisher Site | Google Scholar
X. Feng, Y. Xu, Y. Chen, and Y. J. Tang, “MicrobesFlux: a web platform for drafting metabolic models from the KEGG database,” BMC Systems Biology, vol. 6, article 94, 2012.
View at: Publisher Site | Google Scholar
A. Kumar, P. F. Suthers, and C. D. Maranas, “MetRxn: A knowledgebase of metabolites and reactions spanning metabolic models and databases,” BMC Bioinformatics, vol. 13, no. 1, article 6, 2012.
View at: Publisher Site | Google Scholar
C. S. Henry, M. Dejongh, A. A. Best, P. M. Frybarger, B. Linsay, and R. L. Stevens, “High-throughput generation, optimization and analysis of genome-scale metabolic models,” Nature Biotechnology, vol. 28, no. 9, pp. 977–982, 2010.
View at: Publisher Site | Google Scholar
N. Swainston, K. Smallbone, P. Mendes, D. Kell, and N. Paton, “The SuBliMinaL Toolbox: automating steps in the reconstruction of metabolic networks,” Journal of Integrative Bioinformatics, vol. 8, article 186, no. 2, 2011.
View at: Google Scholar
J. Boele, B. G. Olivier, and B. Teusink, “FAME, the flux analysis and modeling environment,” BMC Systems Biology, vol. 6, no. 1, p. 8, 2012.
View at: Publisher Site | Google Scholar
R. Agren, L. Liu, S. Shoaie, W. Vongsangnak, I. Nookaew, and J. Nielsen, “The RAVEN toolbox and its use for generating a genome-scale metabolic model for penicillium chrysogenum,” PLoS Computational Biology, vol. 9, no. 3, Article ID e1002980, 2013.
View at: Publisher Site | Google Scholar
J. J. Hamilton and J. L. Reed, “Software platforms to facilitate reconstructing genome-scale metabolic networks,” Environmental Microbiology, vol. 16, no. 1, pp. 49–59, 2014.
View at: Publisher Site | Google Scholar
T. Altman, M. Travers, A. Kothari, R. Caspi, and P. D. Karp, “A systematic comparison of the MetaCyc and KEGG pathway databases,” BMC Bioinformatics, vol. 14, article 112, 2013.
View at: Publisher Site | Google Scholar
M. E. Smoot, K. Ono, J. Ruscheinski, P.-L. Wang, and T. Ideker, “Cytoscape 2.8: new features for data integration and network visualization,” Bioinformatics, vol. 27, no. 3, pp. 431–432, 2011.
View at: Publisher Site | Google Scholar
R. Saito, M. E. Smoot, K. Ono et al., “A travel guide to Cytoscape plugins,” Nature Methods, vol. 9, no. 11, pp. 1069–1076, 2012.
View at: Publisher Site | Google Scholar
M. DeJongh, B. Bockstege, P. Frybarger, N. Hazekamp, J. Kammeraad, and T. McGeehan, “CytoSEED: a Cytoscape plugin for viewing, manipulating and analyzing metabolic models created by the model SEED,” Bioinformatics, vol. 28, no. 6, pp. 891–892, 2012.
View at: Publisher Site | Google Scholar
M. König and H. Holzhütter, “Fluxviz-cytoscape plug-in for visualization of flux distributions in networks,” in Proceedings of the International Conference on Genome Informatics, 2010.
View at: Google Scholar
H. Rohn, A. Junker, A. Hartmann et al., “VANTED v2: a framework for systems biology applications,” BMC Systems Biology, vol. 6, article 139, 2012.
View at: Publisher Site | Google Scholar
B. H. Junker, C. Klukas, and F. Schreiber, “Vanted: a system for advanced data analysis and visualization in the context of biological networks,” BMC Bioinformatics, vol. 7, article 109, 13 pages, 2006.
View at: Publisher Site | Google Scholar
H. Rohn, A. Hartmann, A. Junker, B. H. Junker, and F. Schreiber, “FluxMap: A VANTED add-on for the visual exploration of flux distributions in biological networks,” BMC Systems Biology, vol. 6, article 33, 2012.
View at: Publisher Site | Google Scholar
E. Grafahrend-Belau, C. Klukas, B. H. Junker, and F. Schreiber, “FBA-SimVis: interactive visualization of constraint-based metabolic models,” Bioinformatics, vol. 25, no. 20, pp. 2755–2757, 2009.
View at: Publisher Site | Google Scholar
A. Kostromins and E. Stalidzans, “Paint4Net: COBRA Toolbox extension for visualization of stoichiometric models of metabolism,” BioSystems, vol. 109, no. 2, pp. 233–239, 2012.
View at: Publisher Site | Google Scholar
P. A. Jenseny and J. A. Papin, “MetDraw: automated visualization of genome-scale metabolic network reconstructions and high-throughput data,” Bioinformatics, vol. 30, no. 9, pp. 1327–1328, 2014.
View at: Publisher Site | Google Scholar
V. Satish Kumar, M. S. Dasika, and C. D. Maranas, “Optimization based automated curation of metabolic reconstructions,” BMC Bioinformatics, vol. 8, article 212, 2007.
View at: Publisher Site | Google Scholar
P. Kharchenko, D. Vitkup, and G. M. Church, “Filling gaps in a metabolic network using expression information,” Bioinformatics, vol. 20, supplement 1, pp. i178–i185, 2004.
View at: Publisher Site | Google Scholar
M. L. Green and P. D. Karp, “A Bayesian method for identifying missing enzymes in predicted metabolic pathway databases,” BMC Bioinformatics, vol. 5, article 76, 2004.
View at: Publisher Site | Google Scholar
V. S. Kumar and C. D. Maranas, “GrowMatch: an automated method for reconciling In Silico/In Vivo growth predictions,” PLoS Computational Biology, vol. 5, no. 3, 2009.
View at: Publisher Site | Google Scholar
V. Hatzimanikatis, C. Li, J. A. Ionita, C. S. Henry, M. D. Jankowski, and L. J. Broadbelt, “Exploring the diversity of complex metabolic networks,” Bioinformatics, vol. 21, no. 8, pp. 1603–1609, 2005.
View at: Publisher Site | Google Scholar
M. Lakshmanan, G. Koh, B. K. S. Chung, and D.-Y. Lee, “Software applications for flux balance analysis,” Briefings in Bioinformatics, vol. 15, no. 1, pp. 108–122, 2014.
View at: Publisher Site | Google Scholar
S. A. Becker and B. O. Palsson, “Context-specific metabolic networks are consistent with experiments,” PLoS Computational Biology, vol. 4, no. 5, Article ID e1000082, 2008.
View at: Publisher Site | Google Scholar | MathSciNet
A. S. Blazier and J. A. Papin, “Integration of expression data in genome-scale metabolic network reconstructions,” Frontiers in Physiology, vol. 3, article 299, 2012.
View at: Publisher Site | Google Scholar
D. Machado and M. Herrgård, “Systematic evaluation of methods for integration of transcriptomic data into constraint-based models of metabolism,” PLoS Computational Biology, vol. 10, no. 4, Article ID e1003580, 2014.
View at: Publisher Site | Google Scholar
H. Zur, E. Ruppin, and T. Shlomi, “iMAT: an integrative metabolic analysis tool,” Bioinformatics, vol. 26, no. 24, pp. 3140–3142, 2010.
View at: Publisher Site | Google Scholar
P. A. Jensen and J. A. Papin, “Functional integration of a metabolic network model and expression data without arbitrary thresholding,” Bioinformatics, vol. 27, no. 4, pp. 541–547, 2011.
View at: Publisher Site | Google Scholar
C. Colijn, A. Brandes, J. Zucker et al., “Interpreting expression data with metabolic flux models: predicting Mycobacterium tuberculosis mycolic acid production,” PLoS Computational Biology, vol. 5, no. 8, Article ID e1000489, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
A. P. Burgard, P. Pharkya, and C. D. Maranas, “Optknock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization,” Biotechnology and Bioengineering, vol. 84, no. 6, pp. 647–657, 2003.
View at: Publisher Site | Google Scholar
P. Pharkya, A. P. Burgard, and C. D. Maranas, “OptStrain: a computational framework for redesign of microbial production systems,” Genome Research, vol. 14, no. 11, pp. 2367–2376, 2004.
View at: Publisher Site | Google Scholar
I. Rocha, P. Maia, P. Evangelista et al., “OptFlux: an open-source software platform for in silico metabolic engineering,” BMC Systems Biology, vol. 4, no. 1, article 45, 2010.
View at: Publisher Site | Google Scholar
M. Cvijovic, R. Olivares-Hernandez, R. Agren et al., “BioMet Toolbox: genome-wide analysis of metabolism,” Nucleic Acids Research, vol. 38, supplement 2, Article ID gkq404, pp. W144–W149, 2010.
View at: Publisher Site | Google Scholar
K. Yizhak, O. Gabay, H. Cohen, and E. Ruppin, “Model-based identification of drug targets that revert disrupted metabolism and its application to ageing,” Nature Communications, vol. 4, 2013.
View at: Publisher Site | Google Scholar
P. A. Jensen, K. A. Lutz, and J. A. Papin, “TIGER: toolbox for integrating genome-scale metabolic models, expression data, and transcriptional regulatory networks,” BMC Systems Biology, vol. 5, no. 1, article 147, 2011.
View at: Publisher Site | Google Scholar
P. Gawand, P. Hyland, A. Ekins, V. J. J. Martin, and R. Mahadevan, “Novel approach to engineer strains for simultaneous sugar utilization,” Metabolic Engineering, vol. 20, pp. 63–72, 2013.
View at: Publisher Site | Google Scholar
J.-H. Kim, D. E. Block, and D. A. Mills, “Simultaneous consumption of pentose and hexose sugars: an optimal microbial phenotype for efficient fermentation of lignocellulosic biomass,” Applied Microbiology and Biotechnology, vol. 88, no. 5, pp. 1077–1085, 2010.
View at: Publisher Site | Google Scholar
S. K. Lee, H. Chou, T. S. Ham, T. S. Lee, and J. D. Keasling, “Metabolic engineering of microorganisms for biofuels production: from bugs to synthetic biology to fuels,” Current Opinion in Biotechnology, vol. 19, no. 6, pp. 556–563, 2008.
View at: Publisher Site | Google Scholar
N. M. D. Courchesne, A. Parisien, B. Wang, and C. Q. Lan, “Enhancement of lipid production using biochemical, genetic and transcription factor engineering approaches,” Journal of Biotechnology, vol. 141, no. 1-2, pp. 31–41, 2009.
View at: Publisher Site | Google Scholar
K. Yizhak, T. Benyamini, W. Liebermeister, E. Ruppin, and T. Shlomi, “Integrating quantitative proteomics and metabolomics with a genome-scale metabolic network model,” Bioinformatics, vol. 26, no. 12, Article ID btq183, pp. i255–i260, 2010.
View at: Publisher Site | Google Scholar
N. Jamshidi and B. Ø. Palsson, “Mass action stoichiometric simulation models: incorporating kinetics and regulation into stoichiometric models,” Biophysical Journal, vol. 98, no. 2, pp. 175–185, 2010.
View at: Publisher Site | Google Scholar
L. Jerby, T. Shlomi, and E. Ruppin, “Computational reconstruction of tissue-specific metabolic models: application to human liver metabolism,” Molecular Systems Biology, vol. 6, article 401, 2010.
View at: Publisher Site | Google Scholar
B. R. Bochner, “Global phenotypic characterization of bacteria,” FEMS Microbiology Reviews, vol. 33, no. 1, pp. 191–205, 2009.
View at: Publisher Site | Google Scholar
B. R. Bochner, “New technologies to assess genotype-phenotype relationships,” Nature Reviews Genetics, vol. 4, no. 4, pp. 309–314, 2003.
View at: Publisher Site | Google Scholar
L. A. I. Vaas, J. Sikorski, V. Michael, M. Göker, and H.-P. Klenk, “Visualization and curve-parameter estimation strategies for efficient exploration of phenotype microarray kinetics,” PLoS ONE, vol. 7, no. 4, Article ID e34846, 2012.
View at: Publisher Site | Google Scholar
J. L. Reed, T. R. Patel, K. H. Chen et al., “Systems approach to refining genome annotation,” Proceedings of the National Academy of Sciences of the United States of America, vol. 103, no. 46, pp. 17480–17484, 2006.
View at: Publisher Site | Google Scholar
M. H. Medema, R. van Raaphorst, E. Takano, and R. Breitling, “Computational tools for the synthetic design of biochemical pathways,” Nature Reviews Microbiology, vol. 10, no. 3, pp. 191–202, 2012.
View at: Publisher Site | Google Scholar
A. F. Talebi, M. Tohidfar, A. Bagheri et al., “Manipulation of carbon flux into fatty acid biosynthesis pathway in Dunaliella salina using AccD and ME genes to enhance lipid content and to improve produced biodiesel quality,” Biofuel Research Journal, vol. 1, no. 3, pp. 91–97, 2014.
View at: Google Scholar
E. Andrianantoandro, S. Basu, D. K. Karig, and R. Weiss, “Synthetic biology: new engineering rules for an emerging discipline,” Molecular Systems Biology, vol. 2, no. 1, Article ID msb4100073, 2006.
View at: Publisher Site | Google Scholar
S. A. Benner and A. M. Sismour, “Synthetic biology,” Nature Reviews Genetics, vol. 6, no. 7, pp. 533–543, 2005.
View at: Publisher Site | Google Scholar
D. A. Drubin, J. C. Way, and P. A. Silver, “Designing biological systems,” Genes and Development, vol. 21, no. 3, pp. 242–254, 2007.
View at: Publisher Site | Google Scholar
M. A. Marchisio and J. Stelling, “Computational design of synthetic gene circuits with composable parts,” Bioinformatics, vol. 24, no. 17, pp. 1903–1910, 2008.
View at: Publisher Site | Google Scholar
D. Lopez, D. Casero, S. J. Cokus, S. S. Merchant, and M. Pellegrini, “Algal functional annotation tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data,” BMC Bioinformatics, vol. 12, no. 1, article 282, 2011.
View at: Publisher Site | Google Scholar
R. L. Chang, L. Ghamsari, A. Manichaikul et al., “Metabolic network reconstruction of Chlamydomonas offers insight into light-driven algal metabolism,” Molecular Systems Biology, vol. 7, article 518, 2011.
View at: Publisher Site | Google Scholar
N. R. Boyle and J. A. Morgan, “Flux balance analysis of primary metabolism in Chlamydomonas reinhardtii,” BMC Systems Biology, vol. 3, article 4, 2009.
View at: Publisher Site | Google Scholar
A. Manichaikul, L. Ghamsari, E. F. Y. Hom et al., “Metabolic network analysis integrated with transcript verification for sequenced genomes,” Nature Methods, vol. 6, no. 8, pp. 589–592, 2009.
View at: Publisher Site | Google Scholar
C. G. de Oliveira Dal'Molin, L.-E. Quek, R. W. Palfreyman, and L. K. Nielsen, “AlgaGEM—a genome-scale metabolic reconstruction of algae based on the Chlamydomonas reinhardtii genome,” BMC Genomics, vol. 12, no. 4, article S5, 2011.
View at: Publisher Site | Google Scholar
P. May, J. O. Christian, S. Kempa, and D. Walther, “ChlamyCyc: an integrative systems biology database and web-portal for Chlamydomonas reinhardtii,” BMC Genomics, vol. 10, article 209, 2009.
View at: Publisher Site | Google Scholar
A. J. M. Walhout, G. F. Temple, M. A. Brasch et al., “GATEWAY recombinational cloning: application to the cloning of large numbers of open reading frames or ORFeomes,” Methods in Enzymology, vol. 328, pp. 575–592, 2000.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2014 Joseph Koussa et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

4066

Downloads

2046

Citations

BioMed Research International

Renewable Energy and Alternative Fuel Technologies

Computational Approaches for Microalgal Biofuel Optimization: A Review

Abstract

1. Introduction

2. Metabolic Network Model Reconstruction

3. Pathway Visualization

4. Model Refinement and Gap Filling

4.1. Gapfind and Gapfill

4.2. MEP and Pathway Tools Hole Filler

4.3. GrowMatch

4.4. BNICE

5. Constraint-Based Modeling, FBA, and Integration of Expression Data

5.1. GIMME, iMAT, and MADE

5.2. E-Flux

5.3. Optknock, Optstrain, and Optflux

5.4. BioMet Toolbox

5.5. MTA

5.6. TIGER

5.7. SIMUP

6. Omics Data Integration Tools

6.1. IOMA

6.2. MASS

6.3. MBA

7. Bioengineering, Parts and Circuits

8. Emerging Algal-Specific Computational and Experimental Resources

9. Conclusion

Conflict of Interests

Acknowledgments

References

Copyright