Abstract

Maintenance and manipulation of large DNA and RNA virus genomes had presented an obstacle for virological research. BAC vectors provided a solution to both problems as they can harbor large DNA sequences and can efficiently be modified using well-established mutagenesis techniques in Escherichia coli. Numerous DNA virus genomes of herpesvirus and pox virus were cloned into mini-F vectors. In addition, several reverse genetic systems for RNA viruses such as members of Coronaviridae and Flaviviridae could be established based on BAC constructs. Transfection into susceptible eukaryotic cells of virus DNA cloned as a BAC allows reconstitution of recombinant viruses. In this paper, we provide an overview on the strategies that can be used for the generation of virus BAC vectors and also on systems that are currently available for various virus species. Furthermore, we address common mutagenesis techniques that allow modification of BACs from single-nucleotide substitutions to deletion of viral genes or insertion of foreign sequences. Finally, we review the reconstitution of viruses from BAC vectors and the removal of the bacterial sequences from the virus genome during this process.

1. Introduction

For many years, genetic manipulation of the genomes of large DNA viruses was extremely tedious and dependent on rare recombination events in susceptible eukaryotic cells. Infected cells were transfected with plasmids or linear DNA fragments containing a selection marker flanked by sequences homologous to the target locus. This process allowed the deletion or mutation of the gene of interest in the virus genome. However, purification of resulting recombinant viruses was laborious and often required several passages. Deletion of essential viral genes was usually not feasible due to the strong selection for progeny viruses that efficiently replicate in vitro. Constant selective pressure during serial virus passage often resulted in compensatory mutations in the viral genome. To overcome the obstacle of essentiality, transcomplementing cell lines were generated that would compensate for the absence of the gene in the virus context. This solution would work in most cases but was not always possible due to the toxicity of some viral proteins. A solution to the problems was the maintenance and modification of virus genomes in bacteria where the accuracy of the bacterial polymerase allows clonal maintenance of viral sequences in E. coli. As viral promoters are not functional in bacteria, there is no selective pressure on the virus genome in E. coli. In rare cases, high-copy plasmids containing virus sequences can be toxic for bacteria; however, this problem could also be overcome by the use of single or low-copy vectors, such as bacterial artificial chromosomes (BACs). Another advantage of BAC replicons is the high capacity of up to 300 kbp that is required for the cloning of large DNA and RNA virus genomes. Maintenance and faithful replication of the BAC construct in E. coli are facilitated by components encoded in the minimal fertility factor replicon (mini-F), the backbone of the BAC vector. Replication of the BAC is initiated at the origin of replication S (oriS) and stringently regulated by the repE and repF gene products encoded in the mini-F. Resulting copies of the replicon are subsequently allocated to the two daughter cells. This dynamic process is facilitated by the mini-F-encoded proteins SopA and SopB and the centromere region sopC [1]. Bacterial antibiotic resistance cassettes present in the BAC vector backbone allow the selection in E. coli. Besides the essential sequences required for replication and maintenance, many BAC vectors harbor a variety of expression cassettes that allow visualization of BAC-containing sequences in transfected cells selection in mammalian cells and, hence, ease the generation of recombinant viruses that contain the mini-F.

The major advantage of the maintenance of virus genomes in BAC vectors is the availability of well-established mutagenesis techniques in E. coli. Over the last decade, a number of methods have been developed that allow the generation of virtually any mutation in a virus genome. Within a few days, genes or sequence elements can be removed. Foreign sequences can readily be inserted into the genome, allowing detection of viral proteins via epitope tags, or expression of foreign genes for development of vector vaccines [27]. Furthermore, reporter genes such as the green fluorescent protein (GFP) can be incorporated into the viral genome to visualize infected cells in vitro and in vivo or fused to a viral protein to determine its localization in living cells [3, 6, 8]. In addition, luciferase reporter genes can be inserted which allows tracking of the virus in live animals [9, 10]. Even single nucleotides can be edited in the virus genome, a modification that is almost impossible with conventional virus mutagenesis in mammalian cells.

Until now, BACs have been generated for a large number of DNA but also some RNA viruses. In this paper we provide an overview on strategies for the generation of virus BAC clones for both DNA and RNA viruses. Furthermore, we review the viral BAC systems that are currently available to the research community. As modification of virus genomes is an important tool in virological research, we outline the available mutagenesis techniques for viral BAC vectors. Last but not least, we describe various techniques that can be used for the removal of the mini-F from viral genomes upon reconstitution in eukaryotic cells.

2. Generation of Bacterial Artificial Chromosomes (BACs)

2.1. Homologous Recombination in Mammalian Cells

One of the most common methods for the insertion of the mini-F vector into the genomes of DNA viruses utilizes the recombination machinery of mammalian cells. For this approach, a transfer vector is generated that harbors the mini-F cassette flanked by sequences identical to the insertion site in the virus genome. The choice of the mini-F insertion site is critical as essential genes may not be affected in the process [11]. Circular or linearized transfer plasmid is either transfected into virus-infected cells or is co-transfected with virus genomic DNA. In a small portion of transfected cells, a recombination event facilitated by cellular recombinases allows the insertion of the mini-F sequences into the virus genome. Upon virus reconstitution, cells producing recombinant viruses can be visualized by fluorescent markers such as green fluorescent protein (GFP), which is often introduced in standard mini-F plasmids. Alternatively, cells infected with recombinant viruses can be enriched using selection markers such as gpt, hygromycin, or neomycin that are encoded by resistance genes present in the mini-F vector. Upon purification of recombinant viruses harboring mini-F vector sequences, susceptible cells are infected and circular virus DNA is isolated. In many cases, viral DNA is prepared using a method previously published by Hirt (1967), which uses fractionated precipitation with SDS and NaCl to separate viral from cellular DNA [12]. Residual cellular DNA can be fragmented using restriction enzymes that do not cut in the virus genome. Resulting linearized DNA may be removed by incubation with λ-exonuclease, an enzyme that exclusively digests linear DNA, leaving circular DNA intact [13]. This approach allows an enrichment of circular virus DNA. Isolated viral DNA harboring the mini-F is then electroporated into E. coli K12 strains, and BAC-containing bacteria are selected using antibiotic resistance genes present in the mini-F backbone [14]. Resulting BAC clones are routinely analyzed by restriction fragment length polymorphism (RFLP) to ensure the integrity of the virus genome. In addition, the insertion site of the mini-F is sequenced to determine if mutations that are present may have occurred in the recombination process. Recently, sequencing of entire BAC clones has become an attractive alternative, as the cost of high-throughput sequencing has dropped dramatically.

2.2. Cosmid-Based Approach

An alternative strategy often used for BAC generation of cell-associated viruses utilizes cosmid vectors to initially maintain overlapping parts of the genome of DNA viruses. The mini-F is subsequently inserted into one of the cosmids by ligation or homologous recombination in E. coli. Transfection of the overlapping cosmids into eukaryotic cells results in recombination between homologous sequences and reconstitution of infectious virus. During the process, the cosmid containing the mini-F cassette is incorporated into the virus genome, all resulting viruses harbor the mini-F, and no laborious selection steps to obtain recombinant clones are required. As described above, circular virus DNA is then isolated and transformed into E. coli and clones are screened for the integrity of the virus genomes they contain [13].

2.3. In Vitro Ligation

Recently, it has been shown that the mini-F replicon can be inserted into herpesvirus genomes by direct ligation [29]. For this purpose, concatemeric virus DNA is isolated from herpesvirus infected-cells and cleaved with a restriction enzyme that cuts only in a single locus within the virus genome. The resulting full-length viral genome is then ligated with a linearized mini-F vector containing compatible DNA ends. To avoid ligation of the mini-F with cellular fragments, restriction enzymes that recognize an interrupted palindrome and allow the generation of desired directional sticky ends such as SfiI or BstXI can be used. This strategy has been successfully applied to the generation of a BAC system for human herpesvirus 6A (HHV-6A) [29]. There are, however, several disadvantages to this method. Firstly, the strategy requires a completely sequenced virus genome in order to determine potential restriction sites that can be used for the ligation procedure. Secondly, many virus genomes do not possess a unique restriction site that is suitable for the strategy. Thirdly, the mini-F insertion site is limited to the location of the unique restriction site. Insertion into open reading frames (ORFs) or promoters of the virus genome can impair or abrogate infectivity of BAC-derived viruses. Last but not least, ligation and transformation procedures for large BAC vectors are very inefficient, therefore hampering cloning attempts.

2.4. Strategy for Poxviruses

As described in Section 2.1, insertion of the mini-F sequences into the poxvirus genome can be facilitated by the cellular recombination machinery in mammalian cells [4043]. However, unlike herpesviruses, poxviruses do not produce a circular form of the virus genome during replication. This poses a major hurdle for the transfer of the recombinant poxvirus constructs into E. coli. To overcome the problem, infected cells are treated with isatin-β-thiosemicarbazone that promotes accumulation of nonresolved, genomic concatemers [4044]. For the generation of some poxvirus BAC clones, it was sufficient to transform E. coli with concatemeric DNA, a procedure that likely resulted in a recombination event allowing circularization of the replicon [40, 43]. Alternatively, isolated poxvirus DNA was circularized prior to transformation of E. coli using the Cre/loxP- or Flp/FRT-recombination system that will be further described in Section 4.3 [41, 42].

2.5. Generation of BACs for RNA Viruses

BAC vectors can also be used to maintain and modify the genome of RNA viruses. This has been successfully shown for several members of the Coronaviridae and Flaviviridae [4550]. To generate a full-length BAC clone for these nonsegmented RNA viruses, viral RNA is isolated from infected cells and cDNA synthesized using reverse transcriptase. Full-length genomic cDNA can be purified by gel electrophoresis and subsequently cloned into a mini-F vector by a standard ligation reaction. If synthesis of a full-length cDNA is not possible due to the size of the virus genome, fragments of the genome can be transcribed into cDNA and subsequently combined by ligation to obtain a complete virus genome [51]: similar to the situation for DNA viruses, some RNA viruses can be reconstituted by transfection of (cloned) viral nucleic acid into eukaryotic cells. Expression of viral genomic RNA can be facilitated by cellular RNA polymerase II under the control of the major immediate-early promoter of human cytomegalovirus (HCMV) [45, 46]. Alternatively, a T7 promoter can be employed to drive the expression of viral RNA, where T7 polymerase is either stably expressed in the cell line used for virus reconstitution or delivered in trans by transfection or using vaccinia virus as a delivery vehicle [52, 53]. Correct processing of the 3′-ends of the virus genomes is often achieved by the bovine growth hormone (BGH) terminator and sequences of the hepatitis delta virus (HDV) ribozyme [45, 46, 49]. Full-length viral RNA can also be transcribed in vitro using a T7 promoter/polymerase system with the virus BAC clone as a template. Transfection into susceptible cells of the viral RNA transcribed in vitro usually allows virus reconstitution [51].

Currently, several laboratories are working on the generation of BAC systems for segmented RNA viruses. For this approach, cDNA clones of viral RNA segments are combined into a BAC vector. Alternatively, segments of the virus genome can be generated by de novo synthesis [54]. Transcription of genomic RNA of segmented viruses utilizes similar promoter/terminator systems as described above for nonsegmented RNA viruses. For example, all eight influenza A virus genome segments were recently cloned into a single high-capacity vector. This system allows a more efficient reconstitution of the virus in eukaryotic cells and could be used for the production of recombinant influenza vaccines [55].

3. BACs Available for Members of Various Herpesvirus Families

3.1. Herpesvirales

Over the last two decades, BAC vectors have become an important tool for herpesvirus research. In 1997, the first virus BAC system was developed by Messerle and colleagues for the murine cytomegalovirus (MCMV), one of the largest herpesviruses with a genome size of 230 kbp [14]. Shortly after the generation of the MCMV BAC, the technology was applied to many other species in the order Herpesvirales. These include numerous members of the Herpesviridae with species from all three subfamilies, Alpha-, Beta- and Gammaherpesvirinae (Table 1). Until now, BAC clones have been generated for all human herpesviruses with the exception of human herpesvirus 7 (HHV-7). Besides the Herpesviridae, two members of the Alloherpesviridae, koi herpesvirus and channel catfish herpesvirus, have been cloned into mini-F vectors [38, 39]. All in all, BAC systems are available for at least 27 herpesvirus species that infect hosts as diverse as fish, birds, and humans (Table 1). For many of these herpesviruses, several strains have been cloned as BACs, allowing the maintenance and manipulation of laboratory strains and clinical isolates with varying virulence. The plethora of BAC-based genetic systems has eased the analysis of herpesvirus-encoded genes and has immensely contributed to our understanding of the viruses’ life cycles and pathogenesis.

3.2. Poxviridae

So far, several full-length poxvirus genomes of the genus Orthopoxvirus have been cloned into BAC vectors. The first BAC construct was generated for the vaccinia virus strain Western Reserve [41, 56]. After this proof-of-principle for poxvirus BAC generation, two BAC vectors were developed for the highly attenuated modified vaccinia virus Ankara (MVA) and one for its more virulent parental strain, chorioallantois vaccinia virus Ankara (CVA). The MVA and CVA BAC constructs allowed an analysis of the effect of six major deletions present in MVA genome on poxvirus pathogenesis and the differences in cellular tropisms of MVA and CVA [40, 42]. Recently, a BAC clone was established for cowpox virus (CPXV), a zoonotic, rodent-borne poxvirus that has the largest and most complete genome of all orthopoxviruses [43]. The broad spectrum of mutagenesis techniques described in Section 4 facilitated many studies that shed light on the role of viral genes in the poxvirus lifecycle and allowed the establishment of recombinant next-generation vector vaccines [40, 43, 56, 57].

3.3. RNA Viruses

The establishment of stable genetic systems for RNA viruses was, and in some cases still is, one of the major challenges. Cloning of cDNA sequences into expression vectors allows maintenance and manipulation of the RNA virus genomes. However, in case of large nonsegmented RNA viruses, the capacity of regular plasmids is often not sufficient. Furthermore, high-copy vectors containing virus-derived cDNA fragments can be instable or exhibit toxic effects on the bacterial host. To circumvent the obstacles, cDNAs of several nonsegmented viruses were inserted into mini-F vectors. The first BAC of an RNA virus was developed by Almazán and colleagues in 2000 for transmissible gastroenteritis coronavirus (TGEV), a member of the Coronaviridae [45]. The large TGEV genome of almost 29 kbp was transcribed into cDNA and was successfully cloned and efficiently maintained as a BAC construct. In the following years, BAC systems were generated for other coronaviruses such as human coronavirus (HCoV) and SARS-related coronavirus (SARS-CoV) [46, 49]. Similarly, the cDNA sequence of Japanese encephalitis virus (JEV), a member of the Flaviviridae, was cloned into a mini-F vector in 2003 [50]. Since then, BAC-based genetic systems have been generated for the pestiviruses bovine viral diarrhea virus (BVDV) and classical swine fever virus (CSFV) (Table 2) [47, 48].

4. BAC Mutagenesis

4.1. Transposon Mutagenesis

One major advantage of virus BACs systems is the availability of well-established genetic tools that allow random and specific modifications of the virus genome in E. coli. One method that allows modification of BACs constructs in a nontargeted fashion is transposon-mediated mutagenesis. Random integration of transposable elements (Tn) into virus BACs can result in the interruption of viral genes and sequence elements. The integration of Tn is mediated by the transposase (tnpA) and resolvase (tnpR) gene products of the transposon system [58, 59]. Transposon vectors were generated which contain all required components for the mutagenesis system. Insertion of various antibiotic resistance cassettes into the Tn sequence allows the selection of BAC clones that harbor a transposon insertion. A temperature-sensitive origin of replication can be utilized for a rapid removal of the transposon vector. To ensure that Tn transposition occurs in the BAC construct and not in the cellular genome, transposons such as Tn1721 have been developed with a strong preference for circular DNA. The optimized transposon system can be used to generate recombinant BAC libraries. The phenotype of mutagenized, BAC-derived viruses can be subsequently analyzed and can be used for genome-wide screens. Such screens are dependent on the available readout systems, but have been performed for viral genes involved in replication, immune evasion, and other processes important for completion of life cycles of viruses [60].

4.2. RecA-Based Mutagenesis

To allow a more detailed characterization of viral genes, targeted sequence modifications are necessary. Due to the large size of the BAC constructs, ligation and transformation procedures are usually very inefficient [61]. To overcome the obstacles, most modifications of BAC constructs are facilitated by homologous recombination techniques in E. coli. The two well-established RecA and Red/RecET recombination systems allow rapid and convenient modifications of BAC constructs [7, 62, 63]. The RecA system utilizes cellular RecA recombinase expressed in bacteria. In order to introduce the desired modification through RecA, long homologous sequences of 500 bp to 3 kbp are required for recombination events [28, 64]. However, as repetitive sequences are present in many viruses such as herpesviruses, RecA expression can lead to rapid destabilization of viral BAC clones. This often results in the loss of portions of the virus genome. For this reason, BACs are usually maintained in RecA-deficient E. coli strains, and the recombinase is only transiently expressed during the mutagenesis procedure [14, 28, 64].

A common RecA-based mutagenesis technique is termed shuttle mutagenesis. The system makes use of vectors that harbor the desired modifications flanked by sequences homologous to the target site in the BAC (Figure 1) [28, 64]. Recombination of homologous sites is facilitated by the RecA that usually is encoded by the shuttle vector [28]. The plasmid regularly harbors a positive and negative selection marker and often a temperature-sensitive origin of replication. The plasmid is then transformed into E. coli containing the desired BAC construct for mutagenesis to proceed, and bacteria harboring the shuttle vector are selected using the positive selection marker. Expression of RecA facilitates recombination of sequences that are identical in the shuttle plasmid and the target site in the BAC construct. Replication of the plasmid can be repressed in a temperature-dependent manner, and enrichment is achieved of bacteria in which the plasmid sequences were integrated into the BAC. In a second recombination event, the backbone of shuttle vector is excised from the BAC construct. This can occur with either homologous flank present in shuttle vector sequences. If the second recombination utilizes the homologous sequence that was not utilized in the first step, then the desired modification is maintained in the BAC vector (Figure 1) [28, 64]. Negative selection markers such as rspL [65], sacB [66], and tetR [67] in the shuttle vector backbone can be used to suppress nonresolved BAC clones.

One of the major advantages of shuttle mutagenesis is that it can be used for the generation of a wide variety of mutations without leaving any unwanted selection markers or other bacterial sequences behind. Deletions or modification of the target sequence in the BAC can be introduced within a matter of days. Standard shuttle vectors for specific target sites can be generated and rapidly modified to be used for introduction of various mutations into a single locus. Furthermore, the system allows the insertion of short and long sequences that can be utilized for the development of vector vaccines. However, the RecA system has also a number of disadvantages. The major problem poses the instability of the BAC clones upon induction of the recombination system, often resulting in the loss of large portions of the BAC. The frequency of unwanted recombination events is increased when negative selection markers are used. Finally, construction of the shuttle plasmids can be very laborious if several loci in a BAC construct are targeted for mutagenesis.

4.3. Recombineering of BAC Constructs

An alternative system for the modification of BACs in E. coli is the well-established Red and RecE/T recombination systems that are derived from bacteriophage λ and the Rac prophage, respectively [7, 63]. Both systems utilize double-strand DNA (dsDNA) ends as substrate for the recombination reaction and consist of three components [7, 63, 68]. The first component is the Gam protein that protects dsDNA ends from degradation in bacteria [69]. A second component is the 5′–3′ exonuclease, alpha or RecE, that generates single-strand 3′ DNA overhangs despite the presence of Gam [7072]. The last component is the single-strand binding protein Beta or RecT. The protein can bind and protect single-strand DNA from degradation [7375]. In addition, Beta/RecT aids in annealing of single-strand DNA to complementary sequences and in invasion into replication forks with a preference for lagging DNA strands [76, 77]. The major advantage of the Red or RecE/T recombination system is that only short homologous sequences of 30 to 50 bp are required for the recombination to proceed. Furthermore, unwanted recombination events or rearrangements occur rarely, as only homologous double-strand ends can be used as a substrate. For the recombination in bacteria, components of the Red or RecE/T recombination system can be delivered in trans by plasmids such as pKD46 that allow inducible expression of Alpha, Beta and Gam. Once the mutagenesis procedure is completed, pKD46 can be cured from bacteria by its temperature-sensitive replication mechanism [7, 78]. A more convenient alternative is the use of bacteria containing a chromosomally encoded λ prophage, such as E. coli strain DY380 and its derivates. In this case Alpha, Beta, and Gam can be induced in a temperature-dependent manner [77, 79].

The Red or RecE/T mutagenesis system usually utilizes PCR products that contain a positive selection marker and sequences homologous to the target site in the BAC at either end as substrate for recombination. The short homologous sequences needed for targeted insertion into the BAC construct can readily be inserted by 5′ overhangs of the primers used for PCR amplification of the selection marker. The PCR products are then electroporated into E. coli that harbor the desired BAC construct. Clones that incorporate the cassette can be selected for the presence of the selection marker. Several mutagenesis techniques have been developed that often combine Red or RecE/T with other recombination systems and strategies that allow the removal of unwanted sequences.

One of these recombination systems that utilize specific recognition sites is the Cre/loxP system of bacteriophage P1. It consists of the Cre recombinase that facilitates recombination between two 34 bp loxP sites [80]. An orthologous system is based on Flp recombinase derived from Saccharomyces cerevisiae that utilizes FRT recognition sites for recombination [81]. The presence of two loxP or FRT sites within a vector results in the excision of sequences flanked by the recognition sites and can be used for the removal of unwanted bacterial sequences or selection markers that were introduced by Red or RecE/T recombination. The Cre/loxP and FLP/FRT system can also be used for insertion of sequences. A recombination event between one loxP or FRT in the BAC construct and the donor sequence allows insertion of the desired sequences [82, 83]. Expression plasmids can be used to transiently express the recombinases; alternatively, a number of E. coli strains, such as EL250 and EL350, that harbor an inducible form of the cre or flp gene in the chromosome are available [7, 79].

Besides recombination at specific recognition sites, other strategies have been developed that allow complete removal of the introduced marker sequences. One approach utilizes a combination of positive and negative selection markers [7]. In a first Red recombination step, the dual selection cassette is inserted into the target site. Resulting clones are selected for the presence of the positive selection marker. In a second step, a PCR product with the desired modification and flanking homologous sequences for the Red recombination results in the replacement of the positive and the negative selection marker. Desired clones can then be enriched by negative selection, but one disadvantage of the system is the low efficiency of counterselection. In addition, mutations in the negative selection marker can result in clones that are resistant to counterselection. Similarly, illegitimate recombination events removing the counterselection marker without the insertion of the sequence modification can result in many false positive clones.

An alternative approach uses two-way selectable markers, such as galactokinase (GalK) [84]. The galK gene can be used as a positive selection marker in E. coli strains such as SW102 that contain the galactose operon but lack cellular galK and, therefore, cannot utilize galactose as a carbon source. In the first recombination step, the galK cassette is introduced into the BAC construct and desired clones are selected on minimal media containing only galactose as an energy source. However, galK can also be used as a negative selection marker as it converts 2-deoxy-galactose (DOG) to a toxic metabolite, 2-deoxy-galactose-1-phosphate, that suppresses bacterial growth [84]. This property is used in the second recombination step as clones can be selected in which galK is replaced with the desired sequence modification. Besides galK, two other two-way selection markers, thyA and tolC, can be used in bacteria lacking the corresponding gene product [85, 86].

There is an additional Red-based recombination method, en passant mutagenesis, that allows removal of an initially introduced selection marker cassette [6, 77]. The method is based on the insertion of a positive selection marker with an adjacent 18 bp I-SceI restriction site and short-sequence duplications. Short homologous sequences at either end or the linear marker cassette allow the insertion into the target site by Red recombination as described above. Next, expression of the homing endonuclease I-SceI is induced that allows linearization of the BAC construct by the cleavage of the I-SceI restriction site E. coli. The resulting dsDNA ends of the BAC serve as a substrate for a second Red recombination of the duplicated sequences, resulting in the complete removal of all foreign sequences including the selection marker [6, 77]. Inducible expression of the I-SceI enzyme can be accomplished by the use of expression plasmids or E. coli strains that harbor the I-SceI cassette chromosomally (e.g., GS1783) [77].

4.4. Deletion of Sequences in BACs

As discussed in Section 4.3, Red or RecE/T recombination systems provide a stable and more efficient alternative for the generation of BAC mutants. For the deletion of sequences from a BAC construct, a positive selection marker is amplified by PCR using primers with 30 to 50 bp extensions that are homologous to the target site in the BAC (Figure 2(a), light blue and orange boxes) [7]. This PCR product is electroporated into recombination-competent bacteria that harbor the BAC clone. The short homologous sequences at the end of the PCR product are used for two individual Red recombination events that result in the replacement of the target sequence with the selection marker. The resistance gene cassette remains in the BAC and cannot be used for the mutagenesis of a second locus in the construct (Figure 2(a)).

Another strategy that allows the removal of the resistance marker utilizes the Cre/loxP and FLP/FRT recombination systems as described in Section 4.3. For this approach, a positive selection marker flanked by loxP or FRT sites is amplified and introduced into the BAC by homologous recombination. Induction of the Cre or FLP recombinase subsequently results in the removal of the marker cassette from the BAC construct, while only a single loxP or FRT site remains at the site of deletion (Figure 2(b)). Due to this residual recognition site, the system can only be applied once in one and the same BAC construct.

The complete removal of all foreign sequences can be accomplished by en passant mutagenesis [6, 77]. As described in Section 4.3, a cassette is generated that contains a positive selection marker with an adjacent I-SceI restriction site. The cassette is amplified with primers that not only contain homologous sequences for the insertion into the target site, but also a sequence duplication (Figure 2(c), pink and dark blue boxes) in the 5′ overhang. This duplication contains the site to be deleted and allows the removal of entire marker cassette in a second Red recombination step [6, 77]. As all foreign sequences are removed from the site of deletion, en passant mutagenesis can be utilized for consecutive modifications of any locus in a BAC construct.

4.5. Insertion of Sequences

Several strategies that allow the insertion of a sequence of interest (soi) including reporter genes, fluorescent tags, or foreign, viral antigens into BAC constructs have been developed (Figure 3). One approach utilizes a transfer construct that contains the soi and a positive selection marker flanked again by two loxP or FRT recognition sites. The construct is inserted into the target site by Red recombination. In a second step, the selection marker can be removed by the induction of the Flp or Cre recombinase, while one recognition site remains in the BAC construct downstream of the soi (Figure 3(a)) [7].

Alternatively, a cassette that allows positive and negative selection or a two-way marker can be inserted at the target side in a first Red recombination event. Subsequently, the soi with sequences homologous to the target sequence at either end of the soi facilitates the replacement of the dual marker cassette with the soi. Clones that have incorporated the soi can be selected for the absence of the negative selection marker (Figures 3(b) and 3(c)) [7, 84].

Another method that allows the insertion of a soi into BAC constructs is en passant mutagenesis [6, 77]. Here, a transfer construct is generated by the insertion of a positive selection marker, an adjacent I-SceI site, and a sequence duplication into a unique restriction site within the soi. Upon insertion of the transfer cassette, expression of I-SceI is induced which results in the linearization of the BAC construct. The sequence duplications allow a second Red recombination resulting in the removal of all operational sequences [6, 77]. Existing transfer constructs can be used for the insertion of soi into any target sequence (Figure 3(d)). The advantage of the en passant techniques described in this section is that they allow the removal of selection marker and can be used to introduce multiple modifications in a BAC construct.

4.6. Sequence Editing

The compact organization of viral genomes is often a challenge for genetic manipulation of virus BACs. Overlapping coding sequences and regulatory elements require a strategy that allows the introduction of minimal sequence modifications that do not interfere with the expression and function of neighboring genes or sequences. In this section, we describe the available techniques that can be used for specific, minimal editing of sequences. The methods allow the manipulation of BACs on the nucleotide level; they include start codon mutation or insertion of a stop codon to abrogate gene expression as well as modification of functional domains of proteins or regulatory elements.

Sequence editing can be achieved by techniques described in Section 4.3. As described above, a dual or two-way selection marker is initially inserted into the target site in the BAC. In addition, a construct that contains the target sequence with the desired sequence modification is generated. In a second Red recombination step, the modified target sequence is used to replace the negative selection marker. Counterselection allows the enrichment of clones that contain the desired mutation (Figures 4(a) and 4(b)) [7, 84]. The advantage of the system is that the same intermediate clones obtained upon the insertion of the selection marker cassette can be used to introduce a variety of modifications into the same locus. However, false positive clones can present a problem for the selection of correct clones as discussed in Section 4.3 [7, 84].

An alternative technique that allows sequence editing of the target site is en passant mutagenesis, in which the selection marker with an adjacent I-SceI is amplified by PCR using primers that contain homologous sequences for the insertion into the target sequence and duplications for the removal of the marker cassette. The desired modifications are included in the center of the duplicated sequences that are present within primers used for PCR amplification of the selectable marker (Figure 4(c), pink and dark blue boxes). In the first Red recombination, the marker cassette including the sequence duplications and modifications are incorporated into the target site. Upon induction of I-SceI expression, the sequence duplications allow the removal of the marker cassette while one duplicated sequence with the desired sequence modifications remains in the final construct (Figure 4(c)). The technique can be used to insert modifications of up to 50 bp in length. Larger modification can be facilitated by the en passant approach described in Section 4.5 and Figure 3(d) [6, 77].

5. Reconstitution of Virus and Mini-F Removal

5.1. Reconstitution of Recombinant Viruses

After mutagenesis of the viral genomes in E. coli, recombinant virus can be reconstituted by transfection of purified virus BAC DNA or in vitro transcribed RNA into susceptible eukaryotic cells. Upon uptake of the virus genome into transfected cells, virus proteins are expressed and virus replication is initiated. In some cases, cotransfection of BAC DNA and expression plasmids encoding transcriptional activators are required to stimulate virus replication [13]. For poxviruses, infection of susceptible cells with helperviruses prior to or after transfection is required to achieve reconstitution of recombinant virus [4043]. As discussed in Section 2.5, expression of full-length RNA virus genomes from transfected BAC DNA can be dependent on the presence of T7 polymerase that is provided to the transfected cell in trans.

5.2. Site-Specific Excision of Mini-F Sequences

In case of DNA viruses, BAC sequences usually remain in the genome upon reconstitution of the virus. This can have negative effects on virus replication as, for example, the additional mini-F sequences might challenge the packaging capacity of the herpesvirus capsid [87]. In addition, residual bacterial sequences are often unfavorable for some applications including the development and licensing of life-attenuated vaccines. For this reason, several strategies have been developed that allow the excision of the mini-F sequences. One frequently used approach utilizes the Cre/loxP or FLP/FRT recombination system (see Section 4.3). For removal of the mini-F sequences, loxP or FRT sites are inserted at either end of the mini-F sequences. Cotransfection of the virus BAC with a Cre or FLP expression plasmid allows the transient production of the recombinase genes. Upon transfection, Cre or FLP facilitates removal of the mini-F sequences, only leaving a scar of a single loxP or FRT site of 34 bp (Figure 5(a)) [8, 14].

5.3. Delivery of Homologous Sequences In Trans

A second method that allows the removal of the mini-F uses the recombination machinery of eukaryotic cells as described in Section 2. For this approach, a repair vector (or linear PCR product) is generated that contains a 1–4 kb fragment representing the original locus that was used for the insertion of the mini-F. The construct is then cotransfected with virus BAC DNA into susceptible cells. In transfected cells, homologous sequences in the repair vector can recombine with sequences up- and downstream of the mini-F resulting in the removal of all vector sequences. Virus plaques that lost the mini-F sequences can be detected by the loss of visual markers encoded in the mini-F such as GFP [88]. Mini-F negative virus is then isolated, amplified, and used for further experiments. The major advantage of this method is that all bacterial sequences are removed, and no loxP or FRT sites remain in the virus genome. However, laborious purification steps are required to obtain clonal virus in which mini-F sequences were removed. This problem can be minimized by the insertion of the mini-F vector in an essential gene thereby generating a growth advantage for viruses that eliminate bacterial sequences (Figure 5(b)) [11, 88].

5.4. Removal of Mini-F via Duplications

Another method that allows complete removal of the mini-F replicon and does not require laborious purification steps utilizes sequence duplications that are inserted into the virus genome. Viral sequences flanking one end of the mini-F backbone are directly duplicated at the other end. An intramolecular recombination event during virus DNA replication allows the removal of all bacterial sequences, thus restoring the original insertion locus in the virus. However, one disadvantage of direct duplications is that they can serve as a substrate for bacterial recombinases, resulting in the instability of the BAC construct in E. coli (Figure 5(c)) [87]. To circumvent this problem, sequence duplications can be inserted in an inverted orientation. For this approach, a 1 to 3 kbp fragment corresponding to the original insertion site of the mini-F vector is inserted in inverse orientation between the mini-F replicon and the antibiotic resistance gene of the backbone. This antiparallel duplication allows stable maintenance and modification of the BAC in E. coli, while the mini-F sequences are completely removed from the virus genome by two intra- or intermolecular recombination events facilitated by recombinases in eukaryotic cells (Figure 5(d)) [13, 43, 57, 89].

6. Conclusions

Since the establishment of the first BAC system in 1997, BAC technology has contributed substantially to our understanding of the life cycle of large DNA and RNA viruses. Several techniques have been developed that facilitate the insertion of mini-F sequences into the virus genome. The methods allowed the generation of BAC systems for a plethora of virus species including members of the Herpesvirales, Poxviridae, Coronaviridae, and Flaviviridae. Well-established mutagenesis techniques described in this paper facilitate a site-specific manipulation of the virus genome in E. coli. Several strategies can be used to introduce any desired modification including deletions of viral or insertions of foreign sequences. Reconstitution of recombinant viruses can be accomplished by transfection of purified BAC DNA into susceptible mammalian cells, while in some cases helperviruses or additional expression vectors are required in this process. Finally, various techniques have been established that allow the excision of the mini-F sequences from the virus genomes without leaving unwanted sequence behind.

Acknowledgments

The authors thank Matthias Sieber and Nikolaus Osterrieder for editing the manuscript. This work was supported by the DFG Grant TI732/1-1 and unrestricted funding from the Freie Universität Berlin.