Abstract

-Radiation-induced intrastrand guanine-thymine cross-link, G[8,5-Me]T, hinders replication in vitro and is mutagenic in mammalian cells. Herein we report in vitro translesion synthesis of G[8,5-Me]T by human and yeast DNA polymerase (hPol and yPol ). dAMP misincorporation opposite the cross-linked G by yPol was preferred over correct incorporation of dCMP, but further extension was 100-fold less efficient for :A compared to :C. For hPol , both incorporation and extension were more efficient with the correct nucleotides. To evaluate translesion synthesis in the presence of all four dNTPs, we have developed a plasmid-based DNA sequencing assay, which showed that yPol was more error-prone. Mutational frequencies of yPol and hPol were 36% and 14%, respectively. Targeted was the dominant mutation by both DNA polymerases. But yPol induced targeted in 23% frequency relative to 4% by hPol . For yPol , targeted and constituted 83% of the mutations. By contrast, with hPol , semi-targeted mutations (7.2%), that is, mutations at bases near the lesion, occurred at equal frequency as the targeted mutations (6.9%). The kind of mutations detected with hPol showed significant similarities with the mutational spectrum of G[8,5-Me]T in human embryonic kidney cells.

1. Introduction

DNA-DNA interstrand and intrastrand cross-links are strong blocks of DNA replication, and understanding the details of polymerase bypass of these complex lesions is of major interest [15]. The double base DNA lesions are formed at substantial frequency by ionizing radiation and by metal-catalyzed H2O2 reactions (reviewed in [6]). A major DNA damage, in anoxic conditions, is an intrastrand cross-linked species in which C8 of Gua is linked to the 5-methyl group of an adjacent thymine, but the G[8,5-Me]T cross-link is formed at a much higher rate than the T[5-Me,8]G cross-link (Figure 1) [7]. Additional thymine-purine cross-links have been isolated from -irradiated DNA in oxygen-free aqueous solution [8]. Wang and coworkers identified structurally similar guanine-cytosine and guanine-5-methylcytosine cross-links in DNA exposed to - or X-rays [911]. The G[8,5-Me]T cross-link is formed in a dose-dependent manner in human cells when exposed to -rays [12], and the G[ ]C cross-link is formed at a slightly lower level [13].

These intrastrand cross-links destabilize the DNA double helix [14], and UvrABC, the excision nuclease proteins from Escherichia coli, can excise them [15, 16]. Using purified DNA polymerases, it was shown that G[8,5-Me]T and G[ ]C are strong blocks of replication in vitro [12, 17]. For G[8,5-Me]T, primer extension is terminated after incorporation of dAMP opposite the -T by exo-free Klenow fragment and Pol IV (dinB) of Escherichia coli whereas Taq polymerase is completely blocked at the nucleotide preceding the cross-link [17]. However, yeast polymerase (yPol ), a member of the Y-family DNA polymerase from Saccharomyces cerevisiae, can bypass both G[8,5-Me]T and G[ ]C cross-links with reduced efficiency [12, 18]. For both these two lesions, nucleotide incorporation opposite the -base of the cross-link is accurate, but the incorporation of dAMP and dGMP is favored opposite the cross-linked G by yPol over that of the correct nucleotide, dCMP [12, 18].

We have recently compared translesion synthesis of G[8,5-Me]T with T[5-Me,8]G in simian and human embryonic kidney cells and found that both cross-links are strongly mutagenic and that the two lesions show interesting pattern of mutations, which included high frequency of semitargeted mutations that occurred a few bases or to the cross-link [19]. One can anticipate a role of one or more Y-family DNA polymerases in bypassing these replication blocking lesions, and we noted that purified human DNA polymerase (hPol ) incorporates dCMP preferentially opposite the G of G[8,5-Me]T cross-link, in contrast to yPol which incorporates dAMP and dGMP much more readily [12, 19]. However, the previous preliminary studies did not examine the kinetics of polymerase extension beyond the lesion site; nor were the full-length extension products analyzed. The kinetics of nucleotide incorporations are influenced by DNA damages, not only at the lesion site but at least up to 3 bases to the lesion [20]. Therefore, incorporation pattern opposite the lesion provides only part of the information on lesion bypass. In the current paper, we have evaluated translesion synthesis of the G[8,5-Me]T cross-link by these two DNA polymerases more critically by determining single nucleotide incorporation kinetics and characterizing the full-length extension products in the presence of all four dNTPs. We report herein that G[8,5-Me]T bypass by yPol is much more error-prone than hPol . We also show that the mutational signatures of these two polymerases are different.

2. Materials and Methods

2.1. Materials

[ -32P] ATP was supplied by Du Pont New England Nuclear (Boston, MA). Recombinant human and yeast DNA polymerases were purchased from Enzymax, LLC. (Lexington, KY). EcoR V restriction endonuclease, T4 DNA ligase, and T4 polynucleotide kinase were obtained from New England Bioloabs (Beverly, MA). E. coli DL7 (AB1157, , ) was from J. Essigmann (MIT, Cambridge, MA). The pMS2 phagemid was a gift from Masaaki Moriya (SUNY, Stony Brook, NY).

2.2. Methods
2.2.1. Synthesis and Characterization of Oligonucleotides

The lesions containing oligonucleotides have been synthesized and characterized as reported in [15]. Unmodified oligonucleotides were analyzed by MALDI-TOF MS analysis, which gave a molecular ion with a mass within 0.005% of theoretical whereas adducted oligonucleotides were analyzed by ESI-MS in addition to digestion followed by HPLC analysis.

2.2.2. Construction of 26-mer and 36-mer Containing G[8,5-Me]T Cross-Link

The 26-mer G[8,5-Me]T template, -GTGCG TGTTTGTATCGCTTGCAGGGG- , was constructed by ligating a -phosphorylated 14-mer, -ATCGCTTGCAGGGG- ( 7.5 nmol), to the G[8,5-Me]T cross-linked 12-mer, -GTGCG TGTTTGT- ( 5 nmol), in the presence of an 18-nucleotide complementary oligonucleotide, -GCAAGCGATACAAACACG- ( 7.5 nmol), as described [19, 21]. Similarly, a 12-mer, -CCUGGAAGCGAU- ( 7.5 nmol), a -phosphorylated G[8,5-Me]T 12-mer ( 5 nmol), and a -phosphorylated 12-mer, -AUCGCUGCUACC- ( 7.5 nmol), were annealed to a complementary 26-mer, -GCAGCGATACAAACACGCACATCGCT- ( 7.5 nmol), and ligated in the presence of T4 DNA ligase to prepare a G[8,5-Me]T cross-linked 36-mer, -CCUGGAAGCGAUGTGCG TGTTTGTAUCGCUGCUACC- . The oligonucleotides were separated by electrophoresis on a 16% polyacrylamide-8 M urea gel. The ligated product bands were visualized by UV shadowing and excised. The 26-mers and the 36-mers were desalted on a Sephadex G-25 (Sigma) column and stored at C until further use.

2.2.3. In Vitro Nucleotide Incorporation and Chain Extension

To determine the nucleotide preferentially incorporated opposite G[8,5-Me]T cross-link, the steady-state kinetic analyses were performed by the method of Goodman and coworkers [22, 23]. The primed template was obtained by annealing 5-fold molar excess of the modified or control 26-mer template ( 20 ng) to a complementary -32P-labeled primer. Primer extension under standing start conditions was carried out with hPol or yPol (6.4 nM) with individual dNTPs or a mixture of all four dNTPs in 25 mM Tris-HCl buffer (pH 7.5), 5 mM MgCl2, and 5 mM dithiothreitol at C for various times. The reactions were terminated by adding an equal volume of 95% (v/v) formamide, 20 mM EDTA, 0.02% (w/v) xylene cyanol, and 0.02% (w/v) bromophenol blue and heating at C for 2 min, and the products were resolved on a 20% polyacrylamide gel containing 8 M urea. The DNA bands were visualized and quantitated using a Phosphorimager. The dNTP concentration and time of incubation were optimized to ensure that primer extension was less than 20%. The and were extrapolated from the Michaelis-Menten plot of the kinetic data.

2.2.4. Analysis of the Full-Length Bypass Products Using pMS2 Vector

The ss pMS2 shuttle vector DNA (58 pmols, 100  g) was digested with an excess of EcoR V (300 pmol, 4.84  g) for 1 h at C followed by room temperature overnight. A 36-mer scaffold oligonucleotide containing the G[8,5-Me]T cross-link (or a control) was annealed overnight at C to form the gapped DNA. The gapped plasmid was incubated with hPol or yPol and a mixture of all four dNTPs (25 mM each) in 25 mM Tris-HCl buffer (pH 7.5), 5 mM MgCl2, and 5 mM dithiothreitol at C for various times. DNA ligase (200 units) was added, and the pMS2 mixture containing the DNA polymerase, dNTPs, and so forth, was ligated overnight at C. The scaffold oligonucleotide was digested by treatment with uracil glycosylase and exonuclease III, the proteins were extracted with phenol/chloroform, and the DNA was precipitated with ethanol. The final construct was dissolved in deionized water and used to transform E. coli DL7 cells. The transformants were randomly picked and analyzed by DNA sequencing.

3. Results

3.1. In Vitro Bypass by DNA Polymerase

A 26-mer template, -GTGCG TGTTTGTATCGCTTGCAGGGG- , which contained the G[8,5-Me]T cross-link (G T) at the 5th and 6th bases from the end, was constructed. The DNA sequence of the first 12-nucleotides in this template was taken from codon 272–275 of the p53 gene, in which the G[8,5-Me]T cross-link was incorporated at the second and third nucleotide of codon 273, a well-known mutational hotspot for human cancer [24]. We used both running start and standing start conditions to evaluate bypass of the cross-link. Template-primer complex (50 nM) was incubated with increasing concentration of hPol and yPol at C for 30 min in the presence of all four dNTPs (100  M). For the running start experiments, a -32P-radiolabeled 14-mer primer, -CTGCAAGCGATACA- , was annealed to the template so that it was 3 bases to the cross-link. As shown in Figure 2, G[8,5-Me]T was a strong block of both DNA polymerases. With 5 nM hPol , 80% of the control template extended to a 22-mer and a 23-mer (full-length) products whereas for G[8,5-Me]T less than 1% extended to the full-length product, and a major block was at the cross-linked G (19-mer). With 20 nM hPol , nearly 75% was blocked after incorporating a base opposite the cross-linked G (19-mer), and the full-length product increased only to 10%. The full-length product increased to 18% with 50 nM hPol . In similar experiment using yPol , unlike the human enzyme, the major blocks were at 19-mer and 20-mer (i.e., opposite the cross-linked G and its neighbor). With 50 nM yPol , 8% of the primer extended to full-length 23-mer product.

With concentrations of hPol and yPol at 50 nM, a substantial fraction (18% and 8%, resp.) of the primer extended to full-length products in 30 min. So we chose to use 50 nM Pol concentrations for the subsequent experiment. As shown in Figure 3, in the presence of all four dNTPs, extension of a 14-mer primer on the control template rapidly generated a full-length extension product (a 23-mer) as well as a blunt-end addition product (a 24-mer) in 5 min with 50 nM hPol whereas the extension of the primer stalled after adding a base opposite the cross-linked T and G, generating a 19-mer. It is interesting that hPol did not stall before either of the cross-linked bases, but it was unable to continue synthesis only after incorporating a dNMP opposite the cross-linked G. Longer incubation allowed further extension, including a small fraction of full-length product, but even after 2 h the 19-mer band was the most pronounced extension product. The result was qualitatively similar with yPol , except that the extent of full-length product was only marginally increased with time and it stalled both after incorporation of a nucleotide opposite the cross-linked G (19-mer) and after incorporation of a nucleotide opposite its -neighbor (20-mer). Standing start experiments were carried out, and the amount of extension of the primer by one nucleotide was plotted with increasing dNTP concentration to determine the initial velocity of the polymerase-catalyzed reaction, which is shown in Figure 4. From these plots, the steady-state kinetic parameters, and , for nucleotide incorporation opposite cross-linked G and the same for the control were determined (Tables 1 and 2). For hPol , catalytic efficiency ( / ) of dCMP incorporation was 17-fold decreased opposite the cross-linked G whereas extension to the next base was decreased 5-fold relative to control. By contrast, for yPol dCMP incorporation was decreased 1,000-fold, and extension to the next base was decreased 12-fold relative to control. This suggests that yPol had more difficulty in bypassing G[8,5-Me]T than hPol . As was reported before [12, 19], in contrast to hPol , which incorporates the correct nucleotide preferentially opposite G[8,5-Me]T, yPol was much more error-prone, and insertion of dAMP opposite the cross-linked G was favored over that of the correct nucleotide, dCMP (Tables 1 and 2). In fact, dAMP misincorporation opposite the cross-linked G was more than 20-times more efficient than dCMP incorporation by yPol . However, with yPol the extension was 100-fold slower for G*:A pair compared to G*:C pair whereas the same for hPol was about 13-fold slower. In each case, the higher catalytic efficiency was due to a much smaller . When nucleotide incorporation fidelity opposite the cross-linked G and its base was considered, dCMP incorporation over dAMP misincorporation was 200-fold more efficient for hPol whereas the same was only 5-fold more efficient for yPol . Nevertheless, it seems that dCMP was preferred opposite the cross-linked G for bypass of G[8,5-Me]T by both DNA polymerases although the ability to discriminate against the wrong nucleotide by yPol was not high.

3.2. Analysis of the Full-Length Bypass Products

Although steady-state kinetics provides useful information on the ability to incorporate a nucleotide opposite a lesion and further extension, it is important to determine the sequences of full-length bypass products in the presence of all four dNTPs. In mammalian cells, replication of G[8,5-Me]T-containing DNA also generates significant level of semitargeted mutations [19], and it would be of interest to determine if pol causes errors not only opposite the cross-link but also near the lesion. Guengerich and colleagues have developed an elegant LC-ESI/MS/MS-based method to analyze the polymerase extension products [2530]. In the current paper, we report a plasmid-based approach to accomplish the same goal. The principle of this approach is shown in Scheme 1. The pMS2 plasmid was linearized by digestion with EcoR V. A scaffold 36-mer, containing two 12-nucleotide regions complementary to the two ends of the digested plasmid, was annealed to generate a gapped circular DNA, in which the G[8,5-Me]T cross-link was located in the middle of a 12-nucleotide gap. The scaffold G[8,5-Me]T-36-mer contained the same local DNA sequence near the G[8,5-Me]T cross-link as the 26-mer used in the steady-state kinetic assay. It also contained several uracils replacing thymines at the two ends where it annealed with the plasmid. The circular scaffold plasmid DNA was incubated with 50 nM hPol or yPol and a mixture of all four dNTPs (25 mM each) in 25 mM Tris-HCl buffer (pH 7.5), 5 mM MgCl2, and 5 mM dithiothreitol at C for various times. We expected a large fraction of the control construct to extend to full-length circular product whereas a much smaller fraction of the cross-linked construct was able to do the same. The full-length extension product extended up to the end of the circular DNA, and the nick between the two ends was sealed by ligation overnight at C in the presence of an excess of T4 DNA ligase to generate covalently closed circular ss plasmid. Although the DNA polymerase was not inactivated, both hPol and yPol were inefficient in continuing further extension at C (data not shown). The scaffold 36-mer was digested by treatment with uracil DNA glycosylase and exonuclease III. The removal of the lesion-containing scaffold was considered critical to avoid any potential in vivo replication of the lesion. Therefore, we analyzed the products by agarose gel electrophoresis after uracil DNA glycosylase followed by exonuclease III treatment and confirmed that the plasmid was quantitatively linearized when either Pol or DNA ligase was absent (data not shown). The proteins were extracted with phenol and chloroform, and the DNA was precipitated with ethanol. The DNA was used to transform repair-competent E. coli DL7, and the transformants were analyzed by DNA sequencing.

101495.sch.001

The number of colonies recovered upon transformation in E. coli of the plasmid incubated with hPol for different times is shown in Figure 5. Since linear ss DNA is inefficient in transfecting E. coli, no colonies were recovered from the zero time point from both the control and the G[8,5-Me]T scaffold whereas increasing numbers of colonies were recovered as incubation times with the DNA polymerase were increased. The number of colonies reflected the extent of full-length product that was ligated, and relative to the control 36-mer scaffold, the G[8,5-Me]T scaffold generated only 9% progeny at 15 min, which increased to 18% at 30 min and to 27% after 2 h (Figure 4). (For this calculation, the number of colonies obtained from the 120 min extension of the control 36-mer was considered 100%.) This suggests that with increased time of incubation, more DNA polymerase can bypass the G[8,5-Me]T cross-link, as we have also noted in the primer extension experiment with the G[8,5-Me]T 26-mer.

DNA sequencing results of the 2 h incubation products from two independent experiments with hPol and yPol are shown in Figure 6. The types and numbers of mutants from two different experiments are shown in Figure 6(a) whereas Figure 6(b) shows the combined result in a bar graph. As noted in the kinetic studies, yPol was found to be more error-prone than hPol . Mutational frequencies of yPol and hPol were 36% and 14%, respectively, for the G[8,5-Me]T cross-link whereas no mutants were recovered from the control after sequencing in excess of one hundred colonies following extension with each DNA polymerase. The pattern of mutagenesis from the G[8,5-Me]T cross-link was significantly different for these two polymerases. yPol induced targeted as the major mutagenic event, followed by targeted ; these two base substitutions, taken together, constituted 83% of the mutations. By contrast, in the case of hPol , semitargeted mutations (7.2%) occurred at equal frequency as the targeted mutations (6.9%). With hPol , though most frequent mutation was (4%), approximately half as many (2.2%) was also detected. It is interesting that even a single targeted could not be detected in the extension by yPol . Similarly, targeted was completely absent with hPol . For the cross-linked T, yPol bypass was completely error-free whereas low (0.6%) level of transversions was detected with hPol . With yPol , semitargeted mutations were restricted to the immediate -C and -G of the cross-link, but with hPol , errors were noted as far as two bases and five bases to the cross-link. In sum, despite the similarity of targeted transversions, the mutational profile of the two Y-family DNA polymerases exhibited distinct patterns.

4. Discussion

In earlier studies it was shown that hPol preferentially incorporates the correct nucleotide opposite each of the cross-linked bases whereas yPol , though accurately incorporates dAMP opposite the cross-linked T, is highly error-prone in nucleotide incorporation opposite the cross-linked G [12, 19]. However, neither the kinetics of further extension of the primer nor the sequences of the full-length extension products were determined. Miller and Grollman [20] have shown that DNA polymerase functions can be affected by replication-blocking lesions remote from the lesion site. In the current investigation, using steady-state kinetics, we determined that though dAMP incorporation opposite the G of G[8,5-Me]T by yPol was more than 20-fold preferred over dCMP incorporation, further extension of the G*:A pair was 100-fold less efficient than extension of the G*:C pair. As a result, dCMP incorporation followed by further extension was 5-times as efficient as dAMP incorporation by yPol . For hPol , on the other hand, it was nearly 200-times as efficient.

In order to characterize the full-length extension products in the presence of all four dNTPs, we developed a novel method to sequence them. In this approach, as shown in Scheme 1, a single-stranded plasmid (e.g., pMS2) containing a restriction endonuclease site in a hairpin region is digested and linearized by the enzyme. A DNA adduct containing scaffold is annealed to the linear DNA to create a gapped plasmid, in which the lesion is situated in the middle of this gap. A DNA polymerase is allowed to extend the -end of the plasmid to fill in the gap, which is then enzymatically ligated to create a closed-circular plasmid or viral genome. The ss circular DNA is replicated in E. coli, and the progeny is subjected to DNA sequencing. The scaffold is quantitatively removed prior to transformation in E. coli to avoid biological processing of the lesion in vivo. The DNA sequencing result of the area which originally contained the gap provides the nature of extension products. It is worth mentioning that other plasmid-based sequencing techniques using PCR amplification have been developed and successfully used in recent years [31, 32]. However, we believe that the hallmark of our current approach is its simplicity. It neither requires expensive instrumentation nor is technically demanding. While the sensitivity of the mass spectral analysis is limited by the signal to noise ratio, which varies from experiment to experiment, the plasmid-based sequencing approach enables determination of misincorporations occurring at a level of less than 1% frequency. However, the sequence determination is dependent on the efficiency of ligation, which is only proficient with full-length extension products. As a result, a limitation of the current plasmid-based approach is that it does not offer any information on the incomplete extension products, which may be readily available by the MS approach. Using this method of sequencing, we showed that yPol was much more error-prone in bypassing G[8,5-Me]T than hPol . The targeted was the major type of mutation by both DNA polymerases, but yPol induced it nearly 6-times more efficiently than hPol . With hPol , semitargeted mutations, that is, mutations near the lesion, occurred at approximately equal frequency as the targeted mutations whereas more than 80% of the mutations were targeted mutations with yPol .

Several studies have established differences between the yeast and the human enzyme. For translesion synthesis of -hydroxypropanodeoxyguanosine, yPol synthesizes past the adduct relatively accurately whereas hPol discriminates poorly between incorporation of correct and wrong nucleotides opposite the adduct [33]. The mechanistic basis of these two enzymes has been examined, which showed that they differ in several important respects [34]. hPol has a 50-fold-faster rate of nucleotide incorporation than yPol but binds the nucleotide with an approximately 50-fold-lower level of affinity. It is unclear how these differences influence the nucleotide incorporation opposite the G[8,5-Me]T cross-link.

When the hPol mutational spectrum was compared with the mutations detected in human embryonic kidney cells [19], significant similarities in the two results are apparent. Notably, the high frequency of followed by and the semitargeted mutations to the cross-link such as - and - reflect a similar pattern in the in vitro studies using purified DNA hPol and the cellular studies. These similarities notwithstanding, certain variations in the mutation profiles are also noteworthy. Targeted and substitutions at adjacent -G and the thymines noted in the mammalian cells were absent in the hPol extensions. It was suggested that in a cell the binding to proliferating cell nuclear antigen (PCNA) via its PCNA-interacting protein domain is a prerequisite for hPol ’s ability to function in translesion synthesis in human cells [35]. Therefore, certain differences between bypass of a DNA damage by a purified hPol in vitro and that in a cell should be anticipated. Although there is insufficient evidence to conclude that hPol is responsible for the observed mutations of G[8,5-Me]T in human cells, it seems reasonable to postulate that this Y-family DNA polymerase is one of the DNA polymerases involved in the cellular bypass of this cross-link.

Abbreviations

G[8,5-Me]T or G T:Guanine-thymine intrastrand cross-link where C8 of guanine is covalently bonded to the methyl carbon of the -thymine
T[5-Me,8]G or T G:Thymine-guanine intrastrand cross-link where C8 of guanine is covalently bonded to the methyl carbon of the 5’-thymine
hPol and yPol :Represent human and yeast DNA polymerase , respectively.

Acknowledgment

This paper was supported by the National Institute of Environmental Health Sciences, NIH (Grant ES013324).