Review Article

Critical Analysis of Strand-Biased Somatic Mutation Signatures in TP53 versus Ig Genes, in Genome-Wide Data and the Etiology of Cancer

Table 2

Somatic point mutation patterns in the TP53 coding region in “All Breast Cancers,” “All Bladder Cancers,” and “All Lung Cancers.”
(a) All Breast Cancers

OriginalMutant base
baseATCGTotal Statistics

A (22%)2.61.911.015.5A T1.3x
T (21%)3.14.83.911.8A>G versus T>C2.3x
C (29%)2.820.43.626.8G C1.7x
G (28%)31.09.35.545.9G>A versus C>T1.5x
G>T versus C>A3.3x
G>C versus C>G1.5x

(b) All Bladder Cancers

Original baseMutant base
ATCGTotal Statistics

A (22%)3.11.49.614.1A T2.5x
T (21%)1.81.72.25.7A>G versus T>C5.8x
C (29%)2.918.95.227.0G C2.0x
G (28%)34.49.59.353.3G>A versus C>T1.8x
G>T versus C>A3.3x
G>C versus C>G1.8x

(c) All Lung Cancers

Original baseMutant base
ATCGTotal Statistics

A (22%)5.41.610.317.3A T2.5x
T (21%)2.12.32.56.9A>G versus T>C4.5x
C (29%)3.115.55.023.9G C2.2x
G (28%)14.930.47.052.2G>A versus C>T 1.0xNS
G>T versus C>A12.1x
G>C versus C>G1.4x

Values in (a) represent the percentage of the total of 2279 somatic point mutations scored in category “All Breast Cancers” (R15); values in (b) represent the percentage of the total of 1215 somatic point mutations scored in category “All Bladder Cancers” (R15); values in (c) represent the percentage of the total of 2471 somatic point mutations scored in category “All Lung Cancers” (R15). The percentage base composition in the TP53 coding region for codons 130–300 inclusive—the region which contains the vast majority of mutations spanning the DNA binding region. The Chi-Squared statistics (significance levels) are essentially unaltered if mutation frequencies are corrected for base composition. The breakdown of G-to-A and C-to-T mutations at CpG and non-CpG sites is as follows. In (a) for C-to-T mutations 199 occur at CpG sites and 267 occur at non-CpG sites. For G-to-A 320 occur CpG and 387 occur at non-CpG sites. In (b) for C-to-T 104 occur at CpG sites and 126 occur at non-CpG sites. For G-to-A 153 occur at CpG sites and 265 occur at non-CpG sites. In (c) for C-to-T 209 occur at CpG and 173 occur at non-CpG sites. For G-to-A 214 occur at CpG and 154 occur at non-CpG sites.