Comparative Analysis of Context-Dependent Mutagenesis in Humans and Fruit Flies
Table 3
Over- and underrepresentation of genomic frequencies for several words in H. sapiens and D. melanogaster. Data is taken from a previous study [20] supplementary table (available at http://mouse.genebee.msu.ru/words/Supple3(contrast_k).xls). The numbers represent the value = [(Obs () – Exp ())/Exp ()] 100%, where Obs () is the observed word frequency and Exp () is the expected word frequency (based on the frequencies of all of its subwords).
Genomic word over- and underrepresentation in
H. sapiens
D. melanogaster
Words containing a mutation context with increased mutation bias in H. Sapiens
CG
−76.37%
−5.93%
ATAG
−0.79%
4.38%
ATTG
−7.07%
−2.35%
ACAA
1.62%
3.75%
Words derived from mutation contexts with increased mutation bias in H. Sapiens
TG
20.10%
10.67%
ACAG
1.51%
−4.94%
ACTG
−2.07%
−0.46%
CCAA
−6.17%
−1.61%
Words containing mutation contexts with increased mutation bias in D. melanogaster
CCAC
0.19%
1.52%
CACC
1.18%
−4.24%
CCCA
5.63%
0.09%
GCCA
−2.77%
3.63%
ACC
2.28%
−2.39%
CCA
14.82%
9.90%
Words derived from mutation contexts with increased mutation bias in D. melanogaster