Research Article

Factors Affecting Splicing Strength of Yeast Genes

Table 3

Site-specific frequencies and position weight matrix (PWM) for 275 5′ ss. The consensus sequence (UAAAG GUAUGUU UAAUU) can be obtained from those large site-specific PWM entries, with the most important sites in bold italics . The χ 2 test is performed for each site against the background frequencies (A = 0.3279, C = 0.1915, G = 0.2043, and U = 0.2763). The nucleotide sites are labeled with the five exon nucleotides as −5 to −1 and the 12 intron nucleotides as 1 to 12. The PWM is nearly identical when the introns in 5′ UTR were excluded.

SiteACGUχ 2 𝑃 ACGU

−59432579211.7980.00810880.0641−0.71170.02450.2792
−411947486114.1170.00275050.4032−0.1599−0.2225−0.3115
−313938435539.6720.00000010.6268−0.4651−0.3805−0.4601
−213840366138.8990.00000010.6164−0.3915−0.6355−0.3115
−19145885127.2700.00000520.0174−0.22230.6492−0.5685
10127401060.4260.0000004−8.1042−5.46752.2855−8.1044
2090266658.0960.0000003−8.1042−2.5200−8.10481.8081
3268124522.7540.00000031.5723−5.4675−4.6732−4.1523
417291228428.6070.0000002−2.3805−0.8528−5.54541.5859
52027211041.0470.0000004−5.2765−8.10492.2750−5.8967
61082255583.5450.0000003−3.1271−2.6862−4.67321.7472
797183912155.5700.00000010.1092−1.5351−0.52060.6734
89554359111.3630.00991800.07930.0397−0.67590.2635
912345347322.1720.00006010.4508−0.2223−0.7175−0.0534
1011841387817.3340.00060340.3911−0.3560−0.55790.0418
1110533439417.3670.00059400.2232−0.6676−0.38050.3101
129044429912.1090.00701800.0015−0.2546−0.41420.3847