Review Article

Position Weight Matrix, Gibbs Sampler, and the Associated Significance Tests in Motif Characterization and Prediction

Table 1

Site-specific frequencies and position weight matrix (PWM) for 246 donor splice sites (each represented by 5 sites on the exon side and 12 sites on the intron side). The test is performed for each site against the expected background frequencies with , , , and . Sites that have been experimentally verified to be important are in bold.

SiteACGUχ 2PACGU

18330498410.100.01770.0525−0.6332−0.02600.3143
210344465310.040.01820.3613−0.0878−0.1162−0.3434
312136385130.010.00000.5920−0.3739−0.3886−0.3981
412238335332.160.00000.6038−0.2969−0.5893−0.3434
58140814428.330.00000.0177−0.22380.6933−0.6081
6012450948.340.0000−6.6464−5.0056 2.2841 −6.6469
7090237582.230.0000−6.6464−2.3190−6.64801.8032
8239124462.460.0000 1.5693 −5.0056−4.3320−3.8633
916241205387.810.0000−2.2655−0.9496−5.0680 1.5946
10202431928.960.0000−4.8476−6.6483 2.2723 −5.3416
11972228521.060.0000−3.0427−2.6612−4.3320 1.7475
1287153411053.660.00000.1198−1.6111−0.54680.7006
138449308311.710.00850.06960.0659−0.72460.2971
1411139336319.090.00030.4684−0.2599−0.5893−0.0969
1510638317117.240.00060.4024−0.2969−0.67810.0738
169230408413.690.00340.1997−0.6332−0.31550.3143
178038369214.320.0025−0.0001−0.2969−0.46550.4445