Research Article

ASPic-GeneID: A Lightweight Pipeline for Gene Prediction and Alternative Isoforms Detection

Table 1

Accuracy of gene finding programs on the complete C. elegans genome.

Evaluation at gene level
ProgramSNgSPgSSgWGMGSNeSPeSSeWEMESNiSPiSSiWIMISNnSPnSSn

GeneID0.970.830.900.190.030.670.690.680.160.150.700.740.720.260.300.870.880.87
ASPic0.340.960.650.030.660.290.910.600.020.690.300.970.630.030.700.310.980.64
ASPic-GeneID0.990.930.960.090.010.850.810.830.080.030.930.880.900.120.070.950.950.95
ASPic-GeneID_AS10.990.750.870.210.010.870.780.820.120.020.930.860.890.140.070.960.910.93
ASPic-GeneID_AS20.980.980.980.070.020.860.830.840.070.030.940.890.910.110.060.960.930.94
TWINSCAN0.950.870.910.120.050.760.770.760.110.110.810.830.820.170.190.900.910.90
TWINSCAN_EST0.950.880.910.090.050.790.810.800.090.090.840.870.850.130.160.910.920.91
FGENESH0.970.880.920.100.030.760.740.750.130.090.800.790.790.210.200.930.890.91
Genefinder0.950.970.960.050.050.770.740.750.130.080.830.780.800.220.170.930.890.91
SNAP0.960.690.820.220.040.700.660.680.180.120.740.730.730.270.260.900.860.88

Evaluation at transcript level
ProgramSNtSPtSStWTMTSNetSPetSSetWEtMEtSNitSPitSSitWItTMItSNntSPntSSnt

GeneID0.230.230.230.050.190.680.700.690.170.180.700.720.710.280.300.840.870.85
ASPic0.260.710.480.010.290.810.950.880.010.150.820.980.900.020.180.810.990.90
ASPic-GeneID0.440.470.450.030.170.860.810.830.100.050.920.850.880.150.080.930.930.93
ASPic-GeneID_AS10.530.440.480.080.110.870.850.860.080.060.910.880.890.120.090.920.940.93
ASPic-GeneID_AS20.460.500.480.030.170.880.810.840.110.040.940.850.890.150.060.950.900.92
TWINSCAN0.350.360.350.040.150.770.780.770.110.120.810.820.810.180.190.880.910.89
TWINSCAN_EST0.430.450.440.040.130.800.830.810.080.110.840.870.850.130.160.890.930.91
FGENESH0.320.330.320.060.160.750.760.750.130.130.780.790.780.210.220.880.890.88
Genefinder0.300.350.320.050.180.790.730.760.160.100.830.750.790.250.170.910.860.88
SNAP0.270.220.240.090.110.670.730.700.110.190.700.770.730.230.300.820.920.87

The highest values are shown in bold. SN indicates sensitivity. SP indicates specificity. SS indicates the average between SN and SP. Gene (g), transcript (t), exon (e), intron (i), and nucleotide (n) were assessed.