Research Article

K-mer-Based Motif Analysis in Insect Species across Anopheles, Drosophila, and Glossina Genera and Its Application to Species Classification

Table 2

CC statistics for k-mers of lengths 7–9 bp for different combinations of the genera under study.

Group comparisonMinMedianMeanMaxStd. dev.No. of comparisons

Heptamers
Anopheles0.9130.9570.9550.9990.022231
Non-Anopheles0.5900.8330.8370.9990.087630
Drosophila0.5900.8740.8690.9990.072435
Non-Drosophila0.6770.9380.8820.9990.104378
Glossina0.9650.9940.9860.9990.01415
Non-Glossina0.4410.7390.7720.9990.1441326
Anopheles vs. Drosophila0.4410.6480.6440.7700.059660
Anopheles vs. Glossina0.6770.7400.7440.7870.027132
Drosophila vs. Glossina0.6420.7490.7450.8120.033180
C. briggsae vs. Anopheles0.5280.5590.5620.6430.03022
C. briggsae vs. Drosophila0.2660.6200.5730.6670.10230
C. briggsae vs. Glossina0.4850.4920.4990.5340.0186
A. mellifera vs. Anopheles0.5680.6170.6290.7020.04322
A. mellifera vs. Drosophila0.2420.4840.4740.5670.06530
A. mellifera vs. Glossina0.5700.5900.5890.6170.0176

Octamers
Anopheles0.9040.9500.9480.9990.023231
Non-Anopheles0.5880.8240.8220.9980.089630
Drosophila0.5880.8580.8570.9970.069435
Non-Drosophila0.6550.930.8690.9990.113378
Glossina0.9480.9880.9780.9980.02015
Non-Glossina0.4430.7230.7610.9990.1431326
Anopheles vs. Drosophila0.4430.6370.6330.7600.055660
Anopheles vs. Glossina0.6550.7160.7190.7550.026132
Drosophila vs. Glossina0.6210.7280.7230.7910.034180
C. briggsae vs. Anopheles0.5210.5540.5560.6340.02922
C. briggsae vs. Drosophila0.2790.6100.5670.6520.09430
C. briggsae vs. Glossina0.4770.4840.4900.5220.0176
A. mellifera vs. Anopheles0.5640.6110.6240.6960.04222
A. mellifera vs. Drosophila0.2590.4810.4770.5650.06130
A. mellifera vs. Glossina0.5640.5850.5830.6080.0166

Nonamers
Anopheles0.8860.9390.9380.9960.025231
Non-Anopheles0.5770.8050.8010.9930.092630
Drosophila0.5770.8380.8390.9920.069435
Non-Drosophila0.6290.9190.8520.9960.121378
Glossina0.9190.9750.9610.9930.02815
Non-Glossina0.4390.7050.7470.9960.1431326
Anopheles vs. Drosophila0.4390.6240.6190.7460.053660
Anopheles vs. Glossina0.6290.6890.6910.7240.024132
Drosophila vs. Glossina0.5890.6970.6940.7660.034180
C. briggsae vs. Anopheles0.5100.5440.5450.6190.02722
C. briggsae vs. Drosophila0.2850.5940.5530.6360.08630
C. briggsae vs. Glossina0.4640.4700.4750.5030.0146
A. mellifera vs. Anopheles0.5550.6020.6150.6850.04122
A. mellifera vs. Drosophila0.2700.4750.4740.5580.05830
A. mellifera vs. Glossina0.5510.5720.5700.5920.0146

CC values were calculated for the genera Anopheles, Drosophila, and Glossina as well as between these three genera and between two outliers, Apis mellifera and Caenorhabditis elegans, and these two genera. For each combination, the minimum, mean, median, maximum CC values were calculated as well as the standard deviation and the number of species comparisons.