Research Article

Transcriptome Profile of the Asian Giant Hornet (Vespa mandarinia) Using Illumina HiSeq 4000 Sequencing: De Novo Assembly, Functional Annotation, and Discovery of SSR Markers

Table 1

Summary statistics of the transcription assembly for Vespa mandarinia.

Total number of raw reads
 Number of sequences60,723,154
 Number of bases9,169,196,254
Total number of clean reads
 Number of sequences59,184,811
 Number of bases8,297,028,222
 Mean length (bp)140.2
 N50 length (bp)151
 GC % 37.93
High-quality reads (%)97.47 (sequences), 90.49 (bases)
Number of reads discarded (%)2.53 (sequences), 9.51 (bases)
Contig information
 Total number of contig147,400
 Number of bases181,439,800
 Mean length of contig (bp)1,230.9
 N50 length of contig (bp)2,578
 GC % of contig35.50
 Largest contig (bp)30,772
 Number of large contigs (≥500 bp)75,428
Unigene information
 Total number of unigenes66,837
 Number of bases95,657,681
 Mean length of unigene (bp)1,431.2
 N50 length of unigene (bp)3,112
 GC % of unigene35.26
 Length ranges (bp)224–52,946