Research Article

Automated Dataset Generation System for Collaborative Research of Cyber Threat Analysis

Table 2

Number of data for each type.

YearData typesReportMalware
HashIPURLEmailDate, timeCVEFile namePDBCode signOthersTotal

2008031710001700019120
200927842001000010520
20102237928014322213000800732
20111,440412478173197713238253,34014319
20122,240433637464653082824374,52422465
20138,3292,5053,0325991,798453,003978026119,571471,798
20145,6145,4843,2824761,116832,804224382818,8421001,116
20156,8012,7522,6583341,554483,077282063417,258781,554
20168,001525,0203,4492351,833814,8734315414543,703791,974
20174,3433,3163,582534935492,7801399915,660721,017
20183,9003,2962,5822290742,660344043113,2101251,300
2019 (–Jun.)2,0467191,4391940511,11093625,60664628
Total42,939544,02621,6742,6808,05247022,0882502,220211642,81061210,203