Research Article

Detecting Multielement Algorithmically Generated Domain Names Based on Adaptive Embedding Model

Table 3

Datasets in the experiments.

Dataset typeDGA descriptionQuantity

DGA20 types of DGAs are collected, namely: Gameover, Dircrypt, Tinba, Necurs, Ramdo, Cryptolocker, Emotet, Corebot, Banjori, Qakbot, Rovnix, Kraken, Ramnit, Locky, Pykspa, Simda, Symmi, Virut, Matsnu, Suppobox200,000

SDGA8 types of SDGAs are collected, namely: DNL1, DNL2, DNL3, DNL4, 9ML1, 500KL1, 500KL2, 500KL388,000

MEME-DGA: generating domain names based on characters, numbers, special characters, two-character combinations, three-character combinations, and words, using a variety of DGAs ME-SDG: generating domain names based on characters, numbers, special characters, two-character combinations, three-character combinations, and words, using a variety of SDGAs60000

LegitimateTop 1 million domain names updated daily by Alexa from January 2009 to February 2019Prepared in 4:1 quantity with dynamic domain names