Research Article

Deep Neural Embedding for Software Vulnerability Discovery: Comparison and Optimization

Table 3

Reviewed studies which applied various embedding techniques for software engineering.

PaperType of dataEmbedding modelWhether to consider contextual information

Pradel and Sen [23]150,000 JavaScript files collected from various open-source projectsWord2VecNo
Harer et al. [13]C/C++ packages distributed with the Debian Linux distribution C/C++ functions collected from githubWord2VecNo
Henkel et al. [24]19,000 API-usage analogies extracted from the Linux kernelGloVeNo
Fang et al. [25]Projects are extracted from open-source intelligence data such as NVDFastTextNo
Kanade et al. [26]150k Python files from githubCuBERTYes
Karampatsis and Sutton [27]150,000 JavaScript files consisting of various open-source projectsSCELMoYes