Review Article

A Survey of Automatic Software Vulnerability Detection, Program Repair, and Defect Prediction Techniques

Table 3

Technical characteristics of grammar-based clone detection.

System/writerData preprocessingCode representationNetworkClone typesClassification object

CCLearner [3]Use ANTLR and ASRParser to parse each methodToken ASTDNNT1–T3Method pairs
CLDH [4]Parse each code fragment to ASTASTLSTMT1–T4code fragments
White et al. [5]Use ANTLR to tokenize codeAST FBT olive treesRtvNNT1–T4Method/file level
Marastoni et al. [6]Leverage tigress C to obfuscate datasetBinary imageCNNT4Image level