Research Article

Computation of Program Source Code Similarity by Composition of Parse Tree and Call Graph

Table 1

Simple statistics on the real data set.

Information Value

Number of total assignments 36
Number of submitted source codes 555
Average number of submitted codes per assignment 15.42

Minimum number of lines in source code 49
Maximum number of lines in source code 2,863
Average number of lines per source code 305.07

Minimum number of nodes in source code 12
Maximum number of nodes in source code 447
Average number of nodes in source code 64.29

Number of marked plagiarism pairs 175