Research Article

Design and Development of a Large Cross-Lingual Plagiarism Corpus for Urdu-English Language Pair

Table 3

Comparison of rewrite levels of medium documents from Pak Study domain.

ā€‰MPCAPC
2-gram3-gram4-gram2-gram3-gram4-gram

Document 0002.txt0.1530.04200.6250.5210.457
Document 0005.txt0.1100.0490.0250.6590.5190.388
Document 0006.txt0.1430.0400.0080.5870.4480.347
Document 0009.txt0.1140.02300.4660.3220.209
Document 0011.txt0.2100.06600.3870.2620.167