Research Article

Design and Implementation of a Machine Learning-Based Authorship Identification Model

Table 2

PAN12 datasets used in the experiments.

DatasetDescriptionTraining documentsTesting documents

A1Instance-based (original text)66
A2Instance-based (n-grams)66
A3Profile-based (original text)36
A4Profile-based (n-grams)36
C1Instance-based (original text)168
C2Instance-based (n-grams)168
C3Profile-based (original text)88
C4Profile-based (n-grams)88
I1Instance-based (original text)2814
I2Instance-based (n-grams)2814
I3Profile-based (original text)1414
I4Profile-based (n-grams)1414