Table 4: Data distribution for model training, testing, and cross-verification.

Input data for modelingNumber of recordsNumber of desired output

Training data 300101 Negative
199 Positive in various levels

Testing data10034 Negative
66 Positive

Cross verification3010 Negative
20 Positive