Research Article

Recognition of the Script in Serbian Documents Using Frequency Occurrence and Co-Occurrence Analysis

Table 11

The ratio of the co-occurrence descriptors between Latin and Cyrillic documents.

ā€‰Doc 1Doc 2Doc 3Doc 4Doc 5Doc 6Doc 7Doc 8Doc 9Doc 10

Uniformity0.610.590.620.610.600.660.720.510.720.76
Entropy1.291.251.231.351.341.201.171.481.071.07
Max. probability0.700.640.680.690.680.700.770.580.620.76
Dissimilarity1.151.131.171.171.141.060.961.281.080.98
Contrast0.880.860.920.920.890.860.750.940.850.78

The final processing of the above results is based on cumulative measures like average, max. and min. of script type co-occurrence in the database. According to that certain criteria are established. All these are shown in Table 12.