Research Article
Recognition of the Script in Serbian Documents Using Frequency Occurrence and Co-Occurrence Analysis
Table 4
Percentage of script type occurrence in document.
| ||||||||||||||||||||||||||||||||
It is obvious that the Latin document compared to Cyrillic one has slightly smaller number of short (S), descender (D), and full (F) letters. Nonetheless, the crucial margin is seen in ascender (A) letters. Hence, it can be a measure of confidence for detection of the script in a document given in Serbian language. |