Research Article
Mixed Script Identification Using Automated DNN Hyperparameter Optimization
Table 2
Corpus statistics show that (tokens and %age in corpus).
| Corpus statistics | Language | Tokens | %age in corpus |
| Eng | 102311 | 22.7 | Roman Urdu | 97235 | 21.6 | Hindi | 87563 | 19.4 | Bengali | 85672 | 19.0 | Saraiki | 78412 | 17.4 | Total tokens in corpus | 451193 |
|
|