Research Article

iSentenizer- : Multilingual Sentence Boundary Detection Model

Table 6

Performance of systems on different languages of Europarl corpus.

CorpusCandidatesRecallPrecision -Score

DanishiSentenizer98.84%92.88%95.77%
Punkt97.69%79.37%87.59%
MxTerminator35.48%94.13%51.54%

GermaniSentenizer97.61%95.77%97.61%
Punkt97.87%87.53%92.41%
MxTerminator81.00%93.69%86.89%

EnglishiSentenizer98.98%95.79%97.36%
Punkt97.95%93.34%95.59%
MxTerminator96.09%93.97%95.02%

SpanishiSentenizer99.40%94.21%96.74%
Punkt98.11%89.80%93.77%
MxTerminator96.67%90.09%93.26%

DutchiSentenizer99.34%96.24%97.77%
Punkt97.79%92.34%94.99%
MxTerminator91.95%95.32%93.61%

FrenchiSentenizer98.82%95.77%97.28%
Punkt97.84%91.37%94.49%
MxTerminator95.04%91.88%93.44%

ItalianiSentenizer98.90%95.99%97.42%
Punkt98.25%93.69%95.92%
MxTerminator94.96%94.43%94.70%

PortugueseiSentenizer99.58%96.60%98.07%
Punkt98.50%95.76%97.11%
MxTerminator94.88%96.50%95.68%

GreekiSentenizer97.83%96.44%97.13%
Punkt96.98%95.36%96.16%
MxTerminator97.24%93.97%95.58%

FinnishiSentenizer98.98%95.76%97.34%
Punkt98.33%92.34%95.24%
MxTerminator92.46%95.32%93.87%

SwedishiSentenizer95.91%94.30%95.10%
Punkt98.94%89.45%93.95%
MxTerminator99.49%88.33%93.57%