Research Article

Rule-Based Information Extraction from Free-Text Pathology Reports Reveals Trends in South African Female Breast Cancer Molecular Subtypes and Ki67 Expression

Table 2

Evaluation of the matching algorithm extractions on 300 random validation samples.

ParameterCategoryPrecision (CI)Recall (CI)-score (CI)Kappa (CI)

ERNegative0.96 (0.91-0.98)0.93 (0.87-0.97)0.95 (0.89-0.98)0.92 (0.89-0.95)
Positive0.97 (0.94-0.98)0.98 (0.96-0.99)0.98 (0.95-0.99)
PRNegative0.95 (0.91-0.98)0.99 (0.96-0.99)0.97 (0.93-0.99)0.98 (0.96-0.99)
Positive0.99 (0.97-0.99)0.96 (0.92-0.98)0.98 (0.94-0.99)
HER2Negative1.00 (0.98-1.00)1.00 (0.99-1.00)1.00 (0.98-1.00)0.99 (0.97-1.00)
Positive1.00 (0.98-1.00)0.99 (0.94-0.99)0.99 (0.95-0.99)
Ki67<140.84 (0.73-0.91)0.98 (0.90-0.97)0.91 (0.81-0.95)0.95 (0.91-0.97)
≥140.99 (0.97-0.99)0.94 (0.91-0.98)0.97 (0.93-0.98)
GradeI0.83 (0.60-0.93)1.00 (0.89-1.00)0.91 (0.91-0.98)0.97 (0.94-0.98)
II0.97 (0.92-0.99)0.97 (0.92-0.99)0.97 (0.92-0.99)
III0.99 (0.94-0.99)0.95 (0.89-0.98)0.97 (0.91-0.98)
TypeIDC1.00 (0.99-0.99)1.00 (0.99-0.99)1.00 (0.99-0.99)1.00(0.99-1.00)
Others1.00 (0.95-1.00)1.00 (0.95-1.00)1.00 (0.95-1.00)
LateralityLeft breast0.99(0.95-0.99)0.99 (0.96-0.99)0.99 (0.96-0.99)0.99 (0.95-0.99)
Right breast0.99 (0.96-0.99)0.98 (0.94-0.99)0.99 (0.95-0.99)