Sequential Pattern Mining to Predict Medical In-Hospital Mortality from Administrative Data: Application to Acute Coronary Syndrome
Table 3
Means of area under the ROC curve (AURC), F-measure, and error rate for the different types of models and similarities in the modeling of ICD-10 code trajectories.
AURC
F-measure
Error rate
Model
Similarity
Mean
95% CI
Mean
95% CI
Mean
95% CI
NB
Edition
0.77
0.68–0.86
0.70
0.62–0.82
0.26
0.16–0.34
q-gram
0.72
0.64–0.77
0.64
0.58–0.70
0.33
0.28–0.38
Heuristic
0.73
0.64–0.82
0.66
0.60–0.77
0.32
0.24–0.39
KNN
Edition
0.44
0.38–0.53
0.58
0.53–0.63
0.38
0.35–0.43
q-gram
0.50
0.45–0.55
0.57
0.52–0.61
0.40
0.37–0.44
Heuristic
0.54
0.46–0.59
0.55
0.52–0.65
0.41
0.38–0.46
Tree
Edition
0.74
0.66–0.83
0.66
0.56–0.79
0.28
0.19–0.35
q-gram
0.67
0.62–0.71
0.63
0.57–0.70
0.34
0.30–0.39
Heuristic
0.70
0.64–0.80
0.65
0.57–0.77
0.31
0.22–0.38
LR
Edition
0.77
0.68–0.88
0.70
0.62–0.83
0.27
0.16–0.35
q-gram
0.75
0.65–0.82
0.69
0.62–0.77
0.29
0.23–0.38
Heuristic
0.74
0.64–0.82
0.69
0.62–0.80
0.30
0.21–0.39
SVM
Edition
0.83
0.76–0.92
0.70
0.61–0.82
0.25
0.16–0.33
q-gram
0.80
0.72–0.89
0.66
0.60–0.73
0.31
0.26–0.37
Heuristic
0.84
0.77–0.92
0.70
0.64–0.81
0.27
0.20–0.36
ANN
Edition
0.82
0.72–0.94
0.70
0.59–0.85
0.25
0.14–0.33
q-gram
0.81
0.72–0.90
0.70
0.62–0.79
0.28
0.21–0.37
Heuristic
0.83
0.71–0.96
0.73
0.63–0.86
0.26
0.14–0.35
CI = confidence interval. Best results are in bold.