Research Article

Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis

Table 5

Best feature sets for human-machine correlation and their weights in the regression formulae.

FeatureContext
w/o CFx

w/o CFx

DurNormWPW0.3770.4990.378
DurNormW0.5130.402
F0MinW
F0MeanW
F0OnsetW0.173
F0OffPosW0.3220.1200.1850.236
EnNormWPW0.343
EnNormW0.155
MeanJitter15 W0.1180.1860.1130.2490.2390.3660.3680.3200.208
MeanShimmer15 W0.1440.1380.1450.114
StandDevShimmer15 W
#+Voiced15 W0.3210.3470.3340.3240.0940.122
RelNum+/−Voiced15 W0.2180.082
CFx0.2100.206
CQx0.6430.4950.506

0.710.660.710.670.360.530.470.450.49
ρ0.570.490.580.490.270.540.460.450.55
Significance level<0.001<0.001<0.001<0.0010.003<0.001<0.001<0.001<0.001

Contexts: W: word, WPW: word-pause-word, 15 W: 15 words (“global” feature). The correlations of the respective set to the human reference are given by (Pearson) and (Spearman).