Research Article

Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis

Table 7

Correlations of prosodic and Laryngograph measures, which were in the best models for the human rating, with each other.

FeatureDurNormDurNorm F0Min F0MeanF0Onset F0OffPos EnNorm EnNormMeanJitter MeanShimmer StandDevShimmer #+Voiced RelNum+/−Voiced CFxCQx
ContextWPWWWWWWWPWW15 W15 W15 W15 W15 W

DurNormWPW0.020.010.930.030.070.010.200.04
DurNormW0.10−0.30−0.300.010.780.220.000.080.130.130.05
F0MinW0.02−0.300.530.560.34−0.54−0.31−0.70−0.58−0.26
F0MeanW−0.310.620.680.39−0.320.020.12
F0OnsetW0.000.620.620.290.050.070.020.06
F0OffPosW−0.320.330.200.27−0.260.08−0.32−0.320.01
EnNormWPW0.920.070.020.070.060.140.020.080.000.140.00
EnNormW0.190.680.240.150.070.02
MeanJitter15 W0.18−0.300.000.080.030.400.380.620.570.350.23
MeanShimmer15 W−0.27−0.360.210.750.430.400.15
StandDevShimmer15 W−0.280.170.750.340.370.150.04
#+Voiced15 W0.13−0.63−0.29−0.350.000.510.340.310.890.300.15
RelNum+/−Voiced15  W0.16−0.56−0.300.510.310.310.930.200.07
CFx0.080.24−0.350.050.100.400.120.000.450.370.65
CQx0.020.180.070.150.110.080.64

Upper right triangle: Pearson’s ; lower left triangle: Spearman’s ρ.
Contexts: W: word, WPW: word-pause-word, 15 W: 15 words (“global” feature).
All and ρ correlations with an absolute value of larger than 0.25 (0.33) are significant on the 0.05 (0.01) level.