Abstract

The vocal extent measure (VEM) represents a new diagnostic tool to express vocal capacity by quantifying the dynamic performance and frequency range of voice range profiles (VRPs). For VEM calculation, the VRP area is multiplied by the quotient of the theoretical perimeter of a circle with equal VRP area and the actual VRP perimeter. Since different diseases affect voice function to varying degrees, pathology-related influences on the VEM should be investigated more detailed in this retrospective study, three years after VEM implementation. Data was obtained in a standardized voice assessment comprising videolaryngostroboscopy, voice handicap index (VHI-9i), and acoustic-aerodynamic analysis with automatic calculation of VEM and dysphonia severity index (DSI). The complete dataset comprised 1030 subjects, from which 994 adults (376 male, 618 female; 18-86 years) were analyzed more detailed. The VEM differed significantly between pathology subgroups () and correlated with the corresponding DSI values. Regarding VHI-9i, the VEM reflected the subjective impairment better than the DSI. We conclude that the VEM proved to be a comprehensible and easy-to-use interval-scaled parameter for objective VRP evaluation in all pathology subgroups. As expected, exclusive consideration of the measured pathology-related influences on the VEM does not allow conclusions regarding the specific underlying diagnosis.

1. Introduction

Phoniatric voice diagnostics require sophisticated and highly specialized investigation methods, including objective and subjective measuring procedures [13]. Within this multifactorial approach, the recording of a voice range profile (VRP) under standardized conditions represents an established objective and noninvasive tool for the examination of vocal function [46]. A stable and straightforward VRP description succeeds when the vocal extent measure (VEM) is applied [7, 8]. This recently developed innovative parameter describes the vocal function as a positive one-dimensional measure, in contrast to the evaluation of dysphonia as a negative criterion by the established multidimensional parameter dysphonia severity index (DSI) [911]. The VEM documents the vocal capacity by quantification of the dynamic and frequency range. It is indicated as interval-scaled value usually ranging between 0 and 120, whereas these limits may be exceeded on both sides in severely impaired and exceptionally great voices. Hence, a large vocal capacity results in a high VEM value, conversely, a small VRP is characterized by low VEM. The calculation of this measure is based on the VRP area and perimeter; the detailed mathematical derivation was presented previously [7, 8, 12]. The VEM is recognized as a user-friendly and easily implementable diagnostic instrument. The AVA software [13] computes the VEM automatically after VRP measurement and displays it without additional manual effort or delay.

So far, only a few investigations have been conducted on the evaluation of VEM values. Four studies analyzed the VEM to assess treatment outcomes. Three investigations dealt with benign organic changes of the vocal folds (VF: 61 polyps [12], 37 nodules [14], and 60 edema [15]), another one included 152 patients with various laryngeal pathologies () and functional dysphonia () [1]. All of these clinical trials revealed significantly increased VEM values after phonomicrosurgery or conservative treatment. Besides, two other studies evaluated reference ranges of the VEM. The first investigation [8] presented a rough VEM classification deduced from 564 patients with different types of voice disorders, based on the quartiles of the VEM data (≥106: normal vocal capacity; <106 to ≥89: mildly reduced; <89 to ≥64: moderately reduced; <64: severely reduced vocal capacity). The second study [16] introduced gender-specific VEM means () and reference ranges () for young and healthy males (; RR: 99-148) and females (; RR: 88-141). Even though it would simplify the interpretation of vocal complaints in different pathologies and allow for better evaluation of expectable treatment success, more precise and comprehensive data regarding different types of voice disorders as well as healthy voices are not yet available.

Therefore, the purpose of our study was to gather this important information by exploring the influences of pathologic findings on the VEM, three years after the implementation of this parameter in voice diagnostics. It was planned to consider all male and female patients as well as healthy controls in our total study cohort, to examine the effect of specific underlying pathologies compared to other established parameters. Our hypotheses were as follows: the VEM (1) varies between different pathology subgroups, (2) correlates with the corresponding DSI values, and (3) reflects the patients’ subjective voice impairment. Furthermore, the VEM should also be investigated with regard to voice usage, gender, and age differences. Based on the findings of the literature, we hypothesized that professional voice use, male gender, and younger age are associated with higher VEM values [7, 8, 14, 16].

2. Material and Methods

2.1. Participants and Selection Criteria

Three years after the implementation of the VEM [7], a dataset of 1030 consecutive subjects who visited our phoniatric outpatient clinic was analyzed in a clinical retrospective study. Within this 36-month acquisition period, the sample consisted of mainly adult patients with vocal problems, but also of smaller amounts of children, transgender patients, and healthy participants who presented to receive a vocal fitness examination (control group). All children and transgender subjects were excluded from further analysis to avoid distortion of results considering the well-known effects of vocal mutation and maturation [17, 18] as well as conservative or phonosurgical gender-transforming procedures [19, 20] on acoustic and aerodynamic measures. The study was conducted in accordance with the Declaration of Helsinki and approved by the local ethical review board (reference number: EA4/140/10). Selection criteria involved informed consent and completion of the standard phoniatric examination procedures, comprising videolaryngostroboscopy (VLS), acoustic-aerodynamic voice function diagnostics including VRP measurement, DSI, and VEM calculation, as well as subjective self-evaluation using the voice handicap index VHI-9i.

2.2. Pathology-Specific Group Assignment

After taking the medical history, a digital VLS followed to assess the etiological diagnosis of the voice disorder and to evaluate the VF vibration, esp. glottal closure, regularity, mucosal wave propagation, and symmetry. It was carried out using a high-resolution rigid videolaryngoscope (10 mm; 70°) with integrated microphone connected to the Endo-STROB control unit (XION Medical, Berlin, Germany). VLS enabled us to divide the cohort into 2 sample groups: structural and functional dysphonia. The diagnoses were categorized and numbered according to the Classification Manual for Voice Disorders by the American Speech Language and Hearing Association (ASHA) [21]. We arranged the study subgroups according to the affected structures and visual pathologies of the VF. Due to the clinical prevalence and importance of malignant laryngeal lesions, we decided to investigate them separately from benign epithelial changes. In these patients, the specific group assignment was based on the postinterventional histopathological result after microlaryngoscopic excision. Healthy participants who passed the vocal fitness examination served as a control group. Finally, the following pathology subgroups were investigated: group I—malignant pathologies, group II—lesions of the lamina propria (e.g., VF nodules, polyps, cysts, and scars; Reinke’s edema; and sulcus vocalis), group III—benign changes of the epithelium, group IV—inflammatory changes, group V—neurogenic voice disorders (e.g., unilateral paralyses of the recurrent laryngeal nerve, vagal nerve, or external branch of the superior laryngeal nerve; adductor spasmodic dysphonia; and Parkinson’s disease), group VI—functional disorders, and group VII—healthy control group.

2.3. Acoustic-Aerodynamic Analysis

Regarding instrument-assisted objective procedures, the following well-established methods of measuring were applied: (1) maximum phonation time (MPT) as a statement of aerodynamic capacity [22, 23], (2) jitter as a common perturbation measure in acoustic analysis to assess frequency instability [24, 25], and (3) VRP to display the functional interactions of different voice production components with regard to vocal frequency and intensity [26, 27]. Recordings were performed in the sound-treated (one room) voice lab of our department with a background  dB (A), using the DiVAS software (XION Medical, Berlin, Germany). A head-mounted microphone with a stable mouth-microphone distance of 30 cm was applied. Relevant instructions of individuals performing objective acoustic measurements are given in Table 1.

The detailed procedure of VRP measurement is described in a previous publication [7]. The parameters’ highest tone (F0high), lowest intensity (SPLmin), MPT, and jitter were used for DSI calculation [9]. The Gonnermann-classification served for DSI interpretation, discriminating healthy voices (≥4.2) from mildly (<4.2 to ≥1.8), moderately (<1.8 to ≥-1.2), or severely (<-1.2) dysphonic voices [28]. VEM calculation was conducted automatically after VRP recording by the proprietary AVA software [8, 13].

2.4. Subjective Vocal Impairment and Voice Usage

To take the subject’s self-assessment into account, every study participant had to fill out the VHI-9i questionnaire [29]. This item-reduced VHI consists of nine questions addressing functional, physical, and emotional impairments, describing the impact of a voice disorder on a person’s quality of life. Each question can be rated on a scale from 0 to 4 (0: never; 1: almost never; 2: sometimes; 3: almost always; 4: always). The VHI-9i total score ranges from 0 to 36 and allows an impairment-related severity classification (0-5: no dysphonia; 6-13: mild dysphonia; 14-22: moderate dysphonia; 23-36: severe dysphonia) as proposed by Seipelt and Nawka [30]. Furthermore, the questionnaire registers the subject’s profession as well as the individual use of speaking and singing voice. Simplifying the vocal use classification system by Koufman and Isaacson [31], the cohort was divided into “professional voice users” (PVU) and “non-professional voice users” (nPVU) depending on the reported occupation and voice usage.

2.5. Statistical Analysis

All data were statistically analyzed using SPSS version 25 (IBM Corp., Armonk, N.Y., USA). Descriptive statistics were applied to evaluate the basic quantitative features of the data collected. We documented the minimum and maximum values, calculated medians, first and third quartiles, and means and standard deviations (SD) for all objective and subjective parameters as well as patients’ sociodemographic data. By using the one-way analysis of variance (ANOVA) with post hoc analysis, we examined the differences of mean values and their significance considering pathology subgroups, gender, and voice usage. Scatterplots with regression lines were made for females and males to display the dependence of VEM and other objective parameters on VHI-9i. Boxplots were chosen as graphical technique to present pathology-related VEM and DSI ranges. Pearson’s correlation coefficients were computed to investigate the strength and direction of association between the individual parameters in all study participants and in each pathology subgroup. The level of significance was set at . The following abbreviations were used to show different significance levels: %; %; %.

3. Results

3.1. Description of the Study Cohort

The complete dataset comprised 994 study participants, 376 male (18-86 years, median 57) and 618 female subjects (18-82 years, median 47). Women were on average 9 years younger than men at the time of consultation ( vs. , , and ). The age and gender distribution of the examined cohort is shown in Figure 1.

VLS revealed in 691 subject (69.5%) organic diseases at VF level. Classification of the resulting structural dysphonia according to the underlying pathology showed in 26.5% diseases of the lamina propria (e.g., nodules, polyps, cysts, and edema), in 15.9% neurogenic voice disorders (e.g., VF paralysis and spasmodic dysphonia), in 14.7% diseases of the epithelium (e.g., leukoplakia, hyperkeratosis, carcinoma, and papillomatosis), and in 12.4% inflammatory findings (e.g., laryngitis). Additionally, VLS did not show any organic changes in 303 study participants (30.5%). From those individuals with normal laryngeal anatomy, 22.5% suffered from a vocal load-induced functional dysphonia, while 8.0% were completely healthy without any voice complaints (control group). Comparing our pathology subgroups, most voice disorders occurred in the 5th decade of life, except lamina propria changes (4th decade) and malignant lesions (5th to 6th decade). A detailed description of the investigated parameters in our study population is presented in Table 2.

3.2. Effects of Gender, Voice Use, and Age

In both genders, the overall vocal, aerodynamic, and psychometric parameters indicated mild to moderate impairments. Males and females had mildly dysphonic voices with slightly reduced vocal capacities. According to the mean VHI-9i scores ( vs. ), the self-assessed vocal impairment was comparable. Females revealed higher mean values for F0high and DSI, as well as lower values for SPLmin, jitter, and VHI-9i. Males reached higher values for MPT and VEM. These gender-specific differences were statistically significant for the parameters F0high, SPLmin, MPT, DSI (all ), and jitter (), but not significant for VEM and VHI-9i.

Regarding voice usage, this cohort consisted of 3 groups: 29% PVU (, e.g., teachers, preschool teachers, lecturers, instructors, salespersons, singers, and actors), 59% nPVU (, e.g., administration employees, laborers, business (wo)men, pensioners, and unemployed people), and 12% with missing answers concerning occupation (). When consulting a voice specialist in our department, PVU were on average 8 years younger than nPVU ( vs. years, ). The voice use had a significant influence on the objective parameters: compared to nPVU, PVU generally reached higher mean values for F0high, MPT, DSI, and VEM (all ), as well as the lowest mean values for SPLmin () and jitter (). However, comparison of the VHI-9i scores revealed that the subjective impairment did not differ significantly between PVU and nPVU.

Focusing on VHI-9i results, 22.4% of all subjects felt highly (score 23-36), 33.2% moderately (score 14-22), and 30.5% mildly affected (score 6-13), while 13.9% of the participants did not indicate relevant voice impairment (score 0-5). However, subjective complaints rose with age. According to the severity classification, the mean age continuously increased as follows: (no dysphonia) vs. (mild dysphonia) vs. (moderate dysphonia) vs. years (severe dysphonia). These age-dependent VHI deteriorations were significant in women () and showed a trend in men (). In accordance with rising subjective impairment levels, the objective parameters gradually worsened as well (i.e., constant decline of F0high, MPT, DSI, and VEM vs. increase of SPLmin and jitter). In both genders, these changes were highly significant (), apart from men’s SPLmin () and jitter (). Exemplarily, Figure 2 visualizes the scatterplots for the parameters MPT, DSI, and VEM on the -axis related to VHI-9i on the -axis.

3.3. Correlation Analysis and VEM Grading System

Correlation analysis showed that VEM correlated significantly () with F0high (), SPLmin (), MPT (), jitter (vs. ), and DSI (). DSI and VEM correlated furthermore with VHI-9i ( vs. ). The strength of these relationships was moderate, apart from the weak negative relationship between VEM and jitter. Regarding age, correlations showed significant () moderate relationships with F0high (), SPLmin (), and DSI (), compared to weak relationships with jitter (), VEM (), and VHI-9i ().

Finally, derived from the VEM percentiles of the total cohort (25%: 69; 50%: 93; 75%: 108), we created a grading system to compare the means of the investigated parameters on different VEM levels (Table 3). As a result, the higher the VEM, the higher the mean values for F0high, MPT, and DSI, and the lower the mean values for SPLmin, jitter, and VHI-9i. The VEM also seems to decrease as the voice ages. Regardless of the underlying pathology, subgroups I-VI comprised every VEM level. The mean values of all vocal parameters differed significantly between the VEM levels ().

3.4. Effects of Pathology in all Subgroups

Subgroup I contained patients with carcinoma in situ (), pT1a (), and pT1b () squamous cell carcinoma. Males were significantly more often affected than females (average ratio 6.5: 1). At initial diagnosis, women were on average 12 years younger than men (). The aerodynamic and acoustic objective measurements confirmed the highly impaired voices in both genders. Compared to men, women had “better” mean values for all objective parameters apart from MPT and VEM. In contrast to that, their subjective suffering was higher ( vs. ), but not on a significant level (). In comparison with other pathology subgroups, mean values were the worst for F0high, SPLmin, jitter, DSI, and VEM.

Subgroup II involved the following pathologies: VF nodules (), Reinke’s edema (), VF polyps (), sulcus vocalis (), VF cysts (), VF scars (), vascular malformation (), bamboo nodes (), and VF hematoma (). Females were significantly more often affected than males (average ratio 3.3: 1), especially concerning Reinke’s edema (11.7: 1) and VF nodules (7.3: 1). Apart from the latter diagnoses, we found no relevant gender-specific differences for age, size, or location of findings at initial consultation. Women showed significantly higher values for F0high () and lower values for MPT () as well as VEM (). Gender-independent comparison between nodules, polyps, and Reinke’s edema regarding mean VEM ( vs. vs. ) and DSI ( vs. vs. ) revealed highly significant differences ().

Subgroup III included patients with laryngeal papillomatosis (), leukoplakia (), and VF granuloma (). Men were more often affected than women (average ratio 1.6: 1) and on average 9 years older at first presentation (). Apart from the known differences regarding F0high () and MPT (), both genders revealed similar results for SPLmin, jitter, and VHI-9i. However, higher values for VEM in males and DSI in females did not reach the level of significance.

Subgroup IV patients had acute () or chronic laryngitis (). Females and males were nearly equally affected (average ratio 1.3: 1) and showed comparable means in age, SPLmin, jitter, and VHI-9i. Gender-specific differences were significant for F0high (), MPT, and VEM (), but not for DSI ().

Subgroup V involved the following heterogeneous pathologies: unilateral paralyses of the recurrent laryngeal nerve (), vagal nerve (), or external branch of the superior laryngeal nerve (), adductor spasmodic dysphonia (), Parkinson’s disease (), and amyotrophic lateral sclerosis (). Females were more often affected than males (average ratio 1.7: 1). Both genders matched in age and presented with substantially impaired vocal parameters. SPLmin, MPT, jitter, and VHI-9i did not expose relevant gender-specific differences. Higher mean values reached the level of significance in women for F0high () and DSI (), but not in men for VEM (). In comparison with other pathology subgroups, mean values were the worst for MPT and VHI-9i.

Subgroup VI included patients with dysodia (), hypo-/hyperfunctional dysphonia (), and presbyphonia (), as well as paradoxical VF movement disorder (). Females were more often affected than males (average ratio 1.8: 1) and on average 7 years younger when consulting the voice specialist (). Despite moderately impaired VHI-9i scores, both genders reached the highest values of all dysphonic patients for F0high, MPT, DSI, and VEM. Significant gender-specific differences were not detected, apart from F0high and DSI (). Gender-independent comparison between presbyphonic, dysodic, and hyperfunctional patients regarding mean VEM ( vs. vs. ) and DSI ( vs. vs. ) exposed highly significant differences ().

Subgroup VII consisted of considerably more women than men (average ratio 5.6: 1). All subjects showed normal laryngeal findings in the VLS and passed the vocal fitness examination. Compared to pathology subgroups, both genders presented at a much younger age and reached the best values in all observed vocal parameters (). Within this group, gender-specific differences showed significantly higher means for MPT, jitter, and VEM in men (all ), as well as for F0high in women ().

Comparison of all subgroups demonstrated that means for all vocal parameters differed significantly between them (). The post hoc analysis revealed that controls with a healthy voice were reliably distinguished from patients with an impaired voice in all examined parameters. Exemplarily, pathology-specific ranges for VEM and DSI are shown in Figure 3.

4. Discussion

In this study, we examined the pathology-induced changes of different voice disorders on the acoustic and aerodynamic parameters of the VRP with focus on the new VEM. Our results showed novel VEM outcomes concerning (1) variation between different pathology subgroups, (2) correlation with corresponding DSI values, (3) reflection of subjective voice impairment, and (4) changes with regard to voice usage, gender, and age differences. However, the heterogeneity of results regarding how pathology affected the investigated vocal measures and their relations must be critically discussed. From a clinical viewpoint, it is still not enough known about how these differences are explicable by vocal training, age, or gender and how to separate these factors from pathology.

4.1. Effects of Gender, Voice Use, and Age

The VEM exceeded the common range of 0 to 120 at both ends without showing any gender-specific differences (males: -12 to 152 [mean 88] vs. females: -13 to 147 [mean 85]). It correlated significantly with the DSI and corresponded to the same moderate subjective impairment level in men and women. The VEM showed a higher correlation with the VHI-9i total score than the DSI. Therefore, the VEM reflected the subjective perception better. Former studies stated that DSI and VHI are complementing parameters since there was no significant correlation detected [32, 33]. We conclude that VEM, DSI, and VHI represent complementing but different dimensions in voice diagnostics.

In contrast to VEM, a significant gender difference was found in F0high, SPLmin, MPT, and DSI. Females kept their physiologically higher F0high in all pathologies and in the healthy vocal state, resulting in a higher DSI (even when feeling more impaired). Other investigations confirmed a significantly higher F0high in females and the tendency for higher MPT in males, but could not detect a significant gender difference for DSI in subjects without voice complaints [34, 35]. Another study indicated that the DSI is influenced by the registration program as well as gender in patients with voice disorders [7]. The present investigation confirms these findings partially. When comparing the total cohort by gender, there was a significant difference found in DSI values, but in subgroup analysis, only patients with neurogenic lesions (V) and functional disorders (VI) differed on a significant level. Interestingly, those two groups are the ones where both genders showed no significant difference in MPT measures. Therefore, the influence of MPT as a considerable compensation mechanism in multidimensional DSI calculation must be further discussed.

Regarding VEM differences between PVU and nPVU, Caffier et al. [14] showed in patients with VF nodules a significant higher vocal capacity in PVU, with similar subjective VHI-9i impairment levels. Our study confirmed these results. Even among healthy subjects, a small proportion rated their voices mildly to moderately impaired according to the VHI-9i. All of them were female and prospective or active PVU. One explanation could be that PVU have particularly high demands and expectations regarding their vocal quality and performance. They depend on a proper function of their voices to “perform” at its best in their profession. When entering a vocal fitness examination, especially females tend to assess their voices overcritically, even when VRP measurements display a healthy or exceptional great voice. This is in line with the results of Chitguppi et al., which showed that female PVU with no obvious VF pathology reported significantly more voice-related complaints than nPVU, while reaching in all investigated objective vocal parameter values above average [36].

As expected, younger age was associated with higher VEM values. Similar to former studies, the age distribution revealed the majority of our patients in the working population, where the voice is most needed [3739]. The age difference of 8 years implies that PVU consult a physician at an earlier stage, but we did not measure the exact pathology sizes in the VLS to prove this assumption. The VEM was highest in young healthy subjects. The VEM quartiles showed a decrease of VEM with rising age. However, lifelong laryngeal training can postpone signs of vocal aging [40, 41]. Regardless of age, the presented values of VEM quartiles (Q1: ; Q2: to <93; Q3: to <108; Q4: ) could serve in the future as a reference range for the classification of vocal capacity in dysphonic patients and healthy subjects.

4.2. Effects of Pathology in all Subgroups

The VEM in all pathology subgroups differed significantly, without resulting in specific values or ranges that could (pre)determine the medical diagnoses. Every pathology influenced the intensity and frequency range on a distinct level, so that the VEM varied in one pathology from slightly to highly reduced. As known, the VRP aids to present the patients phonatory status without predicting the diagnosis [42]. Other studies confirmed that the VRP contour and its parameters are influenced by an organic voice disorder without showing a pathology-specific VRP configuration [43, 44].

Similar to previous studies, healthy subjects (VII) reached the highest VEM and DSI values while assessing their voices being not impaired [7, 16]. They could be clearly separated from patients with voice disorders by VEM and all other examined parameters. Subjects diagnosed with functional voice disorders (VI) had the best VEM and DSI values of the dysphonic patients and assessed their voices as moderately impaired. Dysfunction without organic changes is accompanied by impaired aerodynamic and acoustic measures [4547]. Within the heterogenic group of functional dysphonia, we found significant VEM differences between presbyphonic, dysodic, and hyperfunctional patients, proposing that the term “non-organic/functional” voice disorder is widely composed and needs clarification in nomenclature settings.

Patients with malignant epithelial lesions (I) had the lowest VEM and can be clearly separated from other subgroups. The VEM also differed significantly from benign changes of the epithelium (III). Likewise, Kang et al. [43] detected a lower objective voice quality (jitter, shimmer, and noise-to-harmonic ratio) as well as a higher subjective impairment (VHI) in patients with glottic carcinoma compared to benign epithelial lesions. The change of the microarchitecture of the epithelial layer leads to an irregular rough surface and voice abnormalities due to increased mass, interference with vibratory behavior, and compromised VF closure. Confirming the literature, increasing thickening and infiltration resulted in a reduced or absent mucosal wave with growing stiffness of the lesion; the glottal closure could be also progressively impaired [4850].

Patients with neurogenic voice disorders (V) assessed their voices higher impaired than the carcinoma group (I), but showed better functional measurements (except for MPT). This heterogeneous subgroup involved absent, decreased, or increased VF motions due to neurological or muscular disorders causing glottal insufficiency or dystonic spasms in varying degrees. As known from other studies, patients with unilateral VF paralysis reported vocal fatigue, dys-/aphonia and a breathy voice due to air escape; the extent depends on the position of the paralyzed VF [51, 52]. Omori et al. [53] found the glottal gap size as the main factor for reduced voice quality rather than the underlying diagnosis. Therefore, the comparable VEM in neurogenic and malignant lesions can be partly explained by the amount of glottal insufficiency, depending on the extent of infiltration, respectively, paralysis.

Benign changes of the lamina propria (II), the epithelium (III), and inflammatory lesions (IV) led to comparable moderately impaired VEM, DSI, and VHI-9i values. As expected, all included pathologies interfered with the vibratory behavior and closure pattern and resulted in roughness and/or breathiness. Confirming previous studies, more diffuse lesions with an excess of liquid, gelatinous, or fibrous material raised the VF stiffness and increased (e.g., Reinke’s edema) or decreased (e.g., laryngitis and sulcus vocalis) the mucosal wave [21, 54]. On the other hand, the VEM was able to differentiate between VF nodules, polyps, and Reinke’s edema by showing a different vocal capacity related to the pathology dimensions.

4.3. Limitations

This study involves a few limitations, which should be considered when drawing general conclusions. Firstly, auditory perceptual assessment was unfortunately not included in the initial study design. Due to the large amount of data, a retrospective blinded voice evaluation of 994 patients with 4-5 raters was not an available option. Secondly, the exact extent of the leading pathology was not classified or measured. However, the importance of the size of the findings should not be underestimated. For example, Salmen et al. found in patients with varying sizes of Reinke’s edema significant differences in VEM and DSI values depending on the Yonekawa classification [15]. Thirdly, various subgroups (e.g., II and VI) were very heterogeneous in itself, creating the false impression that the vocal capacity, respectively, impairments of different pathologies included are alike. Fourthly, the effects of age on the VEM can only be estimated since our patients above 45 years were all classified with a voice disorder. Considering the close relationship to the DSI and its proven influence on age [34, 35, 55], an age effect is likely. As there is no reliable differentiation between the effects of age and pathology, further investigations should include a larger age variety of healthy subjects. Besides, the VEM can only be calculated on the basis of a completed VRP. However, due to the parameter construction, an aphonic voice cannot be quantified by the VEM. In addition, further studies are necessary relating the presented VEM quartiles (Q1: ; Q2: to <93; Q3: to <108; Q4: ) to the auditory perceptual assessment of voices.

5. Conclusions

We conclude that the VEM proved to be a comprehensible and easy-to-use interval-scaled parameter for objective VRP evaluation in all laryngeal pathologies. By quantifying the vocal capacity, it represents vocal function as a positive measure and should therefore be established as a useful and complementary parameter in addition to the DSI. Besides, the VEM seems to reflect the subjective impairment better than the traditional DSI. However, the size of the VEM value alone does not allow to draw conclusions about the underlying diagnosis.

Data Availability

The datasets produced and investigated for this study are not publicly available, as they were obtained from a proprietary database via a licensing agreement.

Conflicts of Interest

The authors have no funding, financial relationships, or conflicts of interest to disclose.

Acknowledgments

The authors wish to thank Julius Rummich for the technical support in data acquisition and documentation, Tatiana Ermakova for the statistical advice, and Marie-Louise Freymann and Philipp Mathmann for the scientific collaboration.