Research Article

Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms

Table 9

Comparison of AVPD with two publicly available voice disorder databases.

Sr. numberCharacteristicsMEEIAVPDSVD

LanguageEnglishArabicGerman

Recording locationMassachusetts Eye & Ear Infirmary (MEEI) voice and speech laboratory, USACommunication and Swallowing Disordered Unit, King Abdulaziz University Hospital, Saudi ArabiaSaarland University, Germany

Sampling frequencySamples are recorded at different sampling frequencies
 (i) 10 kHz
 (ii) 25 kHz
 (iii) 50 kHz
All samples are recorded at same frequency
 (i) 48 kHz
All samples are recorded at same frequency
 (i) 50 kHz

Extension of recorded samplesRecorded samples are stored in  .NSP format onlyRecorded samples are stored in  .wav and  .nsp formatRecorded samples are stored in  .wav and  .nsp format

Recorded text(i) Vowel /a/
(ii) Rainbow passage
(i) Vowel /a/
(ii) Vowel /i/
(iii) Vowel /u/
(iv) Al-Fateha (running speech)
(v) Arabic digits
(vi) Common words
(All vowels are recorded with a repetition)
(i) Vowel /a/
(ii) Vowel /i/
(iii) Vowel /u/
(iv) A sentence

Recording of vowelsOnly stable part of the phonationComplete phonation including onset and offset partsOnly stable part of the phonation

Length of recorded samplesNormal
 (i) Vowel: 3 sec
 (ii) Rainbow: 12 sec
Patient
 (i) Vowel: 1 sec
 (ii) Rainbow: 9 sec
(i) Vowel: 5 sec
(ii) Al-Fateha: 18 sec
(iii) Digits: 10 sec
(iv) Words: 3 sec
(The length of a complete recorded sample is 60 sec approx.)
Vowels: 1~3 sec
Sentence: 2 sec

Ratio of normal and pathological subjectsNormal: 7%
Pathological: 93%
Normal: 51%
Pathological: 49%
Normal: 33%
Pathological: 67%

Perceptual severity
Perceptual severity is rated on a scale of 1 (low) to 3 (high)

Pathology typesFunctional and organicOrganicFunctional and organic

Evaluation of normal subjectsNo such information is available