Table of Contents Author Guidelines Submit a Manuscript
Computational and Mathematical Methods in Medicine
Volume 2015, Article ID 316325, 11 pages
http://dx.doi.org/10.1155/2015/316325
Research Article

Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis

1Lehrstuhl für Mustererkennung, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Martensstraße 3, 91058 Erlangen, Germany
2Klinik für Hals-, Nasen-, Ohrenheilkunde, Universitätsklinikum Magdeburg, Leipziger Straße 44, 39120 Magdeburg, Germany
3Phoniatrische und Pädaudiologische Abteilung, Klinikum der Universität Erlangen-Nürnberg, Bohlenplatz 21, 91054 Erlangen, Germany
4Department of Computer Science and Engineering, University of West Bohemia in Pilsen, Univerzitní 8, 306 14 Plzeň, Czech Republic
5Klinik für Phoniatrie und Pädaudiologie, Medizinische Hochschule Hannover, Carl-Neuberg-Straße 1, 30625 Hannover, Germany

Received 23 February 2015; Accepted 13 May 2015

Academic Editor: Zoran Bursac

Copyright © 2015 Tino Haderlein et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. Y. Maryn and N. Roy, “Sustained vowels and continuous speech in the auditory-perceptual evaluation of dysphonia severity,” Jornal da Sociedade Brasileira de Fonoaudiologia, vol. 24, no. 2, pp. 107–112, 2012. View at Publisher · View at Google Scholar · View at Scopus
  2. J. Oates, “Auditory-perceptual evaluation of disordered voice quality: pros, cons and future directions,” Folia Phoniatrica et Logopaedica, vol. 61, no. 1, pp. 49–56, 2009. View at Publisher · View at Google Scholar · View at Scopus
  3. G. B. Kempster, B. R. Gerratt, K. V. Abbott, J. Barkmeier-Kraemer, and R. E. Hillman, “Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol,” American Journal of Speech-Language Pathology, vol. 18, no. 2, pp. 124–132, 2009. View at Publisher · View at Google Scholar · View at Scopus
  4. J. Kreiman and B. R. Gerratt, “The perceptual structure of pathologic voice quality,” Journal of the Acoustical Society of America, vol. 100, no. 3, pp. 1787–1795, 1996. View at Publisher · View at Google Scholar · View at Scopus
  5. M. Hirano, Clinical Examination of Voice, Springer, New York, NY, USA, 1981.
  6. T. Nawka, L.-C. Anders, and J. Wendler, “Die auditive Beurteilung heiserer Stimmen nach dem RBH-System,” Sprache Stimme Gehör, vol. 18, no. 3, pp. 130–133, 1994. View at Google Scholar
  7. Y. Maryn, N. Roy, M. De Bodt, P. van Cauwenberge, and P. Corthals, “Acoustic measurement of overall voice quality: a meta-analysis,” Journal of the Acoustical Society of America, vol. 126, no. 5, pp. 2619–2634, 2009. View at Publisher · View at Google Scholar · View at Scopus
  8. V. Parsa and D. G. Jamieson, “Acoustic discrimination of pathological voice: sustained vowels versus continuous speech,” Journal of Speech, Language, and Hearing Research, vol. 44, no. 2, pp. 327–339, 2001. View at Publisher · View at Google Scholar · View at Scopus
  9. G. de Krom, “Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments,” Journal of Speech and Hearing Research, vol. 38, no. 4, pp. 794–811, 1995. View at Publisher · View at Google Scholar · View at Scopus
  10. J. Laver, The Phonetic Description of Voice Quality, Cambridge University Press, Cambridge, UK, 1980.
  11. J. Revis, A. Giovanni, and J.-M. Triglia, “Influence de l’attaque sur l’analyse perceptive des dysphonies,” Folia Phoniatrica et Logopaedica, vol. 54, no. 1, pp. 19–25, 2002. View at Publisher · View at Google Scholar · View at Scopus
  12. J. Revis, A. Giovanni, F. Wuyts, and J.-M. Triglia, “Comparison of different voice samples for perceptual analysis,” Folia Phoniatrica et Logopaedica, vol. 51, no. 3, pp. 108–116, 1999. View at Publisher · View at Google Scholar · View at Scopus
  13. I. R. Titze, Workshop on Acoustic Voice Analysis: Summary Statement, National Center for Voice and Speech, Denver, Colo, USA, 1995.
  14. A. Fourcin, “Aspects of voice irregularity measurement in connected speech,” Folia Phoniatrica et Logopaedica, vol. 61, no. 3, pp. 126–136, 2009. View at Publisher · View at Google Scholar · View at Scopus
  15. A. J. Fourcin and E. Abberton, “First applications of a new laryngograph,” Medical & Biological Illustration, vol. 21, no. 3, pp. 172–182, 1971. View at Google Scholar · View at Scopus
  16. V. Zeißler, J. Adelhardt, A. Batliner et al., “The prosody module,” in SmartKom: Foundations of Multimodal Dialogue Systems, W. Wahlster, Ed., pp. 139–152, Springer, Berlin, Germany, 2006. View at Google Scholar
  17. T. Haderlein, E. Nöth, H. Toy et al., “Automatic evaluation of prosodic features of tracheoesophageal substitute voice,” European Archives of Oto-Rhino-Laryngology, vol. 264, no. 11, pp. 1315–1321, 2007. View at Publisher · View at Google Scholar · View at Scopus
  18. T. Haderlein, C. Moers, B. Möbius, and E. Nöth, “Automatic rating of hoarseness by text-based cepstral and prosodic evaluation,” in Text, Speech and Dialogue: 15th International Conference, TSD 2012, Brno, Czech Republic, September 3–7, 2012. Proceedings, vol. 7499, pp. 573–580, Springer, Berlin, Germany, 2012. View at Publisher · View at Google Scholar
  19. P. H. Dejonckere, P. Bradley, P. Clemente et al., “A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS),” European Archives of Oto-Rhino-Laryngology, vol. 258, no. 2, pp. 77–82, 2001. View at Publisher · View at Google Scholar · View at Scopus
  20. International Phonetic Association, Handbook of the International Phonetic Association, Cambridge University Press, Cambridge, UK, 1999.
  21. A. Fourcin, E. Abberton, D. Miller, and D. Howells, “Laryngography speech pattern element tools for therapy, training and assessment,” European Journal of Disorders of Communication, vol. 30, no. 2, pp. 101–115, 1995. View at Publisher · View at Google Scholar · View at Scopus
  22. G. Fairbanks, Voice and Articulation Drillbook, Harper, New York, NY, USA, 2nd edition, 1960.
  23. A. E. Aronson and D. M. Bless, Clinical Voice Disorders, Thieme, New York, NY, USA, 4th edition, 2009.
  24. J. Koreman, M. Pützer, and M. Just, “Correlates of varying vocal fold adduction deficiencies in perception and production: methodological and practical considerations,” Folia Phoniatrica et Logopaedica, vol. 56, no. 5, pp. 305–320, 2004. View at Publisher · View at Google Scholar · View at Scopus
  25. M. Pützer and W. J. Barry, “Methodische Aspekte der auditiven Beurteilung von Stimmqualität,” Sprache-Stimme-Gehör, vol. 28, no. 4, pp. 188–197, 2004. View at Publisher · View at Google Scholar
  26. M. Ptok, C. Schwemmle, C. Iven, M. Jessen, and T. Nawka, “Zur auditiven Bewertung der Stimmqualität,” HNO, vol. 54, no. 10, pp. 793–802, 2006. View at Publisher · View at Google Scholar · View at Scopus
  27. T. Haderlein, K. Riedhammer, E. Nöth et al., “Application of automatic speech recognition to quantitative assessment of tracheoesophageal speech with different signal quality,” Folia Phoniatrica et Logopaedica, vol. 61, no. 1, pp. 12–17, 2009. View at Publisher · View at Google Scholar · View at Scopus
  28. A. Batliner, J. Buckow, H. Niemann, E. Nöth, and V. Warnke, “The prosody module,” in Verbmobil: Foundations of Speech-to-Speech Translation, W. Wahlster, Ed., pp. 106–121, Springer, Berlin, Germany, 2000. View at Publisher · View at Google Scholar
  29. A. Batliner, K. Fischer, R. Huber, J. Spilker, and E. Nöth, “How to find trouble in communication,” Speech Communication, vol. 40, no. 1-2, pp. 117–143, 2003. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  30. A. J. Smola and B. Schölkopf, “A tutorial on support vector regression,” Statistics and Computing, vol. 14, no. 3, pp. 199–222, 2004. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  31. I. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann, San Francisco, Calif, USA, 2nd edition, 2005.
  32. A. Maier, Speech of Children with Cleft Lip and Palate: Automatic Assessment. Dissertation, vol. 29 of Studien zur Mustererkennung, Logos, Berlin, Germany, 2009.
  33. K. Krippendorff, Content Analysis: An Introduction to Its Methodology, Sage, Thousand Oaks, Calif, USA, 2004.
  34. C. Moers, B. Möbius, F. Rosanowski, E. Nöth, U. Eysholdt, and T. Haderlein, “Vowel- and text-based cepstral analysis of chronic hoarseness,” Journal of Voice, vol. 26, no. 4, pp. 416–424, 2012. View at Publisher · View at Google Scholar · View at Scopus
  35. M. Ptok, C. Iven, M. Jessen, and C. Schwemmle, “Objektiv gemessene Stimmlippenschwingungsirregularität vs. subjektiver Eindruck der Rauigkeit,” HNO, vol. 54, no. 2, pp. 132–138, 2006. View at Publisher · View at Google Scholar
  36. P. C. Bagshaw, S. M. Hiller, and M. A. Jack, “Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching,” in Proceedings of the European Conference on Speech Communication and Technology (Eurospeech '93), vol. 2, pp. 1003–1006, 1993.
  37. P. N. Carding, I. N. Steen, A. Webb, K. Mackenzie, I. J. Deary, and J. A. Wilson, “The reliability and sensitivity to change of acoustic measures of voice quality,” Clinical Otolaryngology and Allied Sciences, vol. 29, no. 5, pp. 538–544, 2004. View at Publisher · View at Google Scholar · View at Scopus
  38. J. Hillenbrand, “Perception of aperiodicities in synthetically generated voices,” Journal of the Acoustical Society of America, vol. 83, no. 6, pp. 2361–2371, 1988. View at Publisher · View at Google Scholar · View at Scopus
  39. D. M. Hartl, S. Hans, J. Vaissière, and D. F. Brasnu, “Objective acoustic and aerodynamic measures of breathiness in paralytic dysphonia,” European Archives of Oto-Rhino-Laryngology, vol. 260, no. 4, pp. 175–182, 2003. View at Google Scholar · View at Scopus
  40. I. V. Bele, “Reliability in perceptual analysis of voice quality,” Journal of Voice, vol. 19, no. 4, pp. 555–573, 2005. View at Publisher · View at Google Scholar · View at Scopus
  41. S.-O. Caballero-Morales, “Estimation of phoneme-specific HMM topologies for the automatic recognition of dysarthric speech,” Computational and Mathematical Methods in Medicine, vol. 2013, Article ID 297860, 15 pages, 2013. View at Publisher · View at Google Scholar · View at Scopus