Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2013 (2013), Article ID 435729, 8 pages
http://dx.doi.org/10.1155/2013/435729
Research Article

Robustness of Auditory Teager Energy Cepstrum Coefficients for Classification of Pathological and Normal Voices in Noisy Environments

Signal Processing Laboratory, Physics Department, Sciences Faculty of Tunis, University of Tunis ElManar, 1060 Tunis, Tunisia

Received 31 March 2013; Accepted 8 May 2013

Academic Editors: E. P. Ong and L. Silva

Copyright © 2013 Lotfi Salhi and Adnane Cherif. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. P. Yu, M. Ouaknine, J. Revis, and A. Giovanni, “Objective voice analysis for dysphonic patients: a multiparametric protocol including acoustic and aerodynamic measurements,” Journal of Voice, vol. 15, no. 4, pp. 529–542, 2001. View at Publisher · View at Google Scholar · View at Scopus
  2. B. Boyanov and S. Hadjitodorov, “Acoustic analysis of pathological voices: a voice analysis system for the screening and laryngeal diseases,” IEEE Engineering in Medicine and Biology Magazine, vol. 16, no. 4, pp. 74–82, 1997. View at Publisher · View at Google Scholar · View at Scopus
  3. V. Parsa and D. G. Jamieson, “Identification of pathological voices using glottal noise measures,” Journal of Speech, Language, and Hearing Research, vol. 43, no. 2, pp. 469–485, 2000. View at Google Scholar · View at Scopus
  4. T. Dubuisson, T. Dutoit, B. Gosselin, and M. Remacle, “On the use of the correlation between acoustic descriptors for the normal/Pathological voices discrimination,” EURASIP Journal on Advances in Signal Processing, vol. 2009, Article ID 173967, 2009. View at Publisher · View at Google Scholar · View at Scopus
  5. C. d’Alessandro, B. Bozkurt, B. Doval et al., “Phase-based methods for voice source analysis,” in Advances in Nonlinear Speech Processing, pp. 1–27, 2007. View at Google Scholar
  6. B. Boashash, “Estimating and interpreting the instantaneous frequency of a signal. I. Fundamentals,” Proceedings of the IEEE, vol. 80, no. 4, pp. 520–538, 1992. View at Publisher · View at Google Scholar
  7. L. Liu, J. He, and G. Palm, “Effects of phase on the perception of intervocalic stop consonants,” Speech Communication, vol. 22, no. 4, pp. 403–417, 1997. View at Google Scholar · View at Scopus
  8. R. M. Hegde, H. A. Murthy, and V. R. R. Gadde, “Significance of the modified group delay feature in speech recognition,” IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 1, pp. 190–202, 2007. View at Publisher · View at Google Scholar · View at Scopus
  9. O. Ghitza, “Auditory models and human performance in tasks related to speech coding and speech recognition,” IEEE Transactions on Speech and Audio Processing, vol. 2, no. 1, pp. 115–132, 1994. View at Google Scholar
  10. B. R. Glasberg and B. C. J. Moore, “Derivation of auditory filter shapes from notched-noise data,” Hearing Research, vol. 47, no. 1-2, pp. 103–138, 1990. View at Publisher · View at Google Scholar
  11. T. Irino and R. D. Patterson, “A time-domain, level-dependent auditory filter: the gammachirp,” Journal of the Acoustical Society of America, vol. 101, no. 1, pp. 412–419, 1997. View at Publisher · View at Google Scholar
  12. J. Kortelainen and K. Noponen, Neural Networks, Intelligent Systems, 2005.
  13. C. M. Bishop, Neural Networks for Pattern Recognition, Clarendon Press, Oxford, UK, 1996.
  14. A. Cherif, “Pitch detection and formant extraction of Arabic speech processing,” Journal of Applied Acoustics, 2001. View at Google Scholar
  15. L. Salhi, M. Talbi, S. Abid, and A. Cherif, “Performance of wavelet analysis and neural networks for pathological voices identification,” International Journal of Electronics, vol. 98, no. 7–9, pp. 1129–1140, 2011. View at Publisher · View at Google Scholar
  16. A. Kacha, F. Grenez, and J. Schoentgen, “Multiband frame-based acoustic cues of vocal dysperiodicities in disordered connected speech,” Biomedical Signal Processing and Control, vol. 1, no. 2, pp. 137–143, 2006. View at Publisher · View at Google Scholar · View at Scopus
  17. F. Bettens, F. Grenez, and J. Schoentgen, “Estimation of vocal dysperiodicities in disordered connected speech by means of distant-sample bidirectional linear predictive analysis,” Journal of the Acoustical Society of America, vol. 117, no. 1, pp. 328–337, 2005. View at Publisher · View at Google Scholar · View at Scopus
  18. S. B. Davis and P. Mermelstein, “Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357–366, 1980. View at Google Scholar · View at Scopus
  19. A. Potamianos and P. Maragos, “Time-frequency distributions for automatic speech recognition,” IEEE Transactions on Speech and Audio Processing, vol. 9, no. 3, pp. 196–200, 2001. View at Publisher · View at Google Scholar · View at Scopus
  20. K. S. R. Murty and B. Yegnanarayana, “Combining evidence from residual phase and MFCC features for speaker recognition,” IEEE Signal Processing Letters, vol. 13, no. 1, pp. 52–55, 2006. View at Publisher · View at Google Scholar · View at Scopus
  21. F. Jabloun, A. E. Çetin, and E. Erzin, “Teager energy based feature parameters for speech recognition in car noise,” IEEE Signal Processing Letters, vol. 6, no. 10, pp. 259–261, 1999. View at Publisher · View at Google Scholar · View at Scopus
  22. J. F. Kaiser, “Some observations on vocal tract operation from a fluid flow point of view,” in Vocal Fold Physiology: Biomechanics, Acoustics, and Phonatory Control, 1983. View at Google Scholar
  23. H. A. Patil and K. K. Parhi, “Novel variable length teager energy based features for person recognition from their hum,” in Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP '10), pp. 4526–44529, March 2010.
  24. J. F. Kaiser, “On a simple algorithm to calculate the ‘energy’ of a signal,” in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '90), pp. 381–384, Albuquerque, NM, USA, April 1990.
  25. D. Dimitriadis, P. Maragos, and A. Potamianos, “Auditory teager energy cepstrum coefficients for robust speech recognition,” in Proceedings of the 9th European Conference on Speech Communication and Technology, pp. 3013–3016, Lisbon, Portugal, September 2005. View at Scopus