Table of Contents Author Guidelines Submit a Manuscript
Advances in Acoustics and Vibration
Volume 2015 (2015), Article ID 170183, 9 pages
http://dx.doi.org/10.1155/2015/170183
Research Article

Eigennoise Speech Recovery in Adverse Environments with Joint Compensation of Additive and Convolutive Noise

Thai Nguyen University of Information and Communication Technology, Thai Nguyen 250000, Vietnam

Received 30 June 2015; Accepted 13 October 2015

Academic Editor: Marc Asselineau

Copyright © 2015 Trung-Nghia Phung et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. S. F. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 27, no. 2, pp. 113–120, 1979. View at Publisher · View at Google Scholar · View at Scopus
  2. J. S. Lim and A. V. Oppenheim, “Enhancement and bandwidth compression of noisy speech,” Proceedings of the IEEE, vol. 67, no. 12, pp. 1586–1604, 1979. View at Publisher · View at Google Scholar · View at Scopus
  3. Y. Ephraim and D. Malah, “Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 32, no. 6, pp. 1109–1121, 1984. View at Publisher · View at Google Scholar · View at Scopus
  4. Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error log-spectral amplitude estimator,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 33, no. 2, pp. 443–445, 1985. View at Publisher · View at Google Scholar · View at Scopus
  5. S. Doclo and M. Moonen, “GSVD-based optimal filtering for single and multimicrophone speech enhancement,” IEEE Transactions on Signal Processing, vol. 50, no. 9, pp. 2230–2244, 2002. View at Publisher · View at Google Scholar · View at Scopus
  6. H. Attias, J. C. Platt, A. Acero, and L. Deng, “Speech denoising and dereverberation using probabilistic models,” Advances in Neural Information Processing Systems, vol. 13, pp. 758–764, 2001. View at Google Scholar
  7. A. Mouchtaris, J. V. Spiegel, P. Mueller, and P. Tsakalides, “A spectral conversion approach to single-channel speech enhancement,” IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 4, pp. 1180–1193, 2007. View at Publisher · View at Google Scholar · View at Scopus
  8. N. Bi and Y. Qi, “Application of speech conversion to alaryngeal speech enhancement,” IEEE Transactions on Speech and Audio Processing, vol. 5, no. 2, pp. 97–105, 1997. View at Publisher · View at Google Scholar · View at Scopus
  9. K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, “Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech,” Speech Communication, vol. 54, no. 1, pp. 134–146, 2012. View at Publisher · View at Google Scholar · View at Scopus
  10. T. Hirahara, M. Otani, S. Shimizu et al., “Silent-speech enhancement using body-conducted vocal-tract resonance signals,” Speech Communication, vol. 52, no. 4, pp. 301–313, 2010. View at Publisher · View at Google Scholar · View at Scopus
  11. V.-A. Tran, G. Bailly, H. Lœvenbruck, and T. Toda, “Improvement to a NAM-captured whisper-to-speech system,” Speech Communication, vol. 52, no. 4, pp. 314–326, 2010. View at Publisher · View at Google Scholar · View at Scopus
  12. T. N. Phung, M. Unoki, and M. Akagi, “Improving bone-conducted speech restoration in noisy environment based on LP scheme,” in Proceedings of the APSIPA Annual Summit and Conference, Singapore, December 2010.
  13. D. Huy-Khoi, P. Trung-Nghia, H. C. Nguyen, V. T. Nguyen, and Q. V. Thai, “A novel spectral conversion based approach for noisy speech enhancement,” International Journal of Information and Electronics Engineering, vol. 1, no. 3, pp. 281–285, 2011. View at Google Scholar
  14. L. Sirovich and M. Kirby, “Low-dimensional procedure for the characterization of human faces,” Journal of the Optical Society of America A. Optics and Image Science, vol. 4, no. 3, pp. 519–524, 1987. View at Publisher · View at Google Scholar · View at Scopus
  15. Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, “Adaptive training for voice conversion based on eigenvoices,” IEICE Transactions on Information and Systems, vol. 93, no. 6, pp. 1589–1598, 2010. View at Publisher · View at Google Scholar · View at Scopus
  16. Y. Gong, “A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition,” IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 975–983, 2005. View at Publisher · View at Google Scholar · View at Scopus
  17. D. A. Reynolds, “Speaker identification and verification using Gaussian mixture speaker models,” Speech Communication, vol. 17, no. 1-2, pp. 91–108, 1995. View at Publisher · View at Google Scholar · View at Scopus
  18. F. Rosenblatt, “The perceptron: a probabilistic model for information storage and organization in the brain,” Psychological Review, vol. 65, no. 6, pp. 386–408, 1958. View at Publisher · View at Google Scholar · View at Scopus
  19. D. A. Cohn, Z. Ghahramani, and M. I. Jordan, “Active learning with statistical models,” Journal of Artificial Intelligence Research, vol. 4, pp. 129–145, 1996. View at Google Scholar · View at Scopus
  20. H. Zen, T. Nose, J. Yamagishi et al., “The HMM-based speech synthesis system version 2.0,” in Proceedings of the 6th ISCA Tutorial and Research Workshop on Speech Synthesis (SSW '07), Bonn, Germany, August 2007.
  21. A. Kain and M. W. Macon, “Spectral voice conversion for text-to-speech synthesis,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '98), vol. 1, pp. 285–288, IEEE, May 1998. View at Scopus
  22. T. Matsui and S. Furui, “Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '92), vol. 2, pp. 157–160, IEEE, San Francisco, Calif, USA, March 1992. View at Publisher · View at Google Scholar
  23. European Telecommunications Standards Institute, “Speech processing, transmission and quality aspects (STQ); distributed speech recognition; front-end feature extraction algorithm; compression algorithms,” Technical standard 201 108, v1.1.3, 2003. View at Google Scholar
  24. S. S. Stevens, J. Volkman, and E. Newman, “A scale for the measurement of the psychological magnitude pitch,” Journal of the Acoustical Society of America, vol. 8, no. 3, pp. 185–190, 1937. View at Google Scholar
  25. H. Banno, J. Lu, S. Nakamura, K. Shikano, and H. Kawahara, “Efficient representation of short-time phase based on group delay,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '98), vol. 2, pp. 861–864, IEEE, Seattle, Wash, USA, May 1998. View at Publisher · View at Google Scholar
  26. G. Strang and T. Nguyen, Wavelets and Filter Banks, Wellesley Cambridge Press, Wellesley, Mass, USA, 1997.
  27. Q. T. Nguyen and T. N. Phung, “The perceptual wavelet feature for noise robust Vietnamese speech recognition,” in Proceedings of the 2nd International Conference on Communications and Electronics (HUT-ICCE '08), pp. 258–261, Hanoi, Vietnam, June 2008. View at Scopus
  28. P. T. Trung-Nghia, D. D. Cuong, and P. V. Binh, “A new wavelet-based wide-band speech coder,” in Proceedings of the International Conference on Advanced Technologies for Communications (ATC '08), pp. 349–352, Hanoi, Vietnam, October 2008. View at Publisher · View at Google Scholar · View at Scopus
  29. R. C. Guido, L. Sasso Vieira, S. Barbon Júnior et al., “A neural-wavelet architecture for voice conversion,” Neurocomputing, vol. 71, no. 1–3, pp. 174–180, 2007. View at Publisher · View at Google Scholar · View at Scopus
  30. E. Zwicker, “Subdivision of the audible frequency range into critical bands,” The Journal of the Acoustical Society of America, vol. 33, no. 2, article 248, 1961. View at Publisher · View at Google Scholar
  31. P. F. Craigmile and D. B. Percival, “Asymptotic decorrelation of between-scale wavelet coefficients,” IEEE Transactions on Information Theory, vol. 51, no. 3, pp. 1039–1048, 2005. View at Publisher · View at Google Scholar · View at Scopus
  32. K. Pearson, “On lines and planes of closest fit to systems of points in space,” Philosophical Magazine, vol. 2, no. 6, pp. 559–572, 1901. View at Google Scholar
  33. N. Ahmed, T. Natarajan, and K. R. Rao, “Discrete cosine transform,” IEEE Transactions on Computers, vol. 23, no. 1, pp. 90–93, 1974. View at Publisher · View at Google Scholar
  34. K. R. Rao and P. Yip, Discrete Cosine Transform: Algorithms, Advantages, Applications, Academic Press, Boston, Mass, USA, 1990. View at MathSciNet
  35. M. Chiappetta, “AMD breaks 8GHz overclock with upcoming FX processor, sets world record,” Hot Hardware, 2011. View at Google Scholar