Table of Contents
ISRN Mechanical Engineering
Volume 2012 (2012), Article ID 919234, 9 pages
http://dx.doi.org/10.5402/2012/919234
Review Article

Single Channel Speech Enhancement Techniques in Spectral Domain

Department of Systems Innovations, Graduate School of Engineering Science, Osaka University, 1–3 Machikaneyama, Osaka, Toyonaka 560-8531, Japan

Received 13 February 2012; Accepted 30 April 2012

Academic Editor: D. Aggelis

Copyright © 2012 Arata Kawamura et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. M. Muneyasu and A. Taguchi, Nonlinear Digital Signal Processing, Asakura Publishing, Tokyo, Japan, 1999.
  2. A. Kawamura, Y. Iiguni, and Y. Itoh, “A noise reduction method based on linear prediction with variable step-size,” IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. E88-A, no. 4, pp. 855–861, 2005. View at Publisher · View at Google Scholar · View at Scopus
  3. S. F. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 27, no. 2, pp. 113–120, 1979. View at Google Scholar · View at Scopus
  4. Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 32, no. 6, pp. 1109–1121, 1984. View at Google Scholar · View at Scopus
  5. B. Widrow, J. G. R. Glover Jr., J. M. Mccool et al., “Adaptive noise cancelling: principles and applications,” Proceedings of The IEEE, vol. 63, no. 12, pp. 1692–1716, 1975. View at Publisher · View at Google Scholar
  6. P. J. Wolfe and S. J. Godsill, “Efficient alternatives to the Ephraim and Malah suppression rule for audio signal enhancement,” Eurasip Journal on Applied Signal Processing, vol. 2003, no. 10, pp. 1043–1051, 2003. View at Publisher · View at Google Scholar · View at Scopus
  7. R. J. McAulay and M. L. Malpass, “Speech enhancement using a soft-decision noise suppression filter,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 2, pp. 137–145, 1980. View at Google Scholar · View at Scopus
  8. B. Chen and P. C. Loizou, “Speech enhancement using a MMSE short time spectral amplitude estimator with laplacian speech modeling,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '05), pp. I1097–I1100, March 2005. View at Publisher · View at Google Scholar · View at Scopus
  9. R. Martin, “Speech enhancement based on minimum mean-square error estimation and supergaussian priors,” IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 845–856, 2005. View at Publisher · View at Google Scholar · View at Scopus
  10. S. Gazor and W. Zhang, “Speech enhancement employing laplacian-gaussian mixture,” IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 896–904, 2005. View at Publisher · View at Google Scholar · View at Scopus
  11. T. Lotter and P. Vary, “Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model,” Eurasip Journal on Applied Signal Processing, vol. 2005, no. 7, pp. 1110–1126, 2005. View at Publisher · View at Google Scholar · View at Scopus
  12. I. Andrianakis and P. R. White, “Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors,” Speech Communication, vol. 51, no. 1, pp. 1–14, 2009. View at Publisher · View at Google Scholar · View at Scopus
  13. Y. Tsukamoto, A. Kawamura, and Y. Iiguni, “Speech enhancement based on MAP estimation using a variable speech distribution,” IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. E90-A, no. 8, pp. 1587–1593, 2007. View at Publisher · View at Google Scholar · View at Scopus
  14. A. Kawamura, W. Thanhikam, and Y. Iiguni, “A speech spectral estimator using adaptive speech probability density function,” in Proceedings of the EUSIPCO 2010, pp. 1549–1552, August 2010.
  15. W. Thanhikam, A. Kawamura, and Y. Iiguni, “Speech enhancement using speech model parameters refined by two-step technique,” in Proceedings of the 2nd APSIPA Annual Summit and Conference, p. 11, December 2010.
  16. W. Thanhikam, A. Kawamura, and Y. Iiguni, “Speech enhancement based on real-speech PDF in various narrow SNR intervals,” IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. E95-A, no. 3, pp. 623–630, 2012. View at Google Scholar
  17. S. Furui, Digital Speech Processing, Tokai University Press, Tokyo, Japan, 1985.
  18. S. L. Miller and D. G. Childers, Probability and Random Processes, Elsevier/Academic Press, 2004.
  19. M. Kato, A. Sugiyama, and M. Serizawa, “Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA,” IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. E85-A, no. 7, pp. 1710–1718, 2002. View at Google Scholar · View at Scopus