Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2014, Article ID 723643, 8 pages
http://dx.doi.org/10.1155/2014/723643
Research Article

A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement

1College of Computer Science and Technology, Nanjing University of Science and Technology (NUST), Nanjing 210094, China
2College of Information Technology, Jinling Institute of Technology (JIT), Nanjing 211169, China
3College of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications (NUPT), Nanjing 210046, China

Received 10 February 2014; Accepted 22 April 2014; Published 12 May 2014

Academic Editor: Juan Manuel Górriz Sáez

Copyright © 2014 Yan Zhang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. M. Sahidullah and G. Saha, “Comparison of speech activity detection techniques for speaker recognition,” Cornell University Library, 2012.
  2. J. Han, S. Yook, K. W. Nam et al., “Comparative evaluation of voice activity detectors in single microphone noise reduction algorithms,” Biomedical Engineering Letters, vol. 2, no. 4, pp. 255–264, 2012. View at Google Scholar
  3. C.-H. Hsieh, T.-Y. Feng, and P.-C. Huang, “Energy-based VAD with grey magnitude spectral subtraction,” Speech Communication, vol. 51, no. 9, pp. 810–819, 2009. View at Publisher · View at Google Scholar · View at Scopus
  4. J. L. Shen, J. W. Hung, and L. S. Lee, “Robust entropy-based endpoint detection for speech recognition in noisy environments,” in Proceedings of the 5th International Conference Spoken Language Processing (ICSLP ’98), pp. 232–235, Sydney, Australia, November 1998.
  5. N. Cho and E.-K. Kim, “Enhanced voice activity detection using acoustic event detection and classification,” IEEE Transactions on Consumer Electronics, vol. 57, no. 1, pp. 196–202, 2011. View at Publisher · View at Google Scholar · View at Scopus
  6. K. Ishizuka, T. Nakatani, M. Fujimoto, and N. Miyazaki, “Noise robust voice activity detection based on periodic to aperiodic component ratio,” Speech Communication, vol. 52, no. 1, pp. 41–60, 2010. View at Publisher · View at Google Scholar · View at Scopus
  7. S. Sadjadi and J. Hansen, “Unsupervised speech activity detection using voicing measures and perceptual spectral flux,” IEEE Signal Processing Letters, vol. 20, no. 3, pp. 197–200, 2013. View at Google Scholar
  8. T. Fukuda, O. Ichikawa, and M. Nishimura, “Long-term spectro-temporal and static harmonic features for voice activity detection,” IEEE Journal on Selected Topics in Signal Processing, vol. 4, no. 5, pp. 834–844, 2010. View at Publisher · View at Google Scholar · View at Scopus
  9. N. Dhananjaya and B. Yegnanarayana, “Voiced/nonvoiced detection based on robustness of voiced epochs,” IEEE Signal Processing Letters, vol. 17, no. 3, pp. 273–276, 2010. View at Publisher · View at Google Scholar · View at Scopus
  10. D. K. Kim and J. H. Chang, “Statistical voice activity detection in kernel space,” The Journal of the Acoustical Society of America, vol. 132, no. 4, pp. 303–309, 2012. View at Google Scholar
  11. J. M. Górriz, J. Ramírez, E. W. Lang et al., “Improved likelihood ratio test based voice activity detector applied to speech recognition,” Speech Communication, vol. 52, no. 7-8, pp. 664–677.
  12. J. Wu and X.-L. Zhang, “Efficient multiple kernel support vector machine based voice activity detection,” IEEE Signal Processing Letters, vol. 18, no. 8, pp. 466–469, 2011. View at Publisher · View at Google Scholar · View at Scopus
  13. X. Bao and J. Zhu, “A novel voice activity detection based on phoneme recognition using statistical model,” Journal on Audio, Speech, and Music Processing, vol. 2012, article 1, 2012. View at Publisher · View at Google Scholar
  14. M. H. Moattar and M. M. Homayounpour, “A weighted feature voting approach for robust and real-time voice activity detection,” ETRI Journal, vol. 33, no. 1, pp. 99–109, 2011. View at Publisher · View at Google Scholar · View at Scopus
  15. S. K. Haghani and S. M. Ahadi, “Robust voice activity detection using feature combination,” in Proceedings of the 21st Iranian Conference on Electrical Engineering (ICEE '13), pp. 1–5, Mashhad, Iran, May 2013.
  16. L. M. Arslan, “Modified wiener filtering,” Signal Processing, vol. 86, no. 2, pp. 267–272, 2006. View at Publisher · View at Google Scholar · View at Scopus
  17. Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 32, no. 6, pp. 1109–1121, 1984. View at Google Scholar · View at Scopus
  18. M. J. Alam, M. F. A. Chowdhury, and M. F. Alam, “Comparative study of a priori signal-to- noise ratio (snr) estimation approaches for speech enhancement,” Journal of Electrical and Electronics Engineering, vol. 9, no. 1, pp. 809–817, 2009. View at Google Scholar · View at Scopus
  19. B.-F. Wu and K.-C. Wang, “Robust endpoint detection algorithm based on the adaptive band-partitioning spectral entropy in adverse environments,” IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 762–774, 2005. View at Publisher · View at Google Scholar · View at Scopus
  20. V. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, NY, USA, 1995.
  21. C. C. Chang and C. J. Lin, “LIBSVM: a library for support vector machines,” 2001, http://www.csie.ntu.edu.tw/~cjlin/libsvm/.
  22. A. Benyassine, E. Shlomot, and H. Su, “ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications,” IEEE Communications Magazine, vol. 35, no. 9, pp. 64–73, 1997. View at Google Scholar
  23. Digital Cellular Telecommunications System (Phase 2+), Adaptive Multi Rate (AMR) Speech, ANSI-C Code for AMR Speech Codec; ETSI TS 100 905 V7.0.0 (1999-08); ETSI: Sophia Antipolis, France, 1998.