Table of Contents Author Guidelines Submit a Manuscript
International Journal of Reconfigurable Computing
Volume 2011, Article ID 697080, 10 pages
http://dx.doi.org/10.1155/2011/697080
Research Article

FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition

Electronics, Communications and Information Technology (ECIT), Queens University Belfast, Northern Ireland Science Park, Belfast BT3 9DT, UK

Received 1 June 2010; Revised 19 September 2010; Accepted 27 September 2010

Academic Editor: Gustavo Sutter

Copyright © 2011 Richard Veitch et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. O. Viikki, I. Kiss, and J. Tian, “Speaker- and language-independent speech recognition in mobile communication systems,” in Proceedings of IEEE Interntional Conference on Acoustics, Speech, and Signal Processing, pp. 5–8, May 2001. View at Scopus
  2. A. Bernard and A. Alwan, “Low-bitrate distributed speech recognition for packet-based and wireless communication,” IEEE Transactions on Speech and Audio Processing, vol. 10, no. 8, pp. 570–579, 2002. View at Publisher · View at Google Scholar · View at Scopus
  3. N. Leavitt, “Let’s hear it for audio mining,” Computer, vol. 35, no. 10, pp. 23–25, 2002. View at Google Scholar
  4. S. Douglas, D. Agarwal, T. Alonso et al., “Mining customer care dialogs for ‘daily news’,” IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 652–660, 2005. View at Google Scholar
  5. W. Walker, P. Lamere, P. Kwok et al., “Sphinx-4: a flexible open source framework for speech recognition,” Sun Microsystems Whitepaper, 2004.
  6. S. J. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book Version 3.4., Cambridge University Press, Cambridge, Mass, USA, 2006.
  7. E. C. Lin, K. Yu, R. A. Rutenbar, and T. Chen, “A 1000-word vocabulary, speaker-independent, continuous live-mode speech recognizer implemented in a single FPGA,” in Proceedings of the 15th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA '07), pp. 60–68, February 2007. View at Publisher · View at Google Scholar · View at Scopus
  8. O. Cheng, W. Abdulla, and Z. Salcic, “Hardware-software co-design of automatic speech recognition system for embedded real-time applications,” to appear in IEEE Transactions on Industrial Electronics.
  9. E. C. Lin and R. A. Rutenbar, “A multi-FPGA 10x-real-time high-speed search engine for a 5000-word vocabulary speech recognizer,” in Proceedings of the 7th ACM SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA '09), pp. 83–92, February 2009. View at Publisher · View at Google Scholar · View at Scopus
  10. K. You, H. Lim, and W. Sung, “Architectural design and implementation of an FPGA softcore based speech recognition system,” in Proceedings of the 6th IEEE International Workshop on System on Chip for Real Time Applications (IWSOC '06), pp. 50–55, December 2006. View at Publisher · View at Google Scholar · View at Scopus
  11. M. Mohri, F. Pereira, and M. Riley, “Weighted finite-state transducers in speech recognition,” Computer Speech and Language, vol. 16, no. 1, pp. 69–88, 2002. View at Publisher · View at Google Scholar · View at Scopus
  12. R. Veitch, L.-M. Aubert, R. Woods, and S. Fischaber, “Acceleration of hmm-based speech recognition system by parallel fpga gaussian calculation,” in Proceedings of the 6th Southern Conference on Programmable Logic, 2010.
  13. L. R. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proceedings of the IEEE, vol. 77, no. 2, pp. 257–286, 1989. View at Publisher · View at Google Scholar · View at Scopus
  14. J. Chong, Y. Yi, A. Faria, N. Satish, and K. Keutzer, “Dataparallel large vocabulary continuous speech recognition on graphics processors,” in Proceedings of the 1st Annual Workshop on Emerging Applications and Many Core Architectures, June 2008.
  15. S. Molau, M. Pitz, R. Schlüter, and H. Ney, “Computing mel-frequency cepstral coefficients on the power spectrum,” in Proceedings of IEEE Interntional Conference on Acoustics, Speech, and Signal Processing, pp. 73–76, May 2001. View at Scopus
  16. B. Milner and X. Shao, “Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end,” Speech Communication, vol. 48, no. 6, pp. 697–715, 2006. View at Publisher · View at Google Scholar · View at Scopus
  17. E. Bocchieri and D. Blewett, “A decoder for LVCSR based on fixed-point arithmetic,” in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '06), pp. 1113–1116, May 2006. View at Scopus
  18. Xilinx, “Virtex-5 Family Overview,” http://www.xilinx.com/support/documentation/data_sheets/ds100.pdf.