Table of Contents Author Guidelines Submit a Manuscript
Journal of Electrical and Computer Engineering
Volume 2017, Article ID 1735698, 9 pages
https://doi.org/10.1155/2017/1735698
Research Article

Speaker Recognition Using Wavelet Packet Entropy, I-Vector, and Cosine Distance Scoring

Laboratory of Cyberspace, School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China

Correspondence should be addressed to She Kun; nc.ude.ctseu@nuk

Received 16 February 2017; Revised 17 April 2017; Accepted 26 April 2017; Published 14 May 2017

Academic Editor: Lei Zhang

Copyright © 2017 Lei Lei and She Kun. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. H. Perez, J. Martinez, and I. Espinosa, “Using acoustic paraliguistic information to assess the interaction quality in speech-based system for elder users,” International Jounrnal of Human-Computer Studies, vol. 98, pp. 1–13, 2017. View at Google Scholar
  2. N. Almaadeed, A. Aggoun, and A. Amira, “Speaker identification using multimodal neural networks and wavelet analysis,” IET Biometrics, vol. 4, no. 1, pp. 18–28, 2015. View at Publisher · View at Google Scholar
  3. T. Kinnunen and H. Li, “An overview of text-independent speaker recognition: from features to supervectors,” Speech Communication, vol. 52, no. 1, pp. 12–40, 2010. View at Publisher · View at Google Scholar · View at Scopus
  4. D. A. Reynolds and R. C. Rose, “Robust text-independent speaker identification using gaussian mixture speaker models,” IEEE Transactions on Speech and Audio Processing, vol. 3, no. 1, pp. 72–83, 1995. View at Publisher · View at Google Scholar · View at Scopus
  5. K. S. Ahmad, A. S. Thosar, J. H. Nirmal, and V. S. Pande, “A unique approach in text-independent speaker recognition using mfcc feature sets and probabilistic neural network,” in Proceedings of the 8th International Conference on Advances in Pattern Recognition (ICAPR '15), Kolkata, India, 2015. View at Publisher · View at Google Scholar · View at Scopus
  6. X.-Y. Zhang, J. Bai, and W.-Z. Liang, “The speech recognition system based on bark wavelet MFCC,” in Proceedings of the 8th International Conference on Signal Processing (ICSP '06), pp. 16–20, Beijing, China, 2006. View at Publisher · View at Google Scholar · View at Scopus
  7. A. Biswas, P. K. Sahu, and M. Chandra, “Admissible wavelet packet features based on human inner ear frequency response for Hindi consonant recongnition,” Computer & Communication Technology, vol. 40, pp. 1111–1122, 2014. View at Google Scholar
  8. K. Daqrouq and T. A. Tutunji, “Speaker identification using vowels features through a combined method of formants, wavelets, and neural network classifiers,” Applied Soft Computing, vol. 27, pp. 231–239, 2015. View at Publisher · View at Google Scholar
  9. D. Avci, “An expert system for speaker identification using adaptive wavelet sure entropy,” Expert Systems with Applications, vol. 36, no. 3, pp. 6295–6300, 2009. View at Publisher · View at Google Scholar · View at Scopus
  10. K. Daqrouq, “Wavelet entropy and neural network for text-independent speaker identification,” Engineering Applications of Artificial Intelligence, vol. 24, no. 5, pp. 796–802, 2011. View at Publisher · View at Google Scholar · View at Scopus
  11. M. K. Elyaberani, S. H. Mahmoodian, and G. Sheikhi, “Wavelet packet entropy in speaker-identification emotional state detection from speech signal,” Journal of Intelligent Procedures in Electrical Technology, vol. 20, pp. 67–74, 2015. View at Google Scholar
  12. M. Li, A. Tsiartas, M. Van Segbroeck, and S. S. Narayanan, “Speaker verification using simplified and supervised i-vector modeling,” in Proceedings of the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '13), pp. 7199–7203, Vancouver, Canada, 2013. View at Publisher · View at Google Scholar · View at Scopus
  13. D. Garcia-Romero and A. McCree, “Supervised domain adaptation for i-vector based speaker recognition,” in Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing, pp. 4047–4051, Florence, Italy, 2014. View at Publisher · View at Google Scholar · View at Scopus
  14. A. Kanagasundaram, D. Dean, and S. Sridharan, “I-vector based speaker recognition using advanced channel compensation techniques,” Computer Speech & Labguage, vol. 28, pp. 121–140, 2014. View at Publisher · View at Google Scholar
  15. M. H. Bahari, R. Saeidi, H. Van Hamme, and D. Van Leeuwen, “Accent recognition using i-vector, gaussian mean supervector and gaussian posterior probability supervector for spontaneous telephone speech,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 7344–7348, IEEE, Vancouver, Canada, 2013. View at Publisher · View at Google Scholar · View at Scopus
  16. B. Saha and K. Kamarauslas, “Evaluation of effectiveness of different methods in speaker recognition,” Elektronika ir Elektrochnika, vol. 98, pp. 67–70, 2015. View at Google Scholar
  17. K. K. George, C. S. Kumar, K. I. Ramachandran, and A. Panda, “Cosine distance features for robust speaker verification,” in Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH '15), pp. 234–238, Dresden, Germany, 2015. View at Scopus
  18. B. G. Nagaraja and H. S. Jayanna, “Efficient window for monolingual and crosslingual speaker identification using MFCC,” in Proceedings of the International Conference on Advanced Computing and Communication Systems (ICACCS '13), pp. 1–4, Coimbatore, India, 2013. View at Publisher · View at Google Scholar · View at Scopus
  19. M. Medeiros, G. Araújo, H. Macedo, M. Chella, and L. Matos, “Multi-kernel approach to parallelization of EM algorithm for GMM training,” in Proceedings of the 3rd Brazilian Conference on Intelligent Systems (BRACIS '14), pp. 158–165, Sao Paulo, Brazil, 2014. View at Publisher · View at Google Scholar · View at Scopus
  20. H. R. Tohidypour, S. A. Seyyedsalehi, and H. Behbood, “Comparison between wavelet packet transform, Bark Wavelet & MFCC for robust speech recognition tasks,” in Proceedings of the 2nd International Conference on Industrial Mechatronics and Automation (ICIMA '10), pp. 329–332, Wuhan, China, 2010. View at Publisher · View at Google Scholar · View at Scopus
  21. X.-F. Lv and P. Tao, “Mallat algorithm of wavelet for time-varying system parametric identification,” in Proceedings of the 25th Chinese Control and Decision Conference (CCDC '13), pp. 1554–1556, Guiyang, China, 2013. View at Publisher · View at Google Scholar · View at Scopus
  22. D. Marthnez, O. Pichot, and L. Burget, “Language recognition in I-vector space,” in Proceedongs of the Interspeech, pp. 861–864, Florence, Italy, 2011.
  23. P. Kenny, G. Boulianne, and P. Dumouchel, “Eigenvoice modeling with sparse training data,” IEEE Transactions on Speech and Audio Processing, vol. 13, no. 3, pp. 345–354, 2005. View at Publisher · View at Google Scholar · View at Scopus
  24. M. McLaren and D. Van Leeuwen, “Improved speaker recognition when using i-vectors from multiple speech sources,” in Proceedings of the 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '11), pp. 5460–5463, Prague, Czech Republic, 2011. View at Publisher · View at Google Scholar · View at Scopus
  25. A. Biswas, P. K. Sahu, A. Bhowmick, and M. Chandra, “Feature extraction technique using ERB like wavelet sub-band periodic and aperiodic decomposition for TIMIT phoneme recognition,” International Journal of Speech Technology, vol. 17, no. 4, pp. 389–399, 2014. View at Publisher · View at Google Scholar · View at Scopus
  26. S. G. Mallat, A Wavelet Tour of Signal Processing, ELsevier, Amsterdam, Netherlands, 2012.
  27. Q. Yang and J. Wang, “Multi-level wavelet shannon entropy-based method for single-sensor fault location,” Entropy, vol. 17, no. 10, pp. 7101–7117, 2015. View at Publisher · View at Google Scholar · View at Scopus
  28. T. Ganchev, M. Siafarikas, I. Mporas, and T. Stoyanova, “Wavelet basis selection for enhanced speech parametrization in speaker verification,” International Journal of Speech Technology, vol. 17, no. 1, pp. 27–36, 2014. View at Publisher · View at Google Scholar · View at Scopus
  29. M. Seboussaoui, P. Kenny, and N. Dehak, “An i-vector extractor suitable for speaker recognition with both microphone and telephone speech,” Odyssey, vol. 6, pp. 1–6, 2011. View at Google Scholar
  30. N. Dehak, R. Dehak, P. Kenny, N. Brummer, P. Ouellet, and P. Dumouchel, “Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification,” in Proceedings of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH '09), pp. 1559–1562, Brighton, United Kingdom, 2009. View at Scopus
  31. M. I. Mandasari, M. McLaren, and D. Van Leeuwen, “Evaluation of i-vector speaker recognition systems for forensic application,” in 12th Annual Conference of the International Speech Communication Association (INTERSPEECH '11), pp. 21–24, Florence, Italy, 2011. View at Scopus