Table of Contents
ISRN Signal Processing
Volume 2011 (2011), Article ID 269361, 10 pages
http://dx.doi.org/10.5402/2011/269361
Research Article

Discrete and Dual Tree Wavelet Features for Real-Time Speech/Music Discrimination

1Graduate School of Natural and Applied Sciences, Dokuz Eylul University, 35160 Buca, İzmir, Turkey
2Department of Electronics and Telecommunications Engineering, Izmir University of Economics, 35330 Balçova, İzmir, Turkey
3Department of Electrical and Electronics Engineering, Yaşar University, 35100 Bornova, İzmir, Turkey

Received 5 January 2011; Accepted 1 March 2011

Academic Editor: P. C. Yuen

Copyright © 2011 Timur Düzenli and Nalan Özkurt. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. O. M. Mubarak, E. Ambikairajah, and J. Epps, “Novel features for effective speech and music discrimination,” in Proceedings of the IEEE International Conference on Engineering of Intelligent Systems (ICEIS '06), Sayfa, Islamabad, Pakistan, April 2006.
  2. K. El-Maleh, M. Klein, G. Petrucci, and P. Kabal, “Speech/music discrimination for multimedia applications,” in Proceedings of the IEEE Interntional Conference on Acoustics, Speech, and Signal Processing, pp. 2445–2448, June 2000.
  3. A. C. Gedik and B. Bozkurt, “Pitch-frequency histogram-based music information retrieval for Turkish music,” Signal Processing, vol. 90, no. 4, pp. 1049–1063, 2010. View at Publisher · View at Google Scholar · View at Scopus
  4. J. Saunders, “Real-time discrimination of broadcast speech/music,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '96), pp. 993–996, May 1996.
  5. E. Scheirer and M. Slaney, “Construction and evaluation of a robust multifeature speech/music discriminator,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97), pp. 1331–1334, April 1997.
  6. E. M. Saad, M. I. El-Adawy, M. E. Abu-El-Wafa, and A. A. Wahba, “A multifeature speech/music discrimination system,” in Proceedings of the 19th National Radio Science Conference (NRSC '02), pp. 208–213, 2002.
  7. J. Ajmera, I. McCowan, and H. Bourlard, “Speech/music segmentation using entropy and dynamism features in a HMM classification framework,” Speech Communication, vol. 40, no. 3, pp. 351–363, 2003. View at Publisher · View at Google Scholar · View at Scopus
  8. C. Panagiotakis and G. Tziritas, “A speech/music discriminator based on RMS and zero-crossings,” IEEE Transactions on Multimedia, vol. 7, no. 1, pp. 155–166, 2005. View at Publisher · View at Google Scholar · View at Scopus
  9. J. Wang, Q. Wu, H. Deng, and Q. Yan, “Real-time speech/music classification with a hierarchical oblique decision tree,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '08), pp. 2033–2036, April 2008. View at Publisher · View at Google Scholar
  10. A. Pikrakis, T. Giannakopoulos, and S. Theodoridis, “A speech/music discriminator of radio recordings based on dynamic programming and Bayesian networks,” IEEE Transactions on Multimedia, vol. 10, no. 5, Article ID 4540196, pp. 846–857, 2008. View at Publisher · View at Google Scholar · View at Scopus
  11. M. Kos, M. Grašič, and Z. Kačič, “Online speech/music segmentation based on the variance mean of filter bank energy,” EURASIP Journal on Advances in Signal Processing, vol. 2009, Article ID 628570, pp. 1–13, 2009. View at Publisher · View at Google Scholar · View at Scopus
  12. G. E. G. Tzanetakis and P. Cook, “Audio analysis using the discrete wavelet transform,” in Mathematics and Simulation with Biological, Economical and Musicoacoustical Applications, pp. 318–323, WSES, 2001. View at Google Scholar
  13. M. K. S. Khan and W. G. Al-Khatib, “Machine-learning based classification of speech and music,” ACM Journal on Multimedia Systems, vol. 12, no. 1, pp. 55–67, 2006. View at Publisher · View at Google Scholar · View at Scopus
  14. S. Ntalampiras and N. Fakotakis, “Speech /music discrimination based on discrete wavelet transform,” in Proceedings of the 5th Hellenic Conference on Artificial Intelligence (SETN '08), vol. 5138 of Lecture Notes in Artificial Intelligence, pp. 205–211, Syros, Greece, October 2008.
  15. E. Didiot, I. Illina, D. Fohr, and O. Mella, “A wavelet-based parameterization for speech/music discrimination,” Computer Speech and Language, vol. 24, no. 2, pp. 341–357, 2010. View at Publisher · View at Google Scholar · View at Scopus
  16. N. G. Kingsbury, “The dual-tree complex wavelet transform: a new technique for shift invariance and directional filters,” in Proceedings of the IEEE Digital Signal Processing Workshop, 1998.
  17. I. W. Selesnick, R. G. Baraniuk, and N. G. Kingsbury, “The dual-tree complex wavelet transform,” IEEE Signal Processing Magazine, vol. 22, no. 6, pp. 123–151, 2005. View at Publisher · View at Google Scholar · View at Scopus
  18. F. Zheng, G. Zhang, and Z. Song, “Comparison of different implementations of MFCC,” Journal of Computer Science and Technology, vol. 16, no. 6, pp. 582–589, 2001. View at Google Scholar · View at Scopus
  19. S. Mallat, A Wavelet Tour of Signal Processing, Academic Press, 1999.
  20. N. Kingsbury, “The dual-tree complex wavelet transform: a new technique for shift invariance and directional filters,” in Proceedings of the IEEE Digital Signal Processing Workshop, pp. 1–4, 1998.
  21. I. W. Selesnick, “The design of Hilbert transform pairs of wavelet bases,” IEEE Signal Processing Letters, vol. 8, no. 6, pp. 170–173, 2001. View at Google Scholar
  22. C. Charalambous, “Conjugate gradient algorithm for efficient training of artificial neural networks,” IEEE Proceedings-G on Circuit Devices and System, vol. 139, no. 3, pp. 301–310, 1992. View at Google Scholar · View at Scopus
  23. S. Cai and K. Li, “Matlab Implementation of Wavelet Transforms,” 2002.