Table of Contents
ISRN Signal Processing
Volume 2011, Article ID 145758, 16 pages
http://dx.doi.org/10.5402/2011/145758
Research Article

Adaptive Variable Degree- 𝑘 Zero-Trees for Re-Encoding of Perceptually Quantized Wavelet Packet Transformed Audio and High-Quality Speech

1Department of Electrical and Computer Engineering, Shahid Beheshti University, Evin Sq., Tehran 1983963113, Iran
2Audio and Speech Processing Group, Research Center of Intelligent Signal Processing (RCISP), Tehran 1661617733, Iran

Received 13 November 2010; Accepted 23 December 2010

Academic Editors: Y. H. Chan and B. N. Chatterji

Copyright © 2011 Omid Ghahabi and Mohammad Hassan Savoji. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. VoiceAge Corporation, “Wideband speech coding standards and applications,” 2005, http://www.voiceage.com/media/WidebandSpeech.pdf.
  2. P. Loizou, Speech Enhancement: Theory and Practice, CRC Press, University of Texas, Dallas, Tex, USA, 2007.
  3. I. Daubechies, Ten Lectures on Wavelets, SIAM, Philadelphia, Pa, USA, 1992.
  4. S. Mallat, A Wavelet Tour of Signal Processing, Academic Press, New York, NY, USA, 1999.
  5. P. Srinivasan and L. H. Jamieson, “High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling,” IEEE Transactions on Signal Processing, vol. 46, no. 4, pp. 1085–1093, 1998. View at Google Scholar · View at Scopus
  6. T. Mortazavi and M. H. Savoji, “Adaptive wavelet coding of audio and high quality speech at 32 kb/s using psycho-acoustic noise masking effects,” in Proceedings of the 18th International Congress on Acoustics (ICA '04), Kyoto, Japan, April 2004.
  7. Matlab 7.6 Wavelet Toolbox Users Guide.
  8. ISO/IEC Int'l Standard 11172-3, “Information technology: coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbits/s- part3: audio”.
  9. D. Pan, “Tutorial on MPEG/audio compression,” IEEE Multimedia, vol. 2, no. 2, pp. 60–74, 1995. View at Publisher · View at Google Scholar · View at Scopus
  10. K. Brandenburg and M. Bosi, “Overview of MPEG audio: current and future standards for low-bit-rate audio coding,” Journal of the Audio Engineering Society, vol. 45, no. 1-2, pp. 4–21, 1997. View at Google Scholar · View at Scopus
  11. E. Ambikairajah, A. G. Davis, and W. T. K. Wong, “Auditory masking and MPEG-1 audio compression,” Electronics and Communication Engineering Journal, vol. 9, no. 4, pp. 165–175, 1997. View at Google Scholar · View at Scopus
  12. M. Bosi and R. E. Goldberg, Introduction to Digital Audio Coding and Standards, Kluwer Academic Publishers, Boston, Mass, USA, 2003.
  13. D. Sinha and A. H. Tewfik, “Low bit rate transparent audio compression using adapted wavelets,” IEEE Transactions on Signal Processing, vol. 41, no. 12, pp. 3463–3479, 1993. View at Publisher · View at Google Scholar · View at Scopus
  14. M. Black and M. Zeytinoglu, “Computationally efficient wavelet packet coding of wide-band stereo audio signals,” in Proceedings of the 20th International Conference on Acoustics, Speech, and Signal Processing (ICASSP '95), vol. 5, pp. 3075–3078, Detroit, Mich, USA, May 1995. View at Scopus
  15. B. Carnero and A. Drygajlo, “Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithms,” IEEE Transactions on Signal Processing, vol. 47, no. 6, pp. 1622–1635, 1999. View at Publisher · View at Google Scholar · View at Scopus
  16. M. Rosa Zurera, F. López Ferreras, M. P. Jarabo Amores, S. Maldonado Bascón, and N. Ruiz Reyes, “New algorithm for translating psycho-acoustic information to the wavelet domain,” Signal Processing, vol. 81, no. 3, pp. 519–531, 2001. View at Publisher · View at Google Scholar · View at Scopus
  17. N. R. Reyes, M. R. Zurera, F. L. Ferreras, and P. J. Amores, “Adaptive wavelet-packet analysis for audio coding purposes,” Signal Processing, vol. 83, no. 5, pp. 919–929, 2003. View at Publisher · View at Google Scholar · View at Scopus
  18. X. He and M. S. Scordilis, “An enhanced psychoacoustic model based on the discrete wavelet packet transform,” Journal of the Franklin Institute, vol. 343, no. 7, pp. 738–755, 2006. View at Publisher · View at Google Scholar · View at Scopus
  19. J. M. Shapiro, “Embedded image coding using zerotrees of wavelet coefficients,” IEEE Transactions on Signal Processing, vol. 41, no. 12, pp. 3445–3462, 1993. View at Publisher · View at Google Scholar · View at Scopus
  20. A. Said and W. A. Pearlman, “A new, fast, and efficient image codec based on set partitioning in hierarchical trees,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 6, no. 3, pp. 243–250, 1996. View at Google Scholar · View at Scopus
  21. O. Ghahabi and M. H. Savoji, “Modified EZW and SPIHT algorithms for perceptually audio and high quality speech coding,” in Proceedings of the IEEE International Conference on Signal and Image Processing Applications (ICSIPA '09), pp. 530–535, Kuala Lumpur, Malaysia, November 2009. View at Publisher · View at Google Scholar
  22. O. Ghahabi and M. H. Savoji, “Re-encoding of perceptually quantized wavelet packet transform coefficients of audio and high quality speech,” in Proceedings of the 16th International Conference on Digital Signal Processing (DSP '09), Santorini, Greece, July 2009. View at Publisher · View at Google Scholar · View at Scopus
  23. B. E. Usevitch and M. T. Orchard, “Smooth wavelets, transform coding, and Markov-1 processes,” IEEE Transactions on Signal Processing, vol. 43, no. 11, pp. 2561–2569, 1995. View at Publisher · View at Google Scholar · View at Scopus
  24. S. Mallat and F. Falzon, “Analysis of low bit rate image transform coding,” IEEE Transactions on Signal Processing, vol. 46, no. 4, pp. 1027–1042, 1998. View at Google Scholar · View at Scopus
  25. B. E. Usevitch, “A tutorial on modern lossy wavelet image compression: foundations of JPEG 2000,” IEEE Signal Processing Magazine, vol. 18, no. 5, pp. 22–35, 2001. View at Publisher · View at Google Scholar · View at Scopus
  26. N. M. Rajpoot, R. G. Wilson, F. G. Meyer, and R. R. Coifman, “Adaptive wavelet packet basis selection for zerotree image coding,” IEEE Transactions on Image Processing, vol. 12, no. 12, pp. 1460–1471, 2003. View at Publisher · View at Google Scholar · View at PubMed · View at MathSciNet · View at Scopus
  27. N. Sprljan, S. Grgic, and M. Grgic, “Modified SPIHT algorithm for wavelet packet image coding,” Real-Time Imaging, vol. 11, no. 5-6, pp. 378–388, 2005. View at Publisher · View at Google Scholar · View at Scopus
  28. S. Strahl, H. Zhou, and A. Mertins, “An adaptive tree-based progressive audio compression scheme,” in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 219–222, New Paltz, NY, USA, October 2005. View at Publisher · View at Google Scholar · View at Scopus
  29. A. Aggarwal, V. Cuperman, K. Rose, and A. Gersho, “Perceptual zero-trees for scalable wavelet coding of wideband audio,” in Proceedings of the IEEE Workshop on Speech Coding, pp. 16–18, Porvoo, Finland, June 1999.
  30. P.-C. Chang and J.-H. Lin, “Scalable embedded zero-tree wavelet packet audio coding,” in Proceedings of the IEEE Workshop on Signal Processing Advances in Wireless Communications, pp. 384–387, Taiwan, China, March 2001.
  31. Y. Cho and W. A. Pearlman, “Quantifying the coding performance of zerotrees of wavelet coefficients: degree-κ zerotree,” IEEE Transactions on Signal Processing, vol. 55, no. 6, pp. 2425–2431, 2007. View at Publisher · View at Google Scholar · View at Scopus
  32. X. W. Yin, M. Fleury, and A. C. Downton, “Reduced bit rate uniform quantisation for SPIHT encoding,” Electronics Letters, vol. 39, no. 12, pp. 902–903, 2003. View at Publisher · View at Google Scholar · View at Scopus
  33. Z. Lu and W. A. Pearlman, “An efficient and low-complexity audio coder delivering multiple levels of quality for interactive applications,” in Proceedings of the IEEE Workshop on Multimedia Signal Processing, pp. 529–534, Redondo Beach, Calif, USA, December 1998.
  34. Z. Lu and W. A. Pearlman, “High quality scalable stereo audio coding,” 1999, http://www.cipr.rpi.edu/~pearlman/papers/scal_audio.ps.gz.
  35. M. Raad, A. Mertins, and I. Burnett, “Audio compression using the MLT and SPIHT,” in Proceedings of the 6th International Symposium on Digital Signal Processing for Communication Systems (DSPCS '02), pp. 128–131, Wollongong, Australia, 2002.
  36. M. Raad and A. Mertins, “From lossy to lossless audio coding using SPIHT,” in Proceedings of the 5th International Conference on Digital Audio Effects (DAFx '02), pp. 245–250, Hamburg, Germany, September 2002.
  37. M. Dutta and R. Vig, “An improved method of speech compression using warped LPC and MLT-SPIHT algorithm,” in Proceedings of the 6th WSEAS International Conference on Signal, Speech and Image Processing, pp. 85–94, Lisbon, Portugal, September 2006.
  38. M. Raad, A. Mertins, and I. Burnett, “Audio coding based on the modulated lapped transform (MLT) and set partitioning in hierarchical trees,” in Proceedings of the 8th International Conference on Information Systems Analysis and Synthesis (ISAS '02), 2002.
  39. N. Kitawaki, K. Nagai, and T. Yamada, “Objective quality assessment of wideband speech coding,” IEICE Transactions on Communications, vol. E88-B, no. 3, pp. 1111–1117, 2005. View at Publisher · View at Google Scholar · View at Scopus
  40. 3GPP TR 26.976 V8.0.0 (2008-12), “Performance characterization of the Adaptive Multi-Rate Wideband (AMR-WB) speech codec (Release 8)”.