Li Deng

Articles in Scholarly Journals [Incomplete List]

  1. A new look at discriminative training for hidden Markov models
    Pattern Recognition Letters, vol. 28, no. 11, pp. 1285–1294, 2007
  2. A lattice search technique for a long-contextual-span hidden trajectory model of speech?
    Speech Communication, vol. 48, no. 9, pp. 1214–1226, 2006
  3. A state-space model with neural-network prediction for recovering vocal tract resonances in fluent speech from Mel-cepstral coefficients
    Speech Communication, vol. 48, no. 8, pp. 971–988, 2006
  4. Structured Speech Modeling
    IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 5, pp. 1492–1504, 2006
  5. A Bidirectional Target-Filtering Model of Speech Coarticulation and Reduction: Two-Stage Implementation for Phonetic Recognition
    IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 1, pp. 256–265, 2006
  6. Tracking Vocal Tract Resonances Using a Quantized Nonlinear Function Embedded in a Temporal Constraint
    IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 2, pp. 425–434, 2006
  7. Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation
    Computer Speech & Language, 2006
  8. Analysis and Comparison of Two Speech Feature Extraction/Compensation Algorithms
    IEEE Signal Processing Letters, vol. 12, no. 6, pp. 477–480, 2005
  9. Dynamic Speech Models: Theory, Algorithms, and Applications
    Synthesis Lectures on Speech and Audio Processing, vol. 1, no. 1, pp. 1–118, 2005
  10. Dynamic Compensation of HMM Variances Using the Feature Enhancement Uncertainty Computed From a Parametric Model of Speech Distortion
    IEEE Transactions on Speech and Audio Processing, vol. 13, no. 3, pp. 412–421, 2005
  11. Spoken language understanding
    IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 16–31, 2005
  12. A Speech-Centric Perspective for Human-Computer Interface: A Case Study
    The Journal of VLSI Signal Processing-Systems for Signal, Image, and Video Technology, vol. 41, no. 3, pp. 255–269, 2005
  13. A mixed-level switching dynamic system for continuous speech recognition
    Computer Speech & Language, vol. 18, no. 1, pp. 49–65, 2004
  14. Target-Directed Mixture Dynamic Models for Spontaneous Speech Recognition
    IEEE Transactions on Speech and Audio Processing, vol. 12, no. 1, pp. 47–58, 2004
  15. Enhancement of Log Mel Power Spectra of Speech Using a Phase-Sensitive Model of the Acoustic Environment and Sequential Estimation of the Corrupting Noise
    IEEE Transactions on Speech and Audio Processing, vol. 12, no. 2, pp. 133–143, 2004
  16. Estimating Cepstrum of Speech Under the Presence of Noise Using a Joint Prior of Static and Dynamic Features
    IEEE Transactions on Speech and Audio Processing, vol. 12, no. 3, pp. 218–233, 2004
  17. Speech and Language Processing for Multimodal Human-Computer Interaction
    The Journal of VLSI Signal Processing-Systems for Signal, Image, and Video Technology, vol. 36, no. 2/3, pp. 161–187, 2004
  18. Challenges in adopting speech recognition
    Communications of the ACM, vol. 47, no. 1, p. 69, 2004
  19. Joint state and parameter estimation for a target-directed nonlinear dynamic system model
    IEEE Transactions on Signal Processing, vol. 51, no. 12, pp. 3061–3070, 2003
  20. Efficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model
    IEEE Transactions on Speech and Audio Processing, vol. 11, no. 6, pp. 590–602, 2003
  21. Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
    IEEE Transactions on Speech and Audio Processing, vol. 11, no. 6, pp. 568–580, 2003
  22. Model-based speaker normalization methods for speech recognition
    Electronics and Communications in Japan (Part II: Electronics), vol. 86, no. 2, pp. 45–56, 2003
  23. Nonstationary-state hidden Markov model representation of speech signals for speech enhancement
    Signal Processing, vol. 82, no. 2, pp. 205–227, 2002
  24. A robust compensation strategy for extraneous acoustic variations in spontaneous speech recognition
    IEEE Transactions on Speech and Audio Processing, vol. 10, no. 1, pp. 9–17, 2002
  25. Speaker clustering for speech recognition using vocal tract parameters
    Speech Communication, vol. 36, no. 3-4, pp. 305–315, 2002
  26. Distributed speech processing in miPad's multimodal user interface
    IEEE Transactions on Speech and Audio Processing, vol. 10, no. 8, pp. 605–619, 2002
  27. An overlapping-feature-based phonological model incorporating linguistic constraints: Applications to speech recognition
    The Journal of the Acoustical Society of America, vol. 111, no. 2, p. 1086, 2002
  28. A maximum a posteriori approach to speaker adaptation using the trended hidden Markov model
    IEEE Transactions on Speech and Audio Processing, vol. 9, no. 5, pp. 549–557, 2001
  29. A Bayesian approach to the verification problem: applications to speaker verification
    IEEE Transactions on Speech and Audio Processing, vol. 9, no. 8, pp. 874–884, 2001
  30. Parameter estimation of a target-directed dynamic system model with switching states
    Signal Processing, vol. 81, no. 5, pp. 975–987, 2001
  31. A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamic model of speech
    Computer Speech & Language, vol. 14, no. 2, pp. 101–114, 2000
  32. Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics
    The Journal of the Acoustical Society of America, vol. 108, no. 6, p. 3036, 2000
  33. A dynamic system approach to speech enhancement using the H/sub 8/ filtering algorithm
    IEEE Transactions on Speech and Audio Processing, vol. 7, no. 4, pp. 391–399, 1999
  34. A layered neural network interfaced with a cochlear model for the study of speech encoding in the auditory system
    Computer Speech & Language, vol. 13, no. 1, pp. 39–64, 1999
  35. HMM-based strategies for enhancement of speech signals embedded in nonstationary noise
    IEEE Transactions on Speech and Audio Processing, vol. 6, no. 5, pp. 445–455, 1998
  36. Speech trajectory discrimination using the minimum classification error learning
    IEEE Transactions on Speech and Audio Processing, vol. 6, no. 6, pp. 505–515, 1998
  37. A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition
    Speech Communication, vol. 24, no. 4, pp. 299–323, 1998
  38. Game theory approach to discrete H/sub 8/ filter design
    IEEE Transactions on Signal Processing, vol. 45, no. 4, pp. 1092–1095, 1997
  39. Speech recognition using autosegmental representation of phonological units with interface to the trended HMM
    Speech Communication, vol. 23, no. 3, pp. 211–222, 1997
  40. Production models as a structural basis for automatic speech recognition
    Speech Communication, vol. 22, no. 2-3, pp. 93–111, 1997
  41. HMM-based speech recognition using state-dependent, discriminatively derived transforms on mel-warped DFT features
    IEEE Transactions on Speech and Audio Processing, vol. 5, no. 3, pp. 243–256, 1997
  42. Use of generalized dynamic feature parameters for speech recognition
    IEEE Transactions on Speech and Audio Processing, vol. 5, no. 3, pp. 232–242, 1997
  43. Speaker-independent phonetic classification using hidden Markov models with mixtures of trend functions
    IEEE Transactions on Speech and Audio Processing, vol. 5, no. 4, pp. 319–324, 1997
  44. Maximum likelihood in statistical estimation of dynamic systems: Decomposition algorithm and simulation results
    Signal Processing, vol. 57, no. 1, pp. 65–79, 1997
  45. Decomposition solution of H8 filter gain in singularly perturbed systems
    Signal Processing, vol. 55, no. 3, pp. 313–320, 1996
  46. Transiems as dynamically defined, sub-phonemic units of speech: A computational model
    Signal Processing, vol. 49, no. 1, pp. 25–35, 1996
  47. Tracking nonstationary targets using a dynamical system with Markov-modulated parameters
    IEEE Signal Processing Letters, vol. 2, no. 9, pp. 172–175, 1995
  48. A Markov model containing state-conditioned second-order non-stationarity: application to speech recognition
    Computer Speech & Language, vol. 9, no. 1, pp. 63–86, 1995
  49. Analysis of the correlation structure for a neural predictive model with application to speech recognition
    Neural Networks, vol. 7, no. 2, pp. 331–339, 1994
  50. Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
    IEEE Transactions on Speech and Audio Processing, vol. 2, no. 4, pp. 507–520, 1994
  51. Integrated optimization of dynamic feature parameters for hidden Markov modeling of speech
    IEEE Signal Processing Letters, vol. 1, no. 4, pp. 66–69, 1994
  52. A statistical model for formant-transition microsegments of speech incorporating locus equations
    Signal Processing, vol. 37, no. 1, pp. 121–128, 1994
  53. Waveform-based speech recognition using hidden filter models: parameter selection and sensitivity to power normalization
    IEEE Transactions on Speech and Audio Processing, vol. 2, no. 1, pp. 80–89, 1994
  54. Context-dependent Markov model structured by locus equations: Applications to phonetic classification
    The Journal of the Acoustical Society of America, vol. 96, no. 4, p. 2008, 1994
  55. A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
    The Journal of the Acoustical Society of America, vol. 95, no. 5, p. 2702, 1994
  56. Dynamic formant tracking of noisy speech using temporal analysis on outputs from a nonlinear cochlear model
    IEEE Transactions on Biomedical Engineering, vol. 40, no. 5, pp. 456–467, 1993
  57. Numerical property and efficient solution of a transmission-line model for basilar membrane wave motions
    Signal Processing, vol. 33, no. 3, pp. 269–285, 1993
  58. Hidden Markov model representation of quantized articulatory features for speech recognition
    Computer Speech & Language, vol. 7, no. 3, pp. 265–282, 1993
  59. A stochastic model of speech incorporating hierarchical nonstationarity
    IEEE Transactions on Speech and Audio Processing, vol. 1, no. 4, pp. 471–474, 1993
  60. Modeling acoustic transitions in speech by state-interpolation hidden Markov models
    IEEE Transactions on Signal Processing, vol. 40, no. 2, pp. 265–271, 1992
  61. Structural design of hidden Markov model speech recognizer using multivalued phonetic features: Comparison with segmental speech units
    The Journal of the Acoustical Society of America, vol. 92, no. 6, p. 3058, 1992
  62. Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition
    IEEE Transactions on Signal Processing, vol. 39, no. 7, pp. 1677–1681, 1991
  63. The semi-relaxed algorithm for estimating parameters of hidden Markov models
    Computer Speech & Language, vol. 5, no. 3, pp. 231–236, 1991
  64. Modeling microsegments of stop consonants in a hidden Markov model based word recognizer
    The Journal of the Acoustical Society of America, vol. 87, no. 6, p. 2738, 1990
  65. Use of vowel duration information in a large vocabulary word recognizer
    The Journal of the Acoustical Society of America, vol. 86, no. 2, p. 540, 1989