Research Article | Open Access
A. Benkrid, A. Benallal, K. Benkrid, "Real-Time Vocal Tract Modelling", Modelling and Simulation in Engineering, vol. 2007, Article ID 028456, 8 pages, 2007. https://doi.org/10.1155/2007/28456
Real-Time Vocal Tract Modelling
To date, most speech synthesis techniques have relied upon the representation of the vocal tract by some form of filter, a typical example being linear predictive coding (LPC). This paper describes the development of a physiologically realistic model of the vocal tract using the well-established technique of transmission line modelling (TLM). This technique is based on the principle of wave scattering at transmission line segment boundaries and may be used in one, two, or three dimensions. This work uses this technique to model the vocal tract using a one-dimensional transmission line. A six-port scattering node is applied in the region separating the pharyngeal, oral, and the nasal parts of the vocal tract.
- G. Bristow, Electronic Speech Synthesis, Granada, London, NY, USA, 1984.
- P. B. Johns and R. L. Beurle, “Numerical solution of 2-dimensional scattering problems using TLM,” Proceedings of IEE, vol. 1118, no. 9, pp. 15–31, 1977.
- S. El-Masri, X. Pelorson, P. Saguet, and P. Badin, “Vocal tract acoustics using the transmission line matrix (TLM) method,” http://www.asel.udel.edu/icslp/cdrom/vol2/522/a522.pdf.
- J. A. Portí and J. A. Morente, “A three-dimensional symmetrical condensed TLM node for acoustics,” Journal of Sound and Vibration, vol. 241, no. 2, pp. 207–222, 2001.
- S. Akhtarzad and P. B. Johns, “The solution of Maxwell's equations in three space dimensions and time by the TLM method of numerical analysis,” Proceedings of IEE, vol. 122, no. 12, pp. 1349–1352, 1975.
- P. B. Johns and G. Butler, “The consistency and the accuracy of the TLM method for diffusion and its relation to existing methods,” International Journal for Numerical Methods in Engineering, vol. 19, no. 10, pp. 1549–1554, 1983.
- P. B. Johns and Y. Rahal, “The use of diakoptics for numerical simulation of the penetration of time domain fields in aircraft,” in Proceedings of the International Conference on Networks and Electronic Office Systems, pp. 57–63, IERE, London, UK, September 1983.
- P. B. Johns, “A simple and unconditionally stable numerical routine for a solution of the diffusion equation,” International Journal for Numerical Methods in Engineering, vol. 11, no. 8, pp. 1307–1328, 1977.
- M. M. Sondhi, “Model for wave propagation in a lossy vocal tract,” Journal of the Acoustical Society of America, vol. 55, no. 5, pp. 1070–1075, 1974.
- A. Benkrid, Transmission line modelling of the vocal tract, Ph.D. thesis, University of Nottingham, Nottingham, UK, 1989.
- L. L. Beranek, Acoustics, McGraw-Hill, New York, NY, USA, 1975.
- P. M. Morse and K. U. Ingard, Theoretical Acoustics, McGraw-Hill, New York, NY, USA, 1968.
- M. R. Portnoff and R. W. Schaffer, “Mathematical considerations in digital simulation of the vocal tract,” Journal of the Acoustical Society of America, vol. 53, no. 1, pp. 294–, 1973.
- J. L. Flanagan, “Voices of Men and Machines,” Journal of the Acoustical Society of America, vol. 51, no. 5, pp. 1375–1377, 1972.
- G. Fant, Acoustic Theory of Speech Production, Mouton de Gruyter, Berlin, Germany, 1970.
- A. R. Greenwood, C. C. Goodyear, and P. A. Martin, “Measurements of vocal tract shapes using magnetic resonance imaging,” IEE Proceedings I: Communications, Speech & Vision, vol. 139, no. 6, pp. 553–560, 1992.
- R. B. Adler, L. J. Chu, and R. M. Fano, Electromagnetic Energy Transmission, John Wiley & Sons, New York, NY, USA, 1983.
- D. T. Paris and F. K. Hurd, Basic Electromagnetic Theory, McGraw-Hill, New York, NY, USA, 1969.
- L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals, Prentice-Hall, Englewood Cliffs, NJ, USA, 1976.
- H. W. Strube, “Determination of the instant of the glottal closure from speech wave,” Journal of the Acoustical Society of America, vol. 56, no. 5, pp. 1625–1629, 1974.
- P. M. Morse, Vibration and Sound, McGraw-Hill, New York, NY, USA, 1948.
- I. B. Crandall, Theory of Vibrating Systems and Sound, D. Van Nostrand, New York, NY, USA, 1972.
- M. R. Matausek and V. S. Batalov, “A new approach to the determination of the glottal waveform,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 6, pp. 616–622, 1980.
- M. Rothenberg, “A new inverse filtering techniques for deriving the glottal airflow waveform during voicing,” Journal of the Acoustical Society of America, vol. 53, no. 6, pp. 1632–1645, 1973.
- “TMS320 User's Guide,” 1985, Preliminary. Texas Instruments, Dallas, Tex, USA.
- “TMS32020 BOARD,” June 1985, User Manual. Loughborough Sound.
- J. M. Loasby, Real time transmission line modelling of the vacal tract using multiple digital signal processors, Ph.D. thesis, University of Nottingham, Nottingham, UK, 1992.
- A. Benkrid, T. E. Cross, M. Atmani, and M. Koudil, “A multiprocessor TMS320 platform for a real-time modelling,” IEEE Transactions on Engineering Management, vol. 46, no. 6, pp. 174–178, 1999.
Copyright © 2007 A. Benkrid et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.