Gérard Bailly

Gérard Bailly is a senior CNRS Research Director at the Institut de la Communication Parlée, Grenoble, France. He is leading there the Talking Machines Team, dedicated to multimodal speech synthesis. He has worked in the field of speech communication for more than 20 years. He coedited Talking Machines: Theories, Models and Designs (Elsevier, 1992) and Improvements in Speech Synthesis (Wiley, 2002). Audiovisual Speech Processing is under preparation with MIT Press. He co-organised the ESCA Autrans Workshop (1991), Journées d’Etudes sur la Parole (Aussois, 2000), Smart Object Conference (Grenoble, 2003), and sOc-EUSAI (Grenoble, 2005). He is a founder member of the ISCA SynSIG and SproSIG special-interest groups. His current interest is multimodal interaction for pointing to real and virtual objects and to conversational agents using speech, hand, and head movements and eye gaze.

Biography Updated on 10 January 2007

Articles in Scholarly Journals [Incomplete List]

  1. SFC: A trainable prosodic model
    Speech Communication, vol. 46, no. 3-4, pp. 348–364, 2005
  2. Analysis and synthesis of the three-dimensional movements of the head, face, and hand of a speaker using cued speech
    The Journal of the Acoustical Society of America, vol. 118, no. 2, p. 1144, 2005
  3. A model of acoustic interspeaker variability based on the concept of formant–cavity affiliation
    The Journal of the Acoustical Society of America, vol. 115, no. 1, p. 337, 2004
  4. Structural analysis of complex oxides LnMnTaO (Ln = rare earth and yttrium) with pyrochlore-related structures
    Journal of Alloys and Compounds, vol. 374, no. 1-2, pp. 177–180, 2004
  5. Tracking talking faces with shape and appearance models
    Speech Communication, vol. 44, no. 1-4, pp. 63–82, 2004
  6. International Journal of Speech Technology, vol. 6, no. 1, pp. 11–19, 2003
  7. International Journal of Speech Technology, vol. 6, no. 4, pp. 331–346, 2003
  8. Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images
    Journal of Phonetics, vol. 30, no. 3, pp. 533–553, 2002
  9. Generating prosodic attitudes in French: Data, model and evaluation
    Speech Communication, vol. 33, no. 4, pp. 357–371, 2001
  10. Linear degrees of freedom in speech production: Analysis of cineradio- and labio-film data and articulatory-acoustic modeling
    The Journal of the Acoustical Society of America, vol. 109, no. 5, p. 2165, 2001
  11. Objective evaluation of grapheme to phoneme conversion for text-to-speech synthesis in French
    Computer Speech & Language, vol. 12, no. 4, pp. 393–410, 1998
  12. Learning to speak. Sensori-motor control of speech movements
    Speech Communication, vol. 22, no. 2-3, pp. 251–267, 1997
  13. Characterisation of rhythmic patterns for text-to-speech synthesis
    Speech Communication, vol. 15, no. 1-2, pp. 127–137, 1994