The blurb text from my PhD thesis:
Erwin Marsi, Intonation in Spoken Language Generation, Utrecht: LOT, 2001.
Spoken Language Generation (SLG) is a combination of Natural Language Generation (NLG) and Speech Synthesis. The central question is how one can program computers to produce high-quality speech from some computer-internal representation of information (rather than from text, as in Text-to-Speech synthesis). The research on SLG described in this thesis focuses on generating Dutch speech with an optimal intonation. The motivation is primarily linguistic in the sense that SLG is used as a tool to test and improve linguistic theories about the relation between the intonation of an utterance and other aspects of its structure and context.
The research objective in this work is to design, build and evaluate a linguistically-based SLG system that generates speech with pitch accents and intonational boundaries that are most acceptable to experts. It is driven by research questions regarding the adequacy of the linguistic theories employed, as well as questions regarding the best architecture for an SLG system. While focusing on intonation (phonological phrasing, intonational phrasing, and pitch accent placement in Dutch) and system architecture, it provides an overview of all stages of the SLG process and the underlying syntactic, phonological and phonetic theory, and includes contributions to text planning, surface realization, tune choice, and speech synthesis. In addition, it reports on an evaluation experiment, comparing output of the SLG system with that of human experts and other systems at the symbolic level.
This study is of interest to theoretical and computational linguists, as well as researchers in the field of Speech Synthesis and Natural Language Generation.