TreeTalk: Memory-Based Grapheme-Phoneme Conversion Demo

The TreeTalk demo converts Dutch or English words to their phonetic transcription in the SAMPA (Dutch) or DISC (English) phonetic alphabet, and also generates speech audio. This speech audio is synthesized by the MBROLA speech synthesizer from the Circuit Theory and Signal Processing Lab at the Faculté Polytechnique de Mons

Primary word stress is indicated for each word by an apostrophe (') preceding the stressed syllable. The maximum word size is limited to 50 characters.

Choose language: Dutch English


Press to or

TreeTalk is currently in development within the PhD project of Bertjan Busser. A short paper about the Dutch version of TreeTalk is available (first in list below).

A short list of work-in-progress: we are working on predicting suprasegmental prosodic information so that TreeTalk will be able to predict prosodic breaks, boundary tones, and sentence accents. We will also try to incorporate expansion of abbreviations and more intelligent tokenization.

Previous work on grapheme-phoneme conversion and earlier descriptions of TreeTalk by the ILK group can be found in the following publications:

Copyright © 1998 ILK Research Group, Tilburg University. All rights reserved.

Last update: 14 December 1999