UNI-MB - logo
UMNIK - logo
 
(UM)
  • Designing prosodic databases for automatic modeling of Slovenian language in a multilingual TTS System
    Müller, Achim F. ; Stergar, Janez, 1968- ; Horvat, Bogomir, 1936-
    In this paper the design of a prosodic data base and the data driven prediction of phrase breaks for modeling Slovenian language in a multilingual text-to-speech (TTS) system are presented. Automatic ... learning techniques offer a sollution in adapting prosodic models to a new language, voice or a new application, because they allow prosodic regularities to be automatically extracted from a prosodic database of natural speech. Such techniques depend on the construction of a large corpus labeled with symbolic prosody labels. The labeling can be done either automatically or by hand. While automatic labeling can be less accurate than hand labeling, the later is very time consuming. Therefore an interactive tool for semi-automatic labeling that uses the segmented spoken counterpart of the text as input will be presented. The tool combines the advantage of hand labeling and automatic labeling by achieving a high consistency in labeling and reducing the time that would be needed for hand labeling. The labeled Slovenian corpus has been used to train ourphrase break prediciton module. Experiments for the data driven prediction of major and minor phrase break labels have been performed. The achieved prediction accuracy marks state-of-the art for phrase break prediction accuracy for Slovenian language.
    Source: LREC 2002 : proceedings (Vol. 1, str. 288-292)
    Type of material - conference contribution
    Publish date - 2002
    Language - english
    COBISS.SI-ID - 7141398

source: LREC 2002 : proceedings (Vol. 1, str. 288-292)

loading ...
loading ...
loading ...