Prosodic Modelling for Croatian Speech Synthesis
Abstract: In order to include prosody into text to speech systems (TTS), prosody knowledge needs to be acquired, represented and incorporated. Two main features of prosody important for modelling prosody for TTS systems are duration and F0 contour. There are various approaches to modelling those features and they can be categorized into three main groups: rule based, statistical and minimalistic. Some of the best known approaches to duration acquiring are Klatt’s model, classification and regression trees and neural networks and to F0 modelling TOBI, Fujisaki and Tilt. A procedure for automatic intonation event detection on Croatian texts based on the Tilt model was evaluated in terms of Root Mean Square Error values for generated F0 contours.
Keywords: prosody modelling, speech synthesis, TTS, duration models, F0 contour models, prosodic characteristics of Croatian
You are not authenticated to view the full text of this chapter or article.
This site requires a subscription or purchase to access the full text of books or journals.
Do you have any questions? Contact us.Or login to access all content.