Akademska digitalna zbirka SLovenije - logo
VSE knjižnice (vzajemna bibliografsko-kataložna baza podatkov COBIB.SI)
  • TTS-driven expressive embodied conversation agent EVA for UMB-SmartTV
    Rojc, Matej, 1972- ...
    The main goal of using non-verbal modalities together with the general text-to-speech (TTS) system is to better emulate human-like course of the interaction between users and the UMB-SmartTV ... platform. Namely, when human-TV interaction is supported by TTS only, the interactions tend to be still less functional and less human-like. In order to achieve more advanced interaction,and more human-human like, the virtual agent technology as a feedback interface has to be introduced. In this way more appropriate social responses from the UMB-SmartTV through personification of the TTS system, named PLATTOS, can be produced and close to human-human-like communicative behavior may be invoked. Verbal and co-verbal gestures are linked through complex mental processes. Understanding of attitude, emotion, together with how gestures (facial and hand) and body movements complement, or in some cases, override any verbal information produced by the TTS system, provides crucial information for modeling both the interaction and the embodied conversational agentʼs (ECA) socially-oriented responses. The social responsesof the TTS system fused with ECA can then be presented to the user ina more human-like form, using not just audio but also facial expressions, such as: facial emotions, visual animation of synthesized speech, and synchronized head, hand, and body movements. In the paper a novel TTS-driven behavior generation system is proposed to be used for IPTV platforms. The behavior generation engine is implemented as a service and used by UMB-SmartTVin a service-oriented fashion. The behavior generation engine fusesboth, speech and gesture production models, by using FSMs and HRG structures. Selecting the shape and alignment of co-verbal movement for embodied conversational avatar, named EVA, are based on several linguistic features (automatically extracted from the input text), and several prosodic features (symbolic and acoustic features produced within the TTS engine). Finally, the generated speech and co-verbal behavior are animated by embodied conversational agentʼs engine and represented to the user within the UMB-SmarTV user interface. In this way, personificated TTS system PLATTOS, integrated within the UMB-SmartTV system, enable more advanced, personalized, and more natural multimodal-output-based human-machine interface.
    Vrsta gradiva - članek, sestavni del
    Leto - 2014
    Jezik - angleški
    COBISS.SI-ID - 17544982