UNI-MB - logo
UMNIK - logo
 

Search results

Basic search    Advanced search   
Search
request
Library

Currently you are NOT authorised to access e-resources UM. For full access, REGISTER.

3 4 5 6 7
hits: 206
41.
  • Text-to-speech synthesis sy... Text-to-speech synthesis system with Arabic diacritic recognition system
    Rebai, Ilyes; BenAyed, Yassine Computer speech & language, 11/2015, Volume: 34, Issue: 1
    Journal Article
    Peer reviewed

    •We developed an Arabic text-to-speech system, including a diacritization system.•The speech synthesis system is based on statistical parametric.•We address the accuracy of diacritic and acoustic ...
Full text
42.
  • DNN-based grapheme-to-phone... DNN-based grapheme-to-phoneme conversion for Arabic text-to-speech synthesis
    Hadj Ali, Ikbel; Mnasri, Zied; Lachiri, Zied International journal of speech technology, 09/2020, Volume: 23, Issue: 3
    Journal Article
    Peer reviewed

    Arabic text-to-speech synthesis from non-diacritized text is still a big challenge, because of unique Arabic language rules and characteristics. Indeed, the diacritic and gemination signs, which are ...
Full text
43.
  • Prosodic Clustering for Phoneme-Level Prosody Control in End-to-End Speech Synthesis
    Vioni, Alexandra; Christidou, Myrsini; Ellinas, Nikolaos ... ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021-June-6
    Conference Proceeding
    Open access

    This paper presents a method for controlling the prosody at the phoneme level in an autoregressive attention-based text-to-speech system. Instead of learning latent prosodic features with a ...
Full text

PDF
44.
  • Periodnet: A Non-Autoregressive Waveform Generation Model with a Structure Separating Periodic and Aperiodic Components
    Hono, Yukiya; Takaki, Shinji; Hashimoto, Kei ... ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021-June-6
    Conference Proceeding
    Open access

    We propose PeriodNet, a non-autoregressive (non-AR) waveform generation model with a new model structure for modeling periodic and aperiodic components in speech waveforms. The non-AR waveform ...
Full text

PDF
45.
  • An objective evaluation of ... An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis
    Lőrincz, Beáta; Stan, Adriana; Giurgiu, Mircea Procedia computer science, 2021, 2021-00-00, Volume: 192
    Journal Article
    Peer reviewed
    Open access

    Multi-speaker spoken datasets enable the creation of text-to-speech synthesis (TTS) systems which can output several voice identities. The multi-speaker (MSPK) scenario also enables the use of fewer ...
Full text

PDF
46.
  • Estonian Text-to-Speech Syn... Estonian Text-to-Speech Synthesis with Non-autoregressive Transformers
    Ratsep, Liisa; Lellep, Rasmus; Fishel, Mark Baltic Journal of Modern Computing, 01/2022, Volume: 10, Issue: 3
    Journal Article
    Peer reviewed
    Open access

    While text-to-speech synthesis with non-autoregressive Transformers has achieved state-of-the-art quality for many languages, the methodology of Estonian text-to-speech synthesis has not been revised ...
Full text
47.
  • Intonation modelling using ... Intonation modelling using a muscle model and perceptually weighted matching pursuit
    Honnet, Pierre-Edouard; Gerazov, Branislav; Gjoreski, Aleksandar ... Speech communication, March 2018, 2018-03-00, 20180301, Volume: 97
    Journal Article
    Peer reviewed
    Open access

    We propose a physiologically based intonation model using perceptual relevance. Motivated by speech synthesis from a speech-to-speech translation (S2ST) point of view, we aim at a language ...
Full text

PDF
48.
  • System for Automatic Assign... System for Automatic Assignment of Lexical Stress in Croatian
    Mikelić Preradović, Nives; Nacinovic Prskalo, Lucia Electronics (Basel), 11/2022, Volume: 11, Issue: 22
    Journal Article
    Peer reviewed
    Open access

    It is very popular today to integrate voice interfaces into IoT devices. The pronunciation and proper prosody of speech play a major role in the intelligibility and naturalness of synthesized voices. ...
Full text
49.
  • Towards Lifelong Learning of Multilingual Text-to-Speech Synthesis
    Yang, Mu; Ding, Shaojin; Chen, Tianlong ... ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022-May-23
    Conference Proceeding
    Open access

    This work presents a lifelong learning approach to train a multilingual Text-To-Speech (TTS) system, where each language was seen as an individual task and was learned sequentially and continually. ...
Full text
50.
  • Enhancing Speaking Styles in Conversational Text-to-Speech Synthesis with Graph-Based Multi-Modal Context Modeling
    Li, Jingbei; Meng, Yi; Li, Chenyi ... ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022-May-23
    Conference Proceeding
    Open access

    Comparing with traditional text-to-speech (TTS) systems, conversational TTS systems are required to synthesize speeches with proper speaking style confirming to the conversational context. However, ...
Full text
3 4 5 6 7
hits: 206

Load filters