UNI-MB - logo
UMNIK - logo
 

Rezultati iskanja

Osnovno iskanje    Izbirno iskanje   
Iskalna
zahteva
Knjižnica

Trenutno NISTE avtorizirani za dostop do e-virov UM. Za polni dostop se PRIJAVITE.

1 2 3 4 5
zadetkov: 7.634
1.
  • Speech Synthesis Based on H... Speech Synthesis Based on Hidden Markov Models
    Tokuda, Keiichi; Nankaku, Yoshihiko; Toda, Tomoki ... Proceedings of the IEEE, 05/2013, Letnik: 101, Številka: 5
    Journal Article
    Recenzirano
    Odprti dostop

    This paper gives a general overview of hidden Markov model (HMM)-based speech synthesis, which has recently been demonstrated to be very effective in synthesizing speech. The main advantage of this ...
Celotno besedilo

PDF
2.
  • Conventional and contempora... Conventional and contemporary approaches used in text to speech synthesis: a review
    Kaur, Navdeep; Singh, Parminder The Artificial intelligence review, 07/2023, Letnik: 56, Številka: 7
    Journal Article
    Recenzirano

    Nowadays speech synthesis or text to speech (TTS), an ability of system to produce human like natural sounding voice from the written text, is gaining popularity in the field of speech processing. ...
Celotno besedilo
3.
Celotno besedilo
4.
  • Statistical Parametric Spee... Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks
    Saito, Yuki; Takamichi, Shinnosuke; Saruwatari, Hiroshi IEEE/ACM transactions on audio, speech, and language processing, 2018-Jan., 2018-1-00, Letnik: 26, Številka: 1
    Journal Article
    Recenzirano
    Odprti dostop

    A method for statistical parametric speech synthesis incorporating generative adversarial networks (GANs) is proposed. Although powerful deep neural networks techniques can be applied to artificially ...
Celotno besedilo

PDF
5.
  • Incremental Text-to-Speech ... Incremental Text-to-Speech Synthesis Using Pseudo Lookahead With Large Pretrained Language Model
    Saeki, Takaaki; Takamichi, Shinnosuke; Saruwatari, Hiroshi IEEE signal processing letters, 2021, Letnik: 28
    Journal Article
    Recenzirano
    Odprti dostop

    This letter presents an incremental text-to-speech (TTS) method that performs synthesis in small linguistic units while maintaining the naturalness of output speech. Incremental TTS is generally ...
Celotno besedilo

PDF
6.
  • MID-Attribute Speaker Generation Using Optimal-Transport-Based Interpolation of Gaussian Mixture Models
    Watanabe, Aya; Takamichi, Shinnosuke; Saito, Yuki ... ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023-June-4
    Conference Proceeding
    Odprti dostop

    In this paper, we propose a method for intermediating multiple speakers' attributes and diversifying their voice characteristics in "speaker generation," an emerging task that aims to synthesize a ...
Celotno besedilo
7.
  • Statistical parametric spee... Statistical parametric speech synthesis using deep neural networks
    Zen, Heiga; Senior, Andrew; Schuster, Mike 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 05/2013
    Conference Proceeding
    Odprti dostop

    Conventional approaches to statistical parametric speech synthesis typically use decision tree-clustered context-dependent hidden Markov models (HMMs) to represent probability densities of speech ...
Celotno besedilo

PDF
8.
  • Generative emotional AI for... Generative emotional AI for speech emotion recognition: The case for synthetic emotional speech augmentation
    Latif, Siddique; Shahid, Abdullah; Qadir, Junaid Applied acoustics, July 2023, 2023-07-00, Letnik: 210
    Journal Article
    Recenzirano
    Odprti dostop

    Despite advances in deep learning, current state-of-the-art speech emotion recognition (SER) systems still have poor performance due to a lack of speech emotion datasets. This paper proposes ...
Celotno besedilo
9.
  • Investigating different rep... Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis
    Lorenzo-Trueba, Jaime; Eje Henter, Gustav; Takaki, Shinji ... Speech communication, 20/May , Letnik: 99
    Journal Article
    Recenzirano
    Odprti dostop

    •We study the impact of adding large-scale listener's perceptual annotations into the emotional speech modeling process.•We consider a number of different emotional representations that allow us to ...
Celotno besedilo

PDF
10.
  • Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
    Prajwal, K R; Mukhopadhyay, Rudrabha; Namboodiri, Vinay P. ... 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 06/2020
    Conference Proceeding
    Odprti dostop

    Humans involuntarily tend to infer parts of the conversation from lip movements when the speech is absent or corrupted by external noise. In this work, we explore the task of lip to speech synthesis, ...
Celotno besedilo

PDF
1 2 3 4 5
zadetkov: 7.634

Nalaganje filtrov