UNI-MB - logo
UMNIK - logo
 

Search results

Basic search    Advanced search   
Search
request
Library

Currently you are NOT authorised to access e-resources UM. For full access, REGISTER.

1 2 3 4 5
hits: 208
21.
  • Text-To-Speech Synthesis Based on Latent Variable Conversion Using Diffusion Probabilistic Model and Variational Autoencoder
    Yasuda, Yusuke; Toda, Tomoki ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023-June-4
    Conference Proceeding

    Text-to-speech synthesis (TTS) is a task to convert texts into speech. Two of the factors that have been driving TTS are the advancements of probabilistic models and latent representation learning. ...
Full text
22.
  • CyFi-TTS: Cyclic Normalizing Flow with Fine-Grained Representation for End-to-End Text-to-Speech
    Hwang, In-Sun; Han, Young-Sub; Jeon, Byoung-Ki ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023-June-4
    Conference Proceeding
    Open access

    Advanced end-to-end text-to-speech (TTS) systems directly generate high-quality speech. These systems demonstrate superior performance on the seen dataset from training. However, inferring speech ...
Full text
23.
  • Prosody-TTS: An End-to-End ... Prosody-TTS: An End-to-End Speech Synthesis System with Prosody Control
    Pamisetty, Giridhar; Sri Rama Murty, K. Circuits, systems, and signal processing, 2023/1, Volume: 42, Issue: 1
    Journal Article
    Peer reviewed
    Open access

    End-to-end text-to-speech synthesis systems achieved immense success in recent times, with improved naturalness and intelligibility. However, the end-to-end models, which primarily depend on the ...
Full text
24.
  • The perception of artificia... The perception of artificial-intelligence (AI) based synthesized speech in younger and older adults
    Herrmann, Björn International journal of speech technology, 07/2023, Volume: 26, Issue: 2
    Journal Article
    Peer reviewed

    Artificial intelligence (AI) based synthesized speech has become almost human-like, ubiquitous in everyday live (e.g., smart phones, grocery self-checkouts), and relatively easy to synthesize. This ...
Full text
25.
Full text

PDF
26.
  • ASVspoof 2019: A large-scal... ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
    Wang, Xin; Yamagishi, Junichi; Todisco, Massimiliano ... Computer speech & language, November 2020, 2020-11-00, 2020-11, Volume: 64
    Journal Article
    Peer reviewed
    Open access

    •We describe the protocol and design of the ASVspoof Challenge 2019 database•We detail the speech synthesis and voice conversion algorithms used in the database•We detail the carefully controlled ...
Full text

PDF
27.
  • Speaker Generation
    Stanton, Daisy; Shannon, Matt; Mariooryad, Soroosh ... ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022-May-23
    Conference Proceeding

    This work explores the task of synthesizing speech in non-existent human-sounding voices. We call this task "speaker generation", and present TacoSpawn, a system that performs competitively at this ...
Full text
28.
  • Improving Seq2Seq TTS Front... Improving Seq2Seq TTS Frontends With Transcribed Speech Audio
    Sun, Siqi; Richmond, Korin; Tang, Hao IEEE/ACM transactions on audio, speech, and language processing, 2023, Volume: 31
    Journal Article
    Peer reviewed
    Open access

    Due to the data inefficiency and low speech quality of grapheme-based end-to-end text-to-speech (TTS), having a separate high-performance TTS linguistic frontend is still commonly regarded as ...
Full text
29.
  • SR-TTS: a rhyme-based end-t... SR-TTS: a rhyme-based end-to-end speech synthesis system
    Yao, Yihao; Liang, Tao; Feng, Rui ... Frontiers in neurorobotics, 02/2024, Volume: 18
    Journal Article
    Peer reviewed
    Open access

    Deep learning has significantly advanced text-to-speech (TTS) systems. These neural network-based systems have enhanced speech synthesis quality and are increasingly vital in applications like ...
Full text
30.
  • Prosody modeling for syllab... Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks
    Ramu Reddy, V.; Sreenivasa Rao, K. Neurocomputing (Amsterdam), 01/2016, Volume: 171
    Journal Article
    Peer reviewed

    Prosody plays an important role in improving the quality of text-to-speech synthesis (TTS) system. In this paper, features related to the linguistic and the production constraints are proposed for ...
Full text
1 2 3 4 5
hits: 208

Load filters