Akademska digitalna zbirka SLovenije - logo

Search results

Basic search    Advanced search   
Search
request
Library

Currently you are NOT authorised to access e-resources SI consortium. For full access, REGISTER.

2 3 4 5 6
hits: 14,649
31.
  • Speech emotion recognition ... Speech emotion recognition based on optimized deep features of dual-channel complementary spectrogram
    Li, Juan; Zhang, Xueying; Li, Fenglian ... Information sciences, November 2023, 2023-11-00, Volume: 649
    Journal Article
    Peer reviewed

    Speech emotion recognition (SER) is an essential field of artificial intelligence. Although the Mel spectrogram is commonly used in SER, it emphasizes low-frequency emotional components. In this ...
Full text
Available for: GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
32.
  • ECG Arrhythmia Classificati... ECG Arrhythmia Classification Using STFT-Based Spectrogram and Convolutional Neural Network
    Huang, Jingshan; Chen, Binqiang; Yao, Bin ... IEEE access, 2019, Volume: 7
    Journal Article
    Peer reviewed
    Open access

    The classification of electrocardiogram (ECG) signals is very important for the automatic diagnosis of heart disease. Traditionally, it is divided into two steps, including the step of feature ...
Full text
Available for: NUK, UL, UM, UPUK

PDF
33.
  • Heart sound classification ... Heart sound classification based on scaled spectrogram and tensor decomposition
    Zhang, Wenjie; Han, Jiqing; Deng, Shiwen Expert systems with applications, 10/2017, Volume: 84
    Journal Article
    Peer reviewed

    •First, the spectrograms of heart cycles are scaled for comparison.•Second, tensor decomposition is utilized to the scaled spectrograms.•Third, the intrinsic structure information of scaled ...
Full text
Available for: GEOZS, IJS, IMTLJ, KILJ, KISLJ, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK, ZRSKP
34.
  • Spectral images based envir... Spectral images based environmental sound classification using CNN with meaningful data augmentation
    Mushtaq, Zohaib; Su, Shun-Feng; Tran, Quoc-Viet Applied acoustics, 01/2021, Volume: 172
    Journal Article
    Peer reviewed

    In this study, an effective approach of spectral images based on environmental sound classification using Convolutional Neural Networks (CNN) with meaningful data augmentation is proposed. The ...
Full text
Available for: GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
35.
  • A practical guide for gener... A practical guide for generating unsupervised, spectrogram‐based latent space representations of animal vocalizations
    Thomas, Mara; Jensen, Frants H.; Averly, Baptiste ... The Journal of animal ecology, August 2022, Volume: 91, Issue: 8
    Journal Article
    Peer reviewed
    Open access

    Background: The manual detection, analysis and classification of animal vocalizations in acoustic recordings is laborious and requires expert knowledge. Hence, there is a need for objective, ...
Full text
Available for: BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SBCE, SBMB, UL, UM, UPUK
36.
  • Bottom-up broadcast neural ... Bottom-up broadcast neural network for music genre classification
    Liu, Caifeng; Feng, Lin; Liu, Guochao ... Multimedia tools and applications, 02/2021, Volume: 80, Issue: 5
    Journal Article
    Peer reviewed
    Open access

    Music genre classification based on visual representation has been successfully explored over the last years. Recently, there has been increasing interest in attempting convolutional neural networks ...
Full text
Available for: CEKLJ, EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ

PDF
37.
  • Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions
    Shen, Jonathan; Pang, Ruoming; Weiss, Ron J. ... 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04/2018
    Conference Proceeding
    Open access

    This paper describes Tacotron 2, a neural network architecture for speech synthesis directly from text. The system is composed of a recurrent sequence-to-sequence feature prediction network that maps ...
Full text
Available for: IJS, NUK, UL, UM

PDF
38.
  • GMM and CNN Hybrid Method f... GMM and CNN Hybrid Method for Short Utterance Speaker Recognition
    Liu, Zheli; Wu, Zhendong; Li, Tong ... IEEE transactions on industrial informatics, 07/2018, Volume: 14, Issue: 7
    Journal Article

    During the last few years, the speaker recognition technique has been widely attractive for its extensive application in many fields, such as speech communications, domestics services, and smart ...
Full text
Available for: IJS, NUK, UL
39.
  • Speech-Transformer: A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
    Dong, Linhao; Xu, Shuang; Xu, Bo 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018-April
    Conference Proceeding

    Recurrent sequence-to-sequence models using encoder-decoder architecture have made great progress in speech recognition task. However, they suffer from the drawback of slow training speed because the ...
Full text
Available for: IJS, NUK, UL, UM
40.
  • Sequence-to-Sequence Acoust... Sequence-to-Sequence Acoustic Modeling for Voice Conversion
    Zhang, Jing-Xuan; Ling, Zhen-Hua; Liu, Li-Juan ... IEEE/ACM transactions on audio, speech, and language processing, 03/2019, Volume: 27, Issue: 3
    Journal Article
    Peer reviewed
    Open access

    In this paper, a neural network named sequence-to-sequence ConvErsion NeTwork (SCENT) is presented for acoustic modeling in voice conversion. At training stage, a SCENT model is estimated by aligning ...
Full text
Available for: IJS, NUK, UL

PDF
2 3 4 5 6
hits: 14,649

Load filters