DIKUL - logo

Search results

Basic search    Advanced search   
Search
request
Library

Currently you are NOT authorised to access e-resources UL. For full access, REGISTER.

1 2 3 4 5
hits: 57,172
1.
  • Far-Field Automatic Speech ... Far-Field Automatic Speech Recognition
    Haeb-Umbach, Reinhold; Heymann, Jahn; Drude, Lukas ... Proceedings of the IEEE, 02/2021, Volume: 109, Issue: 2
    Journal Article
    Peer reviewed
    Open access

    The machine recognition of speech spoken at a distance from the microphones, known as far-field automatic speech recognition (ASR), has received a significant increase in attention in science and ...
Full text
Available for: UL

PDF
2.
  • Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture
    Miao, Haoran; Cheng, Gaofeng; Gao, Changfeng ... ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
    Conference Proceeding
    Open access

    Recently, Transformer has gained success in automatic speech recognition (ASR) field. However, it is challenging to deploy a Transformer-based end-to-end (E2E) model for online speech recognition. In ...
Full text
Available for: UL

PDF
3.
  • BigSSL: Exploring the Front... BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
    Zhang, Yu; Park, Daniel S.; Han, Wei ... IEEE journal of selected topics in signal processing, 10/2022, Volume: 16, Issue: 6
    Journal Article
    Peer reviewed
    Open access

    We summarize the results of a host of efforts using giant automatic speech recognition (ASR) models pre-trained using large, diverse unlabeled datasets containing approximately a million hours of ...
Full text
Available for: UL
4.
  • Deep Audio-Visual Speech Re... Deep Audio-Visual Speech Recognition
    Afouras, Triantafyllos; Chung, Joon Son; Senior, Andrew ... IEEE transactions on pattern analysis and machine intelligence, 12/2022, Volume: 44, Issue: 12
    Journal Article
    Peer reviewed
    Open access

    The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of ...
Full text
Available for: UL

PDF
5.
  • Automatic Speech Recognitio... Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language
    Mukhamadiyev, Abdinabi; Khujayarov, Ilyos; Djuraev, Oybek ... Sensors (Basel, Switzerland), 05/2022, Volume: 22, Issue: 10
    Journal Article
    Peer reviewed
    Open access

    Communication has been an important aspect of human life, civilization, and globalization for thousands of years. Biometric analysis, education, security, healthcare, and smart cities are only a few ...
Full text
Available for: UL
6.
  • SpeakerBeam: Speaker Aware ... SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures
    Zmolikova, Katerina; Delcroix, Marc; Kinoshita, Keisuke ... IEEE journal of selected topics in signal processing, 08/2019, Volume: 13, Issue: 4
    Journal Article
    Peer reviewed

    The processing of speech corrupted by interfering overlapping speakers is one of the challenging problems with regards to today's automatic speech recognition systems. Recently, approaches based on ...
Full text
Available for: UL
7.
  • A Unified Convolutional Bea... A Unified Convolutional Beamformer for Simultaneous Denoising and Dereverberation
    Nakatani, Tomohiro; Kinoshita, Keisuke IEEE signal processing letters, 06/2019, Volume: 26, Issue: 6
    Journal Article
    Peer reviewed
    Open access

    This letter proposes a method for estimating a convolutional beamformer that can perform denoising and dereverberation simultaneously in an optimal way. The application of dereverberation based on a ...
Full text
Available for: UL

PDF
8.
  • Automatic speech recognitio... Automatic speech recognition for under-resourced languages: A survey
    Besacier, Laurent; Barnard, Etienne; Karpov, Alexey ... Speech communication, January 2014, 2014-1-00, 20140101, 2014, Volume: 56, Issue: Jan
    Journal Article
    Peer reviewed
    Open access

    Speech processing for under-resourced languages is an active field of research, which has experienced significant progress during the past decade. We propose, in this paper, a survey that focuses on ...
Full text
Available for: UL
9.
  • A Unified Framework for Mul... A Unified Framework for Multilingual Speech Recognition in Air Traffic Control Systems
    Lin, Yi; Guo, Dongyue; Zhang, Jianwei ... IEEE transaction on neural networks and learning systems, 2021-Aug., 2021-08-00, 2021-8-00, 20210801, Volume: 32, Issue: 8
    Journal Article

    This work focuses on robust speech recognition in air traffic control (ATC) by designing a novel processing paradigm to integrate multilingual speech recognition into a single framework using three ...
Full text
Available for: UL
10.
  • End-to-End Audiovisual Spee... End-to-End Audiovisual Speech Recognition System With Multitask Learning
    Tao, Fei; Busso, Carlos IEEE transactions on multimedia, 2021, Volume: 23
    Journal Article
    Peer reviewed

    An automatic speech recognition (ASR) system is a key component in current speech-based systems. However, the surrounding acoustic noise can severely degrade the performance of an ASR system. An ...
Full text
Available for: UL
1 2 3 4 5
hits: 57,172

Load filters