DIKUL - logo

Search results

Basic search    Expert search   

Currently you are NOT authorised to access e-resources UL. For full access, REGISTER.

1 2 3 4 5
hits: 46
1.
  • ASR is All You Need: Cross-Modal Distillation for Lip Reading
    Afouras, Triantafyllos; Chung, Joon Son; Zisserman, Andrew ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 05/2020
    Conference Proceeding
    Open access

    The goal of this work is to train strong models for visual speech recognition without requiring human annotated ground truth data. We achieve this by distilling from an Automatic Speech Recognition ...
Full text
Available for: UL

PDF
2.
  • Counterfactual Multi-Agent ... Counterfactual Multi-Agent Policy Gradients
    Foerster, Jakob; Farquhar, Gregory; Afouras, Triantafyllos ... Proceedings of the ... AAAI Conference on Artificial Intelligence, 04/2018, Volume: 32, Issue: 1
    Journal Article

    Many real-world problems, such as network packet routing and the coordination of autonomous vehicles, are naturally modelled as cooperative multi-agent systems. There is a great need for new ...
Full text
Available for: UL
3.
  • Deep Audio-Visual Speech Re... Deep Audio-Visual Speech Recognition
    Afouras, Triantafyllos; Chung, Joon Son; Senior, Andrew ... IEEE transactions on pattern analysis and machine intelligence, 12/2022, Volume: 44, Issue: 12
    Journal Article
    Peer reviewed

    The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of ...
Full text
Available for: UL

PDF
4.
Full text

PDF
5.
  • Eventfulness for Interactiv... Eventfulness for Interactive Video Alignment
    Sun, Jiatian; Deng, Longxiulin; Afouras, Triantafyllos ... ACM transactions on graphics, 01/08, Volume: 42, Issue: 4
    Journal Article
    Peer reviewed

    Humans are remarkably sensitive to the alignment of visual events with other stimuli, which makes synchronization one of the hardest tasks in video editing. A key observation of our work is that most ...
Full text
Available for: UL
6.
Full text

PDF
7.
  • Localizing Visual Sounds the Hard Way
    Chen, Honglie; Xie, Weidi; Afouras, Triantafyllos ... 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021-June
    Conference Proceeding
    Open access

    The objective of this work is to localize sound sources that are visible in a video without using manual annotations. Our key technical contribution is to show that, by training the network to ...
Full text
Available for: UL

PDF
8.
  • Sub-word Level Lip Reading With Visual Attention
    Prajwal, K R; Afouras, Triantafyllos; Zisserman, Andrew 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022-June
    Conference Proceeding
    Open access

    The goal of this paper is to learn strong lip reading models that can recognise speech in silent videos. Most prior works deal with the open-set visual speech recognition problem by adapting existing ...
Full text
Available for: UL
9.
  • Scaling Up Sign Spotting Th... Scaling Up Sign Spotting Through Sign Language Dictionaries
    Varol, Gül; Momeni, Liliane; Albanie, Samuel ... International journal of computer vision, 06/2022, Volume: 130, Issue: 6
    Journal Article
    Peer reviewed
    Open access

    The focus of this work is sign spotting –given a video of an isolated sign, our task is to identify whether and where it has been signed in a continuous, co-articulated sign language video. To ...
Full text
Available for: CEKLJ, UL
10.
  • Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation
    Rahimi, Akam; Afouras, Triantafyllos; Zisserman, Andrew 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022-June
    Conference Proceeding
    Open access

    The goal of this paper is speech separation and enhancement in multi-speaker and noisy environments using a combination of different modalities. Previous works have shown good performance when ...
Full text
Available for: UL
1 2 3 4 5
hits: 46

Load filters