Akademska digitalna zbirka SLovenije - logo

Rezultati iskanja

Osnovno iskanje    Ukazno iskanje   

Trenutno NISTE avtorizirani za dostop do e-virov konzorcija SI. Za polni dostop se PRIJAVITE.

1 2 3 4 5
zadetkov: 373
1.
Celotno besedilo

PDF
2.
  • Look Closer to See Better: ... Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition
    Jianlong Fu; Heliang Zheng; Tao Mei 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 07/2017
    Conference Proceeding

    Recognizing fine-grained categories (e.g., bird species) is difficult due to the challenges of discriminative region localization and fine-grained feature learning. Existing approaches predominantly ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM
3.
  • Learning Texture Transformer Network for Image Super-Resolution
    Yang, Fuzhi; Yang, Huan; Fu, Jianlong ... 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 01/2020
    Conference Proceeding
    Odprti dostop

    We study on image super-resolution (SR), which aims to recover realistic textures from a low-resolution (LR) image. Recent progress has been made by taking high-resolution images as references (Ref), ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM

PDF
4.
  • Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition
    Heliang Zheng; Jianlong Fu; Tao Mei ... 2017 IEEE International Conference on Computer Vision (ICCV), 10/2017
    Conference Proceeding

    Recognizing fine-grained categories (e.g., bird species) highly relies on discriminative part localization and part-based fine-grained feature learning. Existing approaches predominantly solve these ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM
5.
  • Learning Spatio-Temporal Transformer for Visual Tracking
    Yan, Bin; Peng, Houwen; Fu, Jianlong ... 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 01/2021
    Conference Proceeding
    Odprti dostop

    In this paper, we present a new tracking architecture with an encoder-decoder transformer as the key component. The encoder models the global spatio-temporal feature dependencies between target ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM

PDF
6.
  • Multi-level Attention Netwo... Multi-level Attention Networks for Visual Question Answering
    Dongfei Yu; Jianlong Fu; Tao Mei ... 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017-July
    Conference Proceeding

    Inspired by the recent success of text-based question answering, visual question answering (VQA) is proposed to automatically answer natural language questions with the reference to a given image. ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM
7.
  • Rethinking and Improving Relative Position Encoding for Vision Transformer
    Wu, Kan; Peng, Houwen; Chen, Minghao ... 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 01/2021
    Conference Proceeding
    Odprti dostop

    Relative position encoding (RPE) is important for transformer to capture sequence ordering of input tokens. General efficacy has been proven in natural language processing. However, in computer ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM

PDF
8.
  • Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-Grained Image Recognition
    Zheng, Heliang; Fu, Jianlong; Zha, Zheng-Jun ... 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 06/2019
    Conference Proceeding
    Odprti dostop

    Learning subtle yet discriminative features (e.g., beak and eyes for a bird) plays a significant role in fine-grained image recognition. Existing attention-based approaches localize and amplify ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM

PDF
9.
  • Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting
    Xu, Chenfeng; Qiu, Kai; Fu, Jianlong ... 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 10/2019
    Conference Proceeding
    Odprti dostop

    Dense crowd counting aims to predict thousands of human instances from an image, by calculating integrals of a density map over image pixels. Existing approaches mainly suffer from the extreme ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM

PDF
10.
  • Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner
    Tseng-Hung Chen; Yuan-Hong Liao; Ching-Yao Chuang ... 2017 IEEE International Conference on Computer Vision (ICCV), 2017-Oct.
    Conference Proceeding

    Impressive image captioning results are achieved in domains with plenty of training image and sentence pairs (e.g., MSCOCO). However, transferring to a target domain with significant domain shifts ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM

PDF
1 2 3 4 5
zadetkov: 373

Nalaganje filtrov