Akademska digitalna zbirka SLovenije - logo

Rezultati iskanja

Osnovno iskanje    Ukazno iskanje   

Trenutno NISTE avtorizirani za dostop do e-virov konzorcija SI. Za polni dostop se PRIJAVITE.

1 2 3 4 5
zadetkov: 114
21.
  • Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation
    Wang, Zhenyu; Xie, Enze; Li, Aoxue ... arXiv.org, 01/2024
    Paper, Journal Article
    Odprti dostop

    Despite significant advancements in text-to-image models for generating high-quality images, these methods still struggle to ensure the controllability of text prompts over images in the context of ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
22.
  • Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving
    Shi, Hao; Wang, Song; Zhang, Jiaming ... arXiv.org, 07/2024
    Paper, Journal Article
    Odprti dostop

    Vision-based occupancy prediction, also known as 3D Semantic Scene Completion (SSC), presents a significant challenge in computer vision. Previous methods, confined to onboard processing, struggle ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
23.
  • Adaptive Affinity for Associations in Multi-Target Multi-Camera Tracking
    Hou, Yunzhong; Wang, Zhongdao; Wang, Shengjin ... arXiv.org, 12/2021
    Paper, Journal Article
    Odprti dostop

    Data associations in multi-target multi-camera tracking (MTMCT) usually estimate affinity directly from re-identification (re-ID) feature distances. However, we argue that it might not be the best ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
24.
  • Synthetic Data Are as Good as the Real for Association Knowledge Learning in Multi-object Tracking
    Liu, Yuchi; Wang, Zhongdao; Zhou, Xiangxin ... arXiv (Cornell University), 10/2021
    Paper, Journal Article
    Odprti dostop

    Association, aiming to link bounding boxes of the same identity in a video sequence, is a central component in multi-object tracking (MOT). To train association modules, e.g., parametric networks, ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
25.
  • PixArt-\Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
    Chen, Junsong; Ge, Chongjian; Xie, Enze ... arXiv.org, 03/2024
    Paper, Journal Article
    Odprti dostop

    In this paper, we introduce PixArt-\Sigma, a Diffusion Transformer model~(DiT) capable of directly generating images at 4K resolution. PixArt-\Sigma represents a significant advancement over its ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
26.
  • MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation
    Ge, Chongjian; Chen, Junsong; Xie, Enze ... arXiv.org, 04/2023
    Paper, Journal Article
    Odprti dostop

    Perception systems in modern autonomous driving vehicles typically take inputs from complementary multi-modal sensors, e.g., LiDAR and cameras. However, in real-world applications, sensor corruptions ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
27.
  • Generalizable Re-Identification from Videos with Cycle Association
    Wang, Zhongdao; Dou, Zhaopeng; Zhang, Jingwei ... arXiv (Cornell University), 11/2022
    Paper, Journal Article
    Odprti dostop

    In this paper, we are interested in learning a generalizable person re-identification (re-ID) representation from unlabeled videos. Compared with 1) the popular unsupervised re-ID setting where the ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
28.
  • Towards Real-Time Multi-Object Tracking
    Wang, Zhongdao; Zheng, Liang; Liu, Yixuan ... arXiv (Cornell University), 07/2020
    Paper, Journal Article
    Odprti dostop

    Modern multiple object tracking (MOT) systems usually follow the \emph{tracking-by-detection} paradigm. It has 1) a detection model for target localization and 2) an appearance embedding model for ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
29.
  • Query Adaptive Late Fusion for Image Retrieval
    Wang, Zhongdao; Zheng, Liang; Wang, Shengjin arXiv (Cornell University), 10/2018
    Paper, Journal Article
    Odprti dostop

    Feature fusion is a commonly used strategy in image retrieval tasks, which aggregates the matching responses of multiple visual features. Feasible sets of features can be either descriptors (SIFT, ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
30.
  • Softmax Dissection: Towards Understanding Intra- and Inter-class Objective for Embedding Learning
    He, Lanqing; Wang, Zhongdao; Li, Yali ... arXiv (Cornell University), 02/2020
    Paper, Journal Article
    Odprti dostop

    The softmax loss and its variants are widely used as objectives for embedding learning, especially in applications like face recognition. However, the intra- and inter-class objectives in the softmax ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
1 2 3 4 5
zadetkov: 114

Nalaganje filtrov