Object-Centric Unsupervised Image Captioning

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ECCV (17. : 2022 : Tel Aviv; Online) Computer vision – ECCV 2022 ; Part 36
1. Verfasser: Meng, Zihang (VerfasserIn)
Weitere Verfasser: Yang, David (VerfasserIn), Cao, Xuefei (VerfasserIn), Shah, Ashish (VerfasserIn), Lim, Ser-Nam (VerfasserIn)
Pages:2022
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2022
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Explicit Image Caption Editing 2022 Wang, Zhen
X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks 2022 Cai, Zhaowei
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding 2022 Hao, Jiachang
Generative Negative Text Replay for Continual Vision-Language Pretraining 2022 Yan, Shipeng
Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly 2022 Whitehead, Spencer
Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds 2022 Jain, Ayush
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling 2022 Yang, Zhengyuan
SemAug: Semantically Meaningful Image Augmentations for Object Detection Through Language Grounding 2022 Heisler, Morgan
Single-Stream Multi-level Alignment for Vision-Language Pretraining 2022 Khan, Zaid
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval 2022 Wang, Haoran
ASSISTER: Assistive Navigation via Conditional Instruction Generation 2022 Huang, Zanming
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding 2022 Shi, Cheng
Contrastive Vision-Language Pre-training with Limited Resources 2022 Cui, Quan
Classification-Regression for Chart Comprehension 2022 Levy, Matan
AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant 2022 Wong, Benita
Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation 2022 Lin, Chuang
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels 2022 Ghiasi, Golnaz
NewsStories: Illustrating Articles with Visual Summaries 2022 Tan, Reuben
Object-Centric Unsupervised Image Captioning 2022 Meng, Zihang
Learning Linguistic Association Towards Efficient Text-Video Retrieval 2022 Fang, Sheng
Alle Artikel auflisten