Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ECCV (17. : 2022 : Tel Aviv; Online) Computer vision – ECCV 2022 ; Part 36
1. Verfasser: Boecking, Benedikt (VerfasserIn)
Weitere Verfasser: Usuyama, Naoto (VerfasserIn), Bannur, Shruthi (VerfasserIn), Castro, Daniel C. (VerfasserIn), Schwaighofer, Anton (VerfasserIn), Hyland, Stephanie (VerfasserIn), Wetscherek, Maria (VerfasserIn), Naumann, Tristan (VerfasserIn), Nori, Aditya (VerfasserIn), Alvarez-Valle, Javier (VerfasserIn), Poon, Hoifung (VerfasserIn), Oktay, Ozan (VerfasserIn)
Pages:2022
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2022
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
ASSISTER: Assistive Navigation via Conditional Instruction Generation 2022 Huang, Zanming
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding 2022 Shi, Cheng
Contrastive Vision-Language Pre-training with Limited Resources 2022 Cui, Quan
Classification-Regression for Chart Comprehension 2022 Levy, Matan
AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant 2022 Wong, Benita
Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation 2022 Lin, Chuang
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels 2022 Ghiasi, Golnaz
NewsStories: Illustrating Articles with Visual Summaries 2022 Tan, Reuben
Object-Centric Unsupervised Image Captioning 2022 Meng, Zihang
Learning Linguistic Association Towards Efficient Text-Video Retrieval 2022 Fang, Sheng
Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing 2022 Boecking, Benedikt
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input 2022 Guo, Qingpei
Video Graph Transformer for Video Question Answering 2022 Xiao, Junbin
Rethinking Data Augmentation for Robust Visual Question Answering 2022 Chen, Long
Word-Level Fine-Grained Story Visualization 2022 Li, Bowen
Webly Supervised Concept Expansion for General Purpose Vision Models 2022 Kamath, Amita
Unifying Event Detection and Captioning as Sequence Generation via Pre-training 2022 Zhang, Qi
Fine-Grained Visual Entailment 2022 Thomas, Christopher
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection 2022 Hong, Joanna
Language-Driven Artistic Style Transfer 2022 Fu, Tsu-Jui
Alle Artikel auflisten