ADAPT: Action-Aware Driving Caption Transformer

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:CAAI International Conference on Artificial Intelligence (3. : 2023 : Fuzhou, Fujian) Artificial intelligence ; Part 1
1. Verfasser: Jin, Bu (VerfasserIn)
Weitere Verfasser: Liu, Haotian (VerfasserIn)
Pages:1
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2024
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
ViT-MPI: Vision Transformer Multiplane Images for Surgical Single-View View Synthesis 2024 Han, Chenming
LEAD: LiDAR Extender for Autonomous Driving 2024 Zhang, Jianing
3D-B2U: Self-supervised Fluorescent Image Sequences Denoising 2024 Wang, Jianan
Weakly-Supervised Grounding for VQA with Dual Visual-Linguistic Interaction 2024 Liu, Yi
Self-supervised Meta Auxiliary Learning for Actor and Action Video Segmentation from Natural Language 2024 Ye, Linwei
SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation 2024 Yu, Bin
ADAPT: Action-Aware Driving Caption Transformer 2024 Jin, Bu
Sequential Style Consistency Learning for Domain-Generalizable Text Recognition 2024 Zhang, Pengcheng
Heterogeneous Link Prediction via Mutual Information Maximization Between Node Pairs 2024 Lu, Yifan
MusicGAIL: A Generative Adversarial Imitation Learning Approach for Music Generation 2024 Liao, Yusong
Unsupervised Traditional Chinese Herb Mention Normalization via Robustness-Promotion Oriented Self-supervised Training 2024 Li, Wei
Multi-modal Dialogue State Tracking for Playing GuessWhich Game 2024 Pang, Wei
Dual-Domain Network for Restoring Images from Under-Display Cameras 2024 Wang, Di
Explicit Composition of Neural Radiance Fields by Learning an Occlusion Field 2024 Sun, Xunsen
Equivariant Indoor Illumination Map Estimation from a Single Image 2024 Ai, Yusen
Lightweight Rolling Shutter Image Restoration Network Based on Undistorted Flow 2024 Wang, Binfeng
An Efficient Graph Transformer Network for Video-Based Human Mesh Reconstruction 2024 Tang, Tao
Fast Point Cloud Registration for Urban Scenes via Pillar-Point Representation 2024 Gu, Siyuan
GLCANet: Context Attention for Infrared Small Target Detection 2024 Liu, Rui
RsMmFormer: Multimodal Transformer Using Multiscale Self-attention for Remote Sensing Image Classification 2024 Zhang, Bo
Alle Artikel auflisten