Plugging Stylized Controls in Open-Stylized Image Captioning

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PRCV (6. : 2023 : Xiamen) Pattern recognition and computer vision ; Part 1
1. Verfasser: Wang, Jie (VerfasserIn)
Weitere Verfasser: Zheng, Yixiao (VerfasserIn), Du, Ruoyi (VerfasserIn), Zhang, Yiming (VerfasserIn), Liang, Kongming (VerfasserIn), Ma, Zhanyu (VerfasserIn)
Pages:1
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2024
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Unsupervised Prototype Adapter for Vision-Language Models 2024 Zhang, Yi
Exploring Cross-Modal Inconsistency in Entities and Emotions for Multimodal Fake News Detection 2024 Wang, Longzheng
Learning Adapters for Text-Guided Portrait Stylization with Pretrained Diffusion Models 2024 Yang, Mintu
Modality Interference Decoupling and Representation Alignment for Caricature-Visual Face Recognition 2024 Xu, Yang
Multimodal Rumor Detection by Using Additive Angular Margin with Class-Aware Attention for Hard Samples 2024 Zhou, Chenyu
Location Attention Knowledge Embedding Model for Image-Text Matching 2024 Xu, Guoqing
Efficient Adversarial Training with Membership Inference Resistance 2024 Yan, Ran
Auto-Learning-GCN: An Ingenious Framework for Skeleton-Based Action Recognition 2024 Xin, Wentian
Segmenting Key Clues to Induce Human-Object Interaction Detection 2024 Xue, Mingliang
Temporal-Channel Topology Enhanced Network for Skeleton-Based Action Recognition 2024 Luo, Jinzhao
EdgeFusion: Infrared and Visible Image Fusion Algorithm in Low Light 2024 Song, Zikun
Plugging Stylized Controls in Open-Stylized Image Captioning 2024 Wang, Jie
An Effective Dynamic Reweighting Method for Unbiased Scene Graph Generation 2024 Hu, Lingfeng
Discovering Multimodal Hierarchical Structures with Graph Neural Networks for Multi-modal and Multi-hop Question Answering 2024 Zhang, Qing
Enhancing Recommender System with Multi-modal Knowledge Graph 2024 Sun, Chengjie
Contrastive Perturbation Network for Weakly Supervised Temporal Sentence Grounding 2024 Han, Tingting
Skeleton-Based Action Recognition with Combined Part-Wise Topology Graph Convolutional Networks 2024 Zhu, Xiaowei
Spatio-Temporal Self-supervision for Few-Shot Action Recognition 2024 Yu, Wanchuan
A Fuzzy Error Based Fine-Tune Method for Spatio-Temporal Recognition Model 2024 Li, Jiulin
Image Priors Assisted Pre-training for Point Cloud Shape Analysis 2024 Li, Zhengyu
Alle Artikel auflisten