Multi-modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International Conference on MultiMedia Modeling (30. : 2024 : Amsterdam) Multimedia modeling ; Part 3
1. Verfasser: Xing, Linzi (VerfasserIn)
Weitere Verfasser: Tran, Quan (VerfasserIn), Caba, Fabian (VerfasserIn), Dernoncourt, Franck (VerfasserIn), Yoon, Seunghyun (VerfasserIn), Wang, Zhaowen (VerfasserIn), Bui, Trung (VerfasserIn), Carenini, Giuseppe (VerfasserIn)
Pages:3
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2024
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Multi-task Collaborative Network for Image-Text Retrieval 2024 Qin, Xueyang
MobileViT-FocR: MobileViT with Fixed-One-Centre Loss and Gradient Reversal for Generalised Fake Face Detection 2024 Peng, Ting
Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery Videos 2024 Zheng, Honglei
Differentiable Neural Architecture Search Based on Efficient Architecture for Lightweight Image Super-Resolution 2024 Sheng, Chunyin
Co-speech Gesture Generation with Variational Auto Encoder 2024 Ka, Shinichi
Exploring Imperceptible Adversarial Examples in YCbCr Color Space 2024 Chen, Pei
CA-GAN: Conditional Adaptive Generative Adversarial Network for Text-to-Image Synthesis 2024 Liu, Junpeng
Dual-Fisheye Image Stitching via Unsupervised Deep Learning 2024 Jin, Zhanjie
C3-PO: A Convolutional Neural Network for COVID Onset Prediction from Cough Sounds 2024 Chen, Xiangyu
A Region Based Non-overlapping Reference Speech Estimation Method for Speaker Extraction 2024 Zhang, Yiru
Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization 2024 Li, Pan
Prototype-Enhanced Hypergraph Learning for Heterogeneous Information Networks 2024 Wang, Shuai
A Multidimensional Taxonomy Model for Music Tangible User Interfaces 2024 Baratè, Adriano
Semantic Transition Detection for Self-supervised Video Scene Segmentation 2024 Chen, Lu
FGENet: Fine-Grained Extraction Network for Congested Crowd Counting 2024 Ma, Hao-Yuan
Prior-Knowledge-Free Video Frame Interpolation with Bidirectional Regularized Implicit Neural Representations 2024 He, Yuanjian
Two-Stage Reasoning Network with Modality Decomposition for Text VQA 2024 Ling, Shengrong
Exploring Multi-modal Fusion for Image Manipulation Detection and Localization 2024 Triaridis, Konstantinos
Fractional-Order Image Moments and Applications 2024 Xu, Liyun
MC-TCMNER: A Multi-modal Fusion Model Combining Contrast Learning Method for Traditional Chinese Medicine NER 2024 Cao, Shan
Alle Artikel auflisten