DFAformer: A Dual Filtering Auxiliary Transformer for Efficient Online Action Detection in Streaming Videos

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PRCV (6. : 2023 : Xiamen) Pattern recognition and computer vision ; Part 6
1. Verfasser: Jing, Shicheng (VerfasserIn)
Weitere Verfasser: Xie, Liping (VerfasserIn)
Pages:6
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2024
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
DFAR-Net: Dual-Input Three-Branch Attention Fusion Reconstruction Network for Polarized Non-Line-of-Sight Imaging 2024 Liu, Hao
Memory-Augmented Spatial-Temporal Consistency Network for Video Anomaly Detection 2024 Li, Zhangxun
EVCPP:Example-Driven Virtual Camera Pose Prediction for Cloud Performing Arts Scenes 2024 Qiu, Jucheng
Multimodal Local Feature Enhancement Network for Video Summarization 2024 Li, Zhaoyun
Flow-Guided Diffusion Autoencoder for Unsupervised Video Anomaly Detection 2024 Zhu, Aoni
Temporal-Semantic Context Fusion for Robust Weakly Supervised Video Anomaly Detection 2024 Zeng, Yuan
A Survey: The Sensor-Based Method for Sign Language Recognition 2024 Yang, Tian
Cross-Dataset Distillation with Multi-tokens for Image Quality Assessment 2024 Gao, Timin
EKGRL: Entity-Based Knowledge Graph Representation Learning for Fact-Based Visual Question Answering 2024 Ren, Yongjian
Disentangled Attribute Features Vision Transformer for Pedestrian Attribute Recognition 2024 Liu, Caihua
A High-Resolution Network Based on Feature Redundancy Reduction and Attention Mechanism 2024 Pan, Yuqing
RSID: A Remote Sensing Image Dehazing Network 2024 Li, Yuan
ContextNet: Learning Context Information for Texture-Less Light Field Depth Estimation 2024 Chao, Wentao
An Efficient Way for Active None-Line-of-Sight: End-to-End Learned Compressed NLOS Imaging 2024 Chang, Chen
WDU-Net: Wavelet-Guided Deep Unfolding Network for Image Compressed Sensing Reconstruction 2024 Wang, Xinlu
Enhancing Feature Representation for Anomaly Detection via Local-and-Global Temporal Relations and a Multi-stage Memory 2024 Li, Xuan
Asymmetric Attention Fusion for Unsupervised Video Object Segmentation 2024 Jiang, Hongfan
Unimodal-Multimodal Collaborative Enhancement for Audio-Visual Event Localization 2024 Tian, Huilin
Going Beyond Closed Sets: A Multimodal Perspective for Video Emotion Analysis 2024 Pu, Hao
Denoised Temporal Relation Network for Temporal Action Segmentation 2024 Ma, Zhichao
Alle Artikel auflisten