Deep Compression of Pre-trained Transformer Models

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (36. : 2022 : New Orleans, La.; Online) 36th Conference on Neural Information Processing Systems (NeurIPS 2022 ; Volume 19 of 50
1. Verfasser: Wang, Naigang (VerfasserIn)
Weitere Verfasser: Liu, Chi-Chun (Charlie) (VerfasserIn), Venkataramani, Swagath (VerfasserIn), Sen, Sanchari (VerfasserIn), Chen, Chia-Yu (VerfasserIn), El Maghraoui, Kaoutar (VerfasserIn), Srinivasan, Vijayalakshmi (Viji) (VerfasserIn), Chang, Leland (VerfasserIn)
Pages:36
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2023
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering 2023 Diakonikolas, Ilias
A Theory of PAC Learnability under Transformation Invariances 2023 Shao, Han
Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees 2023 Beznosikov, Aleksandr
Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning 2023 Wu, Chenyang
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models 2023 Shu, Manli
Perturbation Learning Based Anomaly Detection 2023 Cai, Jinyu
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders 2023 Li, Gang
Unsupervised Domain Adaptation for Semantic Segmentation using Depth Distribution 2023 Wu, Quanliang
Are all Frames Equal? Active Sparse Labeling for Video Action Detection 2023 Rana, Aayush
In the Eye of the Beholder: Robust Prediction with Causal User Modeling 2023 Feder, Amir
An Embarrassingly Simple Approach to Semi-Supervised Few-Shot Learning 2023 Wei, Xiu-Shen
Fast Vision Transformers with HiLo Attention 2023 Pan, Zizheng
Rare Gems: Finding Lottery Tickets at Initialization 2023 Sreenivasan, Kartik
Orthogonal Transformer: An Efficient Vision Transformer Backbone with Token Orthogonalization 2023 Huang, Huaibo
Using Mixup as a Regularizer Can Surprisingly Improve Accuracy & Out-of-Distribution Robustness 2023 Pinto, Francesco
FP8 Quantization: The Power of the Exponent 2023 Kuzmin, Andrey
Sparse Interaction Additive Networks via Feature Interaction Detection and Sparse Selection 2023 Enouen, James
PALBERT: Teaching ALBERT to Ponder 2023 Balagansky, Nikita
Analyzing Data-Centric Properties for Graph Contrastive Learning 2023 Trivedi, Puja
Exploring the Whole Rashomon Set of Sparse Decision Trees 2023 Xin, Rui
Alle Artikel auflisten