Fine-tuning Language Models over Slow Networks using Activation Quantization with Guarantees

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (36. : 2022 : New Orleans, La.; Online) 36th Conference on Neural Information Processing Systems (NeurIPS 2022 ; Volume 25 of 50
1. Verfasser: WANG, Jue (VerfasserIn)
Weitere Verfasser: Yuan, Binhang (VerfasserIn), Rimanic, Luka (VerfasserIn), He, Yongjun (VerfasserIn), Dao, Tri (VerfasserIn), Chen, Beidi (VerfasserIn), Ré, Christopher (VerfasserIn), Zhang, Ce (VerfasserIn)
Pages:36
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2023
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation 2023 Gao, Bin-Bin
Most Activation Functions Can Win the Lottery Without Excessive Depth 2023 Burkholz, Rebekka
DaDA: Distortion-aware Domain Adaptation for Unsupervised Semantic Segmentation 2023 Jang, Sujin
Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes 2023 Xue, Zeyue
Rapid Model Architecture Adaption for Meta-Learning 2023 Zhao, Yiren
Unsupervised Learning under Latent Label Shift 2023 Roberts, Manley
QC-StyleGAN - Quality Controllable Image Generation and Manipulation 2023 Nguyen, Dat Viet Thanh
Influencing Long-Term Behavior in Multiagent Reinforcement Learning 2023 Kim, Dong-Ki
Convergent Representations of Computer Programs in Human and Artificial Neural Networks 2023 Srikant, Shashank
Data Distributional Properties Drive Emergent In-Context Learning in Transformers 2023 Chan, Stephanie
Memory safe computations with XLA compiler 2023 Artemev, Artem
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training 2023 Yuan, Geng
Double Bubble, Toil and Trouble: Enhancing Certified Robustness through Transitivity 2023 Cullen, Andrew
Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem 2023 Kumar, Raunak
Large-Scale Differentiable Causal Discovery of Factor Graphs 2023 Lopez, Romain
Private Estimation with Public Data 2023 Bie, Alex
Exploring through Random Curiosity with General Value Functions 2023 Ramesh, Aditya
Optimal Transport-based Identity Matching for Identity-invariant Facial Expression Recognition 2023 Kim, Daeha
Quantized Training of Gradient Boosting Decision Trees 2023 Shi, Yu
Reconstruction on Trees and Low-Degree Polynomials 2023 Koehler, Frederic
Alle Artikel auflisten