Transformer Memory as a Differentiable Search Index

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (36. : 2022 : New Orleans, La.; Online) 36th Conference on Neural Information Processing Systems (NeurIPS 2022 ; Volume 29 of 50
1. Verfasser: Tay, Yi (VerfasserIn)
Weitere Verfasser: Tran, Vinh (VerfasserIn), Dehghani, Mostafa (VerfasserIn), Ni, Jianmo (VerfasserIn), Bahri, Dara (VerfasserIn), Mehta, Harsh (VerfasserIn), Qin, Zhen (VerfasserIn), Hui, Kai (VerfasserIn), Zhao, Zhe (VerfasserIn), Gupta, Jai (VerfasserIn), Schuster, Tal (VerfasserIn), Cohen, William W (VerfasserIn), Metzler, Donald (VerfasserIn)
Pages:36
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2023
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Insights into Pre-training via Simpler Synthetic Tasks 2023 Wu, Yuhuai
C2FAR: Coarse-to-Fine Autoregressive Networks for Precise Probabilistic Forecasting 2023 Bergsma, Shane
Respecting Transfer Gap in Knowledge Distillation 2023 Niu, Yulei
ENS-10: A Dataset For Post-Processing Ensemble Weather Forecasts 2023 Ashkboos, Saleh
Probabilistic Missing Value Imputation for Mixed Categorical and Ordered Data 2023 Zhao, Yuxuan
SGAM: Building a Virtual 3D World through Simultaneous Generation and Mapping 2023 Shen, Yuan
Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models 2023 Zhang, Zijian
One-shot Neural Backdoor Erasing via Adversarial Weight Masking 2023 Chai, Shuwen
Revisiting Neural Scaling Laws in Language and Vision 2023 Alabdulmohsin, Ibrahim M
Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent 2023 Bai, Yu
A general approximation lower bound in L^p norm, with applications to feed-forward neural networks 2023 Achour, El Mehdi
Make an Omelette with Breaking Eggs: Zero-Shot Learning for Novel Attribute Synthesis 2023 Li, Yu-Hsuan
Exploiting Semantic Relations for Glass Surface Detection 2023 Lin, Jiaying
GREED: A Neural Framework for Learning Graph Distance Functions 2023 Ranjan, Rishabh
Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning 2023 Liang, Yongyuan
Alleviating "Posterior Collapse' ' in Deep Topic Models via Policy Gradient 2023 Li, Yewen
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish 2023 Augustyniak, Lukasz
Communication Acceleration of Local Gradient Methods via an Accelerated Primal-Dual Algorithm with an Inexact Prox 2023 Sadiev, Abdurakhmon
Aligning individual brains with fused unbalanced Gromov Wasserstein 2023 Thual, Alexis
An In-depth Study of Stochastic Backpropagation 2023 Fang, Jun
Alle Artikel auflisten