BriefGPT.xyz
Apr, 2023
学习外推:一种横式学习方法
Learning to Extrapolate: A Transductive Approach
HTML
PDF
Aviv Netanyahu, Abhishek Gupta, Max Simchowitz, Kaiqing Zhang, Pulkit Agrawal
TL;DR
本文研究了利用和超参数微调相关的重新参数化策略,增强深度学习系统在特定条件下的组合泛化能力,从而解决超域外推问题。该方法在各种监督学习和模仿学习任务中均具有实用性。
Abstract
machine learning
systems, especially with
overparameterized deep neural networks
, can generalize to novel test instances drawn from the same distribution as the training data. However, they fare poorly when evalu
→