BriefGPT.xyz
Feb, 2024
精细调整增强现有机制: 实体追踪案例研究
Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking
HTML
PDF
Nikhil Prakash, Tamar Rott Shaham, Tal Haklay, Yonatan Belinkov, David Bau
TL;DR
通过对细分任务的调优,研究模型的内部计算如何受到影响,并在实体跟踪中显示出性能提升。
Abstract
fine-tuning
on
generalized tasks
such as instruction following, code generation, and mathematics has been shown to enhance
language models
→