BriefGPT.xyz
Nov, 2023
LooGLE: 长文本语言模型是否理解长文本上下文?
LooGLE: Can Long-Context Language Models Understand Long Contexts?
HTML
PDF
Jiaqi Li, Mengmeng Wang, Zilong Zheng, Muhan Zhang
TL;DR
基于LooGLE评估模型的表现,研究显示商业模型在短依赖任务上胜过开源模型,同时也揭示了长依赖任务的困难,并指出在短问答任务中检索式技术有着明显的好处,而扩展上下文窗口长度的策略对于长上下文理解的影响有限。
Abstract
large language models
(
llms
), despite their impressive performance in various language tasks, are typically limited to processing texts within context-window size. This limitation has spurred significant research
→