BriefGPT.xyz
Nov, 2022
GPT-Neo用于常识推理——理论和实践视角
GPT-Neo for commonsense reasoning-a theoretical and practical lens
HTML
PDF
Rohan Kashyap, Vivek Kashyap, Narendra C. P
TL;DR
本文评估了GPT-neo 1.3亿模型在常识推理任务上的表现,发现模型在某些任务上具有竞争力,但当数据集大小显著较小时表现会很差。研究者还使用可视化和推理测试来证实结果,并通过多种方法进行彻底的健壮性测试。
Abstract
Recent work has demonstrated substantial gains in
pre-training
large-scale
unidirectional language models
such as the GPT-2, GPT-3, and
gpt-neo
→