BriefGPT.xyz
May, 2023
弥合断层:自然语言生成中融入(人类)反馈的调查
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
HTML
PDF
Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins...
TL;DR
本文对利用人类反馈来提高自然语言生成的研究进行了综述。通过介绍反馈的形式和目标,讨论了直接使用反馈或训练反馈模型两种方法在训练和解码过程中的应用。此外,我们还探讨了与反馈收集相关的现有数据集和问题,并提供了人工智能反馈领域的概述。
Abstract
Many recent advances in
natural language generation
have been fueled by training
large language models
on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelp
→