BriefGPT.xyz
Jun, 2023
通过学习有机交互来提升开放式语言模型
Improving Open Language Models by Learning from Organic Interactions
HTML
PDF
Jing Xu, Da Ju, Joshua Lane, Mojtaba Komeili, Eric Michael Smith...
TL;DR
BlenderBot 3x是一种使用有机对话和反馈数据训练的对话模型,用于提高其技能和安全性,并采用学习技巧以避免不良行为,并针对具有挑战性的情况进行更安全的回应。
Abstract
We present
blenderbot 3x
, an update on the
conversational model
BlenderBot 3, which is now trained using organic conversation and feedback data from participating users of the system in order to improve both its
→