BriefGPT.xyz
Feb, 2024
基于动态多重奖励权重的多样式可控生成的强化学习
Reinforcement Learning with Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation
HTML
PDF
Karin de Langis, Ryan Koo, Dongyeop Kang
TL;DR
通过强化学习方法控制多种风格的生成,使用动态权重方法优于静态权重方法,并在2个和3个风格控制方面进行了实证探索。
Abstract
style
is an integral component of text that expresses a diverse set of information, including interpersonal dynamics (e.g. formality) and the author's emotions or attitudes (e.g. disgust). Humans often employ
multiple s
→