Gonzalo Martínez, José Alberto Hernández, Javier Conde, Pedro Reviriego, Elena Merino
TL;DR评估使用语言模型生成的文本的词汇丰富性及其与模型参数的相关性。
Abstract
The performance of conversational large language models (LLMs) in general, and of ChatGPT in particular, is currently being evaluated on many different tasks, from logical reasoning or maths to answering questions on a myriad of topics. Instead, much less attention is being devoted to