BriefGPT.xyz
Sep, 2023
基于LLM的短文本答案自动评分方法探究
Towards LLM-based Autograding for Short Textual Answers
HTML
PDF
Johannes Schneider, Bernd Schenk, Christina Niklaus, Michaelis Vlachos
TL;DR
通过评估大型语言模型在自动评分方面的可行性,并强调大型语言模型如何支持教育工作者验证评分程序,研究表明,虽然“开箱即用”的大型语言模型提供了宝贵的工具来提供补充视角,但它们对于独立自动评分的准备工作仍然是一个尚未完成的工作,需要人工监督。
Abstract
Grading of exams is an important, labor intensive, subjective, repetitive and frequently challenging task. The feasibility of
autograding
textual responses has greatly increased thanks to the availability of
large langu
→