BriefGPT.xyz
Jul, 2024
评估生成和判断编程反馈的语言模型
Evaluating Language Models for Generating and Judging Programming Feedback
HTML
PDF
Charles Koutcheme, Nicola Dainese, Arto Hellas, Sami Sarsa, Juho Leinonen...
TL;DR
使用开源的大型语言模型在学习编程中评估编程作业反馈的高质量和评判编程反馈的质量方面,与专有的模型相比,取得了很好的效果。
Abstract
The emergence of
large language models
(
llms
) has transformed research and practice in a wide range of domains. Within the
computing education re
→