使用大型语言模型生成程序练习的现状调查研究

May, 2024

使用大型语言模型生成程序练习的现状调查研究

A Survey Study on the State of the Art of Programming Exercise Generation using Large Language Models

Eduard Frankford, Ingo Höhn, Clemens Sauerwein, Ruth Breu

TL;DR通过调查研究，本文分析了大语言模型（LLMs）在编程练习生成能力方面的状况，并提出了一个评估矩阵，帮助研究人员和教育工作者决定哪个LLM适合编程练习生成用例。此外，本文还发现多个LLM能够生成有用的编程练习，但存在着LLMs能够解决由LLMs生成的练习的难题。该论文对LLMs在教育中的整合进行了有益的讨论。

Abstract

This paper analyzes large language models (LLMs) with regard to their programming exercise generation capabilities. Through a survey study, we defined the state of the art, extracted their strengths and weaknesse