BriefGPT.xyz
May, 2021
使用APPS评估编码挑战能力
Measuring Coding Challenge Competence With APPS
HTML
PDF
Dan Hendrycks, Steven Basart, Saurav Kadavath, Mantas Mazeika, Akul Arora...
TL;DR
通过引入APPs基准,对编程语言生成进展进行了评估,并发现当前机器学习模型已经开始学会编码,然而在生成Python代码时仍存在语法错误
Abstract
While
programming
is one of the most broadly applicable skills in modern society, modern
machine learning
models still cannot code solutions to basic problems. It can be difficult to accurately assess
→