BriefGPT.xyz
Nov, 2023
ML-Bench:大型语言模型基于开源库进行机器学习任务
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks
HTML
PDF
Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang...
TL;DR
通过使用开源库完成机器学习任务,本文旨在提出一种新的评估设置,以评估大型语言模型(LLMs)在实际编程中的适用性,并介绍了ML-Bench和ML-Agent两个工具,用于评估LLMs在利用开源函数时的有效性。
Abstract
large language models
have shown promising performance in
code generation
benchmarks. However, a considerable divide exists between these benchmark achievements and their practical applicability, primarily attrib
→