BriefGPT.xyz
May, 2023
巨猩: 与大规模API连接的大型语言模型
Gorilla: Large Language Model Connected with Massive APIs
HTML
PDF
Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez
TL;DR
本文介绍了Gorilla模型,该模型通过使用基于LLaMA的finetuned技术,在写API调用方面超越了最先进的GPT-4模型。结合文档检索器,Gorilla展示了对测试时间文档更改的强大适应能力,减少了LMM直接提示时出现的幻觉问题,证明了LLM使用工具的潜力。
Abstract
large language models
(LLMs) have seen an impressive wave of advances recently, with models now excelling in a variety of tasks, such as mathematical reasoning and program synthesis. However, their potential to effectively use tools via
→