BriefGPT.xyz
Mar, 2023
GPT-4 在医疗挑战问题上的能力
Capabilities of GPT-4 on Medical Challenge Problems
HTML
PDF
Harsha Nori, Nicholas King, Scott Mayer McKinney, Dean Carignan, Eric Horvitz
TL;DR
通过对 USMLE 和 MultiMedQA 基准数据集的全面评估,我们发现不需要专门的提示造型来激发 GPT-4,它的表现超过了 USMLE 的合格分数约 20 分,并表现优于早期的通用模型(GPT-3.5)以及专门针对医学知识进行细化调整的模型(Med-PaLM,Flan-PaLM540B的提示调整版本)。
Abstract
large language models
(LLMs) have demonstrated remarkable capabilities in natural language understanding and generation across various domains, including medicine. We present a comprehensive evaluation of
gpt-4
,
→