TL;DR本文评估了商业Large Language Models (LLMs) GPT-3.5-Turbo和GPT-4在2023 BioASQ挑战的任务中的表现,其中0-shot learning和相关段落达到了竞争水平。
Abstract
We assessed the performance of commercial large language models (LLMs) GPT-3.5-Turbo and GPT-4 on tasks from the 2023 bioasq challenge. In Task 11b Phase B, which is focused on →