BriefGPT.xyz
Jul, 2024
大型语言模型作为可靠的知识库?
Large Language Models as Reliable Knowledge Bases?
HTML
PDF
Danna Zheng, Mirella Lapata, Jeff Z. Pan
TL;DR
利用大型语言模型作为知识库的可靠性和效果尚未得到充分研究,该研究通过定义可靠性标准和指标,评估了26个热门语言模型的效果,并发现即使高性能模型如GPT-3.5-turbo也不具备事实性和一致性,而在上下文学习和微调等策略上的努力也未能改善这些语言模型作为知识库的表现。
Abstract
The NLP community has recently shown a growing interest in leveraging
large language models
(LLMs) for knowledge-intensive tasks, viewing LLMs as potential
knowledge bases
(KBs). However, the reliability and exte
→