BriefGPT.xyz
Apr, 2023
ChatGPT-Crawler: 查看ChatGPT的言论是否可靠
ChatGPT-Crawler: Find out if ChatGPT really knows what it's talking about
HTML
PDF
Aman Rangapur, Haoran Wang
TL;DR
本研究分析了ChatGPT在不同对话问答语料库中生成的回答,并使用BERT相似度得分进行比较,以获取自然语言推理(NLI)标签。该研究还确定了ChatGPT提供错误答案的情况,提供了有关该模型可能存在错误的领域的见解。通过评估分数,比较GPT-3和GPT-4的整体性能。
Abstract
large language models
have gained considerable interest for their impressive performance on various tasks. Among these models,
chatgpt
developed by OpenAI has become extremely popular among early adopters who eve
→