BriefGPT.xyz
Mar, 2023
谁在思考?使用 XAI 操作手册推动以人为中心评估 LLMs
Who's Thinking? A Push for Human-Centered Evaluation of LLMs using the XAI Playbook
HTML
PDF
Teresa Datta, John P. Dickerson
TL;DR
本文探讨了人类中心的大型语言模型评估,并提出了心理模型,用例使用价值和认知参与三个研究重点,旨在加速人类中心式大型语言模型评估的进展。
Abstract
Deployed
artificial intelligence
(AI) often impacts humans, and there is no one-size-fits-all metric to evaluate these tools.
human-centered evaluation
of AI-based systems combines quantitative and qualitative an
→