TL;DR本研究使用adversarial prompts对Large Language Models进行度量,并分析了prompt鲁棒性及其传递性,为prompt组合提供了实用性建议。
Abstract
The increasing reliance on large language models (LLMs) across academia and industry necessitates a comprehensive understanding of their robustness to prompts. In response to this vital need, we introduce promptbench