AbstractCompanies, organizations, and governments increasingly exploit
language models' (LM) remarkable capability to display agent-like behavior. As LMs are adopted to perform tasks with growing autonomy, there exists an urgent need for reliable and scalable
→