NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems. Guardrails (or rails for short) are a specific way of controlling the output of an LLM, such as not talking about topics considered harmful, following a predefined dialogue path, using a particular language style, and more. There are several mechanisms that allow LLM providers and developers to add guardrails that are embedded into a specific model at training, e.g. using model alignment. Differently, using a runtime inspired from dialogue management, NeMo Guardrails allows developers to add programmable rails to LLM applications - these are user-defined, independent of the underlying LLM, and interpretable. Our initial results show that the proposed approach can be used with several LLM providers to develop controllable and safe LLM applications using programmable rails.

NeMo Guardrails是一个开源工具包，用于向基于LLM的对话系统轻松添加可编程的防护措施。该工具包允许开发者在LLM应用中添加可编程的防护措施，使得这些措施与底层的LLM无关且可解释，通过使用多个LLM提供者的初步结果表明，该方法能够用于开发可控且安全的LLM应用。

NeMo Guardrails: 可控和安全的LLM应用程序的工具包，带有可编程Rail