Preventing harmful outputs
One of the key concerns with using LLMs is that they may generate harmful or unethical text. This is an area of active research in the field. Here we present some built-in chains inspired by this research, which are intended to make the outputs of LLMs safer.
- Moderation chain: Explicitly check if any output text is harmful and flag it.
- Constitutional chain: Prompt the model with a set of principles which should guide it's behavior.