006·342
guardrails
/ˈɡɑːrdreɪlz/ - n.
1 [colloq.] Rules that enforce a policy, written for the policy the org has not yet enforced anywhere else.Keep. Punchy.This is the problem.
Working definition
2. Programmatic constraints that filter or block model inputs and outputs against defined safety and policy rules.