Guardrail types
PII filtering
Redacts personally identifiable information from queries and responses before they reach the language model.Write confirmation
Requires explicit confirmation before executing tools that modify data. Tools are classified as write operations based on theiris_write flag.
Cost caps
Sets a monthly cost limit per workspace. Queries are rejected once the cap is reached.Rate limits
Limits query frequency to prevent abuse.Blocked topics
Prevents the AI from responding to specific topics.Violation tracking
Compliance violations are recorded in conversation stats. Each violation includes:- Rule: which guardrail was triggered
- Description: what happened
- Conversation: which conversation triggered it
- User: who triggered it
- Timestamp: when it occurred
Organisation baseline
Org admins set the compliance baseline. If a connection is flagged as a security risk, the admin blocks it once and it is disabled across every workspace immediately. Workspaces can:- Add stricter rate limits
- Add additional blocked topics
- Enable PII filtering if the org baseline has it disabled
- Disable PII filtering if the org baseline requires it
- Raise cost caps above the org limit
- Remove blocked topics set at the org level