Warden
beta · v2.14.0
beta · v2.14.0
Documentation

How Warden Compares

vs. Guardrail Tools

FeatureWardenGuardrails AINeMo GuardrailsPrompt rules (CLAUDE.md)
EnforcementDeterministic (compiled patterns)Model-based validationLLM-based railsAdvisory (model may ignore)
LatencySub-second (local binary)100ms+ (API call)200ms+ (LLM call)0ms (static text)
ScopeTool calls + outputAPI input/outputConversation flowConversation only
Local100% localCloud API requiredCloud or local LLMLocal
Multi-agentClaude Code + Gemini CLI + Codex CLIOpenAI onlyOpenAI/NVIDIAPer-assistant
ConfigZero config (compiled rules)Schema definitionsColang DSLMarkdown
Session awarenessPhase tracking, drift, focus, loopsNoneTopic controlNone
Output efficiencyAggressive compressionNoneNoneNone
Cross-session learningResume packets, repair patternsNoneNoneNone

Key difference: Warden’s safety enforcement is deterministic — compiled pattern matching, not an LLM call. When Warden blocks rm -rf /, it’s a regex match, not a model decision. This means no latency variance, no non-determinism, and no prompt injection that can bypass the check.

vs. Multi-Agent Harnesses

Anthropic’s harness design article describes a pattern gaining traction: planner/generator/evaluator agents coordinated in a loop, with context resets between sprints. This is external orchestration — multiple agents coordinated from the outside.

Warden takes the complementary approach: runtime governance from the inside. It doesn’t orchestrate multi-agent workflows. It makes each individual agent session safer, more focused, and more context-efficient.

Multi-Agent HarnessWarden
ArchitectureExternal orchestration (planner → generator → evaluator)Runtime injection (hooks inside each agent’s loop)
Problem solvedTask decomposition, quality evaluation, context management across sprintsSafety enforcement, session degradation, context waste, tool quality
ScopeThe overall workflowEach individual session
RelationshipCoordinates agentsGoverns agents

These aren’t competing approaches. A harness coordinates the work; Warden makes each agent’s individual session safer and more efficient. A multi-agent harness would benefit from Warden running on each of its sub-agents.

vs. Prompt Rules

Prompt rules (CLAUDE.md, system prompts, .gemini instructions) are valuable but fundamentally advisory. The model may ignore them, reinterpret them, or forget them after context compaction.

Warden’s deterministic rules operate outside the model’s context window. The agent cannot bypass them. Prompt rules and Warden are complementary — use prompt rules for coding style and preferences, use Warden for safety and runtime governance.

Key Principles

Deterministic where it matters. Safety enforcement is compiled pattern matching — same input, same output, every time. No model in the loop for safety decisions.

Local-first, always. Every decision happens on your machine. No cloud API, no telemetry, no network calls during operation.

Bounded intelligence. Session guidance is heuristic and clearly labeled as such. Warden is precise about what it guarantees vs. what it approximates.