2 results for tag "skill-comply"
Automated compliance measurement tool that generates test scenarios at three prompt-strictness levels (supportive, neutral, competing), runs `claude -p` to capture tool-call traces, and reports whether agents actually follow a given skill, rule, or agent definition. Use it to verify adherence after adding new skills or rules.