Test any Claude skill using RED-GREEN-REFACTOR cycle with subagent pressure testing to verify the skill resists agent rationalization and bypass attempts