Universal prompt testing methodology using RED-GREEN-REFACTOR cycle with subagents, supporting A/B testing and regression testing for commands, hooks, skills, and production prompts