Agentic Eval

Name: Agentic Eval
Author: agentic-eval

Home/Ai-ml/Agentic Eval

About this skill

Patterns and techniques for evaluating and improving AI agent outputs. Use this skill when: - Implementing self-critique and reflection loops - Building evaluator-optimizer pipelines for quality-critical generation - Creating test-driven code refinement workflows - Designing rubric-based or LLM-as-judge evaluation systems - Adding iterative improvement to agent outputs (code, reports, analysis) - Measuring and improving agent response quality

View on GitHub

GitHub Stats

Stars

Forks

Last Update

Author

agentic-eval

License

Other

Version

1.0.0

Features

Related Skills

Word / DOCX

Create, inspect, and edit Microsoft Word documents and DOCX files with reliable styles, numbering, tracked changes, tables, sections, and compatibility checks. Use when (1) the task is about Word or `...

315.2kivangdavila

Ai-ml

Clanker's World

Operate Clankers World through the canonical `cw` CLI, with bundled runtime helpers, explicit Wall vs Sandbox separation, and safe room operations on `https://clankers.world`.

315.2kdecentraliser

Ai-ml

Self-Improving Agent (Proactive Self-Reflection)

Self-reflection + Self-criticism + Self-learning + Self-organizing memory. Agent evaluates its own work, catches mistakes, and improves permanently. Use before starting work and after responding to th...

315.2kself-improving

Ai-ml

Agentic Eval

About this skill

GitHub Stats

Categories

Features

Related Skills

Word / DOCX

Clanker's World

Self-Improving Agent (Proactive Self-Reflection)