Claude Certification
Agentic Architecture & Orchestration
Lesson 3 · 7 min

Evaluator–Optimizer Loop

Use Claude to grade Claude's own output and iterate.

An evaluator-optimizer loop runs the generator, then a separate Claude call grades the output against a rubric, then a third call revises. This works well for outputs that are easy to judge but hard to produce in one shot — copy editing, code review, design critique.

Keep the rubric explicit and the budget bounded (max 3 iterations).

Production scenario

Real-world example: AI-generated landing-page headlines

A marketing tool generates 20 headline candidates from a product brief, then runs an evaluator call that scores each against a rubric — relevance, CTA strength, length, brand tone (1–5 each). Top three go through an optimizer pass that rewrites them tighter.

brief → generate(20 headlines) → evaluate(rubric) → top 3
top 3 → optimize(critique + rewrite) → final 3

The loop is capped at three iterations. After three passes the marginal lift on the rubric flattens and the cost stops being worth it.

Why this matters: judging is easier than producing for short copy. An evaluator-optimizer loop trades extra calls for higher quality on the final artifact — but only with an explicit rubric and a hard iteration cap.

Knowledge points in this lesson
  • Evaluator grades generator output against a rubric
  • Optimizer revises based on evaluator critique
  • Works when judging is easier than producing
  • Bound iterations to about three
  • Explicit rubric keeps evaluator consistent
Quick check
Agentic ArchitectureSelect all that apply
Which of the following are true about evaluator-optimizer loops? (Select all that apply)