← Back to the bonus vault

Chapter 17 · companion worksheet

Model routing rule template

There are two kinds of AI model — fast/cheap and reasoning/thinking — and the most expensive mistake is using the wrong one for the job, in either direction. Use this one-page guide to map your tasks to the right tier before anyone routes by instinct.

The dependency test — use it for every task

Imagine handing the task to a sharp employee. Would they answer off the top of their head, or would they say "give me a few minutes" and go work it out?

Routing reference: task type → model tier

Task type Model tier Why
Classify an email or ticket (complaint / question / other) Fast / cheap Single-step lookup; no dependency chain; a wrong answer is cheap to catch
Extract structured fields from a document Fast / cheap Pattern recognition; no multi-step reasoning required
Draft a routine reply, summary, or formatted output Fast / cheap Generation from given inputs; quality bar is adequate, not premium
Route or triage incoming work to the right queue Fast / cheap Classification; the whole point is speed and cost efficiency at high volume
Complex contract or regulatory document review Reasoning / thinking Must reason about how multiple clauses or rules interact; step-by-step dependency
Multi-step planning where step 3 depends on step 2's conclusion Reasoning / thinking Errors in early steps compound; deliberation earns its cost
Vendor or options analysis with multiple weighted factors Reasoning / thinking Requires holding trade-offs in mind simultaneously; fast model skips contradictions
Regulatory interpretation — how several rules combine for our situation Reasoning / thinking High stakes; answer depends on reasoning across connected constraints
Deep research — synthesize sources into a considered answer Reasoning / thinking Multi-source reasoning; cost trivial against value of a correct answer

Your task routing map (fill in for your workflows)

Our task (be specific) Model tier assigned Why (dependency test result) Estimated daily volume

Routing health check

The rule of thumb

In practice, the genuinely hard tasks — the ones that earn reasoning-model prices — are roughly 10 to 20 percent of total volume. The other 80 to 90 percent are fast-model work. If your routing sends more than 20 percent to the premium tier, review each one with the dependency test before committing the budget.

Owner of this routing rule: ______

Last reviewed: ______

Next review: ______

Want a second set of eyes on this in your firm? The no-sell promise applies — if it isn't a fit, I'll tell you in the first ten minutes.

Book a 30-Minute Call →