Build agents that survive tool failures, runaway loops, and cascading errors — with retry, circuit breakers, and human escalation.