Context Management & Reliability (D5)
Context windows, prompt caching, reliability, retries, error handling.
Context Management & Reliability (D5) is where agents earn trust: keeping critical information alive across long interactions, propagating errors sanely, and knowing when to escalate to a human. It's the domain people fail most — and the one that separates a demo from something you'd put in production.
Expect questions on managing conversation context over long sessions, designing escalation and ambiguity-resolution patterns, propagating errors across multi-agent systems, handling context during large-codebase exploration, building human-review workflows with confidence calibration, and preserving provenance and uncertainty when synthesizing multiple sources.
This domain spans 6 subtopic areas, covered by 95 practice questions across 22 easy, 49 medium, and 24 hard items.
During a billing dispute, the agent calls process_refund after summarizing a long conversation. Which data type is most at risk of being lost during progressive summarization?
What's covered
Subtopic areas in Context Management & Reliability, drawn from the exam blueprint: