Evaluation and Observability

Trace Analysis

A trace is the full recorded sequence of an agent's decisions, tool calls, inputs, outputs, and intermediate results across a single task execution, giving you a complete audit trail of every step the agent took. Without traces, agent failures are opaque: you see a wrong answer but cannot determine whether reasoning went wrong, a tool returned bad data, or the agent misread its context; traces make non-deterministic failures understandable and fixable. In production, traces also surface performance problems such as which tool calls are slow, which reasoning steps waste tokens, and where the agent loops unnecessarily.