FAQ

Does agentevals re-run my agent?

No. agentevals is built to score behavior from OpenTelemetry traces without re-running the agent.

agentevals works from OpenTelemetry trace data emitted by your agent system. See OTel Compatibility for more details.

Yes. agentevals now includes an initial option to delegate evals to OpenAI’s Evals API. See OpenAI Evals API backend.

Yes. The project now includes container deployment support and a Helm chart for Kubernetes. See Kubernetes & Helm.

No. There is also support for streaming-oriented workflows. See Streaming.