openai · OpenAI Platform Docs
Trace grading
A guide on how to assign structured scores to agent traces and use the evaluation dashboard to run systematic performance assessments against specific criteria.
Derived skill
Files assembled from official documentation
Viewing SKILL.md
Trace grading
A guide on how to assign structured scores to agent traces and use the evaluation dashboard to run systematic performance assessments against specific criteria.
When To Use
Use when you need to systematically evaluate agent correctness, identify regressions, or benchmark performance improvements by grading end-to-end decision logs.
Reference Files
| File | Contains | Use For |
|---|---|---|
SKILL.md | Entry point: scope, routing table, and workflow. | Start here. |
docs/trace-grading-workflow-guide.md | A guide explaining how to assign structured scores and labels to agent traces to evaluate correctness and quality. | Questions about a guide explaining how to assign structured scores and labels to agent traces to evaluate correctness and quality. |
What This Skill Covers
- Trace grading is the process of assigning structured scores or labels to an agent's trace—the end-to-end log of decisions, tool calls, and reasoning steps—to...
- Main sections:
Get started with traces,Evaluate traces with runs.
Workflow
- Open the most relevant file under
docs/for the exact documented workflow and wording. - Open
schemas/files for exact structured contracts. - Open
examples/files for concrete requests, commands, snippets, and manifests. - Do not add behavior or configuration that is not present in the attached source files.
Canonical source: https://developers.openai.com/api/docs/guides/trace-grading.md
