openai · OpenAI Platform Docs

Trace grading

A guide on how to assign structured scores to agent traces and use the evaluation dashboard to run systematic performance assessments against specific criteria.

Import to Prompt Buddy

Derived skill

Files assembled from official documentation

Viewing SKILL.md

Trace grading

A guide on how to assign structured scores to agent traces and use the evaluation dashboard to run systematic performance assessments against specific criteria.

When To Use

Use when you need to systematically evaluate agent correctness, identify regressions, or benchmark performance improvements by grading end-to-end decision logs.

Reference Files

File	Contains	Use For
`SKILL.md`	Entry point: scope, routing table, and workflow.	Start here.
`docs/trace-grading-workflow-guide.md`	A guide explaining how to assign structured scores and labels to agent traces to evaluate correctness and quality.	Questions about a guide explaining how to assign structured scores and labels to agent traces to evaluate correctness and quality.

What This Skill Covers

Trace grading is the process of assigning structured scores or labels to an agent's trace—the end-to-end log of decisions, tool calls, and reasoning steps—to...
Main sections: Get started with traces, Evaluate traces with runs.

Workflow

Open the most relevant file under docs/ for the exact documented workflow and wording.
Open schemas/ files for exact structured contracts.
Open examples/ files for concrete requests, commands, snippets, and manifests.
Do not add behavior or configuration that is not present in the attached source files.

Canonical source: https://developers.openai.com/api/docs/guides/trace-grading.md