openai · OpenAI Platform Docs
Evaluate external models | OpenAI API
Provides a methodology for comparing and evaluating the performance of external large language models against OpenAI models using standardized evaluation frameworks.
Derived skill
Files assembled from official documentation
Viewing SKILL.md
Evaluate external models | OpenAI API
Provides a methodology for comparing and evaluating the performance of external large language models against OpenAI models using standardized evaluation frameworks.
When To Use
Use when you need to implement a benchmarking process to compare the quality, accuracy, or cost-effectiveness of non-OpenAI models against OpenAI's models.
Reference Files
| File | Contains | Use For |
|---|---|---|
SKILL.md | Entry point: scope, routing table, and workflow. | Start here. |
docs/evaluate-external-models-openai-api-workflow-guide.md | A guide explaining how to use the OpenAI Platform to evaluate third-party models, including information on billing, usage limits, and custom endpoints. | Questions about a guide explaining how to use the OpenAI Platform to evaluate third-party models, including information on billing, u... |
What This Skill Covers
- Model selection is an important lever that enables builders to improve their AI applications. When using Evaluations on the OpenAI Platform, in addition to e...
- Main sections:
Third-party models,Billing and usage limits,Available third-party models,Custom endpoints,Run evals with external models.
Workflow
- Open the most relevant file under
docs/for the exact documented workflow and wording. - Open
schemas/files for exact structured contracts. - Open
examples/files for concrete requests, commands, snippets, and manifests. - Do not add behavior or configuration that is not present in the attached source files.
Canonical source: https://developers.openai.com/api/docs/guides/external-models
