Prompt Buddy logoPrompt Buddy

openai · OpenAI Platform Docs

Evaluate external models | OpenAI API

Provides a methodology for comparing and evaluating the performance of external large language models against OpenAI models using standardized evaluation frameworks.

Import to Prompt Buddy

Derived skill

Files assembled from official documentation

Viewing SKILL.md

Evaluate external models | OpenAI API

Provides a methodology for comparing and evaluating the performance of external large language models against OpenAI models using standardized evaluation frameworks.

When To Use

Use when you need to implement a benchmarking process to compare the quality, accuracy, or cost-effectiveness of non-OpenAI models against OpenAI's models.

Reference Files

FileContainsUse For
SKILL.mdEntry point: scope, routing table, and workflow.Start here.
docs/evaluate-external-models-openai-api-workflow-guide.mdA guide explaining how to use the OpenAI Platform to evaluate third-party models, including information on billing, usage limits, and custom endpoints.Questions about a guide explaining how to use the OpenAI Platform to evaluate third-party models, including information on billing, u...

What This Skill Covers

  • Model selection is an important lever that enables builders to improve their AI applications. When using Evaluations on the OpenAI Platform, in addition to e...
  • Main sections: Third-party models, Billing and usage limits, Available third-party models, Custom endpoints, Run evals with external models.

Workflow

  1. Open the most relevant file under docs/ for the exact documented workflow and wording.
  2. Open schemas/ files for exact structured contracts.
  3. Open examples/ files for concrete requests, commands, snippets, and manifests.
  4. Do not add behavior or configuration that is not present in the attached source files.

Canonical source: https://developers.openai.com/api/docs/guides/external-models