Prompt Buddy logoPrompt Buddy

openai · OpenAI Platform Docs

Rate limits | OpenAI API

Explains how OpenAI API rate limits work, including the different types of limits such as RPM, RPD, and TPM, and provides guidance on managing these limits through exponential backoff and error handling.

Import to Prompt Buddy

Derived skill

Files assembled from official documentation

Viewing SKILL.md

Rate limits | OpenAI API

Explains how OpenAI API rate limits work, including the different types of limits such as RPM, RPD, and TPM, and provides guidance on managing these limits through exponential backoff and error handling.

When To Use

Use when implementing error handling logic to manage 429 Too Many Requests errors or when designing request throttling and retry mechanisms for the OpenAI API.

Reference Files

FileContainsUse For
SKILL.mdEntry point: scope, routing table, and workflow.Start here.
docs/rate-limits-openai-api-workflow-guide.mdA guide explaining OpenAI API rate limit mechanics, usage tiers, and header information.Questions about a guide explaining OpenAI API rate limit mechanics, usage tiers, and header information.
examples/rate-limits-openai-api-openai-api-model-limits-curl-request.textA curl command used to retrieve fine-tuning model rate limits from the OpenAI API.Exact payloads, commands, or snippets shown in A curl command used to retrieve fine-tuning model rate limits from the OpenAI API.
examples/rate-limits-openai-api-openai-api-python-exponential-backoff-rate-limit-.textA Python code example using the tenacity library to implement exponential backoff for handling OpenAI API rate limits.Exact payloads, commands, or snippets shown in A Python code example using the tenacity library to implement exponential backoff for handling OpenAI API rate limits.
examples/rate-limits-openai-api-openai-api-python-rate-limit-backoff.textA Python code example demonstrating how to use the backoff library to handle OpenAI API RateLimitError exceptions.Exact payloads, commands, or snippets shown in A Python code example demonstrating how to use the backoff library to handle OpenAI API RateLimitError exceptions.
examples/rate-limits-openai-api-openai-api-python-exponential-backoff-retry-imple.textA Python code implementation demonstrating a custom retry decorator using exponential backoff and jitter to handle OpenAI API rate limits.Exact payloads, commands, or snippets shown in A Python code implementation demonstrating a custom retry decorator using exponential backoff and jitter to handle Op...

What This Skill Covers

  • Rate limits are restrictions that our API imposes on the number of times a user or client can access our services within a specified period of time.
  • Main sections: Why do we have rate limits?, How do these rate limits work?, Usage tiers, Rate limits in headers, Fine-tuning rate limits.

Workflow

  1. Open the most relevant file under docs/ for the exact documented workflow and wording.
  2. Open schemas/ files for exact structured contracts.
  3. Open examples/ files for concrete requests, commands, snippets, and manifests.
  4. Do not add behavior or configuration that is not present in the attached source files.

Canonical source: https://developers.openai.com/api/docs/guides/rate-limits