Prompt Buddy logoPrompt Buddy

openai · OpenAI Platform Docs

Voice agents

Teaches how to architect and implement voice-based AI agents using either a direct speech-to-speech model via live audio sessions or a chained workflow involving speech-to-text, text reasoning, and text-to-speech.

Import to Prompt Buddy

Derived skill

Files assembled from official documentation

Viewing SKILL.md

Voice agents

Teaches how to architect and implement voice-based AI agents using either a direct speech-to-speech model via live audio sessions or a chained workflow involving speech-to-text, text reasoning, and text-to-speech.

When To Use

Use when deciding between a low-latency speech-to-speech architecture or a controlled chained voice pipeline and implementing the corresponding WebRTC or SDK-based workflows.

Reference Files

FileContainsUse For
SKILL.mdEntry point: scope, routing table, and workflow.Start here.
docs/voice-agents-workflow-guide.mdA guide explaining architectural choices for building speech-to-speech voice agents and chained voice workflows.Questions about a guide explaining architectural choices for building speech-to-speech voice agents and chained voice workflows.

What This Skill Covers

  • Voice agents turn the same agent concepts into spoken, low-latency interactions. The key design choice is deciding whether the model should work directly wit...
  • Main sections: Choose the right architecture, Recommended starting points, Build a speech-to-speech voice agent, Build a chained voice workflow, Voice agents still use the same core agent building blocks.

Workflow

  1. Open the most relevant file under docs/ for the exact documented workflow and wording.
  2. Open schemas/ files for exact structured contracts.
  3. Open examples/ files for concrete requests, commands, snippets, and manifests.
  4. Do not add behavior or configuration that is not present in the attached source files.

Canonical source: https://developers.openai.com/api/docs/guides/voice-agents.md