Prompt Buddy logoPrompt Buddy

openai · OpenAI Platform Docs

Speech to text | OpenAI API

Explains how to implement speech-to-text transcription using the OpenAI API, including audio file requirements and parameter usage.

Import to Prompt Buddy

Derived skill

Files assembled from official documentation

Viewing SKILL.md

Speech to text | OpenAI API

Explains how to implement speech-to-text transcription using the OpenAI API, including audio file requirements and parameter usage.

When To Use

Use when you need to implement audio transcription workflows or convert spoken language into text using OpenAI models.

Reference Files

FileContainsUse For
SKILL.mdEntry point: scope, routing table, and workflow.Start here.
docs/speech-to-text-openai-api-workflow-guide.mdA guide detailing OpenAI's speech-to-text endpoints including transcriptions, translations, and speaker diarization using the Whisper model.Questions about a guide detailing OpenAI's speech-to-text endpoints including transcriptions, translations, and speaker diarization u...
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-nodejs.textA Node.js code example demonstrating how to use the OpenAI client to create an audio transcription using the transcriptions create method.Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI client to create an audio transcription using the transcri...
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription.textA Python script using the OpenAI client to transcribe an MP3 audio file using the transcription create method.Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to transcribe an MP3 audio file using the transcription create method.
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl.textA curl command demonstrating how to use the OpenAI audio transcriptions endpoint to transcribe an audio file using a specific model.Exact payloads, commands, or snippets shown in A curl command demonstrating how to use the OpenAI audio transcriptions endpoint to transcribe an audio file using a...
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ.textA curl command demonstrating how to send a multipart/form-data POST request to the OpenAI audio transcriptions endpoint.Exact payloads, commands, or snippets shown in A curl command demonstrating how to send a multipart/form-data POST request to the OpenAI audio transcriptions endpoint.
examples/speech-to-text-openai-api-openai-speech-to-text-api-guide-introduction.textAn introductory text overview explaining the concept and scaling potential of speech-to-text technology within the OpenAI API.Exact payloads, commands, or snippets shown in An introductory text overview explaining the concept and scaling potential of speech-to-text technology within the Op...
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-nodejs-2.textA Node.js code example demonstrating how to use the OpenAI client to create a transcription from an audio file.Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI client to create a transcription from an audio file.
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription-2.textA Python script using the OpenAI client to transcribe an MP3 audio file into text.Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to transcribe an MP3 audio file into text.
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ-2.textA curl command demonstrating how to send a multipart/form-data POST request to the OpenAI audio transcriptions endpoint.Exact payloads, commands, or snippets shown in A curl command demonstrating how to send a multipart/form-data POST request to the OpenAI audio transcriptions endpoint.
examples/speech-to-text-openai-api-openai-speech-to-text-nodejs-transcription-dia.textA Node.js code example demonstrating how to use the OpenAI API to transcribe audio with diarization and automatic chunking.Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI API to transcribe audio with diarization and automatic chu...
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription-3.textA Python script using the OpenAI client to transcribe a local WAV audio file into text.Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to transcribe a local WAV audio file into text.
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ-3.textA curl command demonstrating a POST request to the OpenAI audio transcriptions endpoint using multipart/form-data.Exact payloads, commands, or snippets shown in A curl command demonstrating a POST request to the OpenAI audio transcriptions endpoint using multipart/form-data.
examples/speech-to-text-openai-api-openai-speech-to-text-audio-translation-nodejs.textA Node.js code example demonstrating how to use the OpenAI Whisper model to translate an audio file from German to English text.Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI Whisper model to translate an audio file from German to En...
examples/speech-to-text-openai-api-openai-whisper-audio-translation-python.textA Python script using the OpenAI client to translate German audio files into English text using the whisper-1 model.Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to translate German audio files into English text using the whisper-1 model.
examples/speech-to-text-openai-api-openai-speech-to-text-audio-translation-curl.textA curl command demonstrating how to use the OpenAI audio translation endpoint with the whisper-1 model.Exact payloads, commands, or snippets shown in A curl command demonstrating how to use the OpenAI audio translation endpoint with the whisper-1 model.
examples/speech-to-text-openai-api-openai-speech-to-text-transcription.textA plain text sample containing the spoken content used to demonstrate the OpenAI speech-to-text transcription process.Exact payloads, commands, or snippets shown in A plain text sample containing the spoken content used to demonstrate the OpenAI speech-to-text transcription process.
examples/speech-to-text-openai-api-openai-speech-to-text-nodejs-transcription-whi.textA Node.js code example demonstrating how to use the OpenAI Whisper model to create a transcription with word-level timestamps.Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI Whisper model to create a transcription with word-level ti...
examples/speech-to-text-openai-api-openai-speech-to-text-python-whisper-transcrip.textA Python script using the OpenAI client to transcribe an MP3 audio file into words with timestamp granularity using the whisper-1 model.Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to transcribe an MP3 audio file into words with timestamp granularity using t...
examples/speech-to-text-openai-api-openai-speech-to-text-whisper-curl-transcripti.textA curl command demonstrating how to send an audio file to the OpenAI Whisper API with word-level timestamp granularity.Exact payloads, commands, or snippets shown in A curl command demonstrating how to send an audio file to the OpenAI Whisper API with word-level timestamp granularity.
examples/speech-to-text-openai-api-openai-speech-to-text-pydub-audio-segmentation.textA Python script using pydub to slice an mp3 file into a ten-minute segment for speech-to-text processing.Exact payloads, commands, or snippets shown in A Python script using pydub to slice an mp3 file into a ten-minute segment for speech-to-text processing.
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-nodejs-3.textA Node.js code example demonstrating how to use the OpenAI client to create a transcription from an audio file using the audio transcriptions endpoint.Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI client to create a transcription from an audio file using...
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription-4.textA Python script demonstrating how to use the OpenAI client to transcribe an MP3 audio file into text using the audio transcriptions endpoint.Exact payloads, commands, or snippets shown in A Python script demonstrating how to use the OpenAI client to transcribe an MP3 audio file into text using the audio...
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ-4.textA curl command demonstrating how to send a POST request to the OpenAI audio transcriptions endpoint using multipart/form-data.Exact payloads, commands, or snippets shown in A curl command demonstrating how to send a POST request to the OpenAI audio transcriptions endpoint using multipart/f...
examples/speech-to-text-openai-api-openai-speech-to-text-nodejs-transcription-str.textA Node.js code example demonstrating how to create a streaming transcription request using the OpenAI audio API.Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to create a streaming transcription request using the OpenAI audio API.
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription-str.textA Python script using the OpenAI client to stream transcription text from an audio file using the transcription create method.Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to stream transcription text from an audio file using the transcription creat...
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ-5.textA curl command demonstrating how to send a POST request to the OpenAI audio transcriptions endpoint using the whisper-1 model.Exact payloads, commands, or snippets shown in A curl command demonstrating how to send a POST request to the OpenAI audio transcriptions endpoint using the whisper...
examples/speech-to-text-openai-api-openai-speech-to-text-realtime-websocket-url.textThe WebSocket URL used for initiating real-time speech-to-text transcription via the OpenAI API.Exact payloads, commands, or snippets shown in The WebSocket URL used for initiating real-time speech-to-text transcription via the OpenAI API.
examples/speech-to-text-openai-api-openai-speech-to-text-transcription-session-up.textA JSON object representing a transcription session update configuration including audio format, model parameters, and turn detection settings.Exact payloads, commands, or snippets shown in A JSON object representing a transcription session update configuration including audio format, model parameters, and...
examples/speech-to-text-openai-api-openai-speech-to-text-inputaudiobuffer-append-.textA JSON representation of the inputaudiobuffer.append event containing Base64 encoded audio data for speech-to-text processing.Exact payloads, commands, or snippets shown in A JSON representation of the inputaudiobuffer.append event containing Base64 encoded audio data for speech-to-text pr...
examples/speech-to-text-openai-api-openai-speech-to-text-transcription-session-ob.textA text representation of the realtime transcription session object structure including model and audio format properties.Exact payloads, commands, or snippets shown in A text representation of the realtime transcription session object structure including model and audio format propert...
examples/speech-to-text-openai-api-openai-speech-to-text-realtime-transcription-s.textA text representation of a POST request to the v1/realtime/transcription_sessions endpoint for initiating speech to text sessions.Exact payloads, commands, or snippets shown in A text representation of a POST request to the v1/realtime/transcriptionsessions endpoint for initiating speech to te...
examples/speech-to-text-openai-api-openai-speech-to-text-whisper-nodejs-transcrip.textA Node.js code example demonstrating how to use the OpenAI Whisper model to transcribe an audio file into text.Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI Whisper model to transcribe an audio file into text.
examples/speech-to-text-openai-api-openai-whisper-transcription-python.textA Python script demonstrating how to use the OpenAI Whisper model to transcribe an MP3 audio file into text.Exact payloads, commands, or snippets shown in A Python script demonstrating how to use the OpenAI Whisper model to transcribe an MP3 audio file into text.
examples/speech-to-text-openai-api-openai-speech-to-text-whisper-curl-transcripti-2.textA curl command demonstrating how to send an audio file to the OpenAI Whisper API for transcription using a multipart/form-data POST request.Exact payloads, commands, or snippets shown in A curl command demonstrating how to send an audio file to the OpenAI Whisper API for transcription using a multipart/...
examples/speech-to-text-openai-api-openai-speech-to-text-system-prompt-correction.textA system prompt designed to correct spelling discrepancies and ensure product name accuracy in transcribed text for the OpenAI speech-to-text API.Exact payloads, commands, or snippets shown in A system prompt designed to correct spelling discrepancies and ensure product name accuracy in transcribed text for t...
examples/speech-to-text-openai-api-openai-speech-to-text-system-prompt-correction-2.textA system prompt designed to instruct an AI assistant to correct spelling discrepancies in transcribed text, specifically focusing on ZyntriQix product names.Exact payloads, commands, or snippets shown in A system prompt designed to instruct an AI assistant to correct spelling discrepancies in transcribed text, specifica...

What This Skill Covers

  • The Audio API provides two speech to text endpoints:
  • Main sections: Quickstart, Transcriptions, Speaker diarization, Translations, Supported languages.

Workflow

  1. Open the most relevant file under docs/ for the exact documented workflow and wording.
  2. Open schemas/ files for exact structured contracts.
  3. Open examples/ files for concrete requests, commands, snippets, and manifests.
  4. Do not add behavior or configuration that is not present in the attached source files.

Canonical source: https://developers.openai.com/api/docs/guides/speech-to-text