openai · OpenAI Platform Docs
Speech to text | OpenAI API
Explains how to implement speech-to-text transcription using the OpenAI API, including audio file requirements and parameter usage.
Derived skill
Files assembled from official documentation
Viewing SKILL.md
Speech to text | OpenAI API
Explains how to implement speech-to-text transcription using the OpenAI API, including audio file requirements and parameter usage.
When To Use
Use when you need to implement audio transcription workflows or convert spoken language into text using OpenAI models.
Reference Files
| File | Contains | Use For |
|---|---|---|
SKILL.md | Entry point: scope, routing table, and workflow. | Start here. |
docs/speech-to-text-openai-api-workflow-guide.md | A guide detailing OpenAI's speech-to-text endpoints including transcriptions, translations, and speaker diarization using the Whisper model. | Questions about a guide detailing OpenAI's speech-to-text endpoints including transcriptions, translations, and speaker diarization u... |
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-nodejs.text | A Node.js code example demonstrating how to use the OpenAI client to create an audio transcription using the transcriptions create method. | Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI client to create an audio transcription using the transcri... |
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription.text | A Python script using the OpenAI client to transcribe an MP3 audio file using the transcription create method. | Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to transcribe an MP3 audio file using the transcription create method. |
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl.text | A curl command demonstrating how to use the OpenAI audio transcriptions endpoint to transcribe an audio file using a specific model. | Exact payloads, commands, or snippets shown in A curl command demonstrating how to use the OpenAI audio transcriptions endpoint to transcribe an audio file using a... |
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ.text | A curl command demonstrating how to send a multipart/form-data POST request to the OpenAI audio transcriptions endpoint. | Exact payloads, commands, or snippets shown in A curl command demonstrating how to send a multipart/form-data POST request to the OpenAI audio transcriptions endpoint. |
examples/speech-to-text-openai-api-openai-speech-to-text-api-guide-introduction.text | An introductory text overview explaining the concept and scaling potential of speech-to-text technology within the OpenAI API. | Exact payloads, commands, or snippets shown in An introductory text overview explaining the concept and scaling potential of speech-to-text technology within the Op... |
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-nodejs-2.text | A Node.js code example demonstrating how to use the OpenAI client to create a transcription from an audio file. | Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI client to create a transcription from an audio file. |
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription-2.text | A Python script using the OpenAI client to transcribe an MP3 audio file into text. | Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to transcribe an MP3 audio file into text. |
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ-2.text | A curl command demonstrating how to send a multipart/form-data POST request to the OpenAI audio transcriptions endpoint. | Exact payloads, commands, or snippets shown in A curl command demonstrating how to send a multipart/form-data POST request to the OpenAI audio transcriptions endpoint. |
examples/speech-to-text-openai-api-openai-speech-to-text-nodejs-transcription-dia.text | A Node.js code example demonstrating how to use the OpenAI API to transcribe audio with diarization and automatic chunking. | Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI API to transcribe audio with diarization and automatic chu... |
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription-3.text | A Python script using the OpenAI client to transcribe a local WAV audio file into text. | Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to transcribe a local WAV audio file into text. |
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ-3.text | A curl command demonstrating a POST request to the OpenAI audio transcriptions endpoint using multipart/form-data. | Exact payloads, commands, or snippets shown in A curl command demonstrating a POST request to the OpenAI audio transcriptions endpoint using multipart/form-data. |
examples/speech-to-text-openai-api-openai-speech-to-text-audio-translation-nodejs.text | A Node.js code example demonstrating how to use the OpenAI Whisper model to translate an audio file from German to English text. | Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI Whisper model to translate an audio file from German to En... |
examples/speech-to-text-openai-api-openai-whisper-audio-translation-python.text | A Python script using the OpenAI client to translate German audio files into English text using the whisper-1 model. | Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to translate German audio files into English text using the whisper-1 model. |
examples/speech-to-text-openai-api-openai-speech-to-text-audio-translation-curl.text | A curl command demonstrating how to use the OpenAI audio translation endpoint with the whisper-1 model. | Exact payloads, commands, or snippets shown in A curl command demonstrating how to use the OpenAI audio translation endpoint with the whisper-1 model. |
examples/speech-to-text-openai-api-openai-speech-to-text-transcription.text | A plain text sample containing the spoken content used to demonstrate the OpenAI speech-to-text transcription process. | Exact payloads, commands, or snippets shown in A plain text sample containing the spoken content used to demonstrate the OpenAI speech-to-text transcription process. |
examples/speech-to-text-openai-api-openai-speech-to-text-nodejs-transcription-whi.text | A Node.js code example demonstrating how to use the OpenAI Whisper model to create a transcription with word-level timestamps. | Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI Whisper model to create a transcription with word-level ti... |
examples/speech-to-text-openai-api-openai-speech-to-text-python-whisper-transcrip.text | A Python script using the OpenAI client to transcribe an MP3 audio file into words with timestamp granularity using the whisper-1 model. | Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to transcribe an MP3 audio file into words with timestamp granularity using t... |
examples/speech-to-text-openai-api-openai-speech-to-text-whisper-curl-transcripti.text | A curl command demonstrating how to send an audio file to the OpenAI Whisper API with word-level timestamp granularity. | Exact payloads, commands, or snippets shown in A curl command demonstrating how to send an audio file to the OpenAI Whisper API with word-level timestamp granularity. |
examples/speech-to-text-openai-api-openai-speech-to-text-pydub-audio-segmentation.text | A Python script using pydub to slice an mp3 file into a ten-minute segment for speech-to-text processing. | Exact payloads, commands, or snippets shown in A Python script using pydub to slice an mp3 file into a ten-minute segment for speech-to-text processing. |
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-nodejs-3.text | A Node.js code example demonstrating how to use the OpenAI client to create a transcription from an audio file using the audio transcriptions endpoint. | Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI client to create a transcription from an audio file using... |
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription-4.text | A Python script demonstrating how to use the OpenAI client to transcribe an MP3 audio file into text using the audio transcriptions endpoint. | Exact payloads, commands, or snippets shown in A Python script demonstrating how to use the OpenAI client to transcribe an MP3 audio file into text using the audio... |
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ-4.text | A curl command demonstrating how to send a POST request to the OpenAI audio transcriptions endpoint using multipart/form-data. | Exact payloads, commands, or snippets shown in A curl command demonstrating how to send a POST request to the OpenAI audio transcriptions endpoint using multipart/f... |
examples/speech-to-text-openai-api-openai-speech-to-text-nodejs-transcription-str.text | A Node.js code example demonstrating how to create a streaming transcription request using the OpenAI audio API. | Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to create a streaming transcription request using the OpenAI audio API. |
examples/speech-to-text-openai-api-openai-speech-to-text-python-transcription-str.text | A Python script using the OpenAI client to stream transcription text from an audio file using the transcription create method. | Exact payloads, commands, or snippets shown in A Python script using the OpenAI client to stream transcription text from an audio file using the transcription creat... |
examples/speech-to-text-openai-api-openai-speech-to-text-transcriptions-curl-requ-5.text | A curl command demonstrating how to send a POST request to the OpenAI audio transcriptions endpoint using the whisper-1 model. | Exact payloads, commands, or snippets shown in A curl command demonstrating how to send a POST request to the OpenAI audio transcriptions endpoint using the whisper... |
examples/speech-to-text-openai-api-openai-speech-to-text-realtime-websocket-url.text | The WebSocket URL used for initiating real-time speech-to-text transcription via the OpenAI API. | Exact payloads, commands, or snippets shown in The WebSocket URL used for initiating real-time speech-to-text transcription via the OpenAI API. |
examples/speech-to-text-openai-api-openai-speech-to-text-transcription-session-up.text | A JSON object representing a transcription session update configuration including audio format, model parameters, and turn detection settings. | Exact payloads, commands, or snippets shown in A JSON object representing a transcription session update configuration including audio format, model parameters, and... |
examples/speech-to-text-openai-api-openai-speech-to-text-inputaudiobuffer-append-.text | A JSON representation of the inputaudiobuffer.append event containing Base64 encoded audio data for speech-to-text processing. | Exact payloads, commands, or snippets shown in A JSON representation of the inputaudiobuffer.append event containing Base64 encoded audio data for speech-to-text pr... |
examples/speech-to-text-openai-api-openai-speech-to-text-transcription-session-ob.text | A text representation of the realtime transcription session object structure including model and audio format properties. | Exact payloads, commands, or snippets shown in A text representation of the realtime transcription session object structure including model and audio format propert... |
examples/speech-to-text-openai-api-openai-speech-to-text-realtime-transcription-s.text | A text representation of a POST request to the v1/realtime/transcription_sessions endpoint for initiating speech to text sessions. | Exact payloads, commands, or snippets shown in A text representation of a POST request to the v1/realtime/transcriptionsessions endpoint for initiating speech to te... |
examples/speech-to-text-openai-api-openai-speech-to-text-whisper-nodejs-transcrip.text | A Node.js code example demonstrating how to use the OpenAI Whisper model to transcribe an audio file into text. | Exact payloads, commands, or snippets shown in A Node.js code example demonstrating how to use the OpenAI Whisper model to transcribe an audio file into text. |
examples/speech-to-text-openai-api-openai-whisper-transcription-python.text | A Python script demonstrating how to use the OpenAI Whisper model to transcribe an MP3 audio file into text. | Exact payloads, commands, or snippets shown in A Python script demonstrating how to use the OpenAI Whisper model to transcribe an MP3 audio file into text. |
examples/speech-to-text-openai-api-openai-speech-to-text-whisper-curl-transcripti-2.text | A curl command demonstrating how to send an audio file to the OpenAI Whisper API for transcription using a multipart/form-data POST request. | Exact payloads, commands, or snippets shown in A curl command demonstrating how to send an audio file to the OpenAI Whisper API for transcription using a multipart/... |
examples/speech-to-text-openai-api-openai-speech-to-text-system-prompt-correction.text | A system prompt designed to correct spelling discrepancies and ensure product name accuracy in transcribed text for the OpenAI speech-to-text API. | Exact payloads, commands, or snippets shown in A system prompt designed to correct spelling discrepancies and ensure product name accuracy in transcribed text for t... |
examples/speech-to-text-openai-api-openai-speech-to-text-system-prompt-correction-2.text | A system prompt designed to instruct an AI assistant to correct spelling discrepancies in transcribed text, specifically focusing on ZyntriQix product names. | Exact payloads, commands, or snippets shown in A system prompt designed to instruct an AI assistant to correct spelling discrepancies in transcribed text, specifica... |
What This Skill Covers
- The Audio API provides two speech to text endpoints:
- Main sections:
Quickstart,Transcriptions,Speaker diarization,Translations,Supported languages.
Workflow
- Open the most relevant file under
docs/for the exact documented workflow and wording. - Open
schemas/files for exact structured contracts. - Open
examples/files for concrete requests, commands, snippets, and manifests. - Do not add behavior or configuration that is not present in the attached source files.
Canonical source: https://developers.openai.com/api/docs/guides/speech-to-text
