google · Google AI Docs

Gemini API Gemini Live API overview

An introduction to the Gemini Live API covering its core capabilities, including tool use, session management, ephemeral tokens, and optimization best practices for real-time multimodal interactions.

Import to Prompt Buddy

Derived skill

Files assembled from official documentation

Viewing SKILL.md

Gemini API Gemini Live API overview

An introduction to the Gemini Live API covering its core capabilities, including tool use, session management, ephemeral tokens, and optimization best practices for real-time multimodal interactions.

When To Use

Use when implementing real-time, low-latency multimodal interactions that require session management, tool integration, or optimized WebSocket communication.

Reference Files

File	Contains	Use For
`SKILL.md`	Entry point: scope, routing table, and workflow.	Start here.
`docs/gemini-api-live-overview-workflow-guide.md`	An overview of the Gemini Live API covering use cases, key features, technical specifications, and implementation approaches for real-time voice and vision interactions.	Questions about an overview of the Gemini Live API covering use cases, key features, technical specifications, and implementation app...

What This Skill Covers

The Live API enables low-latency, real-time voice and vision interactions with Gemini. It processes continuous streams of audio, images, and text to deliver...
Main sections: Use cases, Key features, Technical specifications, Choose an implementation approach, Get started.

Workflow

Open the most relevant file under docs/ for the exact documented workflow and wording.
Open schemas/ files for exact structured contracts.
Open examples/ files for concrete requests, commands, snippets, and manifests.
Do not add behavior or configuration that is not present in the attached source files.

Canonical source: https://ai.google.dev/gemini-api/docs/live