google · Google AI Docs
Gemini API Gemini Live API overview
An introduction to the Gemini Live API covering its core capabilities, including tool use, session management, ephemeral tokens, and optimization best practices for real-time multimodal interactions.
Derived skill
Files assembled from official documentation
Viewing SKILL.md
Gemini API Gemini Live API overview
An introduction to the Gemini Live API covering its core capabilities, including tool use, session management, ephemeral tokens, and optimization best practices for real-time multimodal interactions.
When To Use
Use when implementing real-time, low-latency multimodal interactions that require session management, tool integration, or optimized WebSocket communication.
Reference Files
| File | Contains | Use For |
|---|---|---|
SKILL.md | Entry point: scope, routing table, and workflow. | Start here. |
docs/gemini-api-live-overview-workflow-guide.md | An overview of the Gemini Live API covering use cases, key features, technical specifications, and implementation approaches for real-time voice and vision interactions. | Questions about an overview of the Gemini Live API covering use cases, key features, technical specifications, and implementation app... |
What This Skill Covers
- The Live API enables low-latency, real-time voice and vision interactions with Gemini. It processes continuous streams of audio, images, and text to deliver...
- Main sections:
Use cases,Key features,Technical specifications,Choose an implementation approach,Get started.
Workflow
- Open the most relevant file under
docs/for the exact documented workflow and wording. - Open
schemas/files for exact structured contracts. - Open
examples/files for concrete requests, commands, snippets, and manifests. - Do not add behavior or configuration that is not present in the attached source files.
Canonical source: https://ai.google.dev/gemini-api/docs/live
