Gemini API in Vertex AI

Access Google's most advanced AI models built for enterprise use cases using the Gemini API in Vertex AI.

Provide these key capabilities:

Text generation - Chat, completion, summarization
Multimodal understanding - Process images, audio, video, and documents
Function calling - Let the model invoke your functions
Structured output - Generate valid JSON matching your schema
Context caching - Cache large contexts for efficiency
Embeddings - Generate text embeddings for semantic search
Live Realtime API - Bidirectional streaming for low latency Voice and Video interactions
Batch Prediction - Handle massive async dataset prediction workloads

Core Directives

Unified SDK: ALWAYS use the Gen AI SDK (google-genai for Python, @google/genai for JS/TS, google.golang.org/genai for Go, com.google.genai:google-genai for Java, Google.GenAI for C#).
Legacy SDKs: DO NOT use google-cloud-aiplatform, @google-cloud/vertexai, or google-generativeai.