vertex-ai-media-master

Installation
SKILL.md

Vertex AI Media Master

Overview

Multimodal media operations on Google Cloud Vertex AI covering video understanding, audio generation, image creation, and marketing campaign automation. This skill orchestrates Gemini 2.5 Pro/Flash, Imagen 4, and Lyria models to process, analyze, and generate rich media assets.

Prerequisites

  • Google Cloud project with Vertex AI API enabled
  • google-cloud-aiplatform Python SDK installed (pip install google-cloud-aiplatform[vision,audio])
  • GOOGLE_CLOUD_PROJECT and GOOGLE_APPLICATION_CREDENTIALS environment variables set
  • Service account with roles/aiplatform.user permission
  • Sufficient quota for target models (Gemini 2.5 Pro: 2M tokens/min; Imagen 4: 100 images/min)

Instructions

  1. Initialize the Vertex AI client with the target project and region (us-central1 recommended for model availability).
  2. Select the appropriate model for the task:
    • Video analysis: Gemini 2.5 Pro (up to 6 hours at low resolution, 2 hours at default).
    • Image generation: Imagen 4 for highest quality stills; Gemini 2.5 Flash Image for interleaved text+image output.
Related skills
Installs
31
GitHub Stars
2.2K
First Seen
Jan 25, 2026