vertex-ai-media-master
Installation
SKILL.md
Vertex AI Media Master
Overview
Multimodal media operations on Google Cloud Vertex AI covering video understanding, audio generation, image creation, and marketing campaign automation. This skill orchestrates Gemini 2.5 Pro/Flash, Imagen 4, and Lyria models to process, analyze, and generate rich media assets.
Prerequisites
- Google Cloud project with Vertex AI API enabled
google-cloud-aiplatformPython SDK installed (pip install google-cloud-aiplatform[vision,audio])GOOGLE_CLOUD_PROJECTandGOOGLE_APPLICATION_CREDENTIALSenvironment variables set- Service account with
roles/aiplatform.userpermission - Sufficient quota for target models (Gemini 2.5 Pro: 2M tokens/min; Imagen 4: 100 images/min)
Instructions
- Initialize the Vertex AI client with the target project and region (
us-central1recommended for model availability). - Select the appropriate model for the task:
- Video analysis: Gemini 2.5 Pro (up to 6 hours at low resolution, 2 hours at default).
- Image generation: Imagen 4 for highest quality stills; Gemini 2.5 Flash Image for interleaved text+image output.
Related skills