video-generation
Video Generation with Gemini (Veo 3.1)
Use this skill when the user asks to generate or extend videos with Gemini using the Python SDK.
Default to veo-3.1-fast-generate-preview, resolution="720p", and duration_seconds=4, unless the user asks otherwise or the task requires different settings (e.g., extension, interpolation, reference images, 1080p/4k).
Workflow
- Identify the task type: text-to-video, image-to-video, reference images, first/last frames (interpolation), or video extension.
- Ensure
GEMINI_API_KEYis available (env or local.env), then use the Python SDK. - When using images, pass
types.Image(imageBytes=..., mimeType=...)(notPIL.Imageortypes.Part) to avoid input type errors. - Call
client.models.generate_videos(...)with the correct inputs/config (see references). - Poll the operation until
done, then download and save the video. - If no videos are returned, surface a clear error and suggest checking the API key, model, and config.
Use these references (by task type)
More from xiangyu-cas/vision-skills
image-generation
Gemini image generation and editing skill for text-to-image, image-to-image edits, multi-reference composition, and Google Search grounding. Use when creating or modifying images via Gemini (default model gemini-3-pro-image-preview) with the Python SDK.
5bbdown-cli
Install and use the BBDown CLI on Linux/macOS for Bilibili downloads, including login/cookies/access_token, downloading by URL, preferring 720p when available, and writing output under a local data/ directory.
2