image-generation
Installation
SKILL.md
Image Generation with Gemini, OpenAI, and xAI
Generate and edit images using Google Gemini, OpenAI GPT Image 2, and xAI Grok Image APIs via shell scripts.
Available Providers
Google Gemini
- Model:
gemini-3-pro-image-preview(default, "Nano Banana Pro"). Alt:gemini-3.1-flash-image-preview(Flash, 14 ratios). - Strengths: Premium quality, up to 4K output, thinking mode, Google Search grounding, multi-turn editing with up to 14 reference images
- Aspect ratios: 10 on Pro (1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3, 4:5, 5:4, 21:9); Flash adds 4 extreme ratios (1:4, 4:1, 1:8, 8:1)
- Resolution:
--image-sizetakes1K,2K,4Kon both Pro and Flash; Flash additionally supports512(UPPERCASE required) - Env var:
GEMINI_API_KEY
OpenAI GPT Image 2
- Model:
gpt-image-2(default, snapshotgpt-image-2-2026-04-21);gpt-image-1.5available as previous flagship via--model - Strengths: Superior text rendering, transparent backgrounds, up to 16 input images for editing, quality tiers
- Sizes: 1024x1024, 1536x1024 (landscape), 1024x1536 (portrait)
- Quality: low (fast/cheap), medium, high (best fidelity)
- Env var:
OPENAI_API_KEY