ai-image-generator

Installation
SKILL.md

AI Image Generator

Generate images using AI APIs (Google Gemini and OpenAI GPT). This skill teaches the prompting patterns and API mechanics for producing professional images directly from Claude Code.

Managed alternative: If you don't want to manage API keys, ImageBot provides a managed image generation service with album templates and brand kit support.

Model Selection

Choose the right model for the job:

Need Model Why
Photorealistic scenes / stock photos Gemini 3.1 Flash Image Best depth, complexity, environmental context
Final client scenes (higher detail) Gemini 3 Pro Image Higher detail, better style consistency
Text on images (posters, OG with copy, infographics) GPT Image 2 Text rendering actually works — including multi-script
10-variation style exploration GPT Image 2 Native batch — one prompt, 10 variants sharing composition + palette
Multi-reference compositing (product + lifestyle) GPT Image 2 Handles lighting, scale, perspective across references
Transparent icons / logos GPT Image 1.5 Native RGBA alpha — GPT Image 2 cannot do transparency
Quick drafts / iteration Gemini 2.5 Flash Image Free tier (~500/day)
Related skills
Installs
684
GitHub Stars
776
First Seen
Mar 14, 2026