ai-image-generator
AI Image Generator
Generate images using AI APIs (Google Gemini and OpenAI GPT). This skill teaches the prompting patterns and API mechanics for producing professional images directly from Claude Code.
Managed alternative: If you don't want to manage API keys, ImageBot provides a managed image generation service with album templates and brand kit support.
Model Selection
Choose the right model for the job:
| Need | Model | Why |
|---|---|---|
| Photorealistic scenes / stock photos | Gemini 3.1 Flash Image | Best depth, complexity, environmental context |
| Final client scenes (higher detail) | Gemini 3 Pro Image | Higher detail, better style consistency |
| Text on images (posters, OG with copy, infographics) | GPT Image 2 | Text rendering actually works — including multi-script |
| 10-variation style exploration | GPT Image 2 | Native batch — one prompt, 10 variants sharing composition + palette |
| Multi-reference compositing (product + lifestyle) | GPT Image 2 | Handles lighting, scale, perspective across references |
| Transparent icons / logos | GPT Image 1.5 | Native RGBA alpha — GPT Image 2 cannot do transparency |
| Quick drafts / iteration | Gemini 2.5 Flash Image | Free tier (~500/day) |
More from jezweb/claude-skills
tailwind-v4-shadcn
|
2.7Ktanstack-query
|
2.5Kshadcn-ui
Install and configure shadcn/ui components for React projects. Guides component selection, installation order, dependency management, customisation with semantic tokens, and common UI recipes (forms, data tables, navigation, modals). Use after tailwind-theme-builder has set up the theme infrastructure, when adding components, building forms, creating data tables, or setting up navigation.
2.5Ktailwind-theme-builder
>
2.2Kfastapi
|
2.0Kcolor-palette
>
1.9K