image-generation
Image Generation
This skill enables generating images using Google's Gemini API (specifically the "Nano Banana" models) or OpenRouter.
Models
Google Gemini API
- Nano Banana (
gemini-2.5-flash-image): Designed for speed and efficiency. Default model. - Nano Banana Pro (
gemini-3-pro-image-preview): Built on Gemini 3, this is the most advanced image model. Key capabilities:- State-of-the-art text rendering in multiple languages
- Real-world knowledge and deep reasoning for precise, detailed results
- Advanced controls: up to 14 input images for composition
- Studio-quality editing: lighting, camera settings, color grading
- High-fidelity resolutions: 1K, 2K, and 4K
- Brand consistency and character resemblance across edits
Prerequisites
You need an API key to use this skill. The recommended way is to create a .env file in this skill's directory (<your-skill-directory>/image-generation/), but placing it in the workspace root is also supported.
More from ocmrz/skills
google-search
MUST use when searching the web. Use for researching topics, finding discussions on Hacker News/Reddit/Stack Overflow, looking up academic papers, comparing tools or libraries, investigating error messages, or needing precise search filtering.
10npm-trends
Fetch and compare npm package download statistics with trend analysis. Use when the user asks about npm trends, package popularity, download counts, growth rates, or wants to compare npm packages over time.
5conventional-commits
Write a Conventional Commits message for the staged changes structured in the conventional commit format.
5