image-generation

Installation
SKILL.md

Image Generation

This skill enables generating images using Google's Gemini API (specifically the "Nano Banana" models) or OpenRouter.

Models

Google Gemini API

  • Nano Banana (gemini-2.5-flash-image): Designed for speed and efficiency. Default model.
  • Nano Banana Pro (gemini-3-pro-image-preview): Built on Gemini 3, this is the most advanced image model. Key capabilities:
    • State-of-the-art text rendering in multiple languages
    • Real-world knowledge and deep reasoning for precise, detailed results
    • Advanced controls: up to 14 input images for composition
    • Studio-quality editing: lighting, camera settings, color grading
    • High-fidelity resolutions: 1K, 2K, and 4K
    • Brand consistency and character resemblance across edits

Prerequisites

You need an API key to use this skill. The recommended way is to create a .env file in this skill's directory (<your-skill-directory>/image-generation/), but placing it in the workspace root is also supported.

Related skills
Installs
4
Repository
ocmrz/skills
First Seen
Jan 25, 2026