qwen-image-2
Text-to-image and multi-image editing with Alibaba Qwen-Image-2.0 models via inference.sh CLI.
- Two models available: Qwen-Image-2.0 for fast general use, and Qwen-Image-2.0-Pro for professional text rendering and fine-grained control
- Supports text-to-image generation, multi-reference image editing (up to 3 input images), custom resolutions (512–2048 pixels), and negative prompts
- Key parameters include prompt extension toggle, seed-based reproducibility, watermark control, and batch generation (1–6 images per run)
- Pro model excels at text-heavy designs like posters; standard model prioritizes speed for general creative tasks
Install the belt CLI skill:
npx skills add belt-sh/cli
Qwen-Image - Alibaba Image Generation
Generate and edit images with Alibaba Qwen-Image-2.0 models via inference.sh CLI.

Quick Start
Requires inference.sh CLI (
belt). Install instructions
belt login
belt app run alibaba/qwen-image-2 --input '{"prompt": "A serene mountain landscape at sunset"}'
More from inferen-sh/skills
ai-rag-pipeline
0tools-ui
0infsh-cli
Run 250+ AI apps via inference.sh CLI - image generation, video creation, LLMs, search, 3D, Twitter automation. Models: FLUX, Veo, Gemini, Grok, Claude, Seedance, OmniHuman, Tavily, Exa, OpenRouter, and many more. Use when running AI apps, generating images/videos, calling LLMs, web search, or automating Twitter. Triggers: inference.sh, infsh, ai model, run ai, serverless ai, ai api, flux, veo, claude api, image generation, video generation, openrouter, tavily, exa search, twitter api, grok
0storyboard-creation
Film and video storyboarding with shot vocabulary, continuity rules, and panel layout. Covers shot types, camera angles, movement, 180-degree rule, and annotation format. Use for: video planning, film pre-production, ad storyboards, music video planning, animation. Triggers: storyboard, storyboarding, shot list, film planning, video planning, pre production, shot composition, camera angles, scene planning, visual script, animatic, storyboard panels, video storyboard
0web-search
0nano-banana-2
Generate images with Google Gemini 3.1 Flash Image Preview (Nano Banana 2) via inference.sh CLI. Capabilities: text-to-image, image editing, multi-image input (up to 14 images), Google Search grounding. Triggers: nano banana 2, nanobanana 2, gemini 3.1 flash image, gemini 3 1 flash image preview, google image generation
0