agent-gen-image
Agent Gen Image
Use rawgenai <provider> image to generate and edit images. Always read the chosen provider's reference file before running commands.
Prerequisites
brew install WHQ25/tap/rawgenai
Before using a provider, read its setup guide at references/setup/ to configure credentials.
General Guidelines
- On first use, ask user to pick a provider. Remember for the session.
- All output is JSON. Always show file paths to the user.
- For async commands (some providers):
create->status->download. - Write image prompts descriptively: subject + action + environment + style + lighting.
- If a command fails, try a different provider or inform the user.
More from whq25/rawgenai
agent-right-brain
Give agents creative abilities using `rawgenai` — speak, listen, generate images/videos/music/sound effects, create multi-speaker dialogue, and manage voices. Use this skill when the user asks to "speak", "talk", "read aloud", "transcribe", "generate an image", "create a picture", "draw", "edit an image", "generate a video", "create a video", "animate", "generate music", "create a song", "generate sound effects", "create dialogue", "design a voice", "clone a voice", or any request involving voice, audio, image, or video creation.
7agent-speak
Give agents voice abilities using `rawgenai` — text-to-speech, multi-speaker dialogue, and voice management (design, clone, create voices). Use this skill when the user asks to "speak", "talk", "read aloud", "say this", "create dialogue", "design a voice", "clone a voice", or any request involving spoken audio output and voice creation.
1agent-gen-video
Give agents video generation abilities using `rawgenai` — create videos from text/images, extend videos, and remix existing footage. Use this skill when the user asks to "generate a video", "create a video", "animate", "video from image", "extend video", "remix video", or any request involving video creation.
1agent-gen-audio
Give agents music and sound effect generation abilities using `rawgenai` — create music from prompts or lyrics, and generate sound effects from text descriptions. Use this skill when the user asks to "generate music", "create a song", "compose music", "make a beat", "generate sound effects", "create sfx", or any request involving music or sound effect creation.
1agent-listen
Give agents listening abilities using `rawgenai` — transcribe audio/video to text with timestamps, subtitles, and speaker diarization. Use this skill when the user asks to "transcribe", "speech to text", "convert audio to text", "generate subtitles", "diarize speakers", or any request involving audio/video transcription.
1