agent-gen-audio
Agent Gen Audio
Use rawgenai <provider> music to generate music and rawgenai <provider> sfx to generate sound effects. Always read the chosen provider's reference file before running commands.
Prerequisites
brew install WHQ25/tap/rawgenai
Before using a provider, read its setup guide at references/setup/ to configure credentials.
General Guidelines
- On first use, ask user to pick a provider. Remember for the session.
- All output is JSON. Always show file paths to the user.
- For async commands:
create->status->download. - If a command fails, try a different provider or inform the user.
More from whq25/rawgenai
agent-right-brain
Give agents creative abilities using `rawgenai` — speak, listen, generate images/videos/music/sound effects, create multi-speaker dialogue, and manage voices. Use this skill when the user asks to "speak", "talk", "read aloud", "transcribe", "generate an image", "create a picture", "draw", "edit an image", "generate a video", "create a video", "animate", "generate music", "create a song", "generate sound effects", "create dialogue", "design a voice", "clone a voice", or any request involving voice, audio, image, or video creation.
7agent-gen-image
Give agents image generation and editing abilities using `rawgenai` — create images from text, edit existing images, inpaint, and generate with reference images. Use this skill when the user asks to "generate an image", "create a picture", "draw", "edit an image", "inpaint", "remove background", or any request involving image creation or manipulation.
2agent-speak
Give agents voice abilities using `rawgenai` — text-to-speech, multi-speaker dialogue, and voice management (design, clone, create voices). Use this skill when the user asks to "speak", "talk", "read aloud", "say this", "create dialogue", "design a voice", "clone a voice", or any request involving spoken audio output and voice creation.
1agent-gen-video
Give agents video generation abilities using `rawgenai` — create videos from text/images, extend videos, and remix existing footage. Use this skill when the user asks to "generate a video", "create a video", "animate", "video from image", "extend video", "remix video", or any request involving video creation.
1agent-listen
Give agents listening abilities using `rawgenai` — transcribe audio/video to text with timestamps, subtitles, and speaker diarization. Use this skill when the user asks to "transcribe", "speech to text", "convert audio to text", "generate subtitles", "diarize speakers", or any request involving audio/video transcription.
1