chat-with-anyone

Installation
Summary

Clone real voices from online video or design voices from photos, then roleplay as that person with synthetic speech.

  • Two workflows: extract voice from public video (interviews, speeches) by name, or generate a matching voice from an uploaded image of an unrecognizable person
  • Requires ffmpeg, yt-dlp, the tts skill, and a Noiz API key; includes setup verification and dependency installation steps
  • Built-in ethical guardrails: agent must refuse requests targeting non-consenting private individuals or clearly intended for deception, harassment, or fraud
  • Automated reference extraction finds the densest speech segment from downloaded video subtitles and audio, then reuses it across multiple generated replies for voice consistency
SKILL.md

Chat with Anyone

Clone a real person's voice from online video, or design a voice from a photo, then roleplay as that person with TTS.

Important: Ethical Use & Copyright

This skill synthesizes speech that imitates real voices. Before proceeding, the agent must:

  1. Never impersonate someone to deceive, defraud, or harass.
  2. Only use publicly available media (public speeches, interviews, press conferences) as reference audio.
  3. Inform the user that generated audio is synthetic and should not be presented as genuine recordings.
  4. Decline requests that target private individuals who have not consented, or that are clearly intended for deception, harassment, or defamation.

If the user's intent appears harmful, refuse politely and explain why.

Prerequisites

Related skills

More from noizai/skills

Installs
1.9K
Repository
noizai/skills
GitHub Stars
497
First Seen
Mar 3, 2026