magic-data-synthesis
Installation
SKILL.md
Natural Language Triggers
Activate this skill when the user says things like:
- "synthesize data" / "generate missing values" / "fill in the blanks with LLM"
- "translate this column" / "convert HTML to markdown"
- "annotate these records" / "label this data"
- "enrich this dataset" / "generate descriptions"
- "fill the TBD placeholders" / "replace sentinels with real content"
- "use DataDesigner" / "run data-designer"
These produce the SAME behavior as the synthesis workflow. Natural language works equally well.
When to Use
- Columns have missing values, sentinels ("X", "N/A", "TBD"), or placeholders needing contextual generation
- Format conversion (HTML→markdown), translation, annotation, labeling, summarization
- Structured field extraction from unstructured text into multiple columns
- Reference join leaves gaps → LLM fills remaining (use
enrich_from_reference.pyfirst)