skill-creator
Skill Creator
A skill for creating new skills and iteratively improving them.
At a high level, the process of creating a skill goes like this:
- Decide what you want the skill to do and roughly how it should do it
- Write a draft of the skill
- Create a few test prompts and run claude-with-access-to-the-skill on them
- Help the user evaluate the results both qualitatively and quantitatively
- While the runs happen in the background, draft some quantitative evals if there aren't any (if there are some, you can either use as is or modify if you feel something needs to change about them). Then explain them to the user (or if they already existed, explain the ones that already exist)
- Use the
eval-viewer/generate_review.pyscript to show the user the results for them to look at, and also let them look at the quantitative metrics
- Rewrite the skill based on feedback from the user's evaluation of the results (and also if there are any glaring flaws that become apparent from the quantitative benchmarks)
- Repeat until you're satisfied
- Expand the test set and try again at larger scale
Your job when using this skill is to figure out where the user is in this process and then jump in and help them progress through these stages. So for instance, maybe they're like "I want to make a skill for X". You can help narrow down what they mean, write a draft, write the test cases, figure out how they want to evaluate, run all the prompts, and repeat.
On the other hand, maybe they already have a draft of the skill. In this case you can go straight to the eval/iterate part of the loop.
More from liulixiang1988/agent-skills
work-record
Record work log / save work summary / add TODO items. TRIGGER when: user says '记录工作', '保存工作', 'record work', 'save work', 'log work', '工作记录', '写工作日志', '保存工作记录', '记录一下', or similar phrases about saving/recording what was done in the current session. Also trigger when user mentions work log, work record, 工作日志, or wants to summarize completed work for future performance review. Also trigger when user says '加一个todo', 'add a todo', '添加todo', '加个待办', '记录todo', or similar phrases about adding a TODO/待办 item to the work log.
13lumina-image
Build Lumina container images. Use this skill when the user wants to build the proxy API image, build the SandboxControlPlane API image, build the sandbox agent image, or build any Lumina-specific Docker image (e.g., 'build proxy api image', 'build scp api image', 'build sandboxcontrolplane image', 'build sandbox agent').
12ado-lumina-pipeline
Trigger the Lumina-SandboxAKSProvider-Service-Dev-Deploy pipeline in Azure DevOps. Use this skill when the user wants to run/trigger/deploy the Lumina sandbox AKS pipeline, deploy sandbox orchestrator, deploy sandbox components, trigger a dev deploy, or run the dev pipeline. Also use when the user asks to re-run or reproduce a specific pipeline build.
7kv-cert
Azure Key Vault certificate management. TRIGGER when: user asks to create/view/manage SSL certificates in Azure Key Vault, import certificates to App Service, bind custom domains with SSL, grant KV RBAC permissions, or troubleshoot KV certificate issues (e.g., 'create KV cert', 'add SSL to app service', 'bind custom domain', 'grant KV permission', 'KV 证书', '创建证书', '绑定域名', 'SSL 绑定').
1broker-build-deploy-fix-loop
Run an autonomous build → deploy → validate → fix loop for the Lumina Sandbox Broker service in Azure DevOps (org O365exchange, project 'O365 Core'). Use when the user says things like '修改好后开始build, deploy, validation, fix循环', 'start the broker loop', 'keep iterating broker until it deploys and serves HTTP', or asks to drive broker build 54428 / deploy 54444 until the public URL returns a real response instead of the App Service holding page. Also use to resume the loop in a later session.
1