constitutional-ai

Installation

SKILL.md

Constitutional AI - Harmlessness from AI Feedback

Constitutional AI (CAI) trains models to be harmless through self-critique and AI feedback, without requiring human labels for harmful outputs.

Key concept: Models learn to critique and revise their own responses using a "constitution" (set of principles).

Two phases:

Installs

358

Repository

GitHub Stars

10.4K

First Seen

Feb 7, 2026

Security Audits

constitutional-ai — orchestra-research/ai-research-skills