simpo-training

Installation

SKILL.md

SimPO - Simple Preference Optimization

Quick start

SimPO is a reference-free preference optimization method that outperforms DPO without needing a reference model.

Installation:

# Create environment
conda create -n simpo python=3.10 && conda activate simpo

# Install PyTorch 2.2.2
# Visit: https://pytorch.org/get-started/locally/

# Install alignment-handbook
git clone https://github.com/huggingface/alignment-handbook.git
cd alignment-handbook
python -m pip install .

Installs

Repository

firecrawl/ai-re…h-skills

GitHub Stars

First Seen

Mar 28, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykWarn

simpo-training — firecrawl/ai-research-skills