fine-tuning-with-trl

Installation
SKILL.md

TRL - Transformer Reinforcement Learning

Quick start

TRL provides post-training methods for aligning language models with human preferences.

Installation:

pip install trl transformers datasets peft accelerate

Supervised Fine-Tuning (instruction tuning):

from trl import SFTTrainer

trainer = SFTTrainer(
    model="Qwen/Qwen2.5-0.5B",
    train_dataset=dataset,  # Prompt-completion pairs
)
Related skills

More from davila7/claude-code-templates

Installs
334
GitHub Stars
27.2K
First Seen
Jan 21, 2026