fine-tuning-with-trl

Originally fromovachiever/droid-tings
Installation
SKILL.md

TRL - Transformer Reinforcement Learning

Quick start

TRL provides post-training methods for aligning language models with human preferences.

Installation:

pip install trl transformers datasets peft accelerate

Supervised Fine-Tuning (instruction tuning):

from trl import SFTTrainer

trainer = SFTTrainer(
    model="Qwen/Qwen2.5-0.5B",
    train_dataset=dataset,  # Prompt-completion pairs
)
Related skills

More from orchestra-research/ai-research-skills

Installs
196
GitHub Stars
8.3K
First Seen
Feb 7, 2026