fine-tuning-with-trl

Originally fromovachiever/droid-tings
Installation
SKILL.md

TRL - Transformer Reinforcement Learning

Quick start

TRL provides post-training methods for aligning language models with human preferences.

Installation:

pip install trl transformers datasets peft accelerate

Supervised Fine-Tuning (instruction tuning):

from trl import SFTTrainer

trainer = SFTTrainer(
    model="Qwen/Qwen2.5-0.5B",
    train_dataset=dataset,  # Prompt-completion pairs
)
Related skills

More from zechenzhangagi/ai-research-skills

Installs
70
GitHub Stars
8.3K
First Seen
Jan 21, 2026