unsloth-training

Installation

SKILL.md

GRPO - RL with reward functions (no labeled outputs needed)
SFT - Supervised fine-tuning with input/output pairs
Vision - VLM fine-tuning (Qwen3-VL, Gemma3, Llama 3.2 Vision)

Key capabilities:

FP8 Training - 60% less VRAM, 1.4x faster (RTX 40+, H100)
3x Packing - Automatic 2-5x speedup for mixed-length data
Docker - Official unsloth/unsloth image
Mobile - QAT → ExecuTorch → iOS/Android (~40 tok/s)
Export - GGUF, Ollama, vLLM, LM Studio, SGLang

Installs

90

Repository

scientiacapital/skills

GitHub Stars

24

First Seen

Jan 23, 2026

Security Audits

Gen Agent Trust HubPass

unsloth-training — scientiacapital/skills