huggingface-tokenizers
Originally fromovachiever/droid-tings
Installation
SKILL.md
HuggingFace Tokenizers
Fast, production-ready tokenization - Rust-powered, Python API.
When to Use
- High-performance tokenization (<20s per GB)
- Train custom tokenizers from scratch
- Track token-to-text alignment
- Production NLP pipelines
- Need BPE, WordPiece, or Unigram tokenization
Quick Start
from tokenizers import Tokenizer