axiom-vision

Installation
Summary

Apple Vision Framework for computer vision tasks: subject segmentation, pose detection, text recognition, barcode scanning, and document processing.

  • Covers 13+ Vision APIs across subject lifting, hand/body pose, person segmentation, text OCR, barcode detection, and document scanning with decision trees for choosing the right tool
  • Includes 15 production patterns: combining APIs to exclude hands from objects, real-time gesture recognition, multi-person segmentation, fitness action classification, and live camera scanning
  • Requires iOS 14+ minimum; instance masks and 3D body pose need iOS 17+; DataScannerViewController requires iOS 16+
  • All Vision processing must run on background queues to prevent UI freezing; confidence scores must be checked before using landmarks to avoid unreliable detections
SKILL.md

Computer Vision

You MUST use this skill for ANY computer vision work using the Vision framework.

Quick Reference

Symptom / Task Reference
Subject segmentation, lifting See skills/vision-framework.md
Hand/body pose detection See skills/vision-framework.md
Text recognition (OCR) See skills/vision-framework.md
Barcode/QR code detection See skills/vision-framework.md
Document scanning See skills/vision-framework.md
DataScannerViewController See skills/vision-framework.md
Structured document extraction (iOS 26+) See skills/vision-framework.md
Isolate object excluding hand See skills/vision-framework.md
Vision framework API reference See skills/vision-ref.md
Visual Intelligence integration (iOS 26+) See skills/vision-ref.md
Subject not detected See skills/vision-diag.md
Related skills

More from charleswiltgen/axiom

Installs
808
GitHub Stars
895
First Seen
Jan 21, 2026