axiom-vision

Installation
Summary

Apple Vision Framework for computer vision tasks: subject segmentation, pose detection, text recognition, barcode scanning, and document processing.

  • Covers 13+ Vision APIs across subject lifting, hand/body pose, person segmentation, text OCR, barcode detection, and document scanning with decision trees for choosing the right tool
  • Includes 15 production patterns: combining APIs to exclude hands from objects, real-time gesture recognition, multi-person segmentation, fitness action classification, and live camera scanning
  • Requires iOS 14+ minimum; instance masks and 3D body pose need iOS 17+; DataScannerViewController requires iOS 16+
  • All Vision processing must run on background queues to prevent UI freezing; confidence scores must be checked before using landmarks to avoid unreliable detections
SKILL.md

Computer Vision

You MUST use this skill for ANY computer vision work using the Vision framework.

Quick Reference

Related skills

More from charleswiltgen/axiom

Installs
819
GitHub Stars
901
First Seen
Jan 21, 2026