axiom-vision

Installation
SKILL.md

Vision Framework Computer Vision

Guides you through implementing computer vision: subject segmentation, hand/body pose detection, person detection, text recognition, barcode detection, document scanning, and combining Vision APIs to solve complex problems.

When to Use This Skill

Use when you need to:

  • ☑ Isolate subjects from backgrounds (subject lifting)
  • ☑ Detect and track hand poses for gestures
  • ☑ Detect and track body poses for fitness/action classification
  • ☑ Segment multiple people separately
  • ☑ Exclude hands from object bounding boxes (combining APIs)
  • ☑ Choose between VisionKit and Vision framework
  • ☑ Combine Vision with CoreImage for compositing
  • ☑ Decide which Vision API solves your problem
  • ☑ Recognize text in images (OCR)
  • ☑ Detect barcodes and QR codes
  • ☑ Scan documents with perspective correction
  • ☑ Extract structured data from documents (iOS 26+)
Related skills

More from fotescodev/ios-agent-skills

Installs
5
First Seen
Feb 23, 2026