Skip to main content
Back to registry

axiom-vision

charleswiltgen/axiom

Guides you through implementing computer vision: subject segmentation, hand/body pose detection, person detection, text recognition, barcode detection, document scanning, and combining Vision APIs to solve complex problems.

Installs344
Install command
npx skills add https://github.com/charleswiltgen/axiom --skill axiom-vision
Security audits
Gen Agent Trust HubPASS
SocketPASS
SnykPASS
About this skill
Guides you through implementing computer vision: subject segmentation, hand/body pose detection, person detection, text recognition, barcode detection, document scanning, and combining Vision APIs to solve complex problems. Use when you need to: "How do I isolate a subject from the background?" "I need to detect hand gestures like pinch" "How can I get a bounding box around an object without including the hand holding it ?" "Should I use VisionKit or Vision framework for subject lifting?" "How do I segment multiple people separately?" "I need to detect body poses for a fitness app" "How do I preserve HDR when compositing subjects on new backgrounds?" "How do I recognize text in an image?" "I need to scan QR codes from camera" "How do I extract data from a receipt?" "Should I use DataScannerViewController or Vision directly?" "How do I scan documents and correct perspective?" "I need to extract table data from a document" Signs you're making this harder than it needs to be: Before implementing any Vision feature: NEVER run Vision on main thread : Processing video frames? Use VNSequenceRequestHandler (maintains inter-frame state for temporal smoothing). For single images, use VNImageRequestHandler . Creating a new VNImageRequestHandler per frame discards temporal context and causes jittery results. See axiom-vision-ref for full comparison and code examples. User's original...

Source description provided by the upstream skill listing. Community reviews and install context appear in the sections below.

Community Reviews

Latest reviews

Sign in to review

No community reviews yet. Be the first to review.

Browse this skill in context
FAQ
What does axiom-vision do?

Guides you through implementing computer vision: subject segmentation, hand/body pose detection, person detection, text recognition, barcode detection, document scanning, and combining Vision APIs to solve complex problems.

Is axiom-vision good?

axiom-vision does not have approved reviews yet, so SkillJury cannot publish a community verdict.

What agent does axiom-vision work with?

axiom-vision currently lists compatibility with codex, gemini-cli, opencode, cursor, amp, github-copilot.

What are alternatives to axiom-vision?

Skills in the same category include telegram-bot-builder, flutter-app-size, sharp-edges, iterative-retrieval.

How do I install axiom-vision?

npx skills add https://github.com/charleswiltgen/axiom --skill axiom-vision

Related skills

More from charleswiltgen/axiom

Related skills

Alternatives in Software Engineering