lingbot-map-3d-reconstruction

Installation
SKILL.md

LingBot-Map 3D Reconstruction Skill

Skill by ara.so — Daily 2026 Skills collection.

LingBot-Map is a feed-forward 3D foundation model that reconstructs scenes from streaming image or video data using a Geometric Context Transformer. It achieves ~20 FPS on 518×378 resolution over sequences exceeding 10,000 frames via paged KV cache attention.

What It Does

  • Streaming 3D reconstruction from image sequences or video
  • Feed-forward inference (no iterative optimization needed)
  • Outputs: point clouds with per-point confidence, camera poses, depth maps
  • Key features: anchor context, pose-reference window, trajectory memory for drift correction

Installation

# 1. Create environment
conda create -n lingbot-map python=3.10 -y
conda activate lingbot-map
Related skills
Installs
241
GitHub Stars
4
First Seen
Apr 17, 2026