Video-to-3D Reconstruction
Upload a video and reconstruct a 3D scene from it using a multi-agent ML pipeline.
Status: Scaffolded. Cloud Function startVideoReconstruction creates job docs; the actual ML orchestrator is not yet built.
Pipeline Stages
| Stage | ML Model | Purpose |
|---|---|---|
segment | SAM2 | Segment video frames into objects |
depth | Monocular depth estimation | Estimate per-pixel depth |
classify | Object classification | Identify what each segment is |
rig | Auto-rigging | Add armatures to characters/vehicles |
compose | Scene composer | Place objects in 3D space |
Job Lifecycle
uploaded → segmenting → depth → classifying → rigging → composing → done
└→ failed
Each stage updates the videoJobs/{jobId} Firestore document. The VideoPanel subscribes via onSnapshot and shows real-time progress bars.