See it in action
One upload. One pipeline. One finished video.
Outbox takes your source material through analysis, scripting, voiceover, alignment, captions, editing, metadata, and publish without you bouncing between tools.
Core pipeline
Every stage handled inside one production system.
Analyze
Understand the footage before a single word is written
Scene detection, topic extraction, and structure mapping give the pipeline context before it drafts script or metadata.
Script
Generate an editable draft from the actual content
Outbox writes a context-aware script from your upload, so the next stages are grounded in what is really on screen.

Voice
Sync narration without bouncing between tools
Choose your voice provider, match tone to the format, and let the system keep the voiceover aligned with the cut.
Captions
Frame-perfect captions generated straight from the script
Alignment and captioning run as dedicated stages so every word lands on the right frame without manual syncing.
Publish
Ship with metadata, chapters, and publish-ready outputs
Metadata, chapters, and a publish-ready package come out of the same run. YouTube integration shipping next.
Workflow modes
Autopilot when you're moving fast. Manual control when it matters.
Autopilot
Upload once and let the full pipeline run
Best for recurring formats, changelog videos, and any workflow where you want the system to carry footage through every stage.
Advanced
Pause after script, edit, and rerun only downstream stages
Keep control over the high-leverage decisions while preserving the speed of the rest of the pipeline.
Assets
Generate visuals for thumbnails and distribution
Use the same run to produce thumbnail concepts, reuse key frames, and keep visual output consistent with the final edit.
Visibility
Track every artifact and inspect every stage
Revision history, stored artifacts, and stage-level status make it easy to review what changed before you rerun or publish.



