Sign in

Product roadmap

Where Clip Foundry is heading β€” from a faceless short engine to a full production platform. We're at v1 β€” here's the path to v5.

  1. v1

    Faceless engine

    You are here

    One API call (or MCP) turns a script into a ready-to-post short.

    • Script β†’ scenes, AI voiceover, visuals, beat-synced render
    • 9 styles, multi-voice TTS, 9:16 / 1:1 / 16:9
    • REST + SSE + native MCP server, signed asset delivery
    • Resumable jobs, token billing, cost-safe pricing
  2. v1.5

    Production quality core

    Up next

    Make the output repeatably good, not just fast β€” and editable scene by scene.

    • Scene consistency engine β€” shared visual DNA across scenes
    • Quality tiers: draft β†’ standard β†’ premium β†’ cinematic
    • Asset re-ranking β€” best of several candidates per scene
    • Scene-level rerender (image / voice / pacing) β€” fix one scene, not the whole film
    • Premium audio: voice profiles, mastering, advanced captions
  3. v2

    Story engine & control

    Planned

    From topic β†’ video to a steerable story engine with deliberate structure.

    • Inputs: topic / URL / document / outline β†’ script
    • Story templates (hookβ†’revealβ†’payoff, myth vs fact, timeline, …)
    • Editable scene planner + retention-aware hook optimizer
    • Expanded visual style families with their own grammar
  4. v3

    Brand kits & mid-form

    Planned

    Your own assets & branding, and the jump from shorts to multi-minute explainers.

    • Brand kits: logo, colors, fonts, intro/outro, CTA, watermark
    • Asset library + per-scene overrides (mix AI with your footage)
    • Extended shorts: 30 / 45 / 60 / 90 / 120s with pacing curves
    • Chapter engine for 3–10 min explainers
  5. v4

    Production platform

    Planned

    project β†’ chapters β†’ scenes β†’ assets β†’ renders, fully API-first and editable.

    • Projects / chapters / scenes / brand-kits as first-class API resources
    • Scene & chapter editor, revisions, branching
    • Timeline patch API β€” patch render without a full rerender
    • Job graph, partial retries, cost estimate before run
  6. v5

    Long-form & feedback loop

    Planned

    10–20 min+ video, one-click publishing, and optimization that learns from results.

    • Long-form engine: 10–20 min and beyond
    • Publish connectors: YouTube, TikTok, Instagram Reels
    • Retention / hook / style / voice analytics ingestion
    • Auto-optimization: suggest better hook, style, pacing, voice

Directional, not a dated commitment β€” priorities shift with what users need most.

Start building today

Start free