Sora 2 launched September 30, 2025 with synced audio, 25-second clips, and a TikTok-style social app. The leaderboard moved overnight. Runway answered with Gen-4, Pika shipped 2.0, Luma released Dream Machine 2, and Google's Veo 3 arrived inside Gemini for free. The 39 startups on this list ($4.35B in disclosed funding) split clean into three buckets after that: model labs, enterprise avatars, and editor wrappers.
Model labs vs everything else
Runway's $1.43B raised through Series E makes it the longest-tenured creative-AI lab; Gen-4 added consistent characters across shots, the missing primitive for narrative work. Pika ($215M Series B) leans consumer and viral. Kling AI's $1.2B corporate round from Kuaishou keeps a Chinese frontier alive while Western VC pauses. Luma AI ($54M Series B) keeps Dream Machine in the mix despite the smallest cheque on this tier.
Enterprise avatars are a separate business that doesn't need a Sora-class base model. Synthesia's $802M Series E (UK) compounds on 230+ language coverage and Fortune 500 training-and-comms contracts. HeyGen ($65.6M, but reportedly past $100M ARR) is the fast follower with consumer-friendly pricing. D-ID and DeepBrain AI (Korea) own pieces of the same buyer.
Editors and prosumer wrappers
Captions ($100M Series C) automates podcast and short-form cuts, captions, and clip-finding. Descript ($50M Series C) edits video by editing the transcript. Riverside owns the studio-grade recording slot. InVideo AI (India, $35M Series A) wraps text-to-video for non-creator end-users; Leonardo.Ai (Australia) and Krea AI brought real-time generation into image-and-video creative suites.
Music-rights and compute
The music-rights tension is real and unresolved — Sora 2 and Veo 3 both ship with synthesized audio, and licensing for soundtracks is the next litigation front after Suno and Udio. Compute is the structural margin headwind: video is many frames of image-class inference plus temporal-coherence work, and consumer ARPU is not multiples higher than image generation. The teams that survived raised capital before the bar moved (Runway, Synthesia) or wrap generation in workflow software where AI is one feature (Captions, Descript).