Back to Home11/19/2025, 3:28:04 PM

Launch HN: Mosaic (YC W25) – Agentic Video Editing

76 points
53 comments

Mood

calm

Sentiment

positive

Category

tech

Key topics

AI

video editing

YC startup

Hey HN! We’re Adish & Kyle from Mosaic (https://edit.mosaic.so, https://docs.mosaic.so/, https://mosaic.so). Mosaic lets you create and run your own multimodal video editing agents in a node-based canvas. It’s different from traditional video editing tools in two ways: (1) the user interface and (2) the visual intelligence built into our agent.

We were engineers at Tesla and one day had a fun idea to make a YouTube video of Cybertrucks in Palo Alto. We recorded hours of cars driving by, but got stuck on how to scrub through all this raw footage to edit it down to just the Cybertrucks.

We got frustrated trying to accomplish simple tasks in video editors like DaVinci Resolve and Adobe Premiere Pro. Features are hidden behind menus, buttons, and icons, and we often found ourselves Googling or asking ChatGPT how to do certain edits.

We thought that surely now, with multimodal AI, we could accelerate this process. Better yet, an AI video editor could automatically apply edits based off what it sees and hears in your video. The idea quickly snowballed and we began our side quest to build “Cursor for Video Editing”.

We put together a prototype and to our amazement, it was able to analyze and add text overlays based on what it saw or heard in the video. We could now automate our Cybertruck counting with a single chat prompt. That prototype is shown here: https://www.youtube.com/watch?v=GXr7q7Dl9X0.

After that, we spent a chunk of time building our own timeline-based video editor and making our multimodal copilot powerful and stateful. In natural language, we could now ask chat to help with AI asset generation, enhancements, searching through assets, and automatically applying edits like dynamic text overlays. That version is shown here: https://youtu.be/X4ki-QEwN40.

After talking to users though, we realized that the chat UX has limitations for video: (1) the longer the video, the more time it takes to process. Users have to wait too long between chat responses. (2) Users have set workflows that they use across video projects. Especially for people who have to produce a lot of content, the chat interface is a bottleneck rather than an accelerant.

That took us back to first principles to rethink what a “non-linear editor” really means. The result: a node-based canvas which enables you to create and run your own multimodal video editing agents. https://screen.studio/share/SP7DItVD.

Each tile in the canvas represents a video editing operation and is configurable, so you still have creative control. You can also branch and run edits in parallel, creating multiple variants from the same raw footage to A/B test different prompts, models, and workflows. In the canvas, you can see inline how your content evolves as the agent goes through each step.

The idea is that canvas will run your video editing on autopilot, and get you 80-90% of the way there. Then you can adjust and modify it in an inline timeline editor. We support exporting your timeline state out to traditional editing tools like DaVinci Resolve, Adobe Premiere Pro, and Final Cut Pro.

We’ve also used multimodal AI to build in visual understanding and intelligence. This gives our system a deep understanding of video concepts, emotions, actions, spoken word, light levels, shot types.

We’re doing a ton of additional processing in our pipeline, such as saliency analysis, audio analysis, and determining objects of significance—all to help guide the best edit. These are things that we as human editors internalize so deeply we may not think twice about it, but reverse-engineering the process to build it into the AI agent has been an interesting challenge.

Some of our analysis findings: Optimal Safe Rectangles: https://assets.frameapp.ai/mosaicresearchimage1.png Video Analysis: https://assets.frameapp.ai/mosaicresearchimage2.png Saliency Analysis: https://assets.frameapp.ai/mosaicresearchimage3.png Mean Movement Analysis: https://assets.frameapp.ai/mosaicresearchimage4.png

Use cases for editing include: - Removing bad takes or creating script-based cuts from videos / talking-heads - Repurposing longer-form videos into clips, shorts, and reels (e.g. podcasts, webinars, interviews) - Creating sizzle reels or montages from one or many input videos - Creating assembly edits and rough cuts from one or many input videos - Optimizing content for various social media platforms (reframing, captions, etc.) - Dubbing content with voice cloning and lip syncing.

We also support use cases for generating content such as motion graphic animations, cinematic captions, AI UGC content, adding contextual AI-generated B-Rolls to existing content, or modifying existing video footage (changing lighting, applying VFX).

Currently, our canvas can be used to build repeatable agentic workflows, but we’re working on a fully autonomous agent which will be able to do things like: style transfer using existing video content, define its own editing sequence / workflow without needing a canvas, do research and pull assets from web references, and so on.

You can try it today at https://edit.mosaic.so. You can sign up for free and get started playing with the interface by uploading videos, making workflows on the canvas, and editing them in the timeline editor. We do paywall node runs to help cover model costs. Our API docs are at https://docs.mosaic.so. We’d love to hear your feedback!

Mosaic, a YC W25 startup, has launched an agentic video editing platform, likely leveraging AI for automated editing.

Snapshot generated from the HN discussion

Discussion Activity

Very active discussion

First comment

47m

Peak period

23

Hour 3

Avg / period

12.5

Comment distribution50 data points

Based on 50 loaded comments

Key moments

  1. 01Story posted

    11/19/2025, 3:28:04 PM

    4h ago

    Step 01
  2. 02First comment

    11/19/2025, 4:14:58 PM

    47m after posting

    Step 02
  3. 03Peak activity

    23 comments in Hour 3

    Hottest window of the conversation

    Step 03
  4. 04Latest activity

    11/19/2025, 7:16:23 PM

    13m ago

    Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (53 comments)
Showing 50 comments of 53
tonyoconnell
3h ago
1 reply
This is so cool. Good luck with your venture.
adishj
3h ago
Thank you :)
callamdelaney
3h ago
2 replies
Hey, good luck with Mosaic.

Some feedback initially on the landing page, looks great but I thought that there is, for me, too much motion going on on the homepage and the use cases page. May be an unpopular opinion!

cjbarber
3h ago
1 reply
Agreed, homepage was confusing for me also. I tried to scroll around and see a demo. For a product like this that is so visual, I expected to be able to find a 30s demo clip somewhere but couldn't see one on the homepage or product page (and the scrolling on the product page was annoying for me).
adishj
3h ago
the sad part is spent so long on the product page scrolling animation haha

very valid point though — I think a demo clip of a BEFORE vs AFTER immediately somewhere in the hero even or right below it would be helpful

thanks for the feedback

adishj
3h ago
valid points, thanks for the feedback. i had gone for a certain aesthetic but you're right in that it may be a bit too overwhelming.
cjbarber
3h ago
1 reply
I think this is a great endeavor. I was thinking about a channel that I like watching on YouTube. They travel to exotic places by boat and film themselves, nature documentary style. To make good videos requires going to these places, a ton of filming, AND a ton of editing. They put out a video every 2 weeks or so on their trips. I imagine the editing is the hard part.

This is a long winded way of saying that I think creators need what you're making! People who have hours of awesome footage but have to spend dozens of hours cutting it down need this. Then also people who have awesome footage but aren't good at editing or hiring an editor, same thing. I'd love to see someone solve this so that 90th percentile editing is available to all, and then it can be more about who has the interesting content, rather than who has the interesting content and editing skills.

adishj
2h ago
1 reply
thanks! Mosaic can already do the rough cuts for you — so you can upload all your footage from your travel, and prompt it to "make a 2 minute highlight reel of your trip to Japan", for instance.

soon, we also plan to incorporate style transfer, so you could even give it a video from the channel you enjoy watching + your raw footage, and have the agent edit your footage in the same style of the reference video.

mrbluecoat
1h ago
> you can upload all your footage from your travel, and prompt it to "make a 2 minute highlight reel of your trip to Japan"

In relation to the demo requests below, I think this would be a good example of how an average person might use your platform.

penne_pastaa
3h ago
1 reply
this is so cool, can we see some demos of edits you'd make with it?
adishj
2h ago
thanks! check out the demo video here of the latest version of the interface: https://screen.studio/share/SP7DItVD

i playback parts of the cinematic edit I made to the conversation between Dwarkesh Patel and Satya Nadella (e.g. added cinematic captions, motion graphics)

i can post the full edit as well if you're interested

jaccola
3h ago
2 replies
Very cool. It definitely feels to me that the power of pro tools should be available to more people with AI.

Would have been nice if there was a killer demo on your landing page of a video made with Mosaic.

bluelightning2k
1h ago
1 reply
The problem is, any video demo of a tool like this is just an entirely unrelated video.
adishj
1h ago
can you clarify what you mean here? check out this demo video: https://screen.studio/share/SP7DItVD
adishj
2h ago
that's our perspective as well.

a lot of tooling is being built around generative AI in particular, but there's still a big gap for people that want to share their own stories / experiences / footage but aren't well-versed with pro tools.

valid feedback on the landing page — something we'll add in.

BolexNOLA
2h ago
2 replies
> We got frustrated trying to accomplish simple tasks in video editors like DaVinci Resolve and Adobe Premiere Pro. Features are hidden behind menus, buttons, and icons, and we often found ourselves Googling or asking ChatGPT how to do certain edits.

Hidden behind a UI? Most of the major tools like blade, trim, etc. are right there on the toolbars.

> We recorded hours of cars driving by, but got stuck on how to scrub through all this raw footage to edit it down to just the Cybertrucks.

Scrubbing is the easiest part. Mouse over the clip, it starts scrubbing!

I’m being a bit tongue in cheek and I totally agree there is a learning curve to NLE’s but those complaints were also a bit striking to me.

adishj
2h ago
1 reply
hey! You're right that most of the basic tools like splitting / trimming are available right in the timeline. but things like adding a keyframe to animate a counter, for instance, I had no idea where to go or how to start.

Scrubbing is easy enough when you have short footage, but imagine scrubbing through the footage we had of 5 hours of cars driving by, or maybe a bunch of assets. This quickly becomes very tedious.

BolexNOLA
1h ago
I don’t need to imagine, I do it haha but again I was being tongue in cheek. I personally would love an effective tool that can mark and favorite clips for me based on written prompts. Would save me an awful amount of time!
andrewmlevy
1h ago
teddyh
2h ago
1 reply
Not related to NCSA Mosaic (RIP).
adishj
2h ago
if you take a snippet of Ben Horowitz's interview out of context, he has a lot of good things to say about our product :)
shivvtrivedi
2h ago
1 reply
Mosaic team dev here Hanging in the comments all day and pushing updates as fast as we can -really appreciate the feedback!
mberlove
1h ago
1 reply
Is there a way to keep up to date on updates and new announcements? TIA.
adishj
1h ago
yes! please join our discord https://discord.gg/26SAZzBTaP or follow us on X https://x.com/mosaic_so to keep up to date on updates
lava123
2h ago
1 reply
YOOOO, this is super awesome. Love this for you all. Lets make life easier for more creators.
adishj
1h ago
thanks!
bluelightning2k
1h ago
1 reply
Good luck. I've dabbled with this myself and ultimately decided that DaVinci Resolve would end up doing this natively. But then again they haven't yet so who knows!

Good luck with it, sincerely.

adishj
1h ago
thanks! curious what you started dabbling with and if you have any thoughts to share :)
zkmon
1h ago
3 replies
I just clicked the link and encountered a non-scrollable, dark, fixed content pane with loads of flickering images and scrolling text with random font sizes without much meaning. I felt imprisoned, subjected to unexpected suffering, can't scroll away, got scared and raced for the window close button, and then breathed easy.
pelagicAustral
1h ago
1 reply
They really managed to handcraft a unique user experience, that's for sure.
adishj
1h ago
we did but the landing page seems to be detracting from it — head directly to https://edit.mosaic.so to try the actual canvas interface
adishj
1h ago
2 replies
seems like the landing page is detracting from the main product, this is good feedback so thanks! For now, avoid the scaries and head directly to https://edit.mosaic.so to try the actual canvas interface
dang
1h ago
I've put the /edit and /docs links in the first sentence above to soften the blow as well :)
conductr
1h ago
Since video is your thing, I feel like you need to just make a very edited demo reel and put all your energy into trying to get people to watch that video. Meaning, remove almost all text and bloat from the site and just show us all the cool stuff the product does for/to video editing. Distill it to 60-120 seconds and put that on your landing, hell put it on auto play if you want to, so long as it's clear that is the one thing I'm supposed to be paying attention to
deepspace
1h ago
2 replies
I had the same reaction. About what you would expect from a team steeped in the Tesla mindset.
dang
1h ago
Please don't cross into personal attack. We're trying for the opposite on this site.

https://news.ycombinator.com/newsguidelines.html

adishj
1h ago
thanks for the feedback — you can head directly to https://edit.mosaic.so to try the actual canvas interface
ack210
1h ago
2 replies
I just signed up for a Creator plan, but it looks like the automated "Thank you for being a Mosaic Creator" email going out is not configured correctly. Instead of having my company name, it referenced a different business name and description (that seems to exist/be accurate, so not a placeholder).
adishj
1h ago
This has been fixed now.
adishj
1h ago
Hey! Thanks for calling this out — looking into what happened here & fixing right now.
shambu2k
1h ago
1 reply
Damn, you beat me to it. I was building something similar but got too caught up optimizing the context extraction. I actually ended up building a full spec for it—basically a PoC of "grep for videos."

My end goal was to let an agent make semantic changes (e.g., "remove the parts where the guy in the blue dress is seen") by simply grepping the context spec for the relevant timestamps and using ffmpeg to cut them out.

How are you extracting context from videos?

adishj
59m ago
how would this be different from vector embeddings / semantic search?
heyyfurqan
35m ago
1 reply
Damn this is good.
adishj
30m ago
Thank you! :)
sashagoncharov
13m ago
best of luck guys!!
sails
55m ago
I’ve had a lot of fun with Remotion and Claude Code for CLI video editing. I’ve been impressed with how much traditional video editing I can manage.

I will be checking this out!

moinism
43m ago
Hey, this is super cool. congrats on the product and the launch!

I'm building something exactly similar and couldn't believe my eyes when I saw the HN post. What i'm building (chatoctopus.com) is more like a chat-first agent for video editing, only at a prototype stage. But what you guys have achieved is insane. Wishing you lots of success.

to healthy competition!

danishSuri1994
54m ago
Really interesting direction. The node-based canvas feels like a more scalable abstraction for video automation than the usual chat-only interface. I’m curious how you’re handling long-form content where temporal context matters (e.g., emotional shifts, pacing, narrative cues).

Multimodal models are good at frame-level recognition, but editing requires understanding relationships between scenes, have you found any methods that work reliably there?

anthonySs
50m ago
As a creator who films long form content, editing (specifically clipping for short form) is such a nightmare - this solves such a huge problem and the ui is insanely clean.

Will be using this a ton in the future

echelon
1h ago
Can you make this a desktop app?

I'm really tired of editing videos in the cloud. I'm also also tired of all these AI image and video tools that make you work over a browser. Your workflow seems so second class buried amongst all the other browser tabs.

I understand that this is how to deploy quickly to customers, but it feels so gross working on "heavy" media in a browser.

3 more comments available on Hacker News

ID: 45980760Type: storyLast synced: 11/19/2025, 7:26:56 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.