Browser FFmpeg Studio for AI Editing

Fetch supported video URLs (X/Twitter, YouTube, Dailymotion, Vimeo), extract frames, and convert media directly in your browser.

If a source requires login, fetch may fail. Download the media and use local upload as fallback.

Drop a video here

or use the picker

No file selected

Time: 0.000s

Duration: (unknown)

Source is shared across all tabs. Choose media once here, then switch tabs for trim, transcode, frames, audio, or advanced workflows.

Shared source from Tab 1

Choose media once in Media Selection tab; Advanced uses it automatically.

No shared source selected

Output name

FFmpeg args (without input/output; input is always input)

-vf "scale=1280:-2" -c:v libx264 -crf 23 -preset medium

Presets

Examples: extract audio -vn -c:a libmp3lame -q:a 2, trim -ss 00:00:01 -to 00:00:04 -c copy, GIF -vf fps=12,scale=640:-1:flags=lanczos -loop 0.

This exposes most capabilities compiled into this ffmpeg.wasm build (filters, codecs, trimming, muxing, etc.). Availability depends on what codecs/options are present in the bundled build.

Run FFmpeg in your browser without the CLI

Use URL fetch + yt-dlp server-side and ffmpeg.wasm client-side to pull media, scrub timelines, extract frames, and convert image sequences to MP4/WEBM/GIF.

Same result, less friction

yt-dlp 
ffmpeg -i input.mp4 -vf "select=eq(n\,42)" -vframes 1 frame.png
ffmpeg -framerate 12 -i frame%04d.png -c:v libx264 out.mp4

No installs, no flags, no setup. Just the frame you want, fast.

When this makes sense

You’re on a phone, tablet, or locked-down machine
You don’t have ffmpeg handy
You’re away from your main box
You only need one frame
You want to send it straight to ChatGPT or Grok

How it works

The server uses yt-dlp to resolve and fetch media from supported platforms. From there, everything runs locally in your browser with ffmpeg.wasm: decode, scrub, export PNG.

The server never touches frames or images. It only retrieves the video.

Why PNG for AI editing

AI tools like ChatGPT and Grok work best with still images. A clean PNG frame is easy to annotate, modify, or remix with a prompt.

FAQ

Why not just use yt-dlp and ffmpeg?
If you’re set up already, do it. This is for when you want the same result with zero friction.

Why might “Load URL” fail?
Some sources need auth/geo access. If so, download the media yourself and drop it above.

What runs server-side?
Only yt-dlp to fetch the MP4. All decoding and frame extraction happens in your browser via ffmpeg.wasm. Frames never leave your device.

Is this free?
Yes. No account, no signup.

Trim and clip media

Run FFmpeg in your browser without the CLI

Same result, less friction

When this makes sense

How it works

Why PNG for AI editing

FAQ