Initializing... drag & drop files here
Supports: 3G2, 3GP, 3GPP, ASF, AV1, AVCHD +31 more
2.100 (2 seconds and 100 ms) to capture exactly one still. Switch to Multiple Screenshots to extract a sequence at a chosen capture rate (0.1s, 0.2s, 0.3s, 0.5s, 1s, 2s, 3s, 4s, 5s, 6s, 7s, 8s, 9s, or 10s per frame).Every video — whether a 4K iPhone clip, an old AVI camcorder dump, a Zoom recording, a security DVR export, or a YouTube download — is just a sequence of still images played fast. Pulling those stills out as JPGs gives you images you can email, post, embed, archive, and edit anywhere a JPG is understood (which is everywhere). JPG is the right pick when file size matters and the source is photographic; for sharp text and graphics use video to PNG instead.
12.450 seconds for an insurance claim, police report, or HOA submission. JPG keeps the file small enough to email.| Property | Video (MP4 / MOV / AVI / MKV / WebM) | JPG (JPEG) |
|---|---|---|
| Type | Multi-frame container with audio | Single still image |
| Typical codecs | H.264, H.265/HEVC, AV1, VP9, MPEG-2 | DCT-based lossy compression |
| Audio track | Yes | No |
| Plays in browsers | Codec-dependent (some MP4/WebM yes, AVI/MKV often no) | Universal |
| File size, 1 min 1080p | 50-300 MB | 150-500 KB per frame |
| Embeds in docs and slides | Poor — varies by platform | Universal |
| Best for | Storage and playback of full motion | Thumbnails, evidence, references, archives |
| Goal | Frame selection mode | Capture rate / time |
|---|---|---|
| One thumbnail / poster | Specific Frame | Pick the timestamp (e.g. 00:05.000) |
| Evidence still from CCTV / dashcam | Specific Frame | Exact incident time, e.g. 12.450 |
| Storyboard contact sheet | Multiple Screenshots | 5 or 10 seconds per frame |
| Editing image sequence | Multiple Screenshots | 0.1s (10 fps) or 0.2s (5 fps) |
| Rough video summary | Multiple Screenshots | 1 second per frame |
| Frame-by-frame sports analysis | Multiple Screenshots | 0.1s (10 fps) |
| Long meeting / lecture review | Multiple Screenshots | 5 or 10 seconds per frame |
Use Specific Frame mode and enter the time in seconds with millisecond precision. For example, 2.100 means 2 seconds and 100 milliseconds into the video. Use this when you need the exact moment of a sports highlight, the frame just before a crash on a dashcam, the moment a person enters a CCTV view, or the instant a goal crosses the line.
Depends on the capture rate. At 1 second per frame you'll get 600 JPGs. At 0.1s per frame (10 fps) you'll get 6,000 JPGs — fine for editing pipelines but heavy in a browser. At 5 seconds per frame you'll get 120 stills — a manageable contact sheet. Pick the slowest rate that still captures the moments you need; you can always re-run with a denser interval on a clip you've trimmed.
MP4, MOV, AVI, MKV, WebM, WMV, FLV, MTS, M2TS, M2V, M4V, MPG, MPEG, MPEG-2, 3GP, 3GPP, ASF, AV1, AVCHD, DIVX, DV, F4V, MJPEG, MXF, OGV, RM, RMVB, SWF, TS, VOB, WTV, XVID, and more — 35+ containers and the codecs inside them. If your media player can play it, frame extraction usually works even when in-browser preview can't.
JPG for photographic content (live-action footage, faces, landscapes, drone shots, sports) and when file size matters. PNG for screenshots, screen recordings, slide-deck exports, computer-generated content, and when you need pixel-exact text or graphics. PNG is lossless but typically 3-5x larger. For lossless extraction see video to PNG.
No — JPG is a still image format with no audio support. The audio track is discarded during frame extraction. If you need the audio separately, see video to MP3 for a parallel audio export.
The extracted frame matches the video's actual encoded resolution, not the upscaled playback dimensions. A "1080p" web download might really be 1280x720 stretched, and a "720p" old-camcorder AVI is often 720x404 letterboxed. Use the resolution presets to upscale to a larger output, or pick a higher Image Quality preset (Very High / Highest) to keep more detail. Upscaling interpolates pixels — it can't add detail that wasn't captured.
Set Multiple Screenshots to 0.1 seconds per frame (10 fps) for the densest extraction available. For true every-frame extraction matching native rates (24, 25, 29.97, 30, 50, or 60 fps), 0.1s is close enough for most editing workflows. A 5-minute clip at 30 fps native equals 9,000 frames — plan storage and ZIP-download time accordingly.
Files are processed in your browser session via secure WebAssembly decoding. Frames are extracted client-side wherever possible. No watermarks, no sign-up. If you'd rather have an animated output instead of stills, see video to GIF.
Modern browsers don't natively play every container — AVI, MKV, and older MPEG-2 streams especially. Frame extraction decodes the video stream directly, independently of any browser playback layer, so codec quirks that block preview don't usually block extraction here.