Initializing... drag & drop files here
Supports: MP4, M4V
2.100 (2 seconds and 100 ms) to capture exactly one still. Switch to Multiple Screenshots to extract a sequence at a chosen capture rate (0.1s, 0.2s, 0.3s, 0.5s, 1s, 2s, 3s, 4s, 5s, 6s, 7s, 8s, 9s, or 10s per frame).MP4 is the default video container almost everywhere — iPhones and Androids record to MP4, screen recorders save MP4, YouTube and Vimeo serve MP4, and most editing tools export MP4. Inside, the video is usually H.264 or H.265/HEVC; newer iPhones (since iOS 11) prefer HEVC for smaller files. Pulling JPEG stills out of an MP4 gives you images that drop cleanly into any document, slide, post, or archive — without re-encoding the whole video. JPEG is the right pick when file size matters and the source is photographic; for sharp text and screen recordings, see MP4 to PNG.
12.450 seconds for an insurance claim, police report, or HOA submission. JPEG keeps the file small enough to email or attach to a portal.| Property | MP4 (MPEG-4 Part 14) | JPEG (JPG) |
|---|---|---|
| Type | Multi-frame video container with audio | Single still image |
| Released | 2003 (ISO/IEC 14496-14) | 1992 (JPEG standard) |
| Typical codecs | H.264, H.265/HEVC, AV1, MPEG-4 | DCT-based lossy compression |
| Audio track | Yes (AAC, AC-3, ALAC) | None |
| Plays in browsers | Universal for H.264; H.265 limited to Safari and newer Edge | Universal |
| File size, 1 minute 1080p | 50-200 MB | 150-500 KB per frame |
| Embeds in docs and slides | Inconsistent across platforms and email | Universal |
| Best for | Storage and playback of full motion | Thumbnails, evidence, references, archives |
| Goal | Frame selection mode | Capture rate / time |
|---|---|---|
| One thumbnail / poster | Specific Frame | Pick the timestamp (e.g. 00:05.000) |
| Evidence still from dashcam / doorbell | Specific Frame | Exact incident time, e.g. 12.450 |
| Single photo from a phone clip | Specific Frame | The exact moment, e.g. 3.700 |
| Storyboard contact sheet | Multiple Screenshots | 5 or 10 seconds per frame |
| Editing image sequence | Multiple Screenshots | 0.1s (10 fps) or 0.2s (5 fps) |
| Rough video summary | Multiple Screenshots | 1 second per frame |
| Frame-by-frame sports analysis | Multiple Screenshots | 0.1s (10 fps) |
| Long meeting / lecture review | Multiple Screenshots | 5 or 10 seconds per frame |
Use Specific Frame mode and enter the time in seconds with millisecond precision. For example, 2.100 means 2 seconds and 100 milliseconds into the MP4. This is the cleanest way to grab a sports highlight, the frame just before a crash on a dashcam, the exact moment a person walks past a camera, or a single still from a phone clip without scrubbing the whole timeline.
Depends on the capture rate. At 1 second per frame you'll get 600 JPEGs. At 0.1s per frame (10 fps) you'll get 6,000 JPEGs — fine for editing pipelines but heavy in a browser. At 5 seconds per frame you'll get 120 stills — a manageable contact sheet. Pick the slowest rate that still captures the moments you need; you can always re-run with a denser interval on a clip you've trimmed.
Yes. iPhones running iOS 11 or later record HEVC by default unless you switch to "Most Compatible" in Camera settings, and many newer Android phones do the same. HEVC decodes here for frame extraction even on browsers (like Firefox on Linux) that can't natively play HEVC video. 10-bit HDR sources also extract — note that JPEG is 8-bit per channel, so HDR highlights are tone-mapped to standard dynamic range.
The extracted frame matches the video's actual encoded resolution, not the upscaled playback dimensions. A "1080p" web download might really be 1280×720 stretched, and a YouTube re-upload often re-encodes lower than the original. Use the resolution presets to upscale to a larger output, or pick a higher Image Quality preset (Very High / Highest) to keep more detail. Upscaling interpolates pixels — it can't add detail that wasn't captured.
JPEG for photographic content (live-action footage, faces, landscapes, drone shots, sports) and when file size matters. PNG for screenshots, screen recordings, slide-deck exports, computer-generated content, and when you need pixel-exact text or graphics. PNG is lossless but typically 5-10× larger than the equivalent JPEG. See MP4 to PNG for lossless extraction.
No — JPEG is a still image format with no audio support. The audio track (typically AAC in an MP4) is discarded during frame extraction. If you need the audio separately, see MP4 to MP3 for a compact audio export or MP4 to WAV for an uncompressed one.
Files are processed in your browser session via secure WebAssembly decoding wherever possible. Frames are extracted client-side — no watermarks, no sign-up. If you'd rather have an animated output instead of stills, see MP4 to GIF.
Set Multiple Screenshots to 0.1 seconds per frame (10 fps) for the densest extraction available. For true every-frame extraction matching native rates (24, 25, 29.97, 30, 50, or 60 fps), 0.1s is close enough for most editing workflows. A 5-minute 30 fps clip equals 9,000 frames at native rate — plan storage and ZIP-download time accordingly.
Frames extract in your browser session via WebAssembly. Smaller MP4s (under ~2 GB, like a phone clip or short screen recording) extract quickly. Multi-hour 4K recordings are bound by your device's RAM and CPU rather than a server upload limit. For very large files, consider trimming the relevant scene first or extracting at a sparser interval (5s, 10s) to keep the JPEG count manageable.