Initializing... drag & drop files here
Supports: MJPEG
This page extracts the audio track from an MJPEG file and decodes it to uncompressed WAV. Read this first, because there is an important catch: MJPEG (Motion JPEG) is a video-only codec — it compresses each frame as a separate JPEG image and carries no audio of its own. You only get sound out if your file is an MJPEG-in-a-container (an .avi or QuickTime .mov from an old camcorder, webcam, or IP camera) that also muxed a separate audio track. If the source is a pure MJPEG video stream, the resulting WAV will be silent — and that is the format, not a bug in the converter.
WAV is a container for raw pulse-code-modulation (PCM) samples, so the settings that matter are bit depth, sample rate, and channel count. Most old MJPEG camcorders and webcams stored audio as uncompressed PCM at a low sample rate (often 8–24 kHz), so re-sampling upward gains nothing — it just makes a bigger file from the same information.
Keep in mind WAV stores file size in a 32-bit header field, so a single WAV is capped at roughly 4 GiB — about 6.8 hours of 44.1 kHz 16-bit stereo. For an extracted audio track that is rarely a concern, but very long surveillance recordings can hit it.
.avi, our AVI to WAV extractor reads the AVI container directly.If your MJPEG genuinely has no audio track, no tool can extract sound from it — the data does not exist in the file. Check the source: MJPEG from IP cameras, dashcams, microscopes, and many screen recorders is silent by design, while MJPEG muxed into an AVI or .mov from a consumer camcorder usually does carry a PCM track. If you also want to keep the video, convert the whole file to a modern format with our MJPEG to MP4 converter instead of stripping the audio out. And if the file is DRM-protected or corrupted, it may fail to decode entirely; in that case repair or re-export the source before converting.
Not by itself. MJPEG is a video-only codec that stores each frame as an independent JPEG image, with no audio stream. You only get sound when the MJPEG video is wrapped in a container — typically an AVI or QuickTime .mov — that also holds a separate audio track. The U.S. Library of Congress format registry describes MJPEG as video that is "wrapped" with other chunks, including audio, inside formats like AVI and QuickTime. A bare MJPEG stream produces a silent WAV.
WAV stores uncompressed PCM samples, so it is lossless — nothing is thrown away. That makes it the right choice when you want a master copy for editing, transcription, or archiving. MP3 and AAC are smaller because they discard inaudible detail, which is better when you only need to share or stream the audio. Since the audio inside old MJPEG files is already PCM, exporting to WAV simply re-wraps it without a second lossy pass.
Leave both on "Original" if you want an exact copy. Old MJPEG camcorders and webcams typically recorded PCM audio at a low sample rate, so up-sampling to 48 kHz adds file size without adding information. If you need CD-quality output for a project, choose PCM S16LE at 44.1 kHz; for editing headroom, PCM S24LE or S32LE preserves more bit depth.
No. WAV with PCM is uncompressed, so the extracted audio is a bit-faithful copy of whatever the container stored. If the result sounds low-fidelity, that reflects how the original device captured the sound — not the conversion. WAV cannot add detail the source never had.
PCM is uncompressed, so size scales with sample rate, bit depth, channels, and length. As a rough guide, 44.1 kHz 16-bit stereo is about 10 MB per minute. A single WAV is also capped near 4 GiB by its 32-bit size header — roughly 6.8 hours of CD-quality stereo — which only matters for very long recordings.
In our testing, a short MJPEG-in-AVI clip with a PCM track extracts to WAV in a couple of seconds because the audio is simply demuxed and re-wrapped rather than re-encoded. Your file is uploaded over an encrypted connection, processed on our servers, and deleted automatically a few hours after conversion — no sign-up, no watermark, and never shared or made public.