Initializing... drag & drop files here
Supports: XVID
.avi file into the drop zone or click "+ Add Files." Multiple files can be queued and converted in one batch.Xvid is an open-source video codec library — forked from OpenDivX in 2001 and released under the GPL — that implements MPEG-4 Part 2 Advanced Simple Profile, the codec used inside most legacy .avi rips from the 2000s. The audio track inside an Xvid AVI is almost always MP3 (often VBR) or AC-3 (Dolby Digital) — both lossy. Re-extracting that audio as WAV does not recover what the lossy encoder discarded, but it does freeze the current waveform as uncompressed PCM so downstream edits, mixes, and exports do not stack additional generation loss on top.
| Property | Xvid in AVI | WAV |
|---|---|---|
| Type | Video container (Microsoft AVI) with MPEG-4 ASP video + MP3/AC-3 audio | Audio container (Microsoft RIFF) holding raw PCM |
| Released | Xvid forked from OpenDivX: 2001 (GPL); AVI: November 1992 | RIFF/WAVE specification: August 1991 (Microsoft / IBM) |
| Compression | Lossy video; lossy audio (typically MP3 VBR or AC-3) | None (uncompressed PCM); optional ADPCM is rarely used |
| Typical bitrate | 1–2 Mbps video + 128–192 kbps MP3 audio | 1411 kbps for 44.1 kHz / 16-bit / stereo |
| File size (1 min stereo, 44.1 kHz / 16-bit) | ~12–18 MB total (video + audio) | ~10.54 MB audio only |
| Hard size cap | 4 GiB (legacy AVI 32-bit index) | 4 GiB (RIFF 32-bit size field; ~6.8 h of CD-quality) |
| Editing friendliness | Decode + re-mux required | Sample-accurate; native DAW format |
| When to pick it | Playback of legacy video rips | Audio editing, mastering, archival, ML training |
| Target use | Sample rate | Channels | File size per minute |
|---|---|---|---|
| Voice memo / speech-to-text | 16,000 Hz | Mono | ~1.83 MB |
| Podcast dialogue editing | 44,100 Hz | Mono | ~5.04 MB |
| Music / general DAW work (CD spec) | 44,100 Hz | Stereo | ~10.54 MB |
| Video post-production / film | 48,000 Hz | Stereo | ~11.44 MB |
| ML voice training (RVC / XTTS) | 22,050–48,000 Hz | Mono | ~2.52–5.72 MB |
(Sizes are 16-bit PCM; multiply by 1.5× for 24-bit, 2× for 32-bit float.)
Xvid AVIs ship lossy audio — usually MP3 around 128–192 kbps, sometimes AC-3 at 192–448 kbps. WAV stores raw PCM at roughly 1,411 kbps for 44.1 kHz / 16-bit / stereo, so the audio-only WAV is typically 7–10× the size of the embedded MP3. Picking mono and 22,050 Hz roughly quarters the WAV size when source quality does not warrant CD-grade.
No. The Xvid container's audio was already encoded lossily, so the data missing after MP3 or AC-3 quantization stays missing. WAV preserves whatever waveform comes out of the decoder — meaning subsequent edits, normalization, EQ, and exports do not pile on additional generation loss. That is the practical reason to convert.
Match the sample rate to where the audio is going. Music distribution, CD masters, and most streaming services target 44,100 Hz. Anything that has to lock back to picture (YouTube uploads, Premiere / Resolve / Final Cut sessions, broadcast) wants 48,000 Hz. Resampling between the two adds tiny but non-zero artifacts, so pick once and stick with it.
Keep "Original" if you do not know the source rate — the converter reads it from the AVI header and avoids resampling. Force a rate when you are normalizing a batch (every podcast episode at 44.1 kHz, every voice-training clip at 22.05 kHz) or when the destination tool only accepts one rate.
Yes. Use the Trim controls to set a start time and duration before converting. This matters because RIFF/WAVE uses a 32-bit unsigned size field, so any single WAV is hard-capped at 4 GiB — about 6.8 hours of 44.1 kHz / 16-bit stereo. Long lectures, concerts, or surveillance captures will hit that ceiling; trimming or splitting first avoids a corrupt or truncated output.
WAV when the destination tool wants uncompressed PCM (most DAWs, sample players, ML training scripts). FLAC when you want bit-identical lossless audio at roughly 40–60% of the WAV size for archival or transfer. The two are mathematically equivalent in audio content; FLAC just packs it with an entropy coder.
Pick Xvid to MP3 when the result is going to a phone, a podcast feed, or any room where storage and bandwidth matter more than fidelity. Pick WAV when the file is heading into an editor, a sampler, or long-term storage where you do not want to re-encode again later.
Yes — WAV/PCM is decoded natively by iOS Files and Voice Memos, Android, every major desktop OS, and by <audio> elements in Chrome, Firefox, Edge, and Safari. The two practical caveats are file size (a 1-hour stereo CD-quality WAV is roughly 600 MB) and the 4 GiB ceiling. For phone playback, WAV to MP3 afterward keeps file size manageable.
.divx files or other AVI rips, not just Xvid?Yes. Xvid and DivX are sibling MPEG-4 ASP encoders and ship in the same AVI container; the audio track is decoded the same way. If your file extension is .avi rather than .xvid, AVI to WAV is the same pipeline. After conversion you can also trim the WAV further or compress it if it overruns disk.
It depends on the channel setting. Leave "Audio Channel" on Original to preserve the source layout where the decoder supports it; pick Stereo to downmix 5.1 to L/R using standard ITU-R BS.775 coefficients, or Mono to sum everything into a single channel. Most older Xvid TV rips ship stereo MP3 anyway, so the surround question only comes up with DVD-sourced AC-3 captures.