Initializing... drag & drop files here
Supports: CAF
CAF is Apple's Core Audio Format — an audio container, not a video file. MKV (Matroska) is an open, royalty-free multimedia container, announced in December 2002, that wraps a moving picture — usually an H.264, H.265, VP9, or AV1 video track — alongside audio and subtitle streams. A standard .caf holds only sound — a voice memo, a GarageBand loop, an iMessage audio clip — so there is no picture or motion inside it to encode into an MKV video. This tool is wired as a video target (it defaults to a video codec and shows controls for Quality Preset, resolution, bitrate, and trim), and an audio-only file has no frames to fill those with. This page explains what a CAF really is and points you to the tools that do what you actually want.
| Property | Value |
|---|---|
| Format name | Core Audio Format (CAF) |
| Type | Audio container (no video, no image) |
| Developer | Apple |
| Introduced | 2005, with Mac OS X 10.4 (Tiger) |
| Internet media type | audio/x-caf |
| Typical payloads | Linear PCM, Apple Lossless (ALAC), AAC, and other audio codecs |
| Maximum size | No 4 GB cap — uses 64-bit file offsets, so one file can hold an extremely long recording |
| Common sources | GarageBand / Logic Pro / Soundtrack Pro loops and sound effects, iMessage audio messages |
| Best replacement for | AIFF and WAV when a lossless container without the 4 GB limit is needed |
In the CAF specification, the word "frame" means an audio sample frame — "a set of samples representing one sample for each channel," intended to be played together (simultaneously) — not a video frame. Apple's spec even notes that this definition "might be different from the use of the term 'frame' by codecs, video files, and audio or video processing applications." That naming overlap, plus MKV's job as a video container, is the only reason a "CAF to MKV" route appears in some format lists at all. There is no sequence of pictures inside an audio file to turn into video.
| Property | Value |
|---|---|
| Format name | Matroska Multimedia Container (MKV) |
| Type | Video container (video track plus audio and subtitle tracks) |
| Developer | The Matroska project, forked from the Multimedia Container Format (MCF) |
| Announced | 6 December 2002 |
| Built on | EBML (Extensible Binary Meta Language), a binary derivative of XML |
| Internet media type | video/matroska (also seen as video/x-matroska) |
| Common video codecs | H.264, H.265 (HEVC), VP9, AV1, and many others |
| Related extensions | .mka is the audio-only Matroska container; .mkv is the video container |
| Needs a video stream? | For a .mkv, yes — it is the picture-carrying variant; an audio-only input has nothing to interleave |
Because MKV (.mkv) is the picture-carrying member of the Matroska family and a standard CAF describes only sound, there is no video payload to transcode between them. The audio-only Matroska variant is .mka, not .mkv — and this tool targets .mkv, which expects a video track.
.caf does not get on Windows..caf) and convert that to MKV. A Core Audio file has no frames, so a video must already exist to wrap as Matroska.Because an MKV (.mkv) is the picture-carrying Matroska container and a standard .caf is pure audio — there are no images inside to wrap as a video track. This tool defaults to a video codec, so an audio-only input has nothing to fill the picture with. The job only produces a usable MKV if the specific file you uploaded happens to carry a real video stream, which ordinary Core Audio recordings do not.
CAF (Core Audio Format) is an audio container developed by Apple and introduced in 2005 with Mac OS X 10.4 (Tiger), using the media type audio/x-caf. It can hold Linear PCM, Apple Lossless (ALAC), AAC, and other audio codecs. It is not a video or image format, so it cannot be turned into an MKV video.
The route comes from a format-list quirk: MKV is one of xconvert's video output targets, and the tool's metadata pairs the .caf input with it the same way it pairs every video format. That pairing does not mean a Core Audio recording contains video — it does not. We keep the page to explain the mismatch and send you to the audio targets that actually do what you want, rather than silently producing an empty file.
Yes — the Matroska family includes .mka, an audio-only container, which is a sensible home for sound. But this tool targets .mkv, the video variant that expects a picture track, not .mka. For a portable audio file that nearly every player reads, convert to MP3, WAV, or FLAC instead — these are far more widely supported for music than a Matroska audio file.
For everyday use, convert to MP3 for the smallest, most compatible file, or to WAV to keep it lossless and uncompressed. In our testing, a short ALAC-based .caf voice clip converted to MP3 lands in the low hundreds of kilobytes while staying clearly intelligible. See the full set of audio targets on the audio converter.
No. WAV is effectively capped near 4 GB by its 32-bit size fields — which, as Apple notes, "might represent as little as 15 minutes of audio" — but CAF uses 64-bit file offsets, eliminating that practical limit, so a single .caf file can store an extremely long recording. That capacity is one of the reasons Apple created the format as a successor to AIFF and WAV.
Uploads travel over an encrypted connection, are processed on our servers, and are deleted automatically a few hours after conversion. There is no sign-up and no watermark, and your files are never shared or made public.