Question 1

What image formats does XConvert's OCR tool support?

Accepted Answer

XConvert supports all common image formats including PNG, JPG/JPEG, WebP, BMP, and TIFF. PNG and TIFF are recommended for documents because they use lossless compression that preserves text clarity. JPG compression can introduce artifacts around text edges that reduce OCR accuracy, especially at low quality settings.

Question 2

How accurate is the OCR text extraction?

Accepted Answer

Accuracy depends on image quality, text style, and language. High-quality scans of printed text in common fonts typically achieve 95–99% character accuracy. Photographs with good lighting and focus achieve 85–95%. Handwritten text varies widely from 60–85% depending on legibility. Low-resolution or noisy images will produce lower accuracy.

Question 3

Can I extract text from handwritten notes?

Accepted Answer

Yes, but accuracy varies significantly based on handwriting clarity. Neatly printed handwriting in dark ink on white paper produces the best results. Cursive writing, light pencil marks, and cramped text are more challenging. For critical handwritten content, always review and correct the extracted text manually.

Question 4

Does the OCR tool support multiple languages?

Accepted Answer

Yes. XConvert's OCR engine supports text recognition in multiple languages including English, Spanish, French, German, Portuguese, Italian, and other Latin-script languages. Character recognition for non-Latin scripts (Chinese, Japanese, Korean, Arabic, Cyrillic) is also supported with varying accuracy levels depending on the script complexity.

Question 5

How do I improve OCR accuracy for poor quality images?

Accepted Answer

Before uploading, preprocess the image using any image editor: increase contrast, convert to grayscale, sharpen the text, remove background noise, and crop to the text region. Increasing the image resolution (upscaling) can also help, though it won't add detail that wasn't in the original. Straightening skewed text and correcting perspective distortion also improve results significantly.

Question 6

Can I extract text from a PDF with this tool?

Accepted Answer

XConvert's OCR tool processes image files. If your PDF contains scanned pages (image-based PDF), you'll need to export each page as an image first, then upload the images for OCR. If your PDF already contains selectable text (text-based PDF), you can copy the text directly without OCR.

Question 7

Is my uploaded image stored on XConvert's servers?

Accepted Answer

XConvert processes images securely and does not permanently store your uploaded files. Images are processed for text extraction and then discarded. For sensitive documents containing personal information, financial data, or confidential content, this approach ensures your data remains private.

Question 8

Why does OCR confuse certain characters?

Accepted Answer

OCR confusion occurs when characters have similar visual shapes. Common confusions include: 0 (zero) and O (letter O), 1 (one) and l (lowercase L) and I (uppercase i), rn (r-n) and m, cl and d, and 5 and S. These ambiguities are inherent to the visual similarity of these characters in many fonts. Context and language models help resolve most cases, but manual verification is recommended for critical data.

Question 9

Can I extract text from tables in images?

Accepted Answer

Yes, but table structure preservation depends on the complexity of the table. Simple tables with clear grid lines and well-separated cells produce good results, with text extracted in a readable order. Complex tables with merged cells, nested headers, or minimal borders may have text extracted in an unexpected order. For structured table data, you may need to manually reorganize the extracted text or use the JSON/YAML/XML converter to structure the data.

Question 10

What is the maximum image size I can upload?

Accepted Answer

XConvert accepts images up to several megabytes in size. Very large images (above 10 MB) may take longer to process and could be limited by your browser's memory. For optimal performance, resize extremely large images to a reasonable resolution before uploading — 300 DPI at the document's physical size is sufficient for excellent OCR accuracy.

Image Type	Expected Accuracy	Key Factors	Tips for Best Results
Printed text (high contrast)	95–99%	Clean fonts, good lighting	Use original digital files when possible
Screenshots	95–99%	Consistent rendering	Capture at native resolution
Scanned documents (300+ DPI)	90–98%	Scan quality, paper condition	Scan at 300 DPI minimum
Photographs of text	80–95%	Lighting, angle, focus	Shoot straight-on with even lighting
Handwritten text	60–85%	Handwriting clarity	Print clearly, use dark ink
Low-resolution images	50–80%	Pixel density	Upscale image before OCR
Stylized/decorative fonts	60–85%	Font complexity	Standard fonts yield better results
Text on textured backgrounds	70–90%	Contrast ratio	Increase contrast before processing

Online Image to Text Extractor

How to Extract Text from Images with XConvert

What is Image to Text OCR?

OCR Accuracy by Image Type

Common Use Cases

How Optical Character Recognition Works

Tips for Best Results

Frequently Asked Questions