Why Scanned Documents Look Bad

Most office scanners and phone scanning apps are optimised for speed, not quality. The default settings produce images that are "good enough" to read but are far from clean. The grey background comes from the scanner glass picking up ambient light, slight paper yellowing, or incorrect white balance. The blurring comes from scanner bed vibration, slight document movement, or optical lens limitations at cheaper price points. The speckle and noise comes from the JPEG compression that most scanners apply by default.

Phone-scanned documents have additional challenges: uneven lighting, lens distortion at page edges (causing curved text), and shadows cast by your hand or the phone case. Apps like CamScanner correct some of these automatically, but the results are often over-processed, producing an artificial "too white" look with blown-out areas where the original had light text or fine detail.

Cleaning and enhancing scanned PDFs for professional output raw scanshadows · skew · noise Clean & enhanceDeskew · DenoiseRemove shadowsWhiten background clean scan Print-ready quality
Cleaning and enhancing scanned PDFs for professional output
📌 Before vs. After — What Scan Cleanup Actually Does

A typical office scan of a typed letter: grey background (#d8d8d8 approximately), text at 60% contrast, small speckling dots throughout, and slightly soft edges on characters. After cleanup: white background, text at near-100% contrast, noise removed, character edges crisp. The resulting PDF is smaller (grey areas compress worse than white), looks professional when printed, and OCRs accurately.

What Scan Cleanup Tools Actually Do

A good scan cleanup tool applies several image processing operations, ideally in the right sequence:

Step-by-Step: Clean a Scanned PDF with Rifix

  1. Open Rifix — Scan Cleanup in your browser. Works on desktop and mobile.

  2. Load your scanned PDF or image. The file is processed locally — no upload.

  3. Choose a cleanup preset — Light Doc (for well-lit scans), Dark Scan (for low-contrast photocopies), or Aggressive (for very poor quality scans).

  4. Use the side-by-side preview to compare the original and cleaned versions page by page.

  5. Adjust individual sliders (contrast, brightness, sharpening) if the preset doesn't achieve the result you want.

  6. Click Apply to All Pages and download your cleaned PDF.

Recommended Workflow for Best Results

Scan cleanup rarely works in isolation — it's most powerful as part of a sequence. Here's the ideal workflow for a poorly scanned document:

  1. Rotate any sideways pages first using Rifix Rotate PDF. Cleanup works on each page individually, so correct orientation before cleaning.
  2. Run Scan Cleanup to remove background, boost contrast, and sharpen text.
  3. Run OCR on the cleaned version using Rifix OCR. Accuracy will be significantly higher on a cleaned scan than on the original grey image.
  4. Compress the final result with Rifix Compress PDF. Cleaned scans (mostly white background) compress much more efficiently than grey-background originals.

This four-step sequence transforms a difficult, low-quality scan into a professional, searchable, compact PDF — all using browser-based tools with no uploads at any stage.

💡 Pro Tip — Scan Quality at the Source

If you're scanning new documents rather than cleaning existing ones, these settings produce the best results from the start: scan at 300 DPI (not 150 DPI which most copiers default to), choose Black and White mode for text-only documents (not Greyscale), and save as PDF or TIFF rather than JPEG to avoid compression artefacts. A document scanned this way often needs no cleanup at all.

When Scan Cleanup Isn't Enough

Some scans are simply too damaged to recover fully through digital processing. Physical damage — water stains, torn edges, faded ink, handwritten corrections over printed text — can only be corrected to a limited degree by software. If a document is critically important and in poor physical condition, professional document restoration services use specialised hardware (drum scanners, IR and UV imaging) that can recover information invisible to a standard office scanner.

For everyday use, however, the browser-based approach handles the vast majority of scan quality issues that office workers, students, and small business owners encounter regularly. It's private, free, and fast — which is the combination that matters most.

Scan Cleanup for Legal and Archival Documents

When cleaning documents that will be submitted to legal proceedings, immigration authorities, or archival systems, keep the original unmodified scan as the primary record. The cleaned version is a working copy only. Some authorities require "true copies" of original documents — a digitally enhanced scan may not qualify as an unaltered reproduction for these purposes. When in doubt, submit both the original scan and the cleaned version, noting that the latter has been processed for legibility.

Why Scanned Documents Need Cleaning

Scanning a physical document rarely produces a perfect result. Paper is not uniformly white — aging, handling, and lighting create variations in background tone. Flatbed scanners capture shadows at page edges and binding gutters. Mobile phone scans introduce perspective distortion from holding the phone at an angle. Pages are rarely perfectly aligned in a scanner, producing slightly crooked pages that look unprofessional. These imperfections are invisible on paper but obvious in the digital scan — a grey, mottled background instead of clean white, text that runs slightly off-horizontal, and dark shadows in the margins. Cleaning these issues transforms a rough scan into a professional digital document.

OCR converts scanned images into selectable, searchable text scanned image OCRThe quick brown foxjumps over the lazydog. Page 1 of 5.searchable text Copy · Search · Edit Text extracted
OCR converts scanned images into selectable, searchable text

Common Scan Problems and Their Fixes

Background noise (grey or mottled backgrounds instead of clean white): caused by paper texture, age, or scanner sensitivity settings. Fix: increase contrast and apply background whitening to push near-white pixels to pure white while preserving the dark ink. Skewed pages (text runs at an angle): caused by pages not placed perfectly straight in the scanner, or by phone scanning at an angle. Fix: auto-deskew detects the angle of text lines and rotates the page to straighten them — usually corrects angles up to 10–15 degrees automatically. Dark edges and shadows: caused by the gap between the page and scanner lid, or by shadows cast when phone-scanning. Fix: crop to the content area, removing the dark border regions. Bleed-through (content from the reverse side showing faintly): caused by thin paper or high scanner sensitivity. Fix: contrast adjustment lightens the bleed-through while preserving the front-side content.

Cleaning Scanned Documents at rifix.xyz

Open rifix.xyz/scanclean. Upload your scanned PDF. The tool analyses each page and applies a combination of cleaning operations: background normalisation to produce clean white backgrounds, deskewing to straighten tilted pages, contrast enhancement to make text crisper and more readable, and edge cleanup to remove scanner shadow borders. Download the cleaned version and compare it against the original in two browser tabs — the difference is typically immediately visible as a cleaner, more professional appearance. The cleaned version prints more sharply than the raw scan and is more legible on screen, particularly at smaller sizes.

Before or After OCR?

Cleaning should be done before OCR, not after. OCR accuracy is significantly higher on clean, high-contrast scans with straight pages than on raw scans with background noise and skew. Running scanclean first, then OCR on the cleaned version, produces better text recognition results. The workflow: scan → clean at rifix.xyz/scanclean → OCR at rifix.xyz/ocr → searchable, clean PDF. For scans that will not be OCR'd but will be archived or shared as visual documents, cleaning alone is sufficient — the result looks more professional without needing the searchable text layer.

Mobile Scans vs Flatbed Scans

Mobile phone scans have different cleaning challenges than flatbed scanner output. Phone scans often have more significant perspective distortion (the phone was not held perfectly overhead), more variable lighting (shadows from the hand or ambient lighting variation), and lower resolution than a dedicated scanner. Good scanning apps (Adobe Scan, Microsoft Lens, and the native iOS Documents scanner in Files) handle perspective correction automatically but may not correct all lighting issues. Flatbed scanner output is typically more consistent but often has scanner-bed colour cast (a slight blue or grey tint) and edge shadows from the lid. Cleaning tools address both categories — the algorithms adapt to the specific artefacts present in each scan.

Colour vs Black and White Scans

For text documents without colour elements — standard office documents, contracts, letters, forms — scanning in black and white mode rather than colour produces smaller files and cleaner results. Black and white scans are more amenable to automatic cleaning because the algorithm has a clear target: white background, black text. Colour scans of colour documents — forms with coloured headers, annotated documents with coloured highlights, brochures — need to preserve the colour information and are handled differently by cleaning tools. For text-only documents, if you have already scanned in colour, converting to greyscale as part of the cleaning process produces a cleaner result at a smaller file size.

Archiving Cleaned Scans

Cleaned, OCR'd scans are the ideal form for long-term document archiving. They are visually clean and professional, searchable by keyword, small in file size compared to raw colour scans, and compatible with all PDF viewers indefinitely. For important documents that should be retained — contracts, certificates, financial records, identification documents — scan once at good quality, clean, OCR, and archive in a named folder with a date-stamped filename. Keeping both the raw scan and the cleaned version provides a fallback if you ever need the unprocessed original. The cleaned version is what you share and reference; the raw scan is your backup.

NR
Nowsath Rifaya · Founder, Rifix PDF Editor
Operations professional based in Singapore. Built Rifix to solve a real work problem — handling confidential PDF documents without uploading them to unknown servers. Writes from direct experience using these tools daily.

Try It Free — Right Now

Remove grey backgrounds, fix contrast, sharpen text — your scanned documents, professionally cleaned.

Clean Up a Scanned PDF Free →