How to Set Up an OCR Workflow With Windows Scan App
🔍 WiseChecker

How to Set Up an OCR Workflow With Windows Scan App

Quick fix: The Windows Scan app from Microsoft Store doesn’t include OCR. For OCR: scan to PDF, then open the PDF in Microsoft OneNote → right-click image → Copy text from picture. Or use the free Microsoft PowerToys → Text Extractor (Win+Shift+T). Or Adobe Acrobat’s free OCR tier. Or use online tools like onlineocr.net for free occasional OCR.

Windows Scan app handles scanning but not OCR (Optical Character Recognition). For making scanned documents searchable: pair Windows Scan with an OCR tool. Common stack: Scan → PDF → OCR via OneNote or Acrobat.

Symptom: Want an OCR workflow on Windows 11 using Windows Scan app or similar.
Affects: Windows 11 with scanner connected.
Fix time: ~10 minutes setup.

ADVERTISEMENT

What causes this need

Scanning a paper document gives an image. The text in the image isn’t searchable, can’t be copied. OCR makes text selectable / searchable. Useful for: receipts (search by date), books (full-text search), legal docs (find quotes), notes (re-flow into a new format).

Method 1: Scan with Windows Scan, OCR with OneNote

The Microsoft-native workflow.

  1. Open Windows Scan app (Microsoft Store). Free.
  2. Pick scanner from list.
  3. Configure: source (flatbed/feeder), color/grayscale, resolution (300dpi for OCR).
  4. Click Scan. Image saved as PNG or PDF (depending on settings).
  5. For PDF output: pick File type: PDF.
  6. Open the resulting image in Microsoft OneNote.
  7. In OneNote: right-click the inserted image → Copy text from picture.
  8. OneNote runs OCR on the image. Extracted text appears in clipboard.
  9. Paste anywhere (Word, Notepad, email).
  10. For multi-page documents: insert all pages into OneNote, then OCR each.

This is the OneNote route.

ADVERTISEMENT

Method 2: PowerToys Text Extractor for selective OCR

For quick screen OCR.

  1. Install Microsoft PowerToys from Microsoft Store.
  2. Open PowerToys → Text Extractor. Enable.
  3. Default shortcut: Win + Shift + T.
  4. Press shortcut. Screen dims with crosshair cursor.
  5. Drag a rectangle over text you want to extract. OCR runs.
  6. Text copied to clipboard.
  7. Paste anywhere.
  8. For scanned documents: open the scan image full-screen, press shortcut, drag over text. Works on any image on screen.
  9. Supports many languages. Settings → Text Extractor → pick languages to include.
  10. For batch: still per-selection. For full-page batch, use Method 3.

This is the live-screen approach.

Method 3: Adobe Acrobat or third-party for full-page OCR

For comprehensive document OCR.

  1. Adobe Acrobat (paid or free trial):
    • Open PDF in Acrobat. Tools → Scan & OCR.
    • Pick Recognize Text. Acrobat OCRs the document.
    • Resulting PDF has searchable text overlay.
    • Use Find (Ctrl+F) to search. Copy text by selection.
  2. ABBYY FineReader (paid, top-tier): best quality OCR. Worth it for heavy use.
  3. Free: Tesseract (open-source). Command-line OCR. Use with GUI wrappers like gImageReader.
  4. Online: onlineocr.net (free): upload image, get text. Caveat: data sent to their server. Avoid for sensitive docs.
  5. Online: Microsoft Lens (mobile app + Microsoft account): scan with phone camera, OCR via Microsoft 365.
  6. For batch processing: a script using Tesseract:
    for /f %i in ('dir /b *.png') do tesseract %i %~ni

    Converts each PNG to text file.

  7. For Office 365 subscribers: built-in Lens in OneDrive. Phone capture → OCR → PDF.

This is the comprehensive route.

How to verify the fix worked

  • Scanned text is selectable/searchable in PDF (after OCR).
  • Copy text from picture (OneNote) returns recognizable text.
  • PowerToys Text Extractor copies text to clipboard.
  • Pasted text matches scanned content (with normal OCR error rate).

If none of these work

If OCR quality poor: Resolution too low: scan at 300dpi minimum. Skew / rotation: scan straight; or deskew via image tool first. Color issues: scan grayscale, contrast set to ~50% for typed docs. Hand-written text: OCR is poor for handwriting. Use specialized tools like Pen and Ink OCR. Non-English text: ensure OCR tool supports the language. Tesseract has packages for 100+ languages. For low-quality smartphone photos: rotate, crop, increase contrast first. For PDFs with embedded text already: not all PDFs need OCR; check Ctrl+F first. For very long docs: batch OCR via Tesseract or ABBYY. For chronic OCR errors: train Tesseract on specific font (advanced). Or pay for ABBYY for consistent quality.

Bottom line: Windows Scan for scanning — pair with OneNote (Copy text from picture) or PowerToys Text Extractor (Win+Shift+T) for basic OCR. Use Adobe Acrobat or Tesseract for full-page document OCR.

ADVERTISEMENT