The OCR extracted the wrong fields

all Updated Tue Jul 14 2026 00:00:00 GMT+0000 (Coordinated Universal Time)

Edit any field directly on the document detail page — the correction saves immediately and trains the system for that vendor. If a whole document went wrong: click Re-scan (3 retries per document, no quota cost) or re-upload a clearer version. If still wrong after re-scan + re-upload, write to [email protected] with the document attached.

When to use this article

The AI extracted fields from your document and one or more came out wrong: wrong vendor name, wrong amount, wrong VAT rate, wrong date, wrong currency. This article walks through the triage (what's most likely wrong, what to fix first), the per-field-edit flow, the document-level re-scan flow, and when to escalate.

For the broader story of what the AI extracts and how, see what the AI reads, and how to correct it. For the confidence-score-meaning context, see understanding confidence scores. For the per-vendor learning loop, see how to fix a misread vendor name. For pre-extraction problems (upload failed entirely), see upload failed — troubleshooting.

Quick triage — what's most likely wrong

Symptom	Most likely cause	First fix
One or two fields wrong	AI misread a detail; the document's overall confidence was still high	Click and edit the specific fields
All fields wrong or garbled	Poor document quality (blurry, tilted, shadowed)	Re-upload a clearer version
Wrong vendor on a known sender	Invoice layout the AI hasn't seen from this vendor	Edit vendor; the system learns vendor patterns from repeated corrections
Wrong VAT rate	Country-defaults misapplied, or rate on document is unusual	Edit rate; net / VAT amount / gross recompute
Wrong currency	Ambiguous symbol (e.g. `$` could be USD/CAD/AUD/SGD)	Edit currency; conversion re-applies
Date wrong (year off by one)	Common with December invoices read in January (ambiguous date format)	Edit date; conversion-rate re-fetched
Multi-page PDF only extracted page 1	Some scanners produce malformed PDFs	Re-save the PDF in any viewer, re-upload
Line items wrong but totals right	AI saw totals clearly but struggled with table layout	Edit line items individually, or accept totals + leave items blank
Totals wrong but vendor right	Misread digits (often a faded "0" or "8") — inconsistent totals get flagged for review automatically	Edit totals
Forwarded email classified as `irrelevant`	Classifier saw no invoice signals	Upload the attachment directly, or re-forward with "invoice" in the subject

The fix path depends on which of these you're hitting. Most are one-click corrections; whole-document issues are re-scan or re-upload.

Step-by-step — edit fields

The most common fix: just edit the wrong field.

Open the document detail page (from the Documents list, or from the in-app notification when extraction finished).
Click any field on the right side. A text input appears in place.
Type the correction. Autocomplete shows existing values where applicable (e.g. vendor names suggest from your Trading Partners).
Tab or click outside the field to save. The change is immediate and recorded in your audit trail.
The save feeds the vendor-learning loop — after you've corrected the same thing for the same vendor a few times (3 confirmations), the learned pattern (category, currency, VAT rate, net-vs-gross layout) is applied to future scans of that vendor automatically.

See how to fix a misread vendor name for the vendor-learning loop in detail.

Cascading edits

Some edits cascade. Changing the VAT rate recomputes the dependent amounts; changing the currency re-applies the exchange-rate conversion. The editor shows you the recomputed values before you save — and it never silently "fixes" a line-items-vs-total mismatch; you get an explicit choice instead.

Manual override of high-confidence extractions

A scan can come back with a high overall confidence score and still have one wrong field. Click and edit anyway — the system doesn't argue with you. Your manual edit replaces the AI's value, and the change is recorded in the audit trail.

Step-by-step — re-scan a document

If editing field-by-field is too tedious (e.g. nearly everything is wrong, or you've made vendor corrections elsewhere and want them retroactively applied), force a fresh extraction:

Open the document detail page.
Click Re-scan in the top action bar.
The original file is sent back through the AI pipeline, including whatever vendor patterns the system has learned since the first scan. The new extraction replaces the old one — so if you've already made careful manual edits, prefer editing over re-scanning.
Up to 3 re-scans per document, no quota cost.

When re-scan helps:

You've corrected several invoices from the same vendor since the first scan; re-scanning applies the vendor patterns learned in the meantime.
You've updated the company country (Settings → Company), which changed the VAT-rate defaults; re-scan picks up the new defaults.
The original was processed during a transient AI hiccup (rare, but happens) and you want a clean retry.

When re-scan doesn't help:

Same document content + same model = same result. If you haven't made corrections between scans, re-scan returns the same fields.
Document is genuinely unreadable. Re-scan can't see better than it did the first time.

After 3 re-scans the option greys out. At that point either: re-upload a cleaner version (different file hash = new document), or escalate to support.

Step-by-step — re-upload a cleaner version

If the original photo is blurry, tilted, shadowed, or otherwise low-quality:

Take a new photo with better conditions. Tips: flat surface, even lighting (avoid direct flash on glossy receipts), phone parallel to the document, fill the frame with the document, one document per photo.
Upload the new version via the standard upload flow (drag-and-drop or file picker).
Our duplicate-detector compares file hashes — the new version won't match the blurry one (different bytes), so it's a separate document. Quota counts the new one.
Delete the blurry one from the Documents list. The trashed document also stops counting toward this month's document limit.

You end up with one clean document instead of two. The vendor-learning from the blurry document's corrections (if any) still applies — the system associates corrections with vendors, not with specific document IDs.

When to write to support

If after edit + re-scan + re-upload it's still wrong, and the document is a normal-looking invoice from a normal vendor: write to [email protected] with subject [BUG] OCR extraction wrong. Include:

The document itself (attach the PDF / image; redact specific sensitive numbers if you want, but the layout and format matter for diagnosis).
What field is wrong and what the correct value should be.
What the AI returned (you can copy from the document detail page).
The document ID (visible in the URL).

We use these reports to improve the model — your specific case can help fix the same pattern for others. Typically 1–3 working days for response.

Common causes per field

Wrong vendor name

A vendor the system hasn't learned yet → edit it; after three confirmed corrections the vendor's learned pattern applies automatically.
Vendor's invoice is consolidated (e.g. "Stripe Inc" on the header, "Stripe Payments Europe" elsewhere) → the AI picked the wrong one. Edit to your preferred canonical name.
OCR mistake on similar letters (e.g. "Hetzner" read as "Helzner") → edit.

Wrong VAT rate

Country defaults misapplied (German 19% suggested when the invoice is actually French 20%) → edit.
Mixed-rate invoice with multiple line items at different rates → use the line-items view to set per-line rates, or accept the dominant rate and adjust manually.
Reverse-charge invoice (intra-EU B2B) where the rate is 0% with the reverse-charge clause → edit rate to 0%, optionally mark as reverse-charge in the document.

Wrong currency

Ambiguous symbol like $ could be USD, CAD, AUD, NZD, SGD, HKD → the AI uses address and vendor context to disambiguate. If it picked wrong, edit.
Multiple currencies on one invoice (line items in one currency, totals in another) → edit to the dominant currency.
Cryptocurrency or non-ECB currency → conversion field is left blank; supply manually. See multi-currency and official ECB rates.

Wrong date

Year off by one — common with December invoices uploaded in January. EU date format 01.12.2025 is ambiguous if year is partly hidden. Edit.
Format misread — 05/04/2026 could be May 4 or April 5 depending on locale. Edit.
Date too far in the past or future — outliers are worth double-checking against the original document. Confirm or correct.

Wrong totals

Faded number in the PDF (printer ran out of toner) → OCR misread. Edit.
Currency-symbol confusion ("€1,234.50" misread as "1234,50") → edit.
Comma vs decimal in European notation (1.234,50 vs 1,234.50) → the AI handles both but rare edge cases trip it. Edit.

Edge cases

My re-scan didn't help. Same document + same model = same result. The improvement on re-scan comes from corrections you made between scans (vendor learning). If you didn't correct anything since the first scan, re-scan returns the same fields. Either correct first, then re-scan, or re-upload a cleaner version.

I want to re-extract with a different OCR model. The AI model isn't user-configurable (low-confidence scans are already retried with a stronger model automatically). If you have a document type that consistently fails, write [TECHNICAL] to support with a sample — that's the most useful signal for us.

Multi-page PDF only shows page 1. Some scanners produce PDFs with JBIG2 compression or other quirks that confuse the multi-page reader. Open the PDF in any viewer (Preview / Acrobat / Skim), choose File → Export → PDF (or Save As → PDF), upload the re-saved version. The re-save normalises the format.

Forwarded email classified as irrelevant. Upload the attachment directly, or re-forward the email with "invoice" in the subject. See what happens when you forward a newsletter.

I typed the right value but the field changed back. A re-scan or a workflow event (vendor merge, currency change) replaced your manual edit — a re-scan runs a full fresh extraction. Make manual corrections after your last re-scan, not before. The audit trail shows exactly which change happened when; if something looks wrong there, write to support.

Line items extract but totals are blank. Some invoices print totals as images (rendered text rather than selectable PDF text) while line items are in selectable text. The AI handles both but image-rendered totals score lower. Edit totals manually; line items can stay as-extracted.

Hand-written annotations on a printed invoice. The AI reads printed text most reliably; handwriting (a tip added to a restaurant receipt, a scribbled note) may or may not be picked up. Always double-check totals on documents with handwritten amendments, and correct manually where needed.

The document is correct but the bank-match scored it against the wrong invoice. Different problem — see how the matching pipeline works. Extraction-vs-matching are separate stages with separate scoring.

What the AI reads, and how to correct it — broader correction context
Understanding confidence scores — score-meaning context
How to fix a misread vendor name — vendor-specific learning
Upload failed — troubleshooting — when extraction never started