Skip to content

Receipt processor

ID MOD-091
System SD08
Repo bank-app
Build status Not started
Deployed No

What it does

MOD-091 is the receipt processor. It ingests receipt images from two sources — camera capture in the customer app, and email forwarding — and attaches the extracted data to the corresponding transaction record.

Ingestion paths

App capture — customer photographs a receipt immediately after purchase. The image is uploaded from the app and queued for OCR processing.

Email ingestion — customer forwards a receipt email to a dedicated bank email address. MOD-091 parses the email (text and HTML), extracts the receipt data, and attempts to match it to a transaction.

OCR and extraction

Receipt images are processed using a document OCR pipeline (vendor TBD during Phase 1 build evaluation — candidates include AWS Textract, Google Document AI). Extracted fields:

  • Merchant name
  • Transaction date and time
  • Total amount (and per-item amounts where available)
  • GST amount (where shown separately on the receipt)
  • Payment method (for cross-reference validation)

Transaction matching

Extracted receipt data is matched to a candidate transaction by: 1. Amount (exact match within rounding tolerance) 2. Date (within a configurable window, default ±3 days) 3. Merchant name similarity (fuzzy match against enriched merchant name from MOD-087)

A confidence score is computed for the match. Above the high-confidence threshold the receipt is automatically attached. Below threshold, the customer is prompted to confirm or select from candidate transactions.

Design phase

This module is in design. Build begins in Phase 2 of the Expense Intelligence Platform. See the Expense Intelligence Platform summary for the full implementation roadmap.


Module dependencies

Depends on

Module Title Required? Contract Reason
MOD-087 Transaction enrichment engine Required contract/events/ Transaction records from the enrichment layer provide the merchant and amount context used to match extracted receipt data.
MOD-104 AWS shared infrastructure bootstrap Required contract/ssm/ AWS S3 bucket provisioned by MOD-104 is required for receipt image and OCR output storage; the bucket ARN is consumed from SSM.

Required by

Module Title As Contract
MOD-093 Accounting mapper Optional enhancement contract/api/

Policies satisfied

Policy Title Mode How
PRI-001 Privacy Policy LOG Receipt images and extracted data are classified as personal financial data; retention and access are logged per PRI-001.

Capabilities satisfied

Capability Title Mode How
CAP-139 CAP-139 AUTO Ingests receipt photos (app capture) and email receipts; runs OCR extraction; matches to the corresponding transaction by amount, date, and merchant; stores the matched receipt against the transaction record.

Part of SD08 — Customer App & Back Office Platform Compiled 2026-05-22 from source/entities/modules/MOD-091.yaml