~/py-automation
~/automationinvoice_parser.py

Invoice PDF parser → Google Sheets

$50
per script

Reads invoice PDFs from a folder, extracts key fields (number, dates, vendor, amounts, currency) using regex heuristics, and appends each one as a new row in a Google Sheet. Includes a Claude AI fallback for non-standard layouts, a processed-files log to prevent duplicates, and moves processed PDFs to an archive folder.

pdfplumbergspreadgoogle-authpandasregexclaude-ai-fallback
📤
Upload any PDF
Drop a real invoice PDF — the demo extracts fields live in your browser.
🧠
AI fallback
Claude claude-sonnet-4-20250514 handles unusual layouts when regex falls short.
📊
Google Sheets
Each invoice becomes a new row — dates, totals, vendor, client, all structured.
🔁
No duplicates
Processed-files log ensures each PDF is only processed once.

Drop an invoice PDF here

or click to browse · text-based PDFs only

or

Works on text-based PDFs. For scanned/image PDFs, the full Python script supports OCR via pytesseract.

Need a customised version of this?

Describe your exact use case and I'll give you a fixed price within 30 minutes.