Posts
Published articles on document extraction, tutorials, and best practices.
-
Building a Robust Document Parsing System with Cloudflare Queues and Workflows
Learn how to build a production-ready document parsing system using Cloudflare Queues, Workflows, and Mistral AI for OCR processing at scale
-
Build a Resume Database: Extract and Query Hundreds of Resumes as SQLite
Transform unstructured resume PDFs into a searchable SQLite database. Extract candidate information, skills, experience, and education into structured tables for recruiting and talent management.
-
Build an Invoice Extractor with Child Tables and Foreign Keys
Step-by-step guide to model invoices with parent/child tables and foreign keys so your extracted data lands in clean relational SQLite.
-
Build a Bank Statement Extractor with JSON Array Transactions
Model bank statements with statement-level columns plus a JSON array of transactions for flexible extraction and querying.
-
Introducing PdfParse: Transform Documents into Structured Databases
Launch announcement for PdfParse - a novel, robust, and affordable platform for extracting structured data from PDFs with automatic data normalization and SQLite database generation.