Extract structured data from construction PDFs. Convert specifications, BOMs, schedules, and reports from PDF to Excel/CSV/JSON. Use OCR for scanned documents and pdfplumber for native PDFs.
Major update with enhanced extraction features and expanded documentation. - Added detailed README covering ETL workflow, quick start, and advanced use cases. - Now supports both native (pdfplumber) and scanned (OCR) PDF extraction. - Includes construction-specific examples for specifications, BOMs, and schedules. - Provides code samples for extracting tables, text, and specific areas from PDFs. - Outlines recommended packages and installation instructions for various extraction scenarios.