Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python.
- Initial release of the OCR skill using PaddleOCR. - Supports text extraction from PDF and image files in both Chinese and English. - Output includes recognized text and confidence scores in JSON format. - Typical use cases covered: scanned PDFs, images, invoices, contracts, and multi-page documents. - Includes quick start guide and dependency installation instructions.