Native Ruby gem for parsing documents (PDF, DOCX, XLSX, images with OCR) with zero runtime dependencies. Statically links MuPDF for PDF extraction and Tesseract for OCR.
Required Ruby Version
>= 3.1, < 3.5.dev
Authors
Chris Petersen
Versions
- 0.2.0 June 20, 2026 arm64-darwin (20.3 MB)
- 0.2.0 June 20, 2026 x86_64-linux (24.2 MB)
- 0.2.0 June 20, 2026 (19.5 KB)
- 0.1.3 March 24, 2026 (19.5 KB)
- 0.1.2 September 06, 2025 (5.71 MB)