Native Ruby gem for parsing documents (PDF, DOCX, XLSX, images with OCR) with zero runtime dependencies. Statically links MuPDF for PDF extraction and Tesseract for OCR.

Required Ruby Version

>= 3.0.0

Authors

Chris Petersen

Versions

  1. 0.2.0 June 20, 2026 arm64-darwin (20.3 MB)
  2. 0.2.0 June 20, 2026 x86_64-linux (24.2 MB)
  3. 0.2.0 June 20, 2026 (19.5 KB)
  4. 0.1.3 March 24, 2026 (19.5 KB)
  5. 0.1.2 September 06, 2025 (5.71 MB)
Show all versions (8 total)

Pushed by

SHA 256 checksum