OCRmyPDF adds an invisible text layer to PDF documents after passing it through the Tesseract OCR engine. The output will be PDF/A with a selectable but invisible text layer above scanned image-documents. This allows later searching and archiving.
minor feature: Remove gs.py (spoofers entirely removed) and update copyright. Additional size increase reasons.. Document use of mmap.. Approve pdfminer.six 20200726.. test breakage in validation.. v10.3.2 release notes.