Description: OCRmyPDF is an open source command-line program and Python library that applies optical character recognition (OCR) to PDF documents. It takes an existing PDF as input and generates a new searchable PDF as output with an invisible text layer over images.
Type: software
Pricing: Open Source
Description: Tesseract is an open source optical character recognition (OCR) engine. It can recognize text in images and convert it into editable text. It supports over 100 languages and can handle distorted or low-quality images.
Type: software
Pricing: Open Source