PDF OCR is a simple drag-and-drop utility that converts PDFs and images into text documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF or image. This is particularly useful for dealing with PDFs and images that were created via a scan-to-PDF function in a scanner or photo copier. It uses the Tesseract engine to perform OCR, and currently supports over 20 languages for OCR.
| Tags | OCR pdf to text optical character recognition |
|---|---|
| Licenses | Commercial |
| Operating Systems | Mac OS X |
| Implementation | Java |
| Translations | English |
Recent releases


Changes: User preferences are now retained so that conversion settings are remembered between uses. This version also includes minor bugfixes in the OCR algorithm.


Changes: A bug causing blank output on some versions of Snow Leopard was fixed.


Changes: This release fixes a bug causing blank output in single column mode. This bug affected versions 1.8.5-1.8.7.


Changes: This release prepares the UI for internationalization.


Changes: Updated to the latest version of the OCR library.
An application to create, resize, delete, copy, backup, and restore partitions.