Projects / PDF OCR X

PDF OCR X

PDF OCR is a simple drag-and-drop utility that converts PDFs and images into text documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF or image. This is particularly useful for dealing with PDFs and images that were created via a scan-to-PDF function in a scanner or photo copier. It uses the Tesseract engine to perform OCR, and currently supports over 20 languages for OCR.

Tags
Licenses
Operating Systems
Implementation
Translations

Tweet this project Short link

Rss Recent releases

  • Rrelease-mid
  •  13 Aug 2010 23:00
  • Rrelease-after

    Changes: User preferences are now retained so that conversion settings are remembered between uses. This version also includes minor bugfixes in the OCR algorithm.

    • Rrelease-mid
    •  30 Jul 2010 08:16
    • Rrelease-after

      Changes: A bug causing blank output on some versions of Snow Leopard was fixed.

      • Rrelease-mid
      •  27 Jul 2010 22:12
      • Rrelease-after

        Changes: This release fixes a bug causing blank output in single column mode. This bug affected versions 1.8.5-1.8.7.

        • Rrelease-mid
        •  26 Jul 2010 21:50
        • Rrelease-after

          Changes: This release prepares the UI for internationalization.

          • Rrelease-mid
          •  17 Jul 2010 00:38
          • Rrelease-after

            Changes: Updated to the latest version of the OCR library.

            807578f6d9dbbcc292f0c7f9539f8cb8_thumb

            Project Spotlight

            KDE Partition Manager

            An application to create, resize, delete, copy, backup, and restore partitions.

            No-screenshot

            Project Spotlight

            curl and libcurl

            A command line tool and library for client-side URL transfers.