Ocrad is the GNU OCR (Optical Character Recognition) program, implemented as a filter and based on a feature extraction method. It reads images in pbm (bitmap), pgm (greyscale), or ppm (color) formats and produces text in byte (8-bit) or UTF-8 formats. It also includes a layout analyzer that is able to separate the columns or blocks of text normally found on printed pages. Ocrad can be used as a stand-alone console application, or as a backend to other programs.
| Tags | OCR |
|---|---|
| Licenses | GPLv3 |
| Operating Systems | OS Independent |
| Implementation | C++ |
Recent releases


Changes: The layout analyzer is now able to separate blocks of text of arbitrary shape as long as they are clearly delimited by white space. The new option "--quiet" has been added. The "--layout" option no longer accepts an argument. The "--crop" option now accepts negative coordinates, interpreted as relative to the right-bottom corner of the image. A problem that prevented compilation with GCC 4.3.0 has been fixed. "make install-info" should now work on Debian and OS X. The man page is now installed by default.


Changes: Recognition of files with a single big character has been fixed. "make install-info" should now work on Debian and OS X. The man page is now installed by default.


Changes: The new option "--quiet" has been added. Verbosity control of messages has been modified.


Changes: This version recognizes the letter 'a' with a ring above it. Some missing headers that prevented compilation with GCC 4.3.0 have been included.


Changes: The "--crop" option now accepts negative coordinates, interpreted as relative to the right-bottom corner of the image.