Projects / catdoc

catdoc

Catdoc is a MS Word file decoding tool that doesn't attempt to analyze file formatting (it just extracts readable text), but is able to handle all versions of Word and convert character encodings. A Tcl/Tk graphical viewer is also included. It can also read RTF files and convert Excel and PowerPoint files.

Tags Text Processing
Licenses GPL

Tweet this project Short link

Rss Recent releases

  • Rrelease-mid
  •  09 Dec 2005 23:48
  • Rrelease-after

Changes: A catppt utility for viewing PowerPoint files was added. Processing of Mac charsets and dates was improved.

  • Rrelease-mid
  •  05 Jan 2005 19:08
  • Rrelease-after

Changes: A lot of bugs concerning the RTF parser and xls3csv have been fixed. The ability to define a customizable page separator for multi-page spreadsheets and command line switch to specify desired maximal precision of floating point numbers (the default now is to output as many digits as it is) have been added. A bug with reading pre-OLE word/write files and text files (Debian bug #255625) has been fixed.

22e154b62235932ddda2cc7c08c93b52_thumb

Project Spotlight

fxmoviemanager

A file manager and playlist with movie thumbnails.

B47850b5497cfd5cc997b983047cd8bf_thumb

Project Spotlight

dANN

A library to develop with dynamic artificial neural networks, AI, and genetic algorithms.