harvest

Harvest is a system to collect information and make it searchable using a Web interface. It can collect information using HTTP, FTP, NNTP, and local files. Supported formats include HTML, DVI, PS, fulltext, mail, man pages, news, troff, WordPerfect, C sources, and many more. Adding support for new formats is easy due to Harvest's modular design.

Tags Text Processing Indexing Internet Web Indexing/Search Z39.50
Operating Systems Unix
Implementation C Perl

Tweet this project Short link

Rss Recent releases

  • Rrelease-mid
  •  30 Nov 2003 16:59
  • Rrelease-after

Changes: This release features search time display in zquery.pl and updated components. It is a further step in replacing the default full text engine with Indexdata's Zebra.

  • Rrelease-mid
  •  24 Nov 2003 02:33
  • Rrelease-after

Changes: This release improves the integration of Zebra as a full text engine and features some simplification in broker administration and creating new gatherers.

  • Rrelease-mid
  •  10 Nov 2003 02:03
  • Rrelease-after

Changes: This release features a Russian translation of the user's manual, bugfixes in the PowerPoint summarizer, an improved SOIF to XML filter, a fix for a crash bug in essence, and an improved user interface. The broker now can have duplicate data under different URIs.

  • Rrelease-mid
  •  25 Oct 2003 12:46
  • Rrelease-after

Changes: This release adds Dutch, French, Italian, and Swedish user interfaces and support for PowerPoint presentations. It also features portability improvements for FreeBSD, and fixes compilation issues with GCC 3.3.1.

  • Rrelease-mid
  •  09 Oct 2003 14:31
  • Rrelease-after

Changes: This release adds a Dutch user interface and addresses gcc 3.3.1 compilation problems.

No-screenshot

Project Spotlight

The WollMux

An OpenOffice.org plugin with enhanced forms, autotext, and printing features.

4ad3889098e47b3cf79b05111d2ae9f7_thumb

Project Spotlight

ZABBIX

An enterprise-class distributed monitoring solution.