Projects / Anthracite

Anthracite

Anthracite is a collection of Web mining power tools combined in an easy-to-use graphical environment that lets users quickly and seamlessly extract data from Internet sources, modify it to suit their needs, and export it to templates or databases, e.g. for RSS feeds.

Tags Text Processing Markup XML HTML/XHTML Filters Internet Web Indexing/Search Information Management Workflow Frameworks Metadata/Semantic Models
Operating Systems Mac OS X
Implementation Objective C

Tweet this project Short link

Rss Recent releases

  • Rrelease-mid
  •  21 Jan 2009 07:11
  • Rrelease-after

Changes: A problem with the license key manager was fixed.

  • Rrelease-mid
  •  19 Jun 2007 11:40
  • Rrelease-after

Changes: This release has new XSLT and Filter Tag processors, a new solution "Web History to Tag Cloud", and a handful of other updates, enhancements, and fixes.

  • Rrelease-mid
  •  31 Oct 2006 09:13
  • Rrelease-after

Changes: This release increases the document canvas size to 2K square for working with more objects, adds an overview inspector window for navigating the larger document size, adds per-source User-Agent settings and stores options in an external file, includes several more text encoding settings options, and improves plugin support, plus ships with the first available plugin that enables the use of Safari History files.

  • Rrelease-mid
  •  07 Sep 2006 11:19
  • Rrelease-after

Changes: This release fixes a user-reported issue with regular expression find/replace parenthetical back expressions, and adds new sample documents showing how to convert RSS feeds to CSV for use with databases.

  • Rrelease-mid
  •  24 Aug 2006 15:33
  • Rrelease-after

Changes: This release adds a ready to run solution that converts SEC filings into the hCard microformat, as well as an example of integrating Python scripts and speaking RSS headlines. It also fixes two issues for users, one related to MySQL file descriptor resource starvation and the other in the Column Excerpt processor when using negative indexes.

Ee32f835e0097414c4d3f6846fa8e064_thumb

Project Spotlight

Stella

An Atari 2600 VCS emulator.

No-screenshot

Project Spotlight

check_procs_multi

A Nagios plugin like check_procs, but able to check several processes at once.