Jericho HTML Parser

Jericho HTML Parser is a Java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognized or invalid HTML. It also provides high-level HTML form manipulation functions.

Tags Text Processing Markup HTML/XHTML Software Development Libraries Java Libraries Internet Web Dynamic Content
Licenses LGPL
Operating Systems OS Independent
Implementation Java

Tweet this project Short link

Rss Recent releases

  • Rrelease-mid
  •  11 Jun 2009 12:09
  • Rrelease-after

    Changes: Important bugfixes and a new stream-based parsing option allowing memory efficient processing of large files.

    • Rrelease-mid
    •  10 Apr 2009 10:06
    • Rrelease-after

      Changes: This version is a major new release that requires the Java 5 runtime or later. It introduces major API changes such as generics and enums, as well as some new features.

      • Rrelease-mid
      •  25 Jun 2008 06:56
      • Rrelease-after

      Changes: This version includes important bugfixes and the following enhancements. Non-server tags are no longer recognized inside server tags. Microsoft downlevel-revealed conditional comments are recognized. All unnecessary white space may be removed from a source document. Various other enhancements were made to existing features.

      • Rrelease-mid
      •  02 Sep 2007 05:41
      • Rrelease-after

      Changes: This version includes important bugfixes and introduces the following minor enhancements: elements inside SCRIPT elements are ignored. Encoding detection and analysis were improved. Parsing of attributes containing server tags was improved.

      • Rrelease-mid
      •  20 May 2007 04:30
      • Rrelease-after

      Changes: This version has been released under a dual licence system, allowing a choice between the Eclipse Public License (EPL) and the LGPL. It includes important bugfixes and introduces the following major features: simple rendering of HTML markup into text, integrated logging with various logging frameworks, and easier parsing of HTML tags containing server tags.

      30608e18bc89fb17b2b8c944c325e5aa_thumb

      Project Spotlight

      Mac Mass Mailer

      A fully-featured mass mailer to work with mailing lists.

      36273b11553c2a68206b853ce007139f_thumb

      Project Spotlight

      Mutt Folder List

      A mutt patch that adds a sidebar showing all mail folders.