strip_html is a simple Perl filter that reads an mbox-style email message from stdin and outputs the same email message to stdout, but with all HTML stripped off. It uses w3m for the HTML formatting.
| Tags | Communications Email Filters |
|---|---|
| Licenses | GPL |
| Implementation | Perl |
Recent releases


Changes: In this release, the code was simplified, and as a result it should now catch any MIME compliant text/html part. Multipart/alternative MIME parts, which provides a text/plain alternative part, can now be skipped.


Changes: strip_html was rewritten in Perl. It no longer depends on mimedecode. strip_html now also adds missing <html>.*</html> tags.


Changes: The regexp to catch the "HTML" tag was extended so that it also catches "HTML" tags with embedded attributes.


Changes: The HTML tag to enclose clear text was changed from "pre" to "listing", so that it is not necessary to escape special characters in the clear text section.


Changes: A link to the mimedecode tool was added to the README. The procmail rule now saves a copy of the unstripped message in the folder unstripped.backup.
A shell that lets you restrict the user's environment to limited sets of commands.