With LinkChecker, you can check HTML documents and Web sites for broken links. It features recursion, robots.txt exclusion protocol support, HTTP proxy support, i18n support, multithreading, regular expression filtering rules for links, and user/password checking for authorized pages. Output can be colored or normal text, HTML, SQL, CSV, or a sitemap graph in DOT, GML, or XML format. Supported link types are HTTP/1.1 and 1.0, HTTPS, FTP, mailto:, news:, nntp:, Telnet, and local files.
| Tags | Internet Web Site Management Link Checking |
|---|---|
| Licenses | GPL |
| Operating Systems | OS Independent |
| Implementation | Python C |
Recent releases


Changes: A new GUI client for checking has been added, and invalid handling of persistent connections has been fixed. Python >= 2.5 is now required.


Changes: This release adds options to check the HTML and CSS syntax of downloaded pages, and parses Shockwave Flash (SWF) files for URLs to check. Pressing Ctrl-C twice aborts the program immediately. Also included are minor bugfixes regarding HTTP pipelining, HTML parsing, and mail address checks.


Changes: This release correctly handles HTTP/1.1 persistent connections, ignores commented out CSS links, and fixes open file leaks. The documentation has been improved by fixing typographic errors, documenting the --configfile option, and adding a linkcheckerrc(5) manual page.


Changes: Log output with certain Unicode characters is fixed now. XML output has been improved. Gopher URLs are now deprecated.


Changes: Errors during log file opening are now ignored. A problem with certain URL redirects has been fixed. The default config file syntax no longer has broken comment lines.