HTMLDOC
| Developer(s) | Michael R Sweet | 
|---|---|
| Stable release | 1.8.29
   / January 3, 2016 | 
| Preview release | NA
   / NA | 
| Development status | Active | 
| Written in | C, C++ | 
| Operating system | Windows 2000+, Mac OS X v10.4+, Linux 2.4+, Solaris 8+ | 
| License | GNU GPLv2 (source) | 
| Website | www | 
HTMLDOC is a previously commercially developed open-source program that converts HTML web pages and files to indexed HTML, PostScript, and PDF files, complete with a table-of-contents and (in the current development snapshot) index. HTMLDOC can be used from the command-line, a simple GUI, or from a web server. Development originally occurred through the author's now-defunct business, Easy Software Products, and now continues on the author's personal web site.
Features and limitations
HTMLDOC 1.8.x supports most of HTML 3.2 with some elements of HTML 4.01, with no support for CSS.[1] HTMLDOC 1.9.x will support most of HTML 4.01 and CSS1, with some support for selected CSS2.x properties.[2]
HTMLDOC 1.8.x supports the following character sets: Windows-874, Windows-1250, Windows-1251, Windows-1252, Windows-1253, Windows-1255, Windows-1256, Windows-1257, Windows-1258, ISO-8859-1, ISO-8859-2, ISO-8859-3, ISO-8859-4, ISO-8859-5, ISO-8859-6, ISO-8859-7, ISO-8859-8, ISO-8859-9, ISO-8859-14, ISO-8859-15, KOI8-R; you cannot mix characters from different code pages. There is no support for CJK and Arabic characters, and support for ISO-8859-13 is missing.[3] HTMLDOC 1.9.x improves the character set support and includes limited support for UTF-8.[4]
License and availability
Licensed under the terms of the GNU General Public License version 2. It is legal to compile the sources and distribute the program, and various versions can be found on the Internet. For example, HTMLDOC is included as part of the Debian operating systems.[5]