Its source code is written in ANSI C for maximum portability and precompiled binaries are available for a variety of platforms. It is available under the W3C license (a permissive, BSD-style license). Up-to-date versions are currently only available only as source code, cloned from its Github git version control repository.
Examples of fixes it can make to bad HTML:
Straighten mixed-up tags
Fix missing or mismatched end tags
Add missing items (some tags, quotes, ...)
Report proprietary HTML extensions
Change layout of markup to predefined style
Transform characters from some encodings into HTML entities