HtmlUnit

From Wikipedia, the free encyclopedia
Jump to: navigation, search
HtmlUnit
Htmlunit logo.png
Initial release May 22, 2002 (2002-05-22)
Stable release
2.23 / July 27, 2016 (2016-07-27)
Written in Java
Operating system Cross-platform (JVM)
Available in English
Type Web browser
License Apache License 2.0
Website htmlunit.sourceforge.net

HtmlUnit is a headless web browser written in Java. It allows high-level manipulation of websites from other Java code, including filling and submitting forms and clicking hyperlinks. It also provides access to the structure and the details within received web pages. HtmlUnit emulates parts of browser behaviour including the lower-level aspects of TCP/IP and HTTP. A sequence such as getPage(url), getLinkWith("Click here"), click() allows a user to navigate through hypertext and obtain web pages that include HTML, JavaScript, Ajax and cookies. This headless browser can deal with HTTPS security, basic HTTP authentication, automatic page redirection and other HTTP headers. It allows Java test code to examine returned pages either as text, an XML DOM, or as collections of forms, tables, and links.[1]

The goal is to simulate real browsers; namely Chrome, Firefox ESR 38, Internet Explorer 8 and 11, and Edge (experimental).

The most common use of HtmlUnit is test automation of web pages, but sometimes it can be used for web scraping, or downloading website content.

Benefits[edit]

  • Provides high-level API, taking away lower-level details away from the user.[2]
  • Compared to other WebDriver implementations, HtmlUnitDriver is the fastest to implement.[3]
  • It can be configured to simulate specific Browser.

Drawbacks[edit]

  • Elements layout and rendering can not be tested.[4]
  • The JavaScript support is still not complete. And this is usually the main area of enhancements.

Used technologies[edit]

Libraries using HtmlUnit[edit]

See also[edit]

References[edit]

Bibliography[edit]

Further reading[edit]

External links[edit]