Jump to content

jsoup

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Antonio3.14 (talk | contribs) at 20:30, 15 November 2020 (See also: Added link to Flying Saucer). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

jsoup Java HTML Parser
Developer(s)Jonathan Hedley
Stable release
1.13.1 / February 29, 2020; 4 years ago (2020-02-29)[1]
Repository
Written inJava
Operating systemCross-platform
PlatformJava (JVM)
TypeHTML parser
LicenseMIT license
Websitejsoup.org

jsoup is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents.

History

jsoup was written in 2009 by Jonathan Hedley, a software development manager for Amazon Seattle. He has distributed it under the MIT License, a permissive free software license similar to the Creative Commons attribution license.

Hedley's avowed intention in writing jsoup was "to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup."

Projects powered by jsoup

jsoup is used in a number of current projects,[2] including Google's OpenRefine data-wrangling tool.

See also

References

  1. ^ "jsoup Java HTML Parser release 1.13.1". Retrieved 7 March 2020.
  2. ^ "Jsoup". MVNRepository / F. Rodriguez. 2015-03-08.