User:RotlinkBot

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Technical details[edit]

RotlinkBot is written in Scala. It uses Wiki.java to work with the MediaWiki API and Sweble to parse wiki markup and manipulate Wiki DOM. It uses Apache Accumulo for persistent data storage.

Task[edit]

Find dead links (mostly by looking for {{dead link}} marks next to them) and try to recover them by searching web archives using Memento protocol; see also Wikipedia:Bots/Requests for approval/RotlinkBot.

The current version of the bot software does not work with the other, non Memento-compatible, archives (WebCite, WikiWix, Archive.pt, ...).

During the test run, about 3/4 of recovered links were found on Internet Archive (because it has the biggest and oldest database), about 1/4 on Archive.is (because of its proactive archiving of the new links appearing on the Wikis) and only few links on the other archives (because of their smaller size and regional specific).