Wikipedia Bans Archive.today: DDoS Attack & Content Alteration Concerns
- Wikipedia has effectively banned Archive.today, a web archiving service, removing links to the site from over 695,000 pages.
- Archive.today, also known as archive.is and archive.ph, gained popularity as a tool to circumvent paywalls and access archived versions of web pages.
- The immediate trigger for the ban was the discovery that Archive.today was used to launch a distributed denial-of-service (DDoS) attack against the blog Gyrovague, run by Jani Patokallio.
Wikipedia has effectively banned Archive.today, a web archiving service, removing links to the site from over 695,000 pages. The decision, finalized on , stems from concerns over malicious activity and data integrity, marking a reversal of previous policies regarding the archive site.
Archive.today, also known as archive.is and archive.ph, gained popularity as a tool to circumvent paywalls and access archived versions of web pages. Its utility for providing verifiable sources on Wikipedia was previously acknowledged, but recent events have led to a decisive shift in policy.
The immediate trigger for the ban was the discovery that Archive.today was used to launch a distributed denial-of-service (DDoS) attack against the blog Gyrovague, run by Jani Patokallio. Beginning on , visitors to Archive.today’s CAPTCHA page were unknowingly executing JavaScript code that repeatedly sent requests to Patokallio’s blog. This action, intended to overwhelm the blog’s server and increase hosting costs, raised serious concerns about the site’s ethical and security practices.
However, the DDoS attack wasn’t the sole reason for the ban. Evidence emerged indicating that Archive.today’s operators had altered archived webpages, specifically inserting Patokallio’s name into snapshots. This manipulation of archived content directly undermined the site’s reliability as a source for Wikipedia, a platform heavily reliant on verifiable information. The Wikipedia community determined that directing readers to a site capable of altering historical records was unacceptable.
Patokallio’s investigation into Archive.today, dating back to , had previously highlighted the opacity surrounding the site’s ownership. He speculated that the service was likely a “one-person labor of love, operated by a Russian of considerable talent and access to Europe.” His attempts to understand the site’s infrastructure and motivations were met with resistance, culminating in what he described as “an increasingly unhinged series of threats” from the webmaster after he declined a request to remove his investigative blog post.
According to emails shared by Patokallio, the Archive.today webmaster expressed frustration with media coverage, claiming that journalists were selectively quoting his statements and misrepresenting his intentions. He argued that the site’s value wasn’t primarily about bypassing paywalls, but rather “the ability to offload copyright issues.”
This isn’t the first time Archive.today has faced scrutiny from Wikipedia. The site was previously blacklisted in , only to be removed from the blacklist in . The current decision represents a firm rejection of the site, with Wikipedia editors stating there is “consensus to immediately deprecate archive.today” and add it to the spam blacklist.
The deprecation process involves removing existing links to Archive.today across the encyclopedia and preventing the addition of new ones. Editors are encouraged to replace these links with references to the original source material or alternative archiving services, such as the Wayback Machine.
The apparent owner of Archive.today, writing on a LiveJournal blog, acknowledged the situation, stating they would “scale down the ‘DDoS’.” They also questioned why the media hadn’t reported on the events earlier, suggesting a bias influenced by Patokallio’s involvement. They framed the situation as a matter of copyright and the site’s ability to navigate complex legal issues surrounding online content.
The ban on Archive.today underscores the ongoing challenges of maintaining data integrity and security in the digital age. For Wikipedia, it highlights the importance of carefully vetting external sources and prioritizing the reliability of information presented to its readers. The incident also raises broader questions about the responsibility of web archiving services and the potential for malicious use of these platforms.
