Waybackpack might be a bit more popular than Archive-It. We know about 3 links to it since March 2021 and only 3 links to Archive-It. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Other projects include Open Library & archive-it.org. Source: over 1 year ago
The biggest hurdle in DH of the early internet and manipulating the data at scale is creating collections from archived material that fit your needs. The internet archive has a subscription-based archival tool Archive-It but the collection starts when you (or some other person) creates it. There isn't a collection for anti-vaccine material from 1996-. With Covid, many collections have been started but these will... Source: almost 2 years ago
Archive-It is a subscription web archiving service from the Internet Archive that helps organizations to harvest, build, and preserve collections of digital content. Through our user friendly web application Archive-It partners can collect, catalog, and manage their collections of archived content with 24/7 access and full text search available for their use as well as their patrons. Source: almost 2 years ago
Thank you! But the script but the only thing that really deserves credit is Jeremy Singer-Vine's https://github.com/jsvine/waybackpack library. Pretty much made this a very straightforward task. - Source: Hacker News / 10 months ago
> Is there some straightforward way to list all of archive.org's snapshots (of a particular site) without a javascript-enabled browser? I use https://github.com/jsvine/waybackpack.- Source: Hacker News / almost 2 years ago$ waybackpack --list https://diziet.dreamwidth.org/11840.html.
Which paid services are you referring to? It is likely that these services aren't distributing the projects they are based on, if so, then they are in compliance with the licenses of the open source projects, which don't require attribution unless you distribute them. This project started in 2015 btw. Another similar project called waybackpack started in 2016. There are probably more projects. IMO... - Source: Hacker News / almost 3 years ago
Archive.md - archive.is allows you to create a copy of a webpage that will always be up even if the original link is down
Famous First Websites - Discover what popular startup websites looked like at launch
Wayback Machine - Browse through over 150 billion web pages archived from 1996 to a few months ago.
Perma.cc - Perma.
Cached View - CachedView claimed to be the provider of Google caches pages of any website at the moment listed in the index of the Google search engine.
ArchiveBox - The open-source, self-hosted internet archiving solution