ArchiveBot, an IRC bot for archiving websites
Python 382 76
Wget-compatible web downloader and crawler.
HTML 583 78
We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.
Shell 89 22
Making a reusable toolkit for writing seesaw scripts
Python 70 31
URLTeam's second generation of URL shortener archiving tools
Python 75 15
Grabbing all news.
Python 62 32
Managing items for twitch-grab.
Managing items for cdnc-grab.
Archiving California Digital Newspaper Collection (CDNC).
Archiving Twitch (comments and webpages for now)
Archiving Voice of America (all sites).
Archiving URLs (outlinks) from a variety of sources.
Warrior virtual machine appliance (version 4)
Nondestructive warc-in-tar to warc conversion
Loading…