A community-supported supercharged version of paperless: scan, index and archive all your physical documents
-
Updated
May 10, 2025 - Python
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
A supercharged version of paperless: scan, index and archive all your physical documents
Continuous Archiving for Postgres
Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.
Backup and recovery manager for PostgreSQL
ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.
A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
[moved to codeberg] Archive all your favorite podcasts
DocFlow is a powerful Document Management API designed to streamline document handling, including seamless uploading, downloading, organization, versioning, sharing, and more.
A command line tool to archive a git repository from GitHub to the Internet Archive.
A simple, fast, platform-independent tool for downloading Twitch streams, videos, and chat logs.
A Jupyter/Jupyterlab extension to make, download and extract archive files.
Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.
A dockerized, queued high fidelity web archiver based on Squidwarc
Tools for tracking stories on news homepages
Last.fm successor. "Weeb.fm" is definitely a temporary name.
Add a description, image, and links to the archiving topic page so that developers can more easily learn about it.
To associate your repository with the archiving topic, visit your repo's landing page and select "manage topics."