archiving

Here are 118 public repositories matching this topic...

paperless-ngx / paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

pdf machine-learning django angular ocr archiving dms document-management optical-character-recognition document-management-system

Updated May 10, 2025
Python

the-paperless-project / paperless

Star

Scan, index, and archive all of your paper documents

search ocr paper archiving documents

Updated Apr 6, 2021
Python

jonaswinkler / paperless-ng

Star

A supercharged version of paperless: scan, index and archive all your physical documents

search machine-learning django angular ocr archiving full-text-search dms document-management-system

Updated Feb 14, 2023
Python

wal-e / wal-e

Star

Continuous Archiving for Postgres

python backups postgres backup replication blob-store archiving postgresql s3 google-cloud-storage openstack-swift recovery azure-blob pitr

Updated Dec 20, 2023
Python

ArchiveTeam / grab-site

Star

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

crawler spider archiving crawl warc

Updated Jul 7, 2024
Python

JosephLai241 / URS

Sponsor

Star

Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.

Updated Oct 18, 2023
Python

postgrespro / pg_probackup

Star

Backup and recovery manager for PostgreSQL

backup restore archiving postgresql wall recovery incremental-backups

Updated Oct 10, 2024
Python

ArchiveTeam / ArchiveBot

Star

ArchiveBot, an IRC bot for archiving websites

javascript ruby python irc archiving haxe

Updated Apr 6, 2025
Python

VIDA-NYU / reprozip

Star

ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.

python linux docker science vagrant reproducible-research archiving reproducible-science scientific-computing ptrace reproducibility computational-science reprozip reprounzip hacktoberfest nyu

Updated Dec 2, 2024
Python

palewire / archiveis

Sponsor

Star

A simple Python wrapper for the archive.is capturing service

python api news archiving

Updated Feb 11, 2025
Python

palewire / savepagenow

Sponsor

Star

A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service

python api news archiving internetarchive command-line-interface

Updated Oct 15, 2024
Python

janw / podcast-archiver

Star

[moved to codeberg] Archive all your favorite podcasts

python rss podcast archiving podcasts feedparser moved-to-codeberg

Updated Mar 17, 2025
Python

jiisanda / docflow

Sponsor

Star

DocFlow is a powerful Document Management API designed to streamline document handling, including seamless uploading, downloading, organization, versioning, sharing, and more.

api docker nginx rest ec2 docker-compose archiving postgresql s3 versioning jwt-authentication document-management access-control-list document-management-system pydantic fastapi document-sharing

Updated Dec 27, 2024
Python

gdamdam / iagitup

Star

A command line tool to archive a git repository from GitHub to the Internet Archive.

github git cli archiving archive internet-archive internetarchive

Updated Feb 15, 2021
Python

Brisppy / twitch-archiver

Star

A simple, fast, platform-independent tool for downloading Twitch streams, videos, and chat logs.

python downloader twitch archiving

Updated Apr 30, 2025
Python

jupyterlab-contrib / jupyter-archive

Star

A Jupyter/Jupyterlab extension to make, download and extract archive files.

archiving jupyterlab jupyterlab-extensions jupyterlab-extension

Updated Mar 27, 2025
Python

Own-Data-Privateer / hoardy-web

Star

Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.