awesome-web-archiving
An Awesome List for getting started with web archiving
2058cc0-1.0
16 days ago
awesomeawesome-listwebarchiving
wayback
An archiving tool with an IM-style interface that prioritizes privacy and access
Go1719gpl-3.0
4 months ago
archiveharheroku
pywb
Core Python Web Archiving Toolkit for replay and recording of web archives
JavaScript1413gpl-3.0
8 days ago
pythonpywbwayback
acts_as_archival
An ActiveRecord plugin for atomic archiving and unarchiving of object trees. Ins
Ruby128mit
7 months ago
scoop
🍨 High-fidelity, browser-based, single-page web archiving library and CLI for w
JavaScript117mit
2 days ago
archiveweb.page
A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!
TypeScript863agpl-3.0
15 hours ago
archivingbrowser-extensionchromium
wail
:whale2: Web Archiving Integration Layer: One-Click User Instigated Preservation
Roff345mit
6 months ago
guiheritrixopenwayback
nb
CLI and local web plain text note‑taking, bookmarking, and archiving with linkin
Shell6699agpl-3.0
16 days ago
archivingbashbookmark-manager
ganymede
Twitch VOD and Live Stream archiving platform. Includes a rendered and real-time
Go494gpl-3.0
3 days ago
archivearchivedchat
node-archiver
a streaming interface for archive generation
JavaScript2823mit
22 days ago
archiverjavascriptnodejs
ArchiveSpark
An Apache Spark framework for easy data processing, extraction as well as deriva
Scala145mit
2 months ago
archivesparkinternet-archivespark
auto-archiver
Automatically archive links to videos, images, and social media content from Goo
Python570mit
2 months ago
archivedockeropen-source-research
archiver
Easily create & extract archives, and compress & decompress files of various for
Go4296mit
5 months ago
7ziparchivesbrotli
browsertrix
Browsertrix is the hosted, high-fidelity, browser-based crawling service from We
TypeScript149agpl-3.0
4 months ago
archivingcloudkubernetes
browsertrix-crawler
Run a high-fidelity browser-based web archiving crawler in a single Docker conta
TypeScript653agpl-3.0
12 hours ago
crawlercrawlingwacz
ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/P
Python22394mit
2 days ago
archiveboxbackupsbookmark-archiver
buttercup-mobile
Description Buttercup is an open-source password manager, available on all ma
TypeScript400gpl-3.0
2 months ago
buttercuphacktoberfestmobile
playcanvas-rest-api-tools
A set of tools to use with the PlayCanvas REST API for common jobs such as downl
JavaScript23mit
3 months ago
arch
Web application for distributed compute analysis of Archive-It web archive colle
Scala15agpl-3.0
3 months ago
react-native-zip-archive
Zip archive utility for react-native
Java428mit
18 days ago
androidiosreact-native
Share-2-Archive-Today
Simple Android app to add an icon to your share menu to publically archive a url
Kotlin6gpl-3.0
19 days ago
androidarchive
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archi
Scala137apache-2.0
9 months ago
analysisapache-sparkbig-data
RVD
Robot Vulnerability Database. An archive of robot vulnerabilities and bugs.
Python168gpl-3.0
5 months ago
bountybugcybersecurity
freezefs
Create self-extracting compressed or self-mounting archives for MicroPython
Python21mit
7 months ago
Openlib
An Open source app to download and read books from shadow library (Anna’s Archiv
Dart1178gpl-3.0
19 days ago
androidannas-archivebooks
Mink
Chrome extension that uses Memento to indicate that a page a user is viewing on
JavaScript49mit
last month
chromeextensioninternet-archive
geojsonhint
IMPORTANT: This repo will be archived. Use @placemarkio/check-geojson instead
JavaScript258isc
6 months ago
thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cc
C++4926other
9 months ago
algorithmscppcpp11
archivenow
A Tool To Push Web Resources Into Web Archives
Python398mit
10 months ago
internet-archiveweb-archiving
ipwb
InterPlanetary Wayback: A distributed and persistent archive replay system using
Python604mit
4 months ago
dockeripfsmemento
otwarchive
The Organization for Transformative Works (OTW) - Archive Of Our Own (AO3) Proje
Ruby1404gpl-2.0
2 days ago
warcworker
A dockerized, queued high fidelity web archiver based on Squidwarc
Python54gpl-3.0
4 months ago
archivinghigh-fidelity-preservationpreservation
relate
[ARCHIVED] experimenting web app with React + GraphQL + Next.js
JavaScript333gpl-3.0
6 months ago
apollographqlgraphqlnextjs
ReflectionMagic
Framework to drastically simplify your private reflection code using C# dynamic
C#331apache-2.0
10 months ago
papermerge
Open Source Document Management System for Digital Archives (Scanned Documents)
Python2434apache-2.0
8 months ago
archivesdjangodms
usage
A Google Analytics wrapper for command-line, web, and Flutter apps.
Dart146bsd-3-clause
5 months ago
demo_web_zip_wasm
A simple example program for creating password archives in ZIP format, running i
Rust4
6 months ago
libcudacxx
[ARCHIVED] The C++ Standard Library for your entire system. See https://github.c
C++2297other
9 months ago
cppcpp11cpp14
JMeterHARImporterPlugin
A JMeter plugin that lets you select a HAR (HTTP Archive) file to import into JM
Java8mit
29 days ago
http-archivejmeterjmeter-plugin
algo
Description WireGuard is a fast, modern, and secure VPN tunnel. This app allo
Jinja28999agpl-3.0
3 months ago
ansibleencryptionikev2
PDF-Archiver
Description Organize your documents digitally. Just scan and tag them with th
Swift305other
16 days ago
archivearchivingarchivist
library
90+ CLI tools to build, browse, and blend your media library: an index for your
Python374bsd-3-clause
8 days ago
broadcatchingclicommand-line
chatnoir-resiliparse
A robust web archive analytics toolkit
Cython64apache-2.0
4 months ago
bigdatacppcython
load-testing
A collection of best practices, workflows, scripts and scenarios that Cloud Poss
JavaScript54apache-2.0
11 months ago
docker-composegrafanak6
tfmask
Terraform utility to mask select output from `terraform plan` and `terraform app
Go202apache-2.0
11 months ago
maskmaskingregex
go-unarr
Go bindings for unarr (decompression library for RAR, TAR, ZIP and 7z archives)
Go280zlib
7 months ago
7z-archives7zipdecompression-library
Omeka
A flexible web publishing platform for the display of library, museum and schola
PHP487gpl-3.0
8 days ago
raar
RAAR is a ruby application to manage and browse an audio archive.
Ruby20agpl-3.0
2 months ago
archiveradiorails
annotate-pull-request-from-checkstyle
cs2pr - Annotate a GitHub Pull Request based on a Checkstyle XML-report within y
PHP192mit
last month
annotationscheckstylecheckstyle-xml-report
omeka-s
Omeka S is a web publication system for universities, galleries, libraries, arch
PHP408gpl-3.0
20 hours ago
cmslinked-dataphp
awesome-ai4lam
A list of awesome AI in libraries, archives, and museum collections from around
SCSS74cc0-1.0
5 months ago
aiai-in-librariesartificial-intelligence
LinkAce
LinkAce is a self-hosted archive to collect links of your favorite websites.
PHP2646gpl-3.0
2 days ago
archivearchivingbookmark-manager
UnifiedArchive
UnifiedArchive - an archive manager with unified interface for different formats
PHP274mit
3 months ago
7ziparchivesarchiving
archivesspace
ArchivesSpace, the archives management tool
Ruby354other
21 hours ago
archivesarchivesspacecultural-heritage
Cortex-Command-Community-Project-Source
[ARCHIVED] Cortex Command - Open Source under GNU AGPL v3 (no game data included
C++201agpl-3.0
11 months ago
cortex-command
global-indicators
ARCHIVE COPY (see https://github.com/healthysustainablecities/global-indicators)
Jupyter Notebook0mit
5 months ago
paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive
Python18423gpl-3.0
4 months ago
angulararchivingdjango
wikiteam
Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to
Python709gpl-3.0
6 months ago
archive-wikisbackupdigital-preservation
Save-app-android-old
Description Save preserves & safeguards your media and identity against inter
Kotlin96gpl-3.0
2 days ago
beagle-im
Opis BeagleIM is a lightweight and powerful XMPP client developed by Tigase,
Swift184gpl-3.0
22 days ago
appappleapplication
quick-look-plugins
Description Accelerate your workflow with the Quick Look conveniences that on
17951
12 months ago
tutanota
Description PROTECT YOUR PRIVACY WITH TUTA MAIL FOR FREE: SECURE, PRIVATE AND
TypeScript6114gpl-3.0
5 hours ago
emailencryptionjavascript
fbarc
A commandline tool and Python library for archiving data from Facebook using the
Python77cc0-1.0
7 years ago
code4libfacebook-graph-api
ArchiveTools
A collection of tools for archiving and analysing the internet.
Python68gpl-3.0
2 years ago
zotero-memento
Zotero extension that combats link rot by archiving webpages and journal article
JavaScript283mit
2 years ago
awscli-cookbook
ARCHIVED: Installs the AWS Command Line Interface tools and provides a set of LW
Ruby38apache-2.0
7 years ago
JavaScriptServices
[Archived] This repository has been archived
C#3039apache-2.0
5 years ago
aspnet-productdotnet-template
PivotalR-archive
An convenient R tool for manipulating tables in PostgreSQL type databases and a
R126
2 years ago
notebooks
Various examples of notebooks for working with web archives with the Archives Un
Jupyter Notebook22apache-2.0
2 years ago
juypter-notebooknotebookspyspark-notebook
archive
A Common Lisp library for reading archive (tar, cpio, etc.) files
Common Lisp30bsd-3-clause
7 years ago
aws-dynamodb-session-tomcat
ARCHIVED: Amazon DynamoDB based session store for Apache Tomcat
Java95apache-2.0
7 years ago
aws-tvm-anonymous
ARCHIVED: Token Vending Machine for Anonymous Registration
Java34apache-2.0
7 years ago
aws-tvm-identity
ARCHIVED: Token Vending Machine for Identity Registration
Java37apache-2.0
7 years ago
kinesis-log4j-appender
ARCHIVED: Log4J Appender for writing data into a Kinesis Stream
Java62apache-2.0
6 years ago
hatchet-v1-archived
An all-in-one Terraform management tool.
Go156mit
last year
continuous-deliverycontinuous-integrationdevops
Karabiner-archived
Karabiner (KeyRemap4MacBook) is a powerful utility for keyboard customization.
C++3818unlicense
5 years ago
monkeylearn
:no_entry: ARCHIVED :no_entry: Accesses the Monkeylearn API for Text Classifiers
R93
3 years ago
classifierextractormonkeylearn
aws-apigateway-importer
Tools to work with Amazon API Gateway, Swagger, and RAML
Java518apache-2.0
8 years ago
ShowcaseView
[Archived] Highlight the best bits of your app to users quickly, simply, and coo
Java5604
7 years ago
pcap2har
A convertor from .pcap network capture files to HTTP Archive files.
Python234bsd-2-clause
6 years ago
amp
** THIS PROJECT IS STOPPED ** An open source CaaS for Docker, batteries included
Go81apache-2.0
7 years ago
caascloudcluster
twut
An open-source toolkit for analyzing line-oriented JSON Twitter archives with Ap
Scala9apache-2.0
2 years ago
apache-sparksparkspark-packages
Razor
[Archived] Parser and code generator for CSHTML files used in view pages for MVC
C#883apache-2.0
6 years ago
aspnet-product
SignalR
[Archived] Incredibly simple real-time web for ASP.NET Core. Project moved to ht
C#2382apache-2.0
6 years ago
aspnet-product
tsarchive
Consume data streams from a Kafka topic, archive the data packets into the TDEng
C3gpl-3.0
5 years ago
kafkaminiseedtdengine
DeadCScroll
An assembly tutorial for Game Boy showing how the scroll registers can be exploi
21unlicense
4 years ago
Web2Warc
An easy-to-use and highly customizable crawler that enables you to create your o
Scala24mit
7 years ago
emojione
[Archived] The world's largest independent emoji font. Maintained at https://git
PHP4458other
5 years ago
ui
Highly customizable and theming components for React Native
JavaScript106mit
5 years ago
androidcomponentsios
vue-routisan
Archived – new package coming soon!
JavaScript203other
2 years ago
laravelnavigation-guardsrouter
CoreDataDandy
A feature-light wrapper around Core Data that simplifies common database operati
Swift34other
5 years ago
gatekeeper
Rate limiting middleware for Vapor 👮
Swift74mit
3 years ago
rate-limitingrate-limitsserver-side-swift
submissions
Provides a common structure to deal with data based API requests
Swift14mit
3 years ago
validationvaporvapor-3
fontfinder
GTK application for browsing and installing fonts from Google's font archive
Rust276mit
2 years ago
font-findergtkrust
awsbox
INACTIVE - http://mzl.la/ghe-archive - A featherweight PaaS on top of Amazon EC2
JavaScript809other
6 years ago
inactiveunmaintained
pyramid_ipauth
INACTIVE - http://mzl.la/ghe-archive - a pyramid authentication policy based on
Python11
5 years ago
inactiveunmaintained
iprfc
RFC downloader and archiver that stores RFCs on IPFS, and indexes them against L
Go4agpl-3.0
5 years ago
AthenaX
SQL-based streaming analytics platform at scale
Java1224apache-2.0
4 years ago
analyticscalcitedata
makisu
Fast and flexible Docker image building tool, works in unprivileged containerize
Go2410apache-2.0
4 years ago
ci-cdcontainerdocker
admiral
Container management solution with an accent on modeling containerized applicati
Java255other
3 years ago
cscore
Camera access and streaming library (ARCHIVED, merged into allwpilib)
C++24other
7 years ago
cameracamera-accesscameraserver
floatlabelededittext
Floating hint from edit text - inspired by Matt D. Smith's design: http://dribbb
Java1141
9 years ago
awesome-website-change-monitoring
A curated list of awesome tools for website diffing and change monitoring.
488cc0-1.0
2 years ago
awesome-listchange-detectordiff
classicswarm
Swarm Classic: a container clustering system. Not to be confused with Docker Swa
Go5756apache-2.0
4 years ago
paperless
Scan, index, and archive all of your paper documents
Python7854gpl-3.0
4 years ago
archivingdocumentsocr
warclight
A Rails engine supporting the discovery of web archives.
Ruby48other
last year
blacklightdiscoveryrails
marathon
Deploy and manage containers (including Docker) on top of Apache Mesos at scale.
Scala4065apache-2.0
2 years ago
dcosdcos-orchestration-guild
stagehand
Dart project generator - web apps, console apps, servers, and more.
Dart649bsd-3-clause
3 years ago
mlem
🐶 A tool to package, serve, and deploy any ML model on any platform. Archived t
Python717apache-2.0
last year
clidata-sciencedeployment
Serpent
A protocol to serialize Swift structs and classes for encoding and decoding.
Swift286mit
3 years ago
alamofirecarthagecocoapods
GenesisEngine
Experiments with procedurally-generated worlds, XNA, and design patterns
C#41ms-pl
6 years ago
aws-sdk-unity
ARCHIVED: The aws sdk for unity is now distributed as a part of aws sdk for dotn
C#105other
9 years ago
aws-sdkc-sharpunity
awsmobile-cli
CLI experience for Frontend developers in the JavaScript ecosystem.
JavaScript142apache-2.0
5 years ago
awsaws-apigatewayaws-cloudfront
aws-model-validators
Tools for validating the AWS service JSON model files.
Ruby9apache-2.0
10 years ago
aws-sdk-arduino
An experimental SDK for working with AWS Services on Arduino-compatible devices.
C++91apache-2.0
8 years ago
amazon-web-servicesarduino-ideaws
aws-sdk-react-native
AWS SDK for React Native (developer preview)
JavaScript631apache-2.0
6 years ago
aws-sdkcloud
dynamodb-janusgraph-storage-backend
The Amazon DynamoDB Storage Backend for JanusGraph
Java446apache-2.0
3 years ago
dynamodb-online-index-violation-detector
A tool for Amazon DynamoDB to find violations on an online GSI's hash key and ra
Java9apache-2.0
5 years ago
ecs-cloudwatch-logs
This repository provides the assets referred to in the blog post on using Amazon
69apache-2.0
10 years ago
logstash-output-cloudwatchlogs
A logstash plugin that allows to send logs to AWS CloudWatch Logs service.
Ruby37other
7 years ago
opsworks-capistrano
This repository has examples of using Capistrano with instances managed by AWS O
Ruby8apache-2.0
5 years ago
opsworks-first-cookbook
AWS OpsWorks cookbook used to demonstrate simple recipes to get started
Ruby8apache-2.0
5 years ago
service-discovery-ecs-consul
This repository provides the assets referred to in the blog post "Service Discov
HTML108apache-2.0
4 years ago
skill-sample-nodejs-calendar-reader
An Alexa Skill Sample showing how to import calendar data from an .ICS file.
JavaScript75other
6 years ago
timely-security-analytics
Demo code for the Timely Security Analytics and Analysis 2015 Re:Invent presenta
Scala29other
5 years ago
todo-sample-app
Sample Ruby on Rails application demonstrating integration with AWS services.
Ruby27other
5 years ago
Identity
[Archived] ASP.NET Core Identity is the membership system for building ASP.NET C
C#1967apache-2.0
6 years ago
aspnet-product
Security
[Archived] Middleware for security and authorization of web apps. Project moved
C#1266apache-2.0
6 years ago
aspnet-product
terraform-aws-jenkins
Terraform module to build Docker image with Jenkins, save it to an ECR repo, and
HCL256apache-2.0
last year
cicdcodebuildcodepipeline
gtfsrdb
GTFSrDB is a tool to archive gtfs-realtime data to a database.
Python38other
2 years ago
databasegtfsgtfs-realtime
zzarchive-Vulpes
Vulpes: a Deep Belief Net written in F#, and using Alea.cuBase to access the GPU
JavaScript116mit
7 years ago
gambatte
Gambatte source public mirror after official upstream has been made private
Assembly4gpl-2.0
3 years ago
objective-c-style-guide
**Archived** Style guide & coding conventions for Objective-C projects
1675
7 years ago
swift-style-guide
**Archived** Style guide & coding conventions for Swift projects
4776cc0-1.0
7 years ago
twitter-together
⚠️ Archived. See new version at https://github.com/twitter-together/action
JavaScript4mit
2 years ago
GTFS-route-shapes
A simple script to generate a single geoJSON shape for each transit route in a G
Python19
9 years ago
har-rs
A HTTP Archive format (HAR) serialization & deserialization library, written in
Rust41mit
last year
charles-proxydeserializationhar
Bauglir-WebSocket-2
Copy of https://code.google.com/archive/p/bauglir-websocket/
Pascal16
6 years ago
freepascallazaruspascal
keychain
Easily scaffold a keychain using JWT for Vapor ⛓
Swift40mit
3 years ago
jwtkeychainserver-side-swift
storage
Eases the use of multiple storage and CDN services 🗄
Swift66mit
4 years ago
brtoserver-side-swiftstorage
StringDB
StringDB is a modular, key/value pair archival DB designed to consume *tiny* amo
C#71mit
2 years ago
databaseeasy-to-uselightweight
mozfest-program
INACTIVE - http://mzl.la/ghe-archive - Where we're reviewing and scheduling the
45
6 years ago
inactiveunmaintained
node-warc
Parse And Create Web ARChive (WARC) files with node.js
JavaScript93mit
2 years ago
chrome-remote-interfacepupeteerwarc
noc-book
The Nature of Code book (archived repo, see README for new repo / build system!)
JavaScript1938
5 years ago
sbt-docker-compose
Integrates Docker Compose functionality into sbt (archived as unmaintained)
Scala178bsd-3-clause
2 years ago
dockerdocker-composesbt
kwin-lowlatency
archived - X11 full-screen unredirection and lots'a settings for KWin
C++372
2 years ago
kdekwin
lametric-notify-plus
A better way to send Android notifications to a LaMetric Time smart clock.
Java2gpl-3.0
4 years ago
plato-research-dialogue-system
This is the Plato Research Dialogue System, a flexible platform for developing c
Python976apache-2.0
4 years ago
conversational-agentconversational-aiconversational-ui
anonymizer
**ARCHIVED** An anonymizer to obfuscate faces and license plates.
Python262apache-2.0
3 years ago
py-wasapi-client
A client for the Archive-It And Webrecorder WASAPI Data Transfer API
Python14bsd-3-clause
5 years ago
webcommander
Powerful, flexible, intuitive and most importantly simple. That is what a real a
PowerShell165mit
6 years ago
amazon-cognito-streams-sample
Sample demonstrating consuming Amazon Cognito Streams
Java9mit-0
4 years ago
amazon-dsstne
Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed libra
C++4410apache-2.0
5 years ago
route.dart
MOVE to https://github.com/dart-lang/angular/tree/master/angular_router
Dart29bsd-3-clause
7 years ago
CodeXL
CodeXL is a comprehensive tool suite that enables developers to harness the bene
C++995other
5 years ago
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome
JavaScript166apache-2.0
5 years ago
browser-automationchromechrome-headless
build-podcast
[ARCHIVED] :nut_and_bolt: Build Podcast is a show about technology tools for des
HTML299
6 years ago
Carlos
Beschreibung Was die WELT Edition ausmacht:365 Ausgaben pro Jahr, täglich ab
Swift644mit
last year
appscacheios
amazon-kinesis-data-visualization-sample
Amazon Kinesis Data Visualization Sample Application
JavaScript171mit-0
6 years ago
cloudwatch-logs-subscription-consumer
A specialized Amazon Kinesis stream reader (based on the Amazon Kinesis Connecto
Java397other
7 years ago
dynamodb-import-export-tool
Exports DynamoDB items via parallel scan into a blocking queue, then consumes th
Java90apache-2.0
5 years ago
django-graphiql
[DEPRECATED | Use graphene-django package] Integrate GraphiQL easily into your D
CSS35
8 years ago
libbson
ARCHIVED - libbson has moved to https://github.com/mongodb/mongo-c-driver/tree/m
C347apache-2.0
4 years ago
glab
The GitLab CLI tool. Archived: now officially adopted by GitLab as the official
Go2075mit
2 years ago
clicommand-linecustom-gitlab-cli
react-native-google-analytics
Google Analytics for React Native! Compatible with react-native-ab
JavaScript383mit
5 years ago
callisto
A control toolkit for Windows 8 XAML applications. Contains some UI controls to
C#338other
9 years ago
GitHub-Dark-Script
Archived - Please use https://github.com/StylishThemes/GitHub-Dark directly
JavaScript550mit
5 years ago
dark-themegithubgreasemonkey
twitter-kit-ios
Twitter Kit is a native SDK to include Twitter content inside mobile apps.
Objective-C690apache-2.0
6 years ago
logstash-input-dynamodb
This input plugin for Logstash scans a specified DynamoDB table and then reads c
Ruby105apache-2.0
3 years ago
terraform-provider-docker
As part of our introduction to self-service publishing in the Terraform Registry
Go132mpl-2.0
4 years ago
dockerterraformterraform-provider