browsertrix-crawler

Run a high-fidelity browser-based crawler in a single Docker container

License

GNU Affero General Public License v3.0

Run a high-fidelity browser-based crawler in a single Docker container

Creator

webrecorder

Related apps

archiveweb.page

archiveweb.page

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

JavaScript798agpl-3.0

3 months ago

archivingbrowser-extensionchromium

pywb

Core Python Web Archiving Toolkit for replay and recording of web archives

JavaScript1342gpl-3.0

4 months ago

pythonpywbwayback

warcio

Streaming WARC/ARC library for fast web archive IO

Python364apache-2.0

5 months ago

pythonpywbwarc

browsertrix

browsertrix

Browsertrix is the hosted, high-fidelity, browser-based crawling service from We

TypeScript149agpl-3.0

3 months ago

archivingcloudkubernetes