https://github.com/ikreymer/webarchive-indexing
Python40
6 years ago
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.
MIT License