https://github.com/ikreymer/webarchive-indexing
Python41
7 years ago
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.
MIT License