Reviews
A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz
Search similar apps
License
Related apps
ArchiveSpark
An Apache Spark framework for easy data processing, extraction as well as deriva
Scala145mit
2 months ago
archivesparkinternet-archivespark
WarcPartitioner
Partition (W)ARC Files by MIME Type and Year
Java1mit
8 years ago
hadoopwarcweb-archiving
Web2Warc
An easy-to-use and highly customizable crawler that enables you to create your o
Scala24mit
7 years ago