node-crawler
Web Crawler/Spider for NodeJS + server-side jQuery ;-)
JavaScript6617mit
4 months ago
cheeriocrawlerextract-data
storm-crawler
A scalable, mature and versatile web crawler based on Apache Storm
HTML841apache-2.0
2 months ago
apache-stormcrawlerdistributed
browsertrix-crawler
Run a high-fidelity browser-based crawler in a single Docker container
JavaScript500agpl-3.0
2 months ago
crawlercrawlingwacz
grab-site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic igno
Python1233other
3 months ago
archivingcrawlcrawler
librengine
Privacy Web Search Engine (not meta, own crawler)
C++58agpl-3.0
10 months ago
cppcrawlerencryption
Photon
Incredibly fast crawler designed for OSINT.
Python10503gpl-3.0
4 months ago
crawlerinformation-gatheringosint
InfinityCrawler
A simple but powerful web crawler library for .NET
C#229mit
4 months ago
crawlerrobots-txtspider
pyspider
A Powerful Spider(Web Crawler) System in Python.
Python16128apache-2.0
10 months ago
crawlerpython
webmagic
A scalable web crawler framework for Java.
Java11022apache-2.0
5 months ago
crawlerframeworkjava
wombat
Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structur
Ruby1300mit
3 months ago
crawlerdslruby
colly
Elegant Scraper and Crawler Framework for Golang
Go21293apache-2.0
5 months ago
crawlercrawlingframework
bitmagnet
A self-hosted BitTorrent indexer, DHT crawler, content classifier and torrent se
Go1908mit
6 days ago
bittorrentdhtprowlarr
pagser
Pagser is a simple, extensible, configurable parse and deserialize html page to
Go94mit
6 months ago
collycrawlerdeserialization
shattered-pixel-dungeon
Description Shattered Pixel Dungeon is a traditional roguelike dungeon crawle
Java4313gpl-3.0
last month
androidgamegame-development
Appium-Native-Crawler
Appium Native Crawler CLI - Features include: Screenshots, Performance, Accessib
Ruby47apache-2.0
5 years ago
goribot
[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler
Go210apache-2.0
4 years ago
crawlergogolang
domfind
A Python DNS crawler to find identical domain names under different TLDs.
Python20other
5 years ago
crawlercybersecuritydns
Web2Warc
An easy-to-use and highly customizable crawler that enables you to create your o
Scala24mit
7 years ago
dungeons-of-noudar
A first person dungeon-crawler for DOS, written in C++, using software rendering
C++44bsd-2-clause
2 years ago
djgppdungeondungeons
open-source-search-engine
Nov 20 2017 -- A distributed open source search engine and spider/crawler writte
C++1481apache-2.0
2 years ago
Ragpicker
Ragpicker is a Plugin based malware crawler with pre-analysis and reporting func
Python90
9 years ago
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome
JavaScript162apache-2.0
4 years ago
browser-automationchromechrome-headless
Voight-Kampff
Voight-Kampff is a Ruby gem that detects bots, spiders, crawlers and replicants
Ruby176mit
last year
astray
Astray is a lua based maze, room and dungeon generation library for dungeon craw
Lua136zlib
7 years ago
dungeon-crawlerlove2dlua
cloudflare-block-bad-bot-ruleset
:vertical_traffic_light: Block malicious crawlers with Cloudflare Firewall Rules
190mit
4 years ago
cloudflarecloudflare-firewall-rulescrawler-detector