simplecrawler

Very straightforward, event driven web crawler. Features a flexible queue interface and a basic cache mechanism with extensible backend.

spiderhunt

Very straightforward web crawler. Uses EventEmitter. Based on Simplecrawler but using a distributed queue system.