simplecrawler

Very straightforward, event driven web crawler. Features a flexible queue interface and a basic cache mechanism with extensible backend.

cache-proxy-plus

A method proxy library that makes it easy to implement local and remote cache with built-in usage statistics.