syphonx

SyphonX is a tool that extracts data from HTML data, transforming it into JSON of any shape or size. It combines the power of CSS Selectors and jQuery, Regular Expressions, and Javascript into a declarative template format to elegantly solve the simplest

crawlbug

A NodeJS web crawler that can be deployed to multiple machines and writes page data to a Firebase database.